Search Discussions

225 discussions - 815 posts

  • I'm using CDH4.1.2 on my cluster and I'm trying to use the balancer. When I start it I get log output saying I h ave 117 gb to balance and it will balance 10gb this round, then does nothing. It will ...
    James PettyjohnJames Pettyjohn
    Jan 27, 2013 at 10:05 pm
    Jun 21, 2013 at 6:08 am
  • I'm trying to solve a bit of confusion on my part in setting up a Flume Agent that goes from Netcat- Avro- HDFS with the hopes of accessing via Hive. I've seen all sorts of examples for the Hive ...
    Jan 15, 2013 at 4:57 am
    Jan 20, 2013 at 4:20 am
  • Hi, I am running HBase CDH4 v4.1.2. I have a file with about 11000 put statements that I am using to populate a table. After inserting around 150 records it errors out with an OutOfMemory error. I ...
    Jan 11, 2013 at 8:18 pm
    Jan 14, 2013 at 8:15 pm
  • Hi Guys, I am getting the following exception of running a sqoop export node from Oozie. I have installed the MS SQL Server connector and placed the sql server jdbc jar as well as the connector jar ...
    Jan 4, 2013 at 10:26 pm
    Mar 1, 2013 at 3:30 am
  • Hi, I've been thinking of setting up a multiple node hadoop cluster at home to play around with. However, although I have spare machines they are not all the same spec w.r.t disk size; i.e. 1 has ...
    Yasin MustafaYasin Mustafa
    Jan 4, 2013 at 8:38 pm
    Jan 8, 2013 at 7:31 am
  • ERROR org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode: Exception in doCheckpoint. Please advise. 2013-01-22 19:24:29,867 ERROR org.apache.hadoop.security.UserGroupInformation ...
    Willy ChangWilly Chang
    Jan 24, 2013 at 5:59 pm
    Feb 5, 2013 at 3:35 am
  • Hi, I would like to know if there is a eclipse plugin for CDH 4.1.2 ( Hadoop 2.0.0+552 )?? Regards, Vijay Raajaa G S --
    G.S.Vijay raajaaG.S.Vijay raajaa
    Jan 29, 2013 at 4:11 am
    Jan 29, 2013 at 10:01 am
  • I am getting an error when trying to submit a query that launches a mapreduce job using the Hive Beeline interface. I am connected to a Hive Server2 service. The query is simple. SELECT COUNT(*) FROM ...
    Jan 22, 2013 at 7:36 pm
    Mar 24, 2014 at 11:06 am
  • Hello, I have a region stuck in transition. Restarting the master, the cluster, and clearing out the hbase znode had no effect. Whenever a regionserver tries to open the region I get a file not found ...
    Jan 24, 2013 at 7:13 pm
    Feb 1, 2013 at 9:35 pm
  • Hi experts, Have a question about the race between fair scheduler and speculative execution on CDH3 (any minor version) or older hadoop versions (e.g., 0.18.3). Let's say we have turned on both fair ...
    Manhee JoManhee Jo
    Jan 30, 2013 at 5:49 am
    Feb 1, 2013 at 12:44 am
  • Hi all, I'm trying to get JDBC connectivity up and running to hive-server2 but hitting an odd problem... Basic stuff that doesn't need MR to be worked out can be executed fine (eg count(*) from ...
    James HogarthJames Hogarth
    Jan 25, 2013 at 2:46 pm
    Jan 28, 2013 at 11:19 am
  • Patrick, What version of CDH are you using? It's most likely that the version is old and doesn't contain AbstractGenericUDAFResolver. In that case, you may have to upgrade CDH. Mark On Wed, Jan 9, ...
    Mark GroverMark Grover
    Jan 10, 2013 at 12:07 am
    Jan 18, 2013 at 4:49 pm
  • Hello, I'm trying to configure Hue per these instructions: https://ccp.cloudera.com/display/CDH4DOC/Hue+Installation#HueInstallation-TheHueDatabase Everything goes fine including through the syncdb ...
    Joe TravagliniJoe Travaglini
    Jan 24, 2013 at 10:44 pm
    Jan 28, 2013 at 7:17 pm
  • A general question: do we need any special settings in mapred-site.xml and hive-site.xml configs if we want to use Hive on top of CDH HA? Or the hdfs-site.xml config: <property <name ...
    Ken dengKen deng
    Jan 24, 2013 at 11:06 pm
    Jan 25, 2013 at 3:45 am
  • Hi, does anyone know how to configure multiple Journal nodes on one machine in CDH 4.1? an example from hdfs-site.xml would be very appreciated. Thanks --
    Ken dengKen deng
    Jan 18, 2013 at 4:36 am
    Jan 21, 2013 at 9:06 am
  • Hi, I would like to know the configuration settings when user wants to configure (2 NameNode) HA and kerberos secured cluster with CDH4 distribution. -- Cheers, *Subroto Sanyal* --
    Subroto SanyalSubroto Sanyal
    Jan 7, 2013 at 8:06 am
    Jan 9, 2013 at 9:42 am
  • Hello, I'm using oozie 3.2.0-cdh4.1.2 with MRv1. I can submit jobs through hue, but only when I disable lzo compression. It works without the following lines in core-site.xml: <property <name ...
    Martin Sp..Martin Sp..
    Jan 25, 2013 at 2:27 pm
    May 20, 2013 at 7:39 pm
  • How are you validating this? How many map tasks does your job have? Do you have some JT logs to share? And configs (what scheduler, etc.)? On Thu, Jan 31, 2013 at 10:07 PM, samir das mohapatra ...
    Harsh JHarsh J
    Jan 31, 2013 at 5:38 pm
    Jan 31, 2013 at 8:09 pm
  • Dears, Does anyone know HBase 0.90.6(CDH3U4) has limitation about the total regions per region server? I cannot find any setting in HBse's configuration file!? or I missed something? And when I try ...
    James ChangJames Chang
    Jan 28, 2013 at 5:03 am
    Jan 29, 2013 at 4:32 am
  • Hi, Something abnormal happened in my cluster. Actually the default location of snapshot & dataDir for zookeeper is /var/lib/zookeeper in cdh4. The disk at which /var location is configured became ...
    Kumar, Deepak8Kumar, Deepak8
    Jan 17, 2013 at 6:44 pm
    Jan 28, 2013 at 7:38 pm
  • Has Anybody faced this exception before? I am using the default SQL Server connector provided by Cloudera and SqlJdbc4.jar 2013-01-23 13:58:06,581 INFO org.apache.hadoop.mapred.TaskStatus ...
    Jan 23, 2013 at 9:45 pm
    Jan 24, 2013 at 3:48 pm
  • Hi, we want to start with hbase replication. To verify the replication i started the command: = hadoop jar /usr/lib/hbase/hbase.jar verifyrep --starttime=1358510934 --stoptime=1358521734 --families=m ...
    Elmar GroteElmar Grote
    Jan 18, 2013 at 3:46 pm
    Jan 22, 2013 at 9:41 pm
  • I am running CDH4.1.2 and Cloudera Manager 4.1.2 on CentOS 5. (although this also happened on CDH4.0.1 and Cloudera Manager 4.0.4) In Cloudera Manager I have the mapred.userlog.retain.hours set to ...
    Dan RichelsonDan Richelson
    Jan 14, 2013 at 7:13 pm
    Jan 18, 2013 at 8:25 pm
  • There is a flag for the escape sequence that you need to pass as well. Sent from my iPhone On Jan 16, 2013, at 5:25 PM, "nitin@quaero" wrote: Does anybody recognize this exception? ...
    Justin WorkmanJustin Workman
    Jan 17, 2013 at 12:27 am
    Jan 17, 2013 at 2:53 pm
  • I have set up a value of dfs.block.size to 128 MB in hdfs-site.xml on my client machine. The block size on the cluster Namenode is default - based on my understanding the client side *.site.xml takes ...
    Jan 6, 2013 at 8:08 pm
    Jan 15, 2013 at 1:39 pm
  • Hello dear ML, Happy new year ti everyone. We have some interrogation here about MapReduce (mr1). We have few third-party jars to use as dependencies for ou M/R job. To prevent from uploading them ...
    Damien HardyDamien Hardy
    Jan 8, 2013 at 11:25 am
    Jan 11, 2013 at 5:34 pm
  • Hi, I have a full cluster operational. My problem is HADOOP_HOME on cdh4, MR1.... HADOOP_HOME=/usr/lib/hadoop-0.20-mapreduce $ hadoop fsck Exception in thread "main" java.lang.NoClassDefFoundError ...
    Pixel MadnessPixel Madness
    Jan 28, 2013 at 10:33 pm
    Jan 31, 2013 at 7:41 pm
  • Hi Guys, I am using System.getProperty("oozie.action.conf.xml") to access configuration values in Java Class of an Oozie Java action node. Now, many instances of this class may be instantiated at one ...
    Jan 21, 2013 at 6:50 pm
    Jan 22, 2013 at 3:11 pm
  • Dears, I'm study CDH 4.1.2 release, I saw the following info in http://archive.cloudera.com/cdh4/cdh/4/hadoop-2.0.0-cdh4.1.2.CHANGES.txt ======================================================== ...
    James ChangJames Chang
    Jan 15, 2013 at 3:06 am
    Jan 15, 2013 at 7:27 am
  • Hi Guys, I have an one small query, please help me. Let's say i have 3 nodes, Size of each node is 1 TB and replica factor is 2. So ultimately i can store 150 GB data so that i stored 50 GB data in ...
    Ravi SharmaRavi Sharma
    Jan 11, 2013 at 9:23 am
    Jan 11, 2013 at 5:17 pm
  • For testing, I installed Cloudera mgr 4.1.3 and used it to installed CDH4 onto 7 datanodes. Each datanode has 6 drives (146GB) for local storage. After creating the filesys, we have ...
    Jan 31, 2013 at 6:16 am
    Jan 31, 2013 at 6:43 am
  • Hallo, My name is Bernhard Pflugfelder and I am currently evaluating CDH4 (Version 4.1.2) to possibly become the new standard hadoop distibution within my company. I used the Whirr installation mode ...
    Bernhard PflugfelderBernhard Pflugfelder
    Jan 14, 2013 at 3:55 pm
    Jan 30, 2013 at 11:37 pm
  • Hi All, I have a newly built CDH4 10 node cluster. I'm having an issue with my hive jobs. They seem to start correctly, but then I see the following: "Job running in-process (local Hadoop)" The job ...
    Mike DenningMike Denning
    Jan 29, 2013 at 9:37 pm
    Jan 30, 2013 at 6:41 pm
  • Hi Everyone, I have a problem with sqoop export for Oracle database from hbase. I am using this syntax to export data from hbase to Oracle database. ]# sqoop export --connect ...
    Jan 28, 2013 at 7:20 am
    Jan 30, 2013 at 4:22 pm
  • Moving to cdh-user@ as you seem to indicate this is CDH-specific. Lets continue there. My response below: From your question/code below, it is not clear _what_ you aren't seeing as working alright ...
    Harsh JHarsh J
    Jan 23, 2013 at 5:04 pm
    Jan 23, 2013 at 8:17 pm
  • Hi, I am facing the following problem while starting the oozie server... and i was not able to add the mysql and extjs lib also: These are my configurations in oozie-env.sh: export ...
    Shashwath shenoyShashwath shenoy
    Jan 21, 2013 at 5:34 am
    Jan 23, 2013 at 8:04 am
  • Hi there Is there any way to use arrayList of Puts in map function to insert data to hbase ? Because,the context.write method doesn't allow to use arraylist of puts,so in every map function I can ...
    Farrokh ShahriariFarrokh Shahriari
    Jan 20, 2013 at 11:46 am
    Jan 22, 2013 at 12:12 pm
  • Hi, About a month ago, I upgraded my org's cluster from CDH3 to CDH4. Everything went pretty smoothly, but I've now noticed that the namenode doesn't seem to be deleting old edit logs after what both ...
    Marshall Bockrath-VandegriftMarshall Bockrath-Vandegrift
    Jan 21, 2013 at 1:12 am
    Jan 21, 2013 at 9:30 am
  • Hi, I need to combine multiple tuples in output in a single one in pig script. Could you please guide me how we can do it? dump requestFile; Output: (Logging Transaction ...
    Kumar, Deepak8Kumar, Deepak8
    Jan 9, 2013 at 8:43 am
    Jan 14, 2013 at 6:01 pm
  • Hi, How do we gracefully stop a mapreduce job when we see the desired output in a mapper and say mission accomplished. The context is, we are searching through a million small 10k files for a ...
    Jimson K JamesJimson K James
    Jan 14, 2013 at 6:48 am
    Jan 14, 2013 at 10:35 am
  • Hi all For more than two days I fighting with the installation of CM 4.5 I solved almost all the problems remained one problem that I can not solve it the installation falls with "Installation ...
    Michael G.Michael G.
    Jan 23, 2013 at 4:32 pm
    Feb 6, 2013 at 4:44 pm
  • Hi all, I currently use Whirr to deploy a Hadoop cluster on EC2 with CDH 4. This Whirr approach works pretty good for my needs, but now I have a problem using Hive with such a cluster. After a ...
    Bernhard PflugfelderBernhard Pflugfelder
    Jan 25, 2013 at 10:00 am
    Feb 4, 2013 at 3:05 pm
  • Hey guys, Wondering about dfs.name.dir in a shared NFS HA setup (not HDFS quorum). So in the world before HA, it was recommended to keep a copy of the HDFS metadata in dfs.name.dir on a remote filer, ...
    Joe TravagliniJoe Travaglini
    Jan 30, 2013 at 3:25 pm
    Feb 1, 2013 at 8:57 am
  • Hi All, Any one knows, how to load data from one hadoop cluster(CDH4) to another Cluster (CDH4) . They way our project needs are 1) It should be delta load or incremental load. 2) It should be based ...
    Samir das mohapatraSamir das mohapatra
    Jan 31, 2013 at 5:44 am
    Jan 31, 2013 at 6:15 am
  • Hi , Please help me . I want to use Flume in the following case : Spooling directory source -- FileChannel -- HBase sink . But I have some problems with Spooling directory source : Here is my test ...
    NGuyen thi Kim TuyenNGuyen thi Kim Tuyen
    Jan 28, 2013 at 4:33 am
    Jan 28, 2013 at 7:07 am
  • PIDs for all hadoop/yarn services in CDH4.1, all go to /tmp. I believe it is possible to configure in hadoop or yarn env.sh file. Why things are directed to /tmp when we know the tmp folder gets ...
    Willy ChangWilly Chang
    Jan 24, 2013 at 6:31 pm
    Jan 26, 2013 at 4:19 pm
  • Hi all, I have a map/reduce task running on our cluster that is getting killed with the message: "failed to report status for 600 seconds. Killing!" When I look at the logs the map has completed and ...
    Rob StylesRob Styles
    Jan 14, 2013 at 11:24 am
    Jan 25, 2013 at 3:12 pm
  • Hi! How can a single datanode slow down a whole cluster to a crawl? I noticed a very unwelcome behaviour on a 30+ datanode HDFS cluster a few days ago. One of the datanodes was spending most of its ...
    David MorelDavid Morel
    Jan 24, 2013 at 3:23 pm
    Jan 25, 2013 at 6:47 am
  • Hey, so we are trying to restart our namenode and receive the following error with CDH4. When I look in the configuration file I don't see this directory anywhere. This is the first restart we had ...
    Joe SteinJoe Stein
    Jan 16, 2013 at 2:01 pm
    Jan 17, 2013 at 7:23 pm
  • Hi, I'm currently running CDH 4.1.2, and I've noticed that the task JVMs are all run as user "yarn" instead of the user who submits the job. I vaguely remember that the hadoop-2.0.2-alpha release ...
    Mark F.Mark F.
    Jan 14, 2013 at 8:37 pm
    Jan 15, 2013 at 5:29 pm
Group Navigation
period‹ prev | Jan 2013 | next ›
Group Overview
groupcdh-user @

204 users for January 2013

Harsh J: 86 posts Joey Echeverria: 26 posts Nitin: 24 posts Krishnanand Khambadkone: 18 posts Serega Sheypak: 15 posts Jaroslav Cecho: 13 posts Ken deng: 13 posts Aaron T. Myers: 11 posts Alejandro Abdelnur: 11 posts Dhanasekaran Anbalagan: 11 posts Mark Grover: 11 posts Corgone: 10 posts Jabir Ahmed: 10 posts James Chang: 9 posts Jimson K James: 9 posts Joe Travaglini: 9 posts Logan Hardy: 9 posts Farrokh Shahriari: 8 posts Jason King: 8 posts Kumar, Deepak8: 8 posts
show more