Search Discussions

138 discussions - 567 posts

  • Can I integrate HBase 0.90.4 with hadoop ? -Jignesh
    Jignesh PatelJignesh Patel
    Oct 11, 2011 at 9:12 pm
    Oct 14, 2011 at 9:39 pm
  • Hi , what is the way to execute hadoop job on remote cluster. I want to execute my hadoop job from remote web application , but I didn't find any hadoop client (remote API) to do it. Please advice. ...
    Oleg RuchovetsOleg Ruchovets
    Oct 18, 2011 at 10:41 am
    Oct 20, 2011 at 9:43 am
  • I am using following command to create a file in Unix(i.e. mac) system. bin/hadoop fs -mkdir /user/hadoop-user/citation/input While it creates the directory I need, I am struggling to figure out ...
    Jignesh PatelJignesh Patel
    Oct 10, 2011 at 6:16 pm
    Oct 11, 2011 at 2:04 am
  • I found a way to connect to hadoop via hftp, and it works fine, (read only) : uri = "hftp://172.16.xxx.xxx:50070/"; System.out.println( "uri: " + uri ); Configuration conf = new Configuration(); ...
    Jay VyasJay Vyas
    Oct 28, 2011 at 5:52 am
    Oct 29, 2011 at 4:18 am
  • Hi, When I was running a job on hadoop with 75% mappers finished, the jobtracker hung so that I cannot access jobtrackerserver:7845/jobtracker.jsp and hadoop job -status hung as well. Then I stopped ...
    Peng, WeiPeng, Wei
    Oct 21, 2011 at 7:48 am
    Oct 25, 2011 at 2:08 am
  • Hi All, I have a doubt in hadoop secondary namenode concept . Please correct if the following statements are wrong . The namenode hosts the fsimage and edit log files. The secondary namenode hosts ...
    Oct 6, 2011 at 5:58 am
    Oct 11, 2011 at 10:24 pm
  • I'm new to Hadoop. I've read a few articles and presentations which are directed at explaining what Hadoop is, and how it works. Currently my understanding is Hadoop is an MPP system which leverages ...
    Oct 23, 2011 at 2:59 pm
    Nov 22, 2011 at 1:46 am
  • /** Return the disk usage of the filesystem, including total capacity, * used space, and remaining space */ public DiskStatus getDiskStatus() throws IOException { return dfs.getDiskStatus(); } ...
    Uma Maheswara Rao G 72686Uma Maheswara Rao G 72686
    Oct 15, 2011 at 11:52 am
    Oct 20, 2011 at 2:31 pm
  • I've been doing some work on a Jira and want to assign it to myself but there doesn't seem to be an option to do this. I believe I need to be assigned the contributor role before I can have issues ...
    Jon AllenJon Allen
    Oct 16, 2011 at 4:34 pm
    Oct 18, 2011 at 9:25 pm
  • Hello Everyone, I've been having an issue in a hadoop environment (running cdh3u1) where any table declared in hive with the "STORED AS INPUTFORMAT "com.hadoop.mapred.DeprecatedLzoTextInputFormat"" ...
    Jessica OwensbyJessica Owensby
    Oct 5, 2011 at 6:31 pm
    Oct 17, 2011 at 12:05 am
  • Sending it to the hadoop mailing list - I think this is a hadoop related problem and not related to Cloudera distribution. Raj ----- Forwarded Message -----
    Raj VRaj V
    Oct 3, 2011 at 2:37 pm
    Oct 5, 2011 at 10:07 pm
  • Hello, I am trying to understand how data locality works in hadoop. If you run a map reduce job do the mappers only read data from the host on which they are running? Is there a communication ...
    Ivan NovickIvan Novick
    Oct 25, 2011 at 6:47 pm
    Oct 28, 2011 at 9:25 am
  • Currently, we've got defined: <property <name hadoop.tmp.dir</name <value /hadoop/hadoop-metadata/cache/</value </property In our experiments with SOLR, the intermediate files are so large that they ...
    Meng MaoMeng Mao
    Oct 5, 2011 at 5:33 am
    Oct 26, 2011 at 3:50 pm
  • we are using hadoop on virtual box. when it is a single node then it works fine for big dataset larger than the default block size. but in case of multinode cluster (2 nodes) we are facing some ...
    Humayun gmailHumayun gmail
    Oct 16, 2011 at 8:30 am
    Oct 16, 2011 at 5:29 pm
  • Another point concerning the Combiners, the grouping is currently done using the RawComparator used for sorting the Mapper's output. Wouldn't it be useful to be able to set a custom ...
    Mathias HerbertsMathias Herberts
    Oct 29, 2011 at 11:35 am
    Oct 31, 2011 at 10:16 pm
  • Hi, I installed cygwin on win7, when i run hadoop examples its makes /tmp dir in C:/ (win install dir) not in c:/cygwin (cygwin install dir), so java IOException happened. any solution? Thanks, BS
    Oct 28, 2011 at 6:34 am
    Oct 29, 2011 at 4:04 am
  • Hi guys, I'm realy new to hadoop. I have configured a single node hadoop cluster. but seems that my data node is not working. job tracker log file shows this message(alot of them per 10 second): ...
    Majid AzimiMajid Azimi
    Oct 15, 2011 at 8:51 pm
    Oct 17, 2011 at 5:19 am
  • Mapred LearnMapred Learn
    Oct 26, 2011 at 2:04 pm
    Dec 21, 2011 at 9:02 pm
  • I'm investigating a bug where my mapper and reducer tasks run out of memory. It only reproduces when I run on large data sets, so the best way to dig in is to launch my job with sufficiently large ...
    W.P. McNeillW.P. McNeill
    Oct 17, 2011 at 6:34 pm
    Oct 19, 2011 at 3:12 am
  • I know that hadoop0.19.0 supports append option, but not stable. Does the latest version support append option? Is it stable? Thanks for help. bourne
    Oct 17, 2011 at 7:07 am
    Oct 18, 2011 at 4:58 pm
  • Hi, I'm in the process of putting together a 'Hadoop MapReduce Poster' so my students can better understand the various steps of a MapReduce job as ran by Hadoop. I intend to release the Poster under ...
    Mathias HerbertsMathias Herberts
    Oct 31, 2011 at 1:15 pm
    Nov 1, 2011 at 10:55 am
  • How can one test that LZO compression is configured correctly? I can find
    Oct 30, 2011 at 2:27 am
    Oct 30, 2011 at 5:05 pm
  • What is the correct way to reserve space for hdfs? I currently have 2 filesystem, /fs1 and /fs2 and I would like to reserve space for non-dfs operations. For example, for /fs1 i would like to reserve ...
    Oct 27, 2011 at 12:24 am
    Oct 28, 2011 at 11:24 am
  • hi hadoopers, how's your weekend going? i do run out of idea at this point abt behavior of capacity scheduler; been stuck with this for a day and night. I referred to this doc: ...
    Patrick sangPatrick sang
    Oct 15, 2011 at 11:46 pm
    Oct 20, 2011 at 12:35 am
  • I also found another problem if I directly export from eclipse as a jar file then while trying javac -jar or hadoop -jar doesn't recognize that jar. However same jar works well with windows.
    Jignesh PatelJignesh Patel
    Oct 6, 2011 at 12:14 am
    Oct 6, 2011 at 2:21 pm
  • Hello, We have 12 node Hadoop Cluster that is running Hadoop 0.20.2-cdh3u0. Each node has 8 core and 144GB RAM (don't ask). So, I want to take advantage of this huge RAM and run the map-reduce jobs ...
    N.N. GesliN.N. Gesli
    Oct 28, 2011 at 7:09 am
    Nov 4, 2011 at 11:24 am
  • Hi, I wrote a small test program to perform a simple database extraction of information from a simple table on a remote cluster. However, it fails to execute successfully when I run from eclipse it ...
    Jamal xJamal x
    Oct 28, 2011 at 5:18 pm
    Nov 2, 2011 at 4:51 am
  • Hi, I'm designing a 'Hadoop MapReduce Poster', putting all pieces together so people will easily be able to visualize the full M/R flow. Concerning the combiners, I have a few points I'd like to have ...
    Mathias HerbertsMathias Herberts
    Oct 29, 2011 at 10:53 am
    Oct 31, 2011 at 5:42 pm
  • Hi guys : What is the meaning of an EOF exception when trying to connect to Hadoop by creating a new FileSystem object ? Does this simply mean the system cant be read ? java.io.IOException: Call to ...
    Jay VyasJay Vyas
    Oct 30, 2011 at 11:48 pm
    Oct 31, 2011 at 5:37 am
  • Hi All, I was looking into FAQ, but well still have questions. Datanodes in my production are running low in the space of one of dfs.data.dir /dev/sda5 -- 355G 322G 33G 91% /hadoop1 <---- /dev/sdb1 ...
    Patai SangbutsarakumPatai Sangbutsarakum
    Oct 24, 2011 at 7:10 pm
    Oct 26, 2011 at 10:34 pm
  • Hi guys, I'm running a 1-node Hadoop 0.20.2 pseudo-distributed node with RedHat 6.1 on Amazon EC2 and while my node is healthy, I can't seem to get to the JobTracker GUI working. Running 'curl ...
    Sameer FarooquiSameer Farooqui
    Oct 24, 2011 at 6:03 pm
    Oct 25, 2011 at 9:16 am
  • Hi All, i'm newbie on hadoop, if i installed hadoop on 2 node, where is hdfs running ? on master or slave node ? and then if i running sqoop for export dbms to hive, is it give effect on speed up ...
    Oct 21, 2011 at 12:04 am
    Oct 25, 2011 at 7:13 am
  • Hi all, I'm executing one job to convert logs into hive tables. The times are very good once we have added a proper number of nodes but the reduce phase spends always more time in one of the ...
    Raimon BoschRaimon Bosch
    Oct 23, 2011 at 1:02 am
    Oct 24, 2011 at 4:19 pm
  • Hello, I am trying to write my very first MapReduce code. When I try to run the jar, I get this error: 11/10/15 17:17:30 INFO mapred.JobClient: Task Id : attempt_201110151636_0003_m_000001_2, Status ...
    Keith ThompsonKeith Thompson
    Oct 15, 2011 at 9:27 pm
    Oct 17, 2011 at 2:13 am
  • Hi all, I am trying a simple extension of WordCount example in Hadoop. I want to get a frequency of wordcounts in descending order. To that I employ a linear chain of MR jobs. The first MR job (MR-1) ...
    Oct 15, 2011 at 12:32 am
    Oct 15, 2011 at 7:09 pm
  • I am trying to use distcp to copy a file from one HDFS to another. But while copying I am getting the following exception : hadoop distcp hdfs://ub13:54310/user/hadoop/weblog ...
    Praveenesh kumarPraveenesh kumar
    Oct 5, 2011 at 5:16 am
    Oct 11, 2011 at 5:35 am
  • Hi, I'm running a cluster on amazon and sometimes I'm getting this exception: 2011-10-07 10:36:28,014 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: org.apache.hadoop.ipc.RemoteException: ...
    Raimon BoschRaimon Bosch
    Oct 7, 2011 at 10:47 am
    Oct 7, 2011 at 2:36 pm
  • Hey all! I am having an issue with hadoop's daily datanode log growing to + 1.8 GB. I have 3 Nodes in my hdfs cluster, all sharing the same configuration (including same log4j.properties). While ...
    Ronen ItkinRonen Itkin
    Oct 30, 2011 at 9:03 am
    Nov 1, 2011 at 12:51 pm
  • Hi, Is there any meetup group around Bangalore area? Would be very useful to have one in this part of the world.. -- Regards, R.V.
    Real great..Real great..
    Oct 30, 2011 at 2:11 am
    Oct 30, 2011 at 7:51 am
  • Hi, I have used Hadoop with Linux successfully and am now trying to use it on a Windows machine running Cygwin. I ran a job and am now trying to use bin/hadoop fs -get <source <destination to ...
    Keith ThompsonKeith Thompson
    Oct 28, 2011 at 4:04 pm
    Oct 29, 2011 at 12:56 am
  • hi, We managed to lost data when 1 datanode broke down in a cluster of 6 datanodes with replication factor 3. As far as I know, that shouldn't happen, since each blocks should have 1 copy in 3 ...
    Oct 21, 2011 at 9:26 am
    Oct 22, 2011 at 10:32 am
  • Is there such a thing somewhere? I have the basic nPath, lucene-like search processing but looking for ETL like transformations, typical weblog processor or clickstream. Anything beyond "wordcount" ...
    Alex GauthierAlex Gauthier
    Oct 20, 2011 at 3:07 am
    Oct 20, 2011 at 7:00 am
  • Hi guys, noob questions; What do I need to install a new node soon to be added to a cluster and how do I add it? I'm using CDH3 distribution. Thank you!! Alex Gauthier Engineering Manager Teradata ...
    Gauthier, AlexanderGauthier, Alexander
    Oct 18, 2011 at 12:41 am
    Oct 18, 2011 at 4:55 pm
  • Hi, I'm downloading mainframe files using FTP in binary mode on to local file system. These files are now seen as EBCDIC. The information about these files are (a) fixed in length ( each field in ...
    Oct 15, 2011 at 8:51 am
    Oct 15, 2011 at 5:47 pm
  • Hi all, I was wondering if there are any (technical) issues with running two secondary namenodes on two separate servers rather than running just one. Since basically everything falls or stands with ...
    Jorn Argelo - EphorusJorn Argelo - Ephorus
    Oct 12, 2011 at 7:49 am
    Oct 12, 2011 at 10:51 am
  • I m trying to run attached program. My input directory structure is /user/hadoop-user/input/cite65_77.txt file. But it doesn't do anything. It doesn't read the file and not creates output directory.
    Jignesh PatelJignesh Patel
    Oct 10, 2011 at 10:05 pm
    Oct 12, 2011 at 4:19 am
  • I plan to deploy a HDFS cluster which will be shared by multiple MapReduce clusters. I wonder whether this is possible. Will it incur any conflicts among MapReduce (e.g. different MapReduce clusters ...
    Zhenhua (Gerald) GuoZhenhua (Gerald) Guo
    Oct 7, 2011 at 5:10 pm
    Oct 11, 2011 at 7:23 pm
  • Hi, Following this instructions at http://wiki.apache.org/hadoop/HowManyMapsAndReduces I've read that the best amount of reducers for one process is 0.95 or 1.75 * (nodes * ...
    Raimon BoschRaimon Bosch
    Oct 11, 2011 at 12:18 pm
    Oct 11, 2011 at 6:26 pm
  • I see that the JobConf class used in the WordCount tutorial is deprecated for the Configuration class. I am wanting to change the file input format (to the StreamInputFormat for XML as in Hadoop: The ...
    Keith ThompsonKeith Thompson
    Oct 11, 2011 at 4:19 pm
    Oct 11, 2011 at 6:02 pm
  • Hello, Correct me if I'm wrong, but when a program opens n-files at the same time to read from, and start reading from each file at a time 1 line at a time. Isn't hadoop actually fetching ...
    Mark questionMark question
    Oct 5, 2011 at 6:42 am
    Oct 11, 2011 at 5:13 am
Group Navigation
period‹ prev | Oct 2011 | next ›
Group Overview
groupcommon-user @

168 users for October 2011

Jignesh Patel: 47 posts Harsh J: 41 posts Uma Maheswara Rao G 72686: 24 posts Bejoy KS: 20 posts Patrick sang: 15 posts Jay Vyas: 13 posts Raimon Bosch: 13 posts Shevek: 13 posts Masoud: 9 posts Raj Vishwanathan: 9 posts Alexander C.H. Lorenz: 8 posts Ramya Sunil: 8 posts Brock Noland: 7 posts Joey Echeverria: 7 posts Meng Mao: 7 posts Peng, Wei: 7 posts Ivan Novick: 6 posts Keith Thompson: 6 posts Mark question: 6 posts Mathias Herberts: 6 posts
show more