FAQ

Search Discussions

77 discussions - 249 posts

  • Moving to mapreduce-user@, bcc common-user@. Can you see any errors in the logs? Typically this happens when you have no NodeManagers. Check the 'nodes' link and then RM logs. Arun
    Arun C MurthyArun C Murthy
    Dec 9, 2011 at 7:15 am
    Dec 9, 2011 at 7:51 am
  • For Parsing job history logs in H23, One way I see is to figure out the history file path (JobHistoryUtils.getConfiguredHistoryServerDoneDirPrefix(conf)/YYYY/MM/DD/<job serial number /<job_id ...
    RDRD
    Dec 3, 2011 at 8:37 pm
    Dec 5, 2011 at 8:07 am
  • Hi guys ! I have set up a single node cluster as per below link http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/#run-the-mapreduce-job I have tried to run ...
    Arun kArun k
    Dec 3, 2011 at 3:16 am
    Dec 3, 2011 at 5:38 am
  • Pig script allocates 15 nodes to run a pig script *(how to configure that?)*but what happens on a 10 nodes cluster only? UI shows that 15 reducers are running, and at the same time max number of ...
    Keren OuaknineKeren Ouaknine
    Dec 1, 2011 at 1:06 am
    Dec 1, 2011 at 2:59 am
  • Hi, I have a 7-node setup (1 - Namenode/JobTracker, 6 - Datanodes/TaskTrackers) running Hadoop version 0.20.203. I performed the following test: Initially cluster is running smoothly. Just before ...
    Rajat GoelRajat Goel
    Dec 27, 2011 at 1:09 pm
    Dec 27, 2011 at 1:09 pm
  • Hi, In the past few weeks we evaluated and partially migrated from Hadoop 0.20.203.0 to 0.22.0. Most stuff works fine locally and simple jobs do well on the cluster. However, the most essential part ...
    Markus JelsmaMarkus Jelsma
    Dec 23, 2011 at 2:19 pm
    Dec 23, 2011 at 2:19 pm
  • Hi. I'm testing Apache Nutch on Hadoop 0.22.0 and migrated from 0.20.203. Many more tasks fail for unknown reasons (they timeout) while they didn't on the other cluster that was much less high-end. I ...
    Markus JelsmaMarkus Jelsma
    Dec 22, 2011 at 2:05 pm
    Dec 22, 2011 at 2:05 pm
  • I want to try hadoop 0.22. Its possible to downgrade it later without loosing HDFS content (namenode, datanodes)?
    Radim KolarRadim Kolar
    Dec 22, 2011 at 6:57 am
    Dec 22, 2011 at 6:57 am
  • Hi guys ! If we neglect the shuffle part, can reduce phase be CPU/IO bound ? Can anyone suggest some benchmark or example where we can see this ? Arun
    Arun kArun k
    Dec 22, 2011 at 6:16 am
    Dec 22, 2011 at 6:16 am
  • Hi, On 0.22.0 we sometimes see a shuffle phase being stuck to a point where the framework does not kill it because of lack of progress. The reducer's tasktracker log keeps filling up with two ...
    Markus JelsmaMarkus Jelsma
    Dec 20, 2011 at 7:18 am
    Dec 20, 2011 at 7:18 am
  • Moving it to mapreduce-list. Sophie, This could just be a bug a 0.23. 0.23 does not have jobtrackers/tasktrackers. Could you see if you can recreate this? If yes, please do file a jira on this. ...
    Mahadev KonarMahadev Konar
    Dec 19, 2011 at 11:44 pm
    Dec 19, 2011 at 11:44 pm
  • Hi, In the hadoop MapReduce, I've executed the webdatascan example, and the reduce output is in a SequeceFile. The result is shows here ( http://paste.lisp.org/display/126572). What's the trash ...
    Pedro CostaPedro Costa
    Dec 19, 2011 at 1:55 pm
    Dec 19, 2011 at 1:55 pm
  • Hi, I'm migrating Apache jobs to the new MapReduce API. I came across too many issues but there's one i can't seem to figure out: SequenceFile.Reader[] readers = ...
    Markus JelsmaMarkus Jelsma
    Dec 16, 2011 at 11:28 am
    Dec 16, 2011 at 11:28 am
  • Is there a way to pass a service to the output format? I have an object which I would like to initialize/configure outside and then pass in (since it must also be used elsewhere). So far I have been ...
    Adam PortleyAdam Portley
    Dec 15, 2011 at 12:45 am
    Dec 15, 2011 at 12:45 am
  • Hi, is it possible use side-effect file using streaming (python)? If it is, how can i do it? Thanks.
    Kadu canGica EduardoKadu canGica Eduardo
    Dec 14, 2011 at 5:24 pm
    Dec 14, 2011 at 5:24 pm
  • Moving to mapreduce-user@, bcc common-user@. Please use project specific lists. Take a look at JobTracker.heartbeat - *Scheduler.assignTasks. After the scheduler 'assigns' tasks, the JT sends the ...
    Arun C MurthyArun C Murthy
    Dec 13, 2011 at 7:12 pm
    Dec 13, 2011 at 7:12 pm
  • HI guys ! I have a single node set up as per http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/ 1 I have put some sysout statements in Jobtracker and wordcount ...
    Arun kArun k
    Dec 13, 2011 at 2:37 pm
    Dec 13, 2011 at 2:37 pm
  • Hi, I receive the following error while starting datanode in secure mode of hadoop 0.23 2011-12-14 14:35:48,468 INFO http.HttpServer (HttpServer.java:addGlobalFilter(476)) - Added global filter ...
    Sri ramSri ram
    Dec 13, 2011 at 9:27 am
    Dec 13, 2011 at 9:27 am
  • Hi guys ! I have set up a single node cluster as per below link http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/#run-the-mapreduce-job I have tried to run ...
    Arun kArun k
    Dec 13, 2011 at 6:53 am
    Dec 13, 2011 at 6:53 am
  • Hi guys ! I have set up cluster according to http://ankitasblogger.blogspot.com/2011/01/hadoop-cluster-setup.html I have set topology.script.file.name in core-site.xml I tried this simple test ...
    Arun kArun k
    Dec 13, 2011 at 6:40 am
    Dec 13, 2011 at 6:40 am
  • HI, I TRIED INSTALLING HADOOP SECURE MODE IN 0.23 VERSION.I STUCK UP WITH THE ERROR OF java.lang.RuntimeException: Cannot start secure cluster without privileged resources. at ...
    Sri ramSri ram
    Dec 12, 2011 at 8:56 am
    Dec 12, 2011 at 8:56 am
  • Hi guys ! I want to see the behavior of a single node of Hadoop cluster when IO intensive / CPU intensive workload and mix of both is submitted to the single node alone. These workloads must stress ...
    Arun kArun k
    Dec 9, 2011 at 6:27 am
    Dec 9, 2011 at 6:27 am
  • Harsh, I had a doubt regarding task runtimes displayed in Web GUI b'coz the Web GUI shows only task run times in seconds and not in milliseconds. Can i make it display nanotime or atleast ...
    Arun kArun k
    Dec 4, 2011 at 4:54 am
    Dec 4, 2011 at 4:54 am
  • Arun, Rumen is a tool which converts Hadoop MapReduce logs into a standard format. The Rumen trace and the jobhistory files are identical in some sense (information content wise). It would be helpful ...
    Amar KamatAmar Kamat
    Dec 3, 2011 at 8:05 am
    Dec 3, 2011 at 8:05 am
  • I have some test code that would kill performance (and not work properly on a cluster but can accumulate test data in stand alone mode. I want a way to programatically when the code is running in ...
    Steve LewisSteve Lewis
    Dec 2, 2011 at 12:11 am
    Dec 2, 2011 at 12:11 am
  • Hi guys ! Apart from generating the job traces from RUMEN , can i get logs or job traces of varied sizes from some organizations. How can i make sure that the rumen generates only say 25 jobs,50 jobs ...
    Arun kArun k
    Dec 1, 2011 at 3:17 am
    Dec 1, 2011 at 3:17 am
  • Hi, I m defining custom counters in mapper that I want to access in reducer in new API. Does anyone know how to do this ? Thanks, JJ Sent from my iPhone
    Mapred LearnMapred Learn
    Dec 1, 2011 at 2:15 am
    Dec 1, 2011 at 2:15 am
Group Navigation
period‹ prev | Dec 2011 | next ›
Group Overview
groupmapreduce-user @
categorieshadoop
discussions77
posts249
users66
websitehadoop.apache.org...
irc#hadoop

66 users for December 2011

Harsh J: 23 posts Markus Jelsma: 21 posts Arun C Murthy: 20 posts Arun k: 19 posts Praveen Sripati: 10 posts Robert Evans: 9 posts Eyal Golan: 8 posts Bejoy Ks: 7 posts Kevin Burton: 6 posts Mahadev Konar: 6 posts Raghavendhra rahul: 6 posts Avery Ching: 5 posts John Miller: 5 posts Nitin Khandelwal: 5 posts Ann Pal: 4 posts Costin Leau: 4 posts Hadoop anis: 4 posts Keren Ouaknine: 4 posts Sri ram: 4 posts Bing Jiang: 3 posts
show more