FAQ

Search Discussions

17 discussions - 46 posts

  • Hi, I have a mapreduce program embeded in a java application, and I am trying to load additional jar files as add-on for the execution of my map reduce job. My program works like this: JobConf conf = ...
    Eric YangEric Yang
    Dec 31, 2009 at 11:22 pm
    Jan 20, 2010 at 5:54 pm
  • Can someone explain how to override the "FileInputFormat" and "RecordReader" in order to be able to read multiple lines of text from input files in a single map task? Here the key will be the offset ...
    Kunal GuptaKunal Gupta
    Dec 1, 2009 at 6:17 am
    Dec 2, 2009 at 1:42 pm
  • Hi folks, I do have a questions regarding a map-reduce job who gets killed during the copy phase due to a timeout. I don't really understand the output completely so I first look for some help to ...
    Simon WillnauerSimon Willnauer
    Dec 9, 2009 at 6:31 pm
    Dec 9, 2009 at 7:02 pm
  • Hi all, I'd like to configure FairScheduler on hadoop. but seems it can not work. The following is what I did 1. add fairscheduler.jar to lib 2. add the following property to mapred-site.xml ...
    Jeff ZhangJeff Zhang
    Dec 11, 2009 at 3:13 am
    Dec 11, 2009 at 11:27 pm
  • I am writing my custom InputFormat to read N number of lines per map task. For this I have extended the FileInputFormat and RecordReader classes. In my RecordReader I am using LineRecordReader object ...
    Kunal GuptaKunal Gupta
    Dec 2, 2009 at 1:46 pm
    Dec 9, 2009 at 5:12 am
  • Hello! I'm trying to rewrite an image resizing program in terms of map/reduce. The problem I see is that the job is not broken up in to small enough tasks. If I only have 1 input file with 10,000 ...
    Daniel GarciaDaniel Garcia
    Dec 3, 2009 at 4:26 pm
    Dec 5, 2009 at 2:06 am
  • Please direct me to a different forum if I am in the wrong place. I am just trying to compile Hadoop out of the box ... I first download hadoop-0.20.1 and untar it. Next, I run $ANT_HOME/bin/ant, ...
    CalvinCalvin
    Dec 10, 2009 at 11:37 pm
    Dec 10, 2009 at 11:48 pm
  • I am outputting a message to stdout and stderr from inside my MAP function. but the stdout and stderr files in log folder are empty. I tried looking these logfiles from the web UI as well. They are ...
    Kunal GuptaKunal Gupta
    Dec 8, 2009 at 12:14 pm
    Dec 8, 2009 at 6:45 pm
  • Hi, Unit testing a mapper with the old mapred API is easy. However, for the new mapreduce API, I struggle creating the Context object in an easy and elegant way. How do I best mock the Context for ...
    Bernd FondermannBernd Fondermann
    Dec 1, 2009 at 8:44 am
    Dec 1, 2009 at 7:00 pm
  • Hello! I was wondering if there was any convenient way to sort the Reducer output? Specifically, in WordCount is there a way to sort the results by frequency? Thank you very much, I'm sorry if this ...
    Mark VigeantMark Vigeant
    Dec 14, 2009 at 8:14 pm
    Dec 14, 2009 at 8:14 pm
  • Hello communities, Now I'm happy to announce that we've developed new computing model called BSP (Bulk Synchronous Parallel) on top of Hadoop. Here are the slides for the topic "Apache HAMA: An ...
    Edward J. YoonEdward J. Yoon
    Dec 11, 2009 at 3:32 pm
    Dec 11, 2009 at 3:32 pm
  • As an exercise while learning MapReduce, I developed an algorithm for matrix multiplication and wrote it up on my web site. If you're interested, it's at: ...
    John NorstadJohn Norstad
    Dec 8, 2009 at 4:45 pm
    Dec 8, 2009 at 4:45 pm
  • Hi, i want to write a files to hdfs, using hadoop pipes. can anyone tell me how to do that? Im using an external library that writes its output to disk, so probably i have to read that output and ...
    Cyk33Cyk33
    Dec 8, 2009 at 11:30 am
    Dec 8, 2009 at 11:30 am
  • Hi, i want to write a files to hdfs, using hadoop pipes. can anyone tell me how to do that? Im using an external library that writes its output to disk, so probably i have to read that output and ...
    Cyk33Cyk33
    Dec 8, 2009 at 11:21 am
    Dec 8, 2009 at 11:21 am
  • Hello, I am running specific Map/Reduce task on Hadoop cluster 0.19.2, the job was split on 509 maps, 507 maps run quickly enough, 1-2 minutes each; cluster capacity: 9 maps, 3 reduces. The problem ...
    Fuad EfendiFuad Efendi
    Dec 8, 2009 at 12:16 am
    Dec 8, 2009 at 12:16 am
  • I hope y'all can offer me some insight. I've only started looking hard at hadoop in the last couple of days, but I've yet to see a good example that I can hack for a style of problem I tend to ...
    Nathan EdwardsNathan Edwards
    Dec 4, 2009 at 7:57 pm
    Dec 4, 2009 at 7:57 pm
  • Hi all, I am processing a large tab file to format it suitable for loading into a database with a predefined schema. I have a tab file with a column that I need to normalize out to another table and ...
    Tim RobertsonTim Robertson
    Dec 1, 2009 at 9:06 am
    Dec 1, 2009 at 9:06 am
Group Navigation
period‹ prev | Dec 2009 | next ›
Group Overview
groupmapreduce-user @
categorieshadoop
discussions17
posts46
users25
websitehadoop.apache.org...
irc#hadoop

25 users for December 2009

Kunal Gupta: 8 posts Aaron Kimball: 4 posts Geoffry Roberts: 4 posts Amogh Vasekar: 3 posts Fuad Efendi: 3 posts Cyk33: 2 posts Guillaume Viland: 2 posts Sean Owen: 2 posts Simon Willnauer: 2 posts Bernd Fondermann: 1 post Calvin: 1 post Chris Douglas: 1 post Daniel Garcia: 1 post Ed Mazur: 1 post Edward J. Yoon: 1 post Eric Yang: 1 post Jeff Zhang: 1 post John Norstad: 1 post Mark Vigeant: 1 post Nathan Edwards: 1 post
show more