Search Discussions

87 discussions - 448 posts

  • I am struggling to control the behavior of the framework. The first problem is simple: I want to run many simultaneous mapper tasks on each node. I've scoured the forums, done the obvious, and I ...
    Lance AmundsenLance Amundsen
    Oct 17, 2007 at 6:27 am
    Oct 25, 2007 at 5:40 pm
  • -- Bin YANG Department of Computer Science and Engineering Fudan University Shanghai, P. R. China EMail: yangbinisme82@gmail.com
    Bin YANGBin YANG
    Oct 18, 2007 at 10:14 am
    Oct 22, 2007 at 9:37 pm
  • Hi, Can I recursively launch a job within the map() method of a mapper? The task i am facing involves extracting metadata for filenames and catalog names from hierarchical catalogs, and for each ...
    Jim the Standing BearJim the Standing Bear
    Oct 29, 2007 at 8:29 pm
    Oct 30, 2007 at 7:20 pm
  • A very basic question: where to store my personal global variables such that the map and/or reduce functions can see it? Thanks, James
    James YuJames Yu
    Oct 12, 2007 at 2:17 am
    Oct 13, 2007 at 9:50 pm
  • Hi All, Does any one have comments about how Hbase will perform in a 4 node cluster compared to an equivalent MySQL configuration? Thanks, Rafael
    Rafael TurkRafael Turk
    Oct 11, 2007 at 10:36 pm
    Oct 13, 2007 at 5:07 am
  • Hi all I'm getting started with Hadoop and with Yahoo Pig (http://research.yahoo.com/project/pig). I decided to took a look at the code and made some small changes; I announce them here if anyone is ...
    Juan Manuel CaicedoJuan Manuel Caicedo
    Oct 25, 2007 at 4:19 am
    Nov 1, 2007 at 3:30 am
  • Hello Hadoopers! I have just recently started using Hadoop and I have a question that has puzzled me for a couple of days now. I have already browsed the mailing list and found some relevant posts, ...
    Daniel WressleDaniel Wressle
    Oct 8, 2007 at 8:23 am
    Oct 10, 2007 at 4:54 pm
  • I am just getting started with HBase. Thinking about using it for future Nutch development. Have successfully build with ant scripts. In the src I see conf and bin directories similar to Hadoop. But ...
    Dennis KubesDennis Kubes
    Oct 18, 2007 at 6:16 pm
    Oct 23, 2007 at 8:45 pm
  • I would like someone to compare and contrast CIFS and HDFS? Or...if that is not a valid comparison...please explain to me why it's not a valid comparison. Thanks, Trevor . This message and any ...
    Oct 15, 2007 at 7:24 pm
    Oct 17, 2007 at 4:33 pm
  • Hi, I was writing a test mapreduce program and noticed that the input file was always broken down into separate lines and fed to the mapper. However, in my case I need to process the whole file in ...
    Ming YangMing Yang
    Oct 15, 2007 at 1:59 pm
    Oct 15, 2007 at 9:39 pm
  • Does anybody know if there is a jdk6 available for Mac? I checked the apple developer site, and there doesn't seem to be one available, despite blogs from last year claiming apple was distributing ...
    Michael BieniosekMichael Bieniosek
    Oct 12, 2007 at 5:35 pm
    Oct 13, 2007 at 12:36 am
  • Hello all Is there a best practice for using my own classes as keys and values? My first attempt at doing this was successful - I built a BigIntegerWritable class using IntWritable as a template. It ...
    Steve SchlosserSteve Schlosser
    Oct 10, 2007 at 3:03 pm
    Oct 11, 2007 at 1:29 am
  • Hi, I am new to hadoop, and tried to setup a distributed hadoop system. But when I tried to run the example job, it stack dumped with the following exception: org.apache.hadoop.ipc.RemoteException: ...
    Jim the Standing BearJim the Standing Bear
    Oct 8, 2007 at 6:35 pm
    Jan 8, 2013 at 12:13 pm
  • Maybe this question should be move to the developer list, but .. it is not that I currently intent to contribute to the project.. I just wanted to see if I could get the code to compile. This is what ...
    Peter ThygesenPeter Thygesen
    Oct 15, 2007 at 1:53 pm
    Nov 26, 2007 at 6:02 pm
  • i index by NutchWax.jar and have directory CrawlDb, LinkDb, index, indexes, segments after that i search by Nutch but show error in logs of tomcat that java.lang.IllegalStateException at ...
    Oct 16, 2007 at 8:52 am
    Oct 24, 2007 at 10:32 am
  • Hi, In the original MapReduce paper from Google, it mentioned that healthy workers can take over failed task from other workers. Does Hadoop has the same failure recovery strategy? Also the other ...
    Ming YangMing Yang
    Oct 19, 2007 at 3:05 am
    Oct 20, 2007 at 12:55 pm
  • I had a somewhat difficult time figuring out how to get hbase started. In the end, it was pretty simple. Here are the steps: 1. Download hadoop from svn, untar to directory say ~/hadooptrunk and ...
    Dennis KubesDennis Kubes
    Oct 22, 2007 at 4:10 am
    Oct 30, 2007 at 5:26 pm
  • Hi I have been facing some problems in Hadoop. I find the Namennode to be in safe mode?? May I knw why namenode is in safe mode and what happens when it is in safe mode? Thanks in advance Preethi.C ...
    Preethi ChockalingamPreethi Chockalingam
    Oct 17, 2007 at 5:06 am
    Oct 17, 2007 at 6:08 pm
  • Hello all I am new to hadoop . I am trying to write file to single cluster and getting this exception when i am trying to close output stream java.io.IOException: CreateProcess: df -k ...
    Oct 13, 2007 at 3:06 pm
    Oct 13, 2007 at 10:21 pm
  • Hi All: I am tearing out my hair trying to get a simple valueaggregator to run. This seems like an easy enough thing, but I am consistently getting hit with ClassNotFoundException which makes no ...
    C GC G
    Oct 12, 2007 at 9:43 pm
    Oct 13, 2007 at 4:02 am
  • Hi all, I am facing a problem with aggregations where reduce groups are extremely large. It's a very common usage scenario - for example someone might want the equivalent of 'count (distinct.field2) ...
    Joydeep Sen SarmaJoydeep Sen Sarma
    Oct 11, 2007 at 8:15 pm
    Oct 11, 2007 at 10:32 pm
  • Hello all. Due to limited space in current datacenter, I am trying to move my Hadoop cluster to a new datacenter. In the new datacenter, each machine will keep its hostname, but each will be assigned ...
    Taeho KangTaeho Kang
    Oct 4, 2007 at 1:28 am
    Oct 5, 2007 at 4:33 pm
  • Hi there. I did a search of the mailing list archives looking for something similar to this, but I didn't find anything so apologies if this has been discussed before. I'm investigating using Hadoop ...
    Robert JessopRobert Jessop
    Oct 16, 2007 at 3:39 pm
    Oct 18, 2007 at 10:24 am
  • Hi, I have a Hadoop cluster of three machines. When a wordcount example was submitted it was working fine. Nowadays when i submit a job i get a socket error.. and job is not workin. Any reason as why ...
    Oct 10, 2007 at 3:21 am
    Oct 15, 2007 at 7:27 am
  • Hey, where's Hadoop? I've never seen an open-source version of Bigtable. ... "The centers will run an open-source version of Google’s data center software, and I.B.M. is contributing open-source ...
    Jonathan HendlerJonathan Hendler
    Oct 9, 2007 at 11:48 am
    Oct 10, 2007 at 2:17 am
  • I know in Hadoop we can implement multi-threaded, asynchronous
    Nguyen Manh TienNguyen Manh Tien
    Oct 4, 2007 at 1:44 am
    Oct 4, 2007 at 6:43 pm
  • hello, I have several questions on the physical storage of the HBase: 1. Does HBase store each table in A format: "com.cnn.www", t6, "<html ...", "com.cnn.www", t5, "<html ...", "com.cnn.www", t3, ...
    Bin YANGBin YANG
    Oct 31, 2007 at 8:03 am
    Oct 31, 2007 at 3:44 pm
  • Hello @all! I am using Hadoop (version 0.14.3) and I tried to execute Hadoop-Streaming with C. Firstly, I compiled and linked my C-Files, then specified it as mapper and reducer to hadoop-streaming. ...
    Christian KremnitzerChristian Kremnitzer
    Oct 30, 2007 at 3:49 pm
    Oct 31, 2007 at 8:44 am
  • Hi All: Environment: 4 node grid running hadoop-0.14.1. With the system shutdown I wiped out the old HDFS directory structure and created an empty directory. Did a namenode format, and then brought ...
    C GC G
    Oct 30, 2007 at 6:06 pm
    Oct 30, 2007 at 8:36 pm
  • Hi All, I am adding two input dir in a job. Both the input dirs have same <Key.class, Value.class . Inside the map method i want to know that which pair<key, value has come from which input dir. How ...
    Shailendra MudgalShailendra Mudgal
    Oct 12, 2007 at 12:52 pm
    Oct 15, 2007 at 4:52 am
  • Hello, Is it necessary to run the -upgrade operation to take a cluster from 0.14.1 to 0.14.2? None of the release pages say... Thanks, Stu Hood Webmail.us "You manage your business. We'll manage your ...
    Stu HoodStu Hood
    Oct 11, 2007 at 12:10 am
    Oct 12, 2007 at 5:14 pm
  • Hi, I'm trying to get Hadoop running on Windows, and I've found that it isn't exactly a simple process (yes, I'm using Cygwin). I've had a quick look though the mailing list, and I see the occasional ...
    Nick LothianNick Lothian
    Oct 8, 2007 at 5:44 am
    Oct 11, 2007 at 5:28 am
  • Hi, Is there anywhere that I can find sample Hadoop programming examples? -- Best Regards, S.Mehdi Sheikhalishahi, Web: http://www.cse.shirazu.ac.ir/~alishahi/ Bye.
    Mehdi SheikhalishahiMehdi Sheikhalishahi
    Oct 8, 2007 at 11:01 am
    Oct 11, 2007 at 12:07 am
  • Hello! Quick simple question, hopefully someone out there could answer. Does the hadoop dfs support putting multiple files at once? The documentation says -put only works on one file. What's the best ...
    Chris FellowsChris Fellows
    Oct 31, 2007 at 8:50 pm
    Oct 31, 2007 at 10:01 pm
  • I have a corpus of 300,000 raw HTML files that I want to read in and parse using Hadoop. What is the best input file format to use in this case? I want to have access to each page's raw HTML in the ...
    David BalateroDavid Balatero
    Oct 25, 2007 at 12:09 am
    Oct 25, 2007 at 6:41 am
  • Hi, I'm using the Hadoop 0.14.1 AMI as my master node [ami-64f6130d], and I've followed up to "test your cluster" bit of the tutorial listed here: http://wiki.apache.org/lucene-hadoop/AmazonEC2 ...
    Tiger UppercutTiger Uppercut
    Oct 17, 2007 at 7:55 pm
    Oct 17, 2007 at 11:45 pm
  • Hi, when use "hadoop dfs -cat" command, I keep getting the problem that says " Could not obtain block 0 from any node: java.io.IOException: No live nodes contain current block" The block does exist ...
    Open StudyOpen Study
    Oct 16, 2007 at 4:41 pm
    Oct 16, 2007 at 7:28 pm
  • Just a gentle reminder... Hope to see you all tonight at Gordon Biersch, 6pm: http://upcoming.yahoo.com/event/271501 ;) Erich
    Erich NachbarErich Nachbar
    Oct 4, 2007 at 4:50 pm
    Oct 8, 2007 at 4:25 pm
  • hi we have 4 machine cluster. (dual core CPU 3.20GHz 2GB RAM 400GB disk).We use nutch 0.9 and hadoop 0.13.1. We try to crawl web (60K site) 5 depth. When we came 4th segment parse it gave ...
    Uygar BAYARUygar BAYAR
    Oct 3, 2007 at 3:04 pm
    Oct 4, 2007 at 2:29 pm
  • I setup hadoop on my laptop. when i try start-all.sh the following error happens. I checked hadoop site and didn't find the reason. http://wiki.apache.org/lucene-hadoop Do i need to modify some the ...
    Oct 29, 2007 at 1:44 pm
    Oct 30, 2007 at 6:33 am
  • Hi, I was trying HBase from the 0.15 branch. And was doing: HScannerInterface s = table.obtainScanner(...) while(s.next(key, val)) { .... } And encounter the following exception whenever I spent too ...
    Cedric HoCedric Ho
    Oct 26, 2007 at 2:58 am
    Oct 30, 2007 at 1:41 am
  • I was hoping to use -inputformat SequenceFileAsTextInputFormat to process compressed sequencefiles in streaming jobs However, using a python mapper that just echoes out each line as it gets, and ...
    Joydeep Sen SarmaJoydeep Sen Sarma
    Oct 26, 2007 at 7:30 am
    Oct 26, 2007 at 5:20 pm
  • The input for a M/R job consists of multiple files that are less than a block size and the number of maps is the number of files. I would like to be able to control the number of maps in a way that I ...
    Alejandro AbdelnurAlejandro Abdelnur
    Oct 15, 2007 at 6:23 am
    Oct 24, 2007 at 2:40 pm
  • Hi, I have been studying map reduce and hadoop for the past few weeks, and found it a very new concept. While I have a grasp of the map reduce process as well as being able to follow some of the ...
    Jim the Standing BearJim the Standing Bear
    Oct 20, 2007 at 8:53 pm
    Oct 21, 2007 at 8:11 pm
  • Hello my company is considering using a DFS for a project we're currently working on. Since we don't have much experience in the field I've compiled a list of questions that I hope can guide us to ...
    Yoav SteinbergYoav Steinberg
    Oct 21, 2007 at 11:14 am
    Oct 21, 2007 at 5:27 pm
  • Hi! I am PHP Developer & Linux Administrator. I am very new at hadoop & I have some questions. 1. Is hadoop only configuration based Administration like Linux Administration with scripting ...
    Sukalyan BangaSukalyan Banga
    Oct 20, 2007 at 10:48 am
    Oct 20, 2007 at 3:59 pm
  • Hi, As a beginner of Hadoop, I wonder how to send output key-value pairs of the reducers back to the input of mappers for iterative processing. What's hadoop streaming? Can I pipe the output stream ...
    Ken PuKen Pu
    Oct 7, 2007 at 3:55 am
    Oct 15, 2007 at 5:43 am
  • I'm a rank beginner with clusters, but am determined to move into them, starting with Hadoop. I have a habuntu machine under VMware on my MacBook Pro for starters (got it on a DVD when visiting at ...
    Bob FutrelleBob Futrelle
    Oct 12, 2007 at 3:29 am
    Oct 12, 2007 at 5:08 pm
  • Hi, Is there any lock service in hadoop to sync the access to file (such as chubby in GFS)? Thanks -- Xie Gang
    Oct 10, 2007 at 2:36 am
    Oct 10, 2007 at 6:24 pm
  • Hi, I try to debug hadoop DFS with eclipse. When I try to lauch namenode.class as the main class. It fails to createSocketAddr. I find that there is no host properties in the configuration. It seems ...
    Oct 9, 2007 at 1:10 am
    Oct 9, 2007 at 4:25 pm
Group Navigation
period‹ prev | Oct 2007 | next ›
Group Overview
groupcommon-user @

116 users for October 2007

Ted Dunning: 48 posts Doug Cutting: 17 posts Owen O'Malley: 16 posts Lance Amundsen: 15 posts Edward yoon: 13 posts Michael Stack: 13 posts Dennis Kubes: 12 posts Jim the Standing Bear: 12 posts Ming Yang: 11 posts Joydeep Sen Sarma: 9 posts Bin YANG: 8 posts Jim Kellerman: 8 posts Stu Hood: 8 posts Arun C Murthy: 7 posts Chris Dyer: 7 posts Preethi Chockalingam: 7 posts 贺齐: 7 posts C G: 6 posts Enis Soztutar: 6 posts James Yu: 6 posts
show more