Search Discussions

34 discussions - 105 posts

  • 18


    I'm trying to use the s3 filesystem that was recently added to hadoop TRUNK. If I set fs.default.name to be s3://AWS_IDENTIFIER:AWS_SECRET@MY_BUCKET/ so I can run mapreduce jobs that get and set ...
    Michael StackMichael Stack
    Jan 3, 2007 at 5:45 am
    Jan 14, 2007 at 10:42 am
  • I'm new to lucene and Hadoop but what I can't seem to find in the docs, internet... is how (and if possible?) to use Hadoop as the underlying FS for Lucene? Could anyone explain me how these can be ...
    Jan 15, 2007 at 12:49 pm
    Jan 16, 2007 at 9:18 am
  • I'm trying to figure out how to override configuration settings on the command line. Specifically, when I run say, "hadoop-daemon.sh start datanode", I need to be able to give it the name of the ...
    Andrew McNabbAndrew McNabb
    Jan 11, 2007 at 8:23 pm
    Jan 12, 2007 at 6:47 pm
  • There aren't any published papers about Hadoop or its implementation, are there? I was just curious if there's a paper I could cite that would give people information about its design and ...
    Andrew McNabbAndrew McNabb
    Jan 30, 2007 at 5:18 pm
    Feb 4, 2007 at 1:12 am
  • Hi. Currently some of my map reduce jobs need quick access to additional data to check some input values in the map phase. This data is currently held in memory in a hashmap. It's very quick but as ...
    Johan OskarssonJohan Oskarsson
    Jan 25, 2007 at 4:22 am
    Jan 25, 2007 at 8:05 pm
  • Hi, It looks like name node failover is not yet implemented after seeing some emails over this alias. I tested name node failover scenario and it does not work When is it expected to get implemented? ...
    Sarvesh SinghSarvesh Singh
    Jan 8, 2007 at 2:45 pm
    Jan 10, 2007 at 4:42 pm
  • Hi, When I kill job tracker process in a Hadoop cluster of 3 node instances, everything collapsed. I was solving a problem by map/reduce and program bombed as job tracker is dead? Is there any ...
    Sarvesh SinghSarvesh Singh
    Jan 8, 2007 at 2:37 pm
    Jan 10, 2007 at 1:24 pm
  • Hi, Task tracker failover happens but alive slave instance firstly does copy the blocks locally and then recover that. Therefore due to copy operation it takes 7-10 minutes for my test case. To me it ...
    Sarvesh SinghSarvesh Singh
    Jan 8, 2007 at 2:50 pm
    Jan 10, 2007 at 1:23 pm
  • A few things that aren't really clear to me yet ...hadoop is deployed and I want to schedule a new job. Let's say it is written in java. Will hadoop distribute the classes so the job is available on ...
    Torsten CurdtTorsten Curdt
    Jan 5, 2007 at 11:58 am
    Jan 8, 2007 at 6:17 am
  • Hi- I am a big fan of Lucene and sub projects and got a great taste of what Hadoop has to offer for Java developers at my last position. Is there any future work being considered for porting Hadoop ...
    Jared DunneJared Dunne
    Jan 30, 2007 at 1:22 am
    Jan 30, 2007 at 4:26 pm
  • Hi, Is there any way to have some nodes in a cluster to have different mapred.tasktracker.tasks.maximum setting than others? I can't seem to get this to work by changing conf/hadoop-site.xml on some ...
    Espen Amble KolstadEspen Amble Kolstad
    Jan 25, 2007 at 1:14 pm
    Jan 25, 2007 at 6:14 pm
  • Dear all, How can I use in some case MultithreadedMapRunner, and in some case MapRunner for different jobs? Do I have to use one hadoop-site.xml for one job? But I want to all jobs in one jobs.jar? ...
    Jan 20, 2007 at 3:50 pm
    Jan 25, 2007 at 5:33 pm
  • Hi all, I was contemplating working on an EC2 instance manager for use with Hadoop. The idea is it would handle instance creation on namenode/jobtracker startup, perhaps adding/removing instances ...
    Jan 23, 2007 at 12:22 am
    Jan 23, 2007 at 8:05 am
  • Hi, I want loadbalancing to happen when map/reduce tasks get distributed over hadoop cluster. I have different configuration machines and want to utilize the cpu well. How can I do that? Thanks ...
    Sarvesh SinghSarvesh Singh
    Jan 8, 2007 at 2:52 pm
    Jan 10, 2007 at 7:57 pm
  • Hi, I have a hadoop cluster of 3 instances, when I kill data node process on one of the slave machine, failover does not seem to work. Another slave machine does the copy of DFS block for 7-10 ...
    Sarvesh SinghSarvesh Singh
    Jan 8, 2007 at 2:43 pm
    Jan 10, 2007 at 1:16 pm
  • I'm using Hadoop on Ubuntu 6.10. I ran into: $ start-all.sh starting namenode, logging to /usr/local/hadoop-install/hadoop/bin/../logs/hadoop-jj-namenode-jjinuxland.out ...
    Shannon -jj BehrensShannon -jj Behrens
    Jan 2, 2007 at 8:40 pm
    Jan 5, 2007 at 2:35 am
  • I had a hard time getting the Jython WordCount.py example to work. The first problem was caused by changes to Java: cd ~/Desktop/hadoop-0.9.2/src/examples/python bash compile 1 ...
    Shannon -jj BehrensShannon -jj Behrens
    Jan 2, 2007 at 9:01 pm
    Jan 5, 2007 at 2:31 am
  • There's no link to http://wiki.apache.org/lucene-hadoop/HadoopStreaming on http://wiki.apache.org/lucene-hadoop/. It would be really nice if there were one. Best Regards, -jj -- ...
    Shannon -jj BehrensShannon -jj Behrens
    Jan 2, 2007 at 9:01 pm
    Jan 5, 2007 at 2:19 am
  • Hi: During a load test using Jmeter I get this error: 2007-01-31 09:27:47 StandardWrapperValve[jsp]: Servlet.service() para servlet jsp lanzó excepción java.lang.NullPointerException at ...
    Alvaro CabrerizoAlvaro Cabrerizo
    Jan 31, 2007 at 11:04 am
    Feb 5, 2007 at 4:13 pm
  • All, I am getting this error when trying to run the Nutch inject with the Hadoop trunk. ---------------------------------------------------------------------- 2007-01-30 15:39:53,237 INFO ...
    Dennis KubesDennis Kubes
    Jan 30, 2007 at 10:57 pm
    Jan 30, 2007 at 11:15 pm
  • Hi, Does anyone know whether Hadoop is able to run on IA64? Thks.
    Jan 26, 2007 at 9:15 am
    Jan 26, 2007 at 1:57 pm
  • Dear all, I tried to run the PI example program like this: $ bin/hadoop jar hadoop-0.10.1-examples.jar pi 100 100 But the result turned out to be 0.0. Is it correct? Or there must be something wrong ...
    Liqi GaoLiqi Gao
    Jan 22, 2007 at 9:54 am
    Jan 22, 2007 at 11:49 am
  • Hi All: I’ve updated my hadoop source code to version 0.10.1 and find that the new directory “native” has been added under directory “src”. What’s the new functions or features “native” supports? ...
    Jan 17, 2007 at 9:57 am
    Jan 19, 2007 at 8:37 am
  • In reading through some entries on the dev and commit lists I keep seeing talk about spills to disk? Can someone explain what that is? Dennis Kubes
    Dennis KubesDennis Kubes
    Jan 11, 2007 at 9:06 pm
    Jan 11, 2007 at 11:43 pm
  • I'm running from the trunk. I'm getting the following error in both local and server mode (in this case the scheme is hdfs) when trying to access DFS: Exception in thread "main" ...
    Alejandro AbdelnurAlejandro Abdelnur
    Jan 10, 2007 at 2:23 am
    Jan 10, 2007 at 5:46 am
  • The default JAVA_HOME in hadoop-env.sh is /usr/bin/java. This is confusing because /usr/bin/java is a binary, not a directory. On my system, this resulted in: $ hadoop namenode -format ...
    Shannon -jj BehrensShannon -jj Behrens
    Jan 2, 2007 at 8:35 pm
    Jan 3, 2007 at 5:39 pm
  • Do you know how should i fix it? i set the the classpath into hadoop-env.sh export CLASSPATH=.:$JAVA_HOME/lib/dt.jar:$JAVA_HOME/lib/tools.jar:${HADOOP_HOME}/nutch-nightly.jar Thanks. [hadoop@xy05 ~]$ ...
    James ZhengJames Zheng
    Jan 30, 2007 at 11:48 am
    Jan 30, 2007 at 11:48 am
  • Can someone confirm for me that the TaskLogs are only for errors produced by the child tasks? Essentially making it easier to see the errors as opposed to including them and having to search for them ...
    Dennis KubesDennis Kubes
    Jan 29, 2007 at 9:43 pm
    Jan 29, 2007 at 9:43 pm
  • Hello, I've written an extension to the Internet Archive's open source "Heritrix" crawler that extends it to write into HDFS in SequenceFile format. The key is the URL and the value is the HTTP ...
    Doug JuddDoug Judd
    Jan 26, 2007 at 1:24 am
    Jan 26, 2007 at 1:24 am
  • A post by Nat Torkington (http://radar.oreilly.com/archives/2007/01/threads_conside.html) mentions MapReduce (and Hadoop) as a way of avoiding the problem of nondeterminism in threads. The paper by ...
    Tom WhiteTom White
    Jan 23, 2007 at 1:39 pm
    Jan 23, 2007 at 1:39 pm
  • Hi, friends, I am new to Hadoop. I've setup Hadoop on a Linux server as the namenode. I want to make up use of my laptop (WindowsXP + cygwin). When I tried to run bin/start-all.sh from namenode, ...
    Liqi GaoLiqi Gao
    Jan 16, 2007 at 9:49 am
    Jan 16, 2007 at 9:49 am
  • Hi, Can somebody send me some article/paper/doc suggesting how to write/design map/reduce program over hadoop so that we can minimize I/O, less block transfer and use Hadoop effectively. Thanks ...
    Sarvesh SinghSarvesh Singh
    Jan 8, 2007 at 2:55 pm
    Jan 8, 2007 at 2:55 pm
  • Hi, While starting hadoop process we are getting the following error in logs tasktracker in datanode is not able to connect back to jobtracker (but jobtracker on the other machine started ...
    Jan 5, 2007 at 6:24 am
    Jan 5, 2007 at 6:24 am
  • I wrote a simple Hadoop example that shells out to Python. It worked well, and I was pleased. Here's my blog entry about the whole experience: ...
    Shannon -jj BehrensShannon -jj Behrens
    Jan 2, 2007 at 10:46 pm
    Jan 2, 2007 at 10:46 pm
Group Navigation
period‹ prev | Jan 2007 | next ›
Group Overview
groupcommon-user @

33 users for January 2007

Doug Cutting: 13 posts Sarvesh Singh: 13 posts Tom White: 10 posts Shannon -jj Behrens: 8 posts Andrew McNabb: 5 posts Bryan A. P. Pendleton: 5 posts Dennis Kubes: 5 posts Gautam Kowshik: 4 posts Michael Stack: 4 posts Owen O'Malley: 4 posts Maarten: 3 posts Andrzej Bialecki: 3 posts Lee: 3 posts 张茂森: 3 posts Dhruba Borthakur: 2 posts Liqi Gao: 2 posts Torsten Curdt: 2 posts AaRon: 1 post Alejandro Abdelnur: 1 post Alvaro Cabrerizo: 1 post
show more