Search Discussions

18 discussions - 57 posts

  • Hi, I am running a cluster of 21 nodes. while running any task I observed that reduce tasks are getting scheduled much before all the map tasks are finished. As a result, reduce tasks are waiting for ...
    Kalbande, ManishKalbande, Manish
    Jul 20, 2006 at 7:54 pm
    Jul 26, 2006 at 8:22 pm
  • Hi everyone, Are you planning on coming to SIGIR in Seattle? If enough people are, perhaps we can have a Nutch/Hadoop get-together. I'm in Seattle, so if enough people say yes I'll look for a place ...
    Michael CafarellaMichael Cafarella
    Jul 25, 2006 at 4:10 pm
    Aug 8, 2006 at 2:15 pm
  • Hi, I would like to know today why it is not possible to append datas into an existing file (Path) or why the FSDataOutputStream must be closed before the file is written to the DFS. In fact, my ...
    Thomas FRIOLThomas FRIOL
    Jul 13, 2006 at 3:17 pm
    Jul 17, 2006 at 7:27 am
  • I will be running a cluster with 100-200 nodes, most of which will be shut down at night. For the sake of example lets say that 4 'reliable slaves' will remain turned on continuously, and let me call ...
    Mikkel Kamstrup ErlandsenMikkel Kamstrup Erlandsen
    Jul 24, 2006 at 7:28 am
    Jul 24, 2006 at 11:07 am
  • In MPI, getRank() gives an unique id to identify a process. Is there an equivalent in Hadoop that uniquely identify each map process? Thanks, VJ
    Vijay MurthiVijay Murthi
    Jul 19, 2006 at 8:18 pm
    Jul 20, 2006 at 6:54 am
  • Hi, I've always wondered if a lack of overwrite / random-write op means that updates are much faster than convention filesystems.. The fact that both (dfs, gfs) support delete op, does it mean that ...
    Jul 13, 2006 at 6:23 am
    Jul 14, 2006 at 3:00 pm
  • Hi, Here's a scenario I have faced a couple of times recently: <scenario I have a list of URIs (either http:// or just dfs file-list) which represent input to a Map-Reduce task where each map gets 1 ...
    Arun C MurthyArun C Murthy
    Jul 6, 2006 at 9:20 am
    Jul 7, 2006 at 7:59 am
  • I have heard a rumor about the existence of an indexed SequenceFile that is basically a normal SequenceFile with an associated small index file with list of offsets to a subset of the keys in the ...
    Benjamin ReedBenjamin Reed
    Jul 28, 2006 at 5:14 pm
    Jul 28, 2006 at 9:56 pm
  • Hi All, I was in the lookout for an open source DFS and my search ended in Hadoop. So far from what I have read, I strongly believe this is a really good file system and also provides support for ...
    Jul 25, 2006 at 9:38 am
    Jul 26, 2006 at 6:34 am
  • Moved this to hadoop-user... My 2 cents: A datanode is identified by its storageID. If you have multiple storage devices in your box (visible to the user as different directories) then you can have ...
    Devaraj DasDevaraj Das
    Jul 25, 2006 at 11:18 am
    Jul 26, 2006 at 3:24 am
  • Hi all, I am a new hadoop user and I am now writting my own map reduce operations but it is hard for me to find out where comes from the problem when the job fails. So my question is : What is the ...
    Thomas FRIOLThomas FRIOL
    Jul 17, 2006 at 8:37 am
    Jul 17, 2006 at 4:38 pm
  • Hi, Sorry for the nature of the question, but can anyone estimate when Hadoop will be "stable and production-ready"? I know you've run it on clusters with 600+ nodes, but you guys (hadoop-dev) know ...
    Otis GospodneticOtis Gospodnetic
    Jul 7, 2006 at 5:00 pm
    Jul 8, 2006 at 2:08 am
  • Hi everyone, If you're in Seattle for SIGIR, come to this meeting of FOHLNs (Friends Of Hadoop, Lucene and Nutch). We'll talk about search and get something to eat and drink. Please RSVP via the ...
    Michael CafarellaMichael Cafarella
    Jul 31, 2006 at 5:32 pm
    Jul 31, 2006 at 5:32 pm
  • Hello, I'm evaluating Hadoop for a large GIS application. When running the wordcount example, I experience an issue where my master node cannot open a socket to port 50010 of my remove slave node. ...
    Jul 31, 2006 at 3:26 pm
    Jul 31, 2006 at 3:26 pm
  • It would be a good idea to have Mapper and Reducer expose a getLogger () method. It could be extending a seperate interface like Loggable. The logger is initialized when the Map and Reduce tasks are ...
    Sanjay DahiyaSanjay Dahiya
    Jul 26, 2006 at 8:38 am
    Jul 26, 2006 at 8:38 am
  • Hello, I have been looking at Hadoop for awhile now and have been trying to get 0.4.0 to work with Nutch to do a small distributed crawl. Problem is, whenever the task (fetching) nears completion, ...
    Sellek, GregSellek, Greg
    Jul 25, 2006 at 6:53 pm
    Jul 25, 2006 at 6:53 pm
  • Hi, I'm working on a web interface for accessing various data stored in a Hadoop cluster, and I'd like to display tracker info in this interface. The admin interface in Hadoop just calls ...
    Vetle RoeimVetle Roeim
    Jul 25, 2006 at 9:06 am
    Jul 25, 2006 at 9:06 am
  • All, I should have posted earlier, but the mainline svn is broken. You either need stay before svn revision 421837 (hadoop-354) or apply (hadoop-364 and hadoop-365). -- Owen
    Owen O'MalleyOwen O'Malley
    Jul 17, 2006 at 4:38 pm
    Jul 17, 2006 at 4:38 pm
Group Navigation
period‹ prev | Jul 2006 | next ›
Group Overview
groupcommon-user @

27 users for July 2006

Doug Cutting: 8 posts Paul Sutter: 7 posts Benjamin Reed: 3 posts Michael Cafarella: 3 posts Mikkel Kamstrup Erlandsen: 3 posts Owen O'Malley: 3 posts Stefan Groschupf: 3 posts Thomas FRIOL: 3 posts Yoram Arnon: 3 posts Drwho: 2 posts Eric Baldeschwieler: 2 posts Ken Krugler: 2 posts Arun C Murthy: 1 post Bryan A. Pendleton: 1 post Devaraj Das: 1 post Jagadeesh: 1 post Kalbande, Manish: 1 post Konstantin Shvachko: 1 post Michael Stack: 1 post Otis Gospodnetic: 1 post
show more