Search Discussions

26 discussions - 100 posts

  • hi, when i try [nutch@master ~/search]$ bin/nutch crawl urls -dir crawled -depth 3 i have this error in log file master server == hadoop-nutch-tasktracker-master.log <== 2007-04-30 08:53:52,318 WARN ...
    Apr 30, 2007 at 3:03 pm
    Aug 30, 2007 at 9:52 am
  • Hi, Inspired by http://www.mail-archive.com/nutch-user@lucene.apache.org/ msg02394.html I'm trying to run Hadoop on multiple CPU's, but without using HDFS. In my hadoop-site.xml I have the following ...
    Eelco LempsinkEelco Lempsink
    Apr 16, 2007 at 1:42 pm
    Apr 18, 2007 at 4:24 pm
  • Hi hadooping people... I'm having trouble running the wordcount example with hadoop... i ran it ok with only one host but when i add another machine to the cluster... it falls apart! :( I read in the ...
    Pedro GuedesPedro Guedes
    Apr 2, 2007 at 2:01 pm
    Apr 3, 2007 at 4:25 pm
  • FYI http://research.yahoo.com/project/pig Doug
    Doug CuttingDoug Cutting
    Apr 26, 2007 at 8:11 pm
    May 1, 2007 at 6:38 pm
  • Hi hadoopers, I'm working on an enterprise search engine that works on an hadoop cluster but is controlled form the outside. I managed to implement a simple crawler much like Nutch's... Now i have a ...
    Pedro GuedesPedro Guedes
    Apr 18, 2007 at 10:26 am
    Apr 26, 2007 at 11:15 am
  • Currently I'm working on a search application that uses Lucene. Many of the fields I index in Lucene are stored fields, because I need to retrieve the actual text and metadata of each document, and ...
    Andy LiuAndy Liu
    Apr 10, 2007 at 9:42 pm
    Apr 15, 2007 at 7:09 pm
  • Hi, We have some troubles with the reduce phase of our job. Is it possible to re-execute the reduce tasks without the need to do all map tasks again? Thanks! Mathijs Homminga
    Mathijs HommingaMathijs Homminga
    Apr 3, 2007 at 9:18 am
    Apr 4, 2007 at 2:40 pm
  • Hello all I'm a new Hadoop user and I'm looking at using Hadoop for a distributed machine learning application. For my application (and probably many machine learning applications), one would ...
    Albert StrasheimAlbert Strasheim
    Apr 8, 2007 at 9:55 am
    Apr 9, 2007 at 4:41 pm
  • Hi. I'm trying to decommission 10 datanodes of 35 in our cluster. The process have been running for a couple of days but only one node have finished. Perhaps I should have tried to decommission one ...
    Johan OskarssonJohan Oskarsson
    Apr 29, 2007 at 5:01 pm
    Apr 30, 2007 at 5:01 pm
  • Hi all, I'm a bit confused by the way logging works on Hadoop. In short, my question is: where does the log from my Nutch plugins end up when running on Hadoop? I'm running Nutch 0.9 on Hadoop ...
    Mathijs HommingaMathijs Homminga
    Apr 20, 2007 at 7:02 pm
    Apr 23, 2007 at 9:06 am
  • Hi all, I have had some troubles with 2 nodes on one of our clusters. While most nodes finished their map tasks successfully in about 2 secs, two were not responding well. On their Task Trackers the ...
    Mathijs HommingaMathijs Homminga
    Apr 23, 2007 at 10:15 am
    Apr 24, 2007 at 8:29 am
  • Hello all I've got a small hadoop cluster running (5 nodes today, going to 15+ soon), and I'd like to do some benchmarking. My question to the group is - what is the first benchmark you run on a new ...
    Steve SchlosserSteve Schlosser
    Apr 23, 2007 at 2:39 pm
    Apr 23, 2007 at 6:00 pm
  • hi, when i exec bin/start-all.sh at first time after bin/hadoop namenode -format datanode is started. then i exec bin/stop-all.sh stopping jobtracker stopping tasktracker ...
    Apr 30, 2007 at 5:45 pm
    Apr 30, 2007 at 6:40 pm
  • Hi, I've read http://wiki.apache.org/lucene-hadoop/Hbase/HbaseArchitecture and it sounds mostly wonderful! However, I am wondering about this: "Since the death of the HMaster means the death of the ...
    Otis GospodneticOtis Gospodnetic
    Apr 30, 2007 at 3:02 am
    Apr 30, 2007 at 6:24 am
  • Hi, We're interested in using Hadoop for our application for purposes of replication and distribution of query execution. But I have some questions as to whether it's a good fit. We have essentially ...
    Vinaya ShastrakarVinaya Shastrakar
    Apr 18, 2007 at 8:01 am
    Apr 18, 2007 at 8:05 am
  • Hi all! I'm new to hadoop, and I didn't find any method which allows appending to a file. Can anyone give an example of how to do this? Thanks, Einav
    Enav ItamarEnav Itamar
    Apr 17, 2007 at 10:19 am
    Apr 17, 2007 at 10:46 am
  • One of my task is to calculate some statistics from a very large amount of log files for our customers. We are trying out hadoop to solve this problem. The mapper and reducer code are very straight ...
    Dongsheng WangDongsheng Wang
    Apr 27, 2007 at 7:48 pm
    Apr 27, 2007 at 7:48 pm
  • Hi, we operate a big webbased online community with a lot of linux boxes. We would like to store the videos and the pictures of our member on HDFS. We are only interested in clustered filesystem of ...
    Oezcan AcarOezcan Acar
    Apr 23, 2007 at 8:36 am
    Apr 23, 2007 at 8:36 am
  • What is the ratio of checksum errors that everyone else is seeing while running large jobs? I am trying to determine what an average number of checksum errors is vs. what should be occurring. Dennis ...
    Dennis KubesDennis Kubes
    Apr 22, 2007 at 11:32 pm
    Apr 22, 2007 at 11:32 pm
  • With HADOOP-1216, the framework will support reduce=none feature by setting numReduceTasks=0. If a map/reduce job set numReduceTasks=0, it will not create any reducer tasks. The mappers will not ...
    Runping QiRunping Qi
    Apr 20, 2007 at 11:25 pm
    Apr 20, 2007 at 11:25 pm
  • The in current framework, each mapper task will create one combiner object per partition per spill. This is very costly, since each time a combiner is created, a new process is actually created to ...
    Runping QiRunping Qi
    Apr 20, 2007 at 11:09 pm
    Apr 20, 2007 at 11:09 pm
  • Hi, This is my first post to the Hadoop list and have not yet written a program using the framework. I'm querying several large Lucene indexes, and generating about 30 text files 1-3MB each. These ...
    Peter W.Peter W.
    Apr 12, 2007 at 7:42 pm
    Apr 12, 2007 at 7:42 pm
  • Hello, We have been experimenting with Hadoop on a largish, but shared cluster. That means we can allocate various nodes, but would also like to let others use nodes (so not having a node permanently ...
    Timothy ChklovskiTimothy Chklovski
    Apr 5, 2007 at 9:58 pm
    Apr 5, 2007 at 9:58 pm
  • Hi, I am in the process of cleaning up Hadoop streaming. I noticed there are some half baked stuffs, and not sure whether they have ever been used/tested. Your feedbacks will help a lot. Thanks a lot ...
    Runping QiRunping Qi
    Apr 5, 2007 at 8:14 pm
    Apr 5, 2007 at 8:14 pm
  • Hi, I'm trying to create a MapFile.Reader in my MapReduce Driver. In order to get the FileSystem, I call conf.getFs(), but that gives me the following NullPointerException: Exception in thread "main" ...
    Andrew HitchcockAndrew Hitchcock
    Apr 4, 2007 at 9:58 pm
    Apr 4, 2007 at 9:58 pm
  • when I use nutch-nightly0.9 ,I got this: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable And I echo $JAVA_LIBRARY_PATH,then I got: ...
    Apr 4, 2007 at 3:15 am
    Apr 4, 2007 at 3:15 am
Group Navigation
period‹ prev | Apr 2007 | next ›
Group Overview
groupcommon-user @

40 users for April 2007

Derevo: 9 posts Doug Cutting: 9 posts Mathijs Homminga: 7 posts Arun C Murthy: 6 posts Pedro Guedes: 6 posts Wangxu: 5 posts Briggs: 4 posts Eelco Lempsink: 4 posts Albert Strasheim: 3 posts Andy Liu: 3 posts Owen O'Malley: 3 posts Runping Qi: 3 posts Andrew Hitchcock: 2 posts Dennis Kubes: 2 posts Jafarim: 2 posts Jim Kellerman: 2 posts Johan Oskarsson: 2 posts Ken Krugler: 2 posts Michael Bieniosek: 2 posts Otis Gospodnetic: 2 posts
show more