Search Discussions

101 discussions - 445 posts

  • I'm trying to run the HBase PerformanceEvaluation program on a cluster of 5 hadoop nodes (on virtual machines). hadoop07 is a DFS Master and HBase master hadoop08-12 are HBase region servers I start ...
    Kareem DanaKareem Dana
    Nov 15, 2007 at 11:31 pm
    Nov 26, 2007 at 6:30 pm
  • Hi, I am new to hadoop. We are evaluating HDFS for a reliable, disitrbuted file system use. running on the name node m/c) I have run so far: 1.The writes are very fast. 2.The read is very slow ...
    Nov 8, 2007 at 9:02 pm
    Nov 19, 2007 at 1:58 pm
  • Hi Hadoopers, Many of the computations that I am performing with MapReduce require several chains of MapReduce operations where the output of one or more previous reduce steps is the input to a ...
    Chris DyerChris Dyer
    Nov 6, 2007 at 9:38 pm
    Nov 9, 2007 at 8:20 pm
  • Hi, I checked out Hadoop (including HBase) from its Subversion repository today, build it successfully (on Cygwin) and started HBase in "local" mode. Then I took your little example program from the ...
    Holger StenzhornHolger Stenzhorn
    Nov 1, 2007 at 9:49 pm
    Nov 7, 2007 at 6:24 pm
  • Hi everyone, we are experiencing a very weak map-red performance on the following mapred cluster setup: - Hadoop - nightly build from 2007-10-25_17-03-53 - 5 tasktracker-/datanodes + 1 ...
    André MartinAndré Martin
    Nov 1, 2007 at 9:50 am
    Nov 30, 2007 at 5:34 pm
  • I am using map/reduce with hadoop-0.15.0-streaming.jar to process the data with php scripts. I have coded to process the data the blow is an example of word counts from the input. bin/hadoop jar ...
    Nov 10, 2007 at 8:54 pm
    Nov 16, 2007 at 6:36 am
  • I set up a little benchmark on a 39 node cluster to sort 40gb of random text data (generated by RandomTextWriter using key length: 1-10 words and value length: 0-200 words, data uncompressed). The ...
    Owen O'MalleyOwen O'Malley
    Nov 9, 2007 at 1:03 am
    Nov 9, 2007 at 8:15 am
  • Hello! We have a web site currently built on linux/apache/mysql/php. Most pages do some mysql queries and then stuff the results into php/html templates. We've been hitting the limits of what our ...
    Mike PerkowitzMike Perkowitz
    Nov 30, 2007 at 5:47 pm
    Dec 4, 2007 at 8:16 pm
  • Google has a very interesting tech-talk up about Dryad: Microsoft's distributed execution framework. There has been a paper out about it for a while, but the video has some more information about the ...
    Stu HoodStu Hood
    Nov 9, 2007 at 7:01 am
    Nov 13, 2007 at 3:01 am
  • i use nutchwax-0.10.0 search but it shows only "Search took 0.032 seconds. Hits 0-0 (out of about 0 total matching pages): " it not shows title, url, content when i look at log file in catalina.out ...
    Nov 27, 2007 at 10:01 am
    Dec 13, 2007 at 7:36 pm
  • Hi everyone, I'm a new member joining the lists. I saw a new feature about append (HADOOP-1700) would be added in the HDFS. I'm concerned that when will the feature be implemented ? Thanks, Xiang
    Xiangna LiXiangna Li
    Nov 14, 2007 at 3:21 am
    Dec 11, 2007 at 6:48 am
  • Hello gentlemen! We would like to integrate our hadoop-based application into Tomcat. The WEB part will be used to manage job submission and control jobs, and initially all jobs will reside within ...
    Eugeny N DzhurinskyEugeny N Dzhurinsky
    Nov 22, 2007 at 10:32 am
    Nov 27, 2007 at 4:48 am
  • Hello, Much of the hadoop documentation speaks to large clusters of commodity machines. There is a debate on our end about which would be better: a small number of high performance machines (2 boxes ...
    Chris FellowsChris Fellows
    Nov 7, 2007 at 7:57 pm
    Nov 7, 2007 at 9:36 pm
  • Is there anywhere documented the expected behavior of concurrent changes in the filesystem? As an example: Hdfs client C1 is slowly writing to "/path/a/file". Now another hdfs client C2 renames ...
    Torsten CurdtTorsten Curdt
    Nov 5, 2007 at 10:05 am
    Nov 6, 2007 at 12:25 am
  • Hi, I'm trying to evaluate hadoop/hbase for a project I'm on that requires filtering massive amounts of RSS data. I've been trying to follow the simple tutorials, but I can't seem to get anything to ...
    Jonathan doklovicJonathan doklovic
    Nov 1, 2007 at 8:03 pm
    Nov 5, 2007 at 3:06 pm
  • Hello, I am in need of some clarifications on how to run a hadoop job locally. The cluster was originally set up to have two nodes, where one of them also acts as the master node and job tracker. ...
    Jim the Standing BearJim the Standing Bear
    Nov 1, 2007 at 7:38 pm
    Nov 1, 2007 at 9:39 pm
  • Hi, I can communicate with the file system via shell command, and it worked corretly. But when I try to write program to write file to the file system, it failed. public class HadoopDFSFileReadWrite ...
    Ryan WangRyan Wang
    Nov 30, 2007 at 2:49 pm
    Dec 1, 2007 at 8:24 am
  • Hi, Based on the documentation I have read, there is one instance of a NameNode. Are there recommended approaches on making the NameNode HA: 1.Have a backup which takes over. Data between primary and ...
    Nov 20, 2007 at 8:47 pm
    Nov 26, 2007 at 8:56 pm
  • I got a table that was doing good then it split and started getting EOF exceptions so I delete the database and started over and now its happened again from the regional server GUI it shows this ...
    Nov 20, 2007 at 9:13 pm
    Nov 23, 2007 at 7:41 pm
  • Hi all: This page http://wiki.apache.org/lucene-hadoop/Hbase/HbaseShell introduces the ‘Algebraic Query Commands’, but I can’t find them in my HBASE shell. Hbase Shell, 0.0.2 version. Copyright (c) ...
    Nov 21, 2007 at 2:28 am
    Nov 22, 2007 at 1:44 am
  • Hey guys, Just noticed some surprising behavior for select statements in HBase 0.15: a select command without a num_versions = 1 clause takes 2 orders of magnitude longer to run than a bare select. ...
    Stu HoodStu Hood
    Nov 7, 2007 at 7:23 am
    Nov 7, 2007 at 11:28 pm
  • Hello I am a student doing an independent study project investigating the possibility of teaching large scale computing on a small scale budget. Th My thought is to use available Open Source ( ...
    Edward Bruce WilliamsEdward Bruce Williams
    Nov 16, 2007 at 2:34 pm
    Apr 2, 2008 at 10:20 pm
  • Can we get someone who has written java apps that work with hbase update the example on this page http://wiki.apache.org/lucene-hadoop/Hbase/FAQ I thank there is some bugs in there that need to be ...
    Nov 23, 2007 at 9:23 pm
    Nov 30, 2007 at 4:19 am
  • Guys, It has been almost 2 months and I'd like to propose another Bay Area Get Together. I thought we could try and hit Gordon Biersch in Palo Alto again around 5pm next Fri. (Nov. 30th). Ted Dunning ...
    Erich NachbarErich Nachbar
    Nov 21, 2007 at 6:43 am
    Nov 23, 2007 at 4:52 am
  • Hello, gentlemen! I would like to implement a custom data provider which will create a records to start map jobs with them. For example I would like to create a thread which will extract some data ...
    Eugeny N DzhurinskyEugeny N Dzhurinsky
    Nov 19, 2007 at 4:44 pm
    Nov 19, 2007 at 10:16 pm
  • Is anyone at ApacheCon this week? I'm out would like to meet any users or developers who are out in Atlanta. Hadoop is listed on the search roundtable on Thursday night, so I'll go to that. -- Owen
    Owen O'MalleyOwen O'Malley
    Nov 14, 2007 at 2:31 pm
    Nov 14, 2007 at 9:17 pm
  • I have a very simple map task that gets a filename - reads and decompresss that file and stores the decompressed file away. As it is reading and writing it does Report.incrCounter w/ the number of ...
    Derek GottfridDerek Gottfrid
    Nov 9, 2007 at 8:40 pm
    Nov 11, 2007 at 6:10 am
  • Hi, Well, this is really a minor point... I am using Hadoop under Cygwin with the default settings. So hence the "hadoop.tmp.dir" is set to "/tmp/hadoop-${user.name}" via the "hadoop-default.xml". ...
    Holger StenzhornHolger Stenzhorn
    Nov 2, 2007 at 8:03 pm
    Nov 2, 2007 at 10:20 pm
  • Hi, It is getting closer to Friday and I wanted to remind everyone that we will be meeting at Gordon Biersch in Palo Alto at 5pm this Fri (11/30): http://upcoming.yahoo.com/event/324051/ No formal ...
    Erich NachbarErich Nachbar
    Nov 28, 2007 at 11:30 pm
    Dec 6, 2007 at 11:27 pm
  • I'm working on a 4 node grid at the moment (physical iron, not virtual), Hadoop 0.15.0 to test out a prototype system before deployment onto a larger grid. I've noticed a few odd behaviors within ...
    C GC G
    Nov 24, 2007 at 5:47 am
    Nov 26, 2007 at 6:42 am
  • when i use this command "bin/nutch generate /user/root/crawld /user/root/crawld/segments" output is : Generator: Selecting best-scoring urls due for fetch. Generator: starting Generator: segment: ...
    Nov 21, 2007 at 6:21 am
    Nov 23, 2007 at 12:46 am
  • Hi folks, I searched around JIRA and didn't find anything that resembled this. Is this something on the roadmap? For normal aggregations, this is never an issue. But in some cases (typically joins) - ...
    Joydeep Sen SarmaJoydeep Sen Sarma
    Nov 19, 2007 at 8:30 pm
    Nov 22, 2007 at 6:51 am
  • Can anyone explain why "testTextToBytes" doesn't assert and "testStringToBytes" does? import org.apache.hadoop.hbase.io.ImmutableBytesWritable; import org.apache.hadoop.io.Text; import ...
    Jason GreyJason Grey
    Nov 21, 2007 at 4:28 pm
    Nov 21, 2007 at 5:18 pm
  • Are there configuration suggestions for 1k nodes ? I was seeing tons of timeouts trying to run 1k nodes. Are there network settings that I need to make? Out of the box stuff seemed to work up to a ...
    Derek GottfridDerek Gottfrid
    Nov 7, 2007 at 4:03 am
    Nov 7, 2007 at 8:41 pm
  • Hi, I am confused with some thing in HBase. 1. All data is stored in HDFS. Data is served to clients by HRegionServers. Is it allowed that the tablet T is on machine A, and served by a HRegionServers ...
    Bin YANGBin YANG
    Nov 1, 2007 at 10:06 am
    Nov 2, 2007 at 4:34 am
  • I'm upgrading from 0.14.2 to 0.15.1 I followed the upgrade guide, but when I started dfs (start-dfs.sh -upgrade) I got the following error: 2007-11-30 13:07:32,515 INFO ...
    Peter ThygesenPeter Thygesen
    Nov 30, 2007 at 12:36 pm
    Nov 30, 2007 at 2:59 pm
  • Anyone have insight on the following message from a near-TRUNK namenode log? 2007-11-26 01:16:23,282 WARN dfs.StateChange - DIR* NameSystem.startFile: failed to create file ...
    Nov 27, 2007 at 5:58 am
    Nov 29, 2007 at 7:25 pm
  • Greetings; I followed the excellent tutorials on the wiki, everything worked fine for the single node version, but for the multi-node setup (four nodes, including master), I had to use ip addresses ...
    Khalil HonsaliKhalil Honsali
    Nov 23, 2007 at 9:32 am
    Nov 28, 2007 at 7:05 pm
  • Hello, gentlemen! We are trying to adapt hadoop to suit our application (or mostly adapt our application to fit Map/Reduce and hadoop;) ), and I have several questions: 1) when doing mapping part of ...
    Eugeny N DzhurinskyEugeny N Dzhurinsky
    Nov 20, 2007 at 4:09 pm
    Nov 20, 2007 at 5:18 pm
  • Hi, I am currently working on a system design and I am interested in hearing some ideas how hadoop/hbase can be used to solve a couple of tricky issues. Basically I have a data set consisting of ...
    Nov 19, 2007 at 4:23 pm
    Nov 20, 2007 at 4:14 pm
  • Hello there! Could somebody please explain is it possible to get some statistics for the certain job? For instance, get some numbers of how many data tuples were processed yet, and how many tuples ...
    Eugeny N DzhurinskyEugeny N Dzhurinsky
    Nov 19, 2007 at 10:04 am
    Nov 19, 2007 at 11:59 am
  • Hi, I have a cluster made of only 2 PCs. The master acts also as a slave. The cluster seems to start properly. It is functional (I can access the dfs, monitor it with the web interfaces, no errors in ...
    Sebastien RainvilleSebastien Rainville
    Nov 10, 2007 at 7:18 pm
    Nov 11, 2007 at 2:19 pm
  • Hi, I'm new to the Hadoop, I'm confused by the store procedures, I found a zlib implementation in the package org.apache.hadoop.io.compression, So I wonder whether the file stored in Hadoop is ...
    Nov 27, 2007 at 1:16 am
    Dec 7, 2007 at 12:55 am
  • We have several 8 processor machines in our cluster, and for most of our mapper tasks we would like to spawn 8 per machine. We have 1 mapper task that is extremely resource intensive and we can only ...
    Jason VennerJason Venner
    Nov 30, 2007 at 8:01 am
    Nov 30, 2007 at 6:33 pm
  • hello, I have a c++ pipe application that I would like to pass command-line parameters to. Is it possible to do that with hadoop pipes? Thanks. -- View this message in context: ...
    Nov 27, 2007 at 7:42 pm
    Nov 29, 2007 at 7:17 pm
  • Just to make sure my head is on straight: Each node in the grid reads its own configuration file (hadoop-site.xml, hadoop-default.xml) and configures itself appropriately, correct? I am asking ...
    C GC G
    Nov 26, 2007 at 9:48 pm
    Nov 27, 2007 at 7:24 pm
  • We are new to hadoop - 1 week and counting :) We have a number of tasks that we want to accomplish with hadoop, and would like to each each of the hadoop steps very simple. To our current limited ...
    Jason VennerJason Venner
    Nov 26, 2007 at 7:08 pm
    Nov 27, 2007 at 2:41 pm
  • when i use this command "${HADOOP_HOME}/bin/hadoop jar ${NUTCHWAX_HOME}/nutchwax.jar all /tmp/inputs /tmp/outputs test" i have error : - LinkDb: done - indexing [Lorg.apache.hadoop.fs.Path;@66e64686 ...
    Nov 23, 2007 at 8:11 am
    Nov 26, 2007 at 2:09 am
  • Hello there, we would like to make some tests with hadoop. For the tests we would like to have a hadoop filesystem up and configured, so using stubs and some mocks of core interfaces we can test the ...
    Eugeny N DzhurinskyEugeny N Dzhurinsky
    Nov 21, 2007 at 3:22 pm
    Nov 22, 2007 at 7:03 pm
  • Reduce Jobs must wait for all maps to be done before doing any work. Why are they started before the maps are done? example of problem If I am running a job and its taking up all the reduce task for ...
    Nov 21, 2007 at 10:52 pm
    Nov 22, 2007 at 8:49 am
Group Navigation
period‹ prev | Nov 2007 | next ›
Group Overview
groupcommon-user @

97 users for November 2007

Stack: 39 posts Ted Dunning: 32 posts Holger Stenzhorn: 25 posts Billy: 21 posts Doug Cutting: 17 posts Owen O'Malley: 16 posts dhruba Borthakur: 15 posts Arun C Murthy: 13 posts Eugeny N Dzhurinsky: 13 posts J2eeiscool: 12 posts Jonathan doklovic: 11 posts Raghu Angadi: 11 posts Edward yoon: 10 posts Jim Kellerman: 10 posts Jibjoice: 9 posts Joydeep Sen Sarma: 9 posts Stu Hood: 9 posts Kareem Dana: 8 posts Devaraj Das: 7 posts Aaron Kimball: 6 posts
show more