Search Discussions
-
I have a cluster of boxes with 3 reducers per node. I want to limit a particular job to only run 1 reducer per node. This job is network IO bound, gathering images from a set of webservers. My job ...
David Parks
Feb 9, 2013 at 3:55 am
Feb 11, 2013 at 6:30 am -
Hello! I'm trying to install Hadoop 1.1.2.21 on CentOS 6.3. I've configured dfs.name.dir in /etc/hadoop/conf/hdfs-site.xml file <name dfs.name.dir</name <value /mnt/ext/hadoop/hdfs/namenode</value ...
Andrey V. Romanchev
Feb 6, 2013 at 3:07 pm
Feb 6, 2013 at 6:07 pm -
Hi, I have a quick question regarding RAID0 performances vs multiple dfs.data.dir entries. Let's say I have 2 x 2TB drives. I can configure them as 2 separate drives mounted on 2 folders and assignes ...
Jean-Marc Spaggiari
Feb 11, 2013 at 1:58 am
Feb 11, 2013 at 4:03 pm -
We're trying to use HFileOutputFormat for bulk hbase loading. When using HFileOutputFormat's setOutputPath or configureIncrementalLoad, the job is unable to run. The error I see in the jobtracker ...
Sean McNamara
Feb 6, 2013 at 12:32 am
Feb 6, 2013 at 9:49 pm -
Hi, I am trying to do a name sorting using secondary sort. I have a working example, which I am taking as a reference. But I am getting a null pointer error in the MapTask class. I am not able to ...
Ravi Chandran
Feb 7, 2013 at 6:25 pm
Feb 8, 2013 at 5:05 pm -
Hello, I am trying to write MapReduce jobs to read data from JSON files and load it into HBase tables. Please suggest me an efficient way to do it. I am trying to do it using Spring Data Hbase ...
Panshul Whisper
Feb 7, 2013 at 11:22 am
Feb 7, 2013 at 2:25 pm -
I am looking for a better solution for this. 1 way to do this would be to find top N values from each mappers and then find out the top N out of them in 1 reducer. I am afraid that this won't work ...
Praveenesh kumar
Feb 2, 2013 at 5:06 am
Feb 2, 2013 at 6:06 pm -
Hello, I am new to Hadoop. I am doing a project in cloud in which I have to use hadoop for Map-reduce. It is such that I am going to collect logs from 2-3 machines having different locations. The ...
Mayur Patil
Feb 5, 2013 at 9:32 pm
Feb 6, 2013 at 1:05 pm -
I’m try to use vaidya to check my mr job, but always get the error info like the below what's the home here? Need I setting any things ./vaidya_new.sh -jobconf ...
Jun zhang
Feb 4, 2013 at 7:19 am
Feb 6, 2013 at 1:38 am -
Hello, I am interested in using hadoop to execute map reduce based CUDA code, for which i need to setup a hadoop cluster + CUDA. I have come across JCUDA, PYCUDA and CUDA C with pipes for hadoop ...
Anil kumar
Feb 3, 2013 at 8:35 pm
Feb 4, 2013 at 10:53 am -
I hope my mail find you well I 'm very interested in learning hadoop , so i need your help to begin , from where can I start, and if there is any courses , how can I attend ,,, *thanks in advance ,,,*
Akram khalil
Feb 2, 2013 at 2:48 pm
Feb 2, 2013 at 8:08 pm -
HI, Currently I am researching about options of encrypting the data in the MapReduce, as we plan to use the Amazon EMR or EC2 services for our data. I am thinking that the compression codec is good ...
Java8964 java8964
Feb 9, 2013 at 8:50 pm
Feb 11, 2013 at 6:09 am -
Hello everyone, I am setting up Hadoop for the first time, so please bear with me while I ask all these beginner questions :) I followed the instructions to create a hodrc, but looks like I cannot ...
Mehmet Belgin
Feb 5, 2013 at 9:42 pm
Feb 8, 2013 at 3:11 pm -
Hi, I am Sharath Chandra, an undergraduate student at BITS-Pilani, India. I would like to get the following clarifications regarding cloudera hadoop distribution. I am using a CDH4 Demo VM for now ...
Sharath Chandra Guntuku
Feb 5, 2013 at 10:59 am
Feb 5, 2013 at 6:12 pm -
Hi I have got the latest source from Git. when I perform mvn install -DskipTests. it was stuck when it needed to download some dependences from ...
YouPeng Yang
Feb 1, 2013 at 4:53 pm
Feb 2, 2013 at 2:34 pm -
Sorry.. Moving 'hbase' mailing list to BCC 'cause this is not related to HBase. Adding 'hadoop' user group.
Something Something
Feb 11, 2013 at 6:25 pm
Feb 11, 2013 at 9:00 pm -
Hi, I am thinking to write some mapper to do conversion of mainframe files to ascii format and contribute back. And before even i do something i wanted to confirm from you guys the following - Do we ...
Jagat Singh
Feb 9, 2013 at 3:24 am
Feb 11, 2013 at 6:45 pm -
Hi all, Has anyone ever used some kind of a "generic output key" for a mapreduce job ? I have a job running multiple tasks and I want them to be able to use both Text and IntWritable as output key ...
Amit Sela
Feb 10, 2013 at 12:01 pm
Feb 11, 2013 at 7:22 am -
I'm a little confused about splitting and readers. The data in my application is stored in files of google protocol buffers. There are multiple protocol buffers per file. There have been a number of ...
Christopher Piggott
Feb 10, 2013 at 3:36 pm
Feb 11, 2013 at 4:27 am -
Hi All, I could not see the hive meta store DB under Mysql database Under mysql user hadoop. Example: $ mysql –u root -p $ Add hadoop user (CREATE USER ‘hadoop'@'localhost' IDENTIFIED BY ‘hadoop ';) ...
Samir das mohapatra
Feb 7, 2013 at 10:47 am
Feb 8, 2013 at 6:21 am -
Hi (I am using Yarn Hadoop-3.0.0.SNAPSHOT, revision 1437315M) I have a question regarding my assumptions on the Yarn-MR design, specially the InputSplit processing. Can someone confirm or point out ...
Blah blah
Feb 1, 2013 at 2:24 pm
Feb 4, 2013 at 1:37 pm -
Hi Guys, I am new to MapR distribution. please share you guidance. we previously using cloudera manger as set limitation. More than 50 nodes not support. please give idea, we planing to move ...
Dhanasekaran Anbalagan
Feb 11, 2013 at 12:46 pm
Feb 11, 2013 at 2:54 pm -
Is there a good reason why the OldCombinerRunner passes Reporter.NULL to the combiner instead of the actual TaskReporter? The NewCombinerRunner does use the TaskReporter when creating the context. If ...
Jim Donofrio
Feb 7, 2013 at 1:41 pm
Feb 11, 2013 at 7:41 am -
Hi, I am trying to view HADOOP SOURCE CODE. I am using HADOOP 1.0.3. In HADOOP distribution, only jar files are there. Give me some instruction to view source code... please I have seen "contribute ...
Dibyendu Karmakar
Feb 11, 2013 at 3:38 am
Feb 11, 2013 at 5:46 am -
Hello All, I am confused over how MapReduce tasks select data blocks for processing user requests ? As data block replication replicates single data block over multiple datanodes, during job ...
Mehal Patel
Feb 9, 2013 at 12:41 am
Feb 9, 2013 at 5:13 am -
Hi, I have data stored in an object that I want to pass into my Mapper. I see from Configuration that there are setters and getters for primitives, but is there a way of doing this with ...
Peter Cogan
Feb 8, 2013 at 3:15 pm
Feb 8, 2013 at 7:51 pm -
2
xcievers
Hello Hadoopers, How's your cluster behave today ?? hope they run well and strong. In the past or some bad days i saw 'Too many fetch-failure'; it was fixed by adjusting dfs.datanode.max.xcievers to ...Patai Sangbutsarakum
Feb 7, 2013 at 7:33 pm
Feb 8, 2013 at 6:58 pm -
Hi, I'm wondering what's the best way to install FUSE with Hadoop 1.0.3? I'm trying to follow all the steps described here: http://wiki.apache.org/hadoop/MountableHDFS but it's failing on each one, ...
Jean-Marc Spaggiari
Feb 8, 2013 at 4:31 pm
Feb 8, 2013 at 5:17 pm -
Hi, I was trying to build hbase 0.94 against hadoop 0.23 but kept on getting compilation errors. It complained about the KerberosUtil class which it required but for some reason that class is missing ...
Viral Bajaria
Feb 4, 2013 at 6:27 am
Feb 7, 2013 at 9:02 pm -
Hi hadoop users, I am trying to use the streaming interface to use my python script mapper to create some files but am running into difficulties actually creating files on the hdfs. I have a python ...
Julian Bui
Feb 7, 2013 at 1:14 am
Feb 7, 2013 at 3:40 pm -
Is it possible to pass unmolested binary data through a map-only streaming job from the command line? I.e., is there a way to avoid extra tabs and newlines in the output? I don't need input splits or ...
Jay Hacker
Feb 6, 2013 at 9:30 pm
Feb 7, 2013 at 3:20 pm -
Hi, I wish to profile my mapper, so I've set the properties mapred.task.profileand mapred.task.profile.maps in mapred-site.xml. At the end of the job I'm getting a profile.out file, however I think ...
Yaron Gonen
Feb 6, 2013 at 9:50 pm
Feb 7, 2013 at 4:48 am -
Lately, jobtracker in one of our production cluster fall into hang state. The load 5,10,15min is like 1 ish; with top command, jobtracker has 100% cpu all the time. So, i went ahead to try top -H -p ...
Patai Sangbutsarakum
Feb 4, 2013 at 11:21 pm
Feb 7, 2013 at 4:24 am -
Hi Guys, We have done moving local file to HDFS hadoop fs -copyFromLocal we have verified some of the file missing in the HDFS, We want validate source to destination. We have already have source ...
Dhanasekaran Anbalagan
Feb 6, 2013 at 10:27 am
Feb 6, 2013 at 7:04 pm -
When setting up passwordless ssh on a cluster, its clear that the namenode needs to be able to ssh into task trackers to start/stop nodes and restart the cluster. What else is passwordless SSH used ...
Jay Vyas
Feb 5, 2013 at 11:06 pm
Feb 5, 2013 at 11:56 pm -
Hi Guys, I have configured HDFS with replication factor 3. We have 1TB for data How to file the particular block will available in 3 machine How to find same block of data will available in 3 machine ...
Dhanasekaran Anbalagan
Feb 5, 2013 at 3:01 pm
Feb 5, 2013 at 3:37 pm -
Hi All I use YARN and MR.I am confused by the relationship between mapred.map.child.java.opts and Container resource allocation. if I setup mapred.map.child.java.opts = -Xmx2048m ,on nodemanager how ...
YouPeng Yang
Feb 4, 2013 at 11:39 am
Feb 5, 2013 at 12:44 am -
how to retrieve the videos files from hdfs and play videos on the browser using mapreducing
Gopi lokavarapu
Feb 1, 2013 at 8:35 am
Feb 2, 2013 at 8:45 am -
Hi All, I am getting Too many fetch-failures exception. What might be the reason for this exception, For same size of data i dint face this error earlier and there is change in code. How to avoid ...
Manoj Babu
Feb 1, 2013 at 3:10 pm
Feb 2, 2013 at 2:18 am -
Hi all, Is anyone aware of any survey/paper/report showing the relationship between a replication factor and its penalty/benefit on write/read operations? BR, George -- ---------------------------
George Kousiouris
Feb 11, 2013 at 4:43 pm
Feb 12, 2013 at 1:37 am -
Are there any rules against writing results to Reducer.Context while in the cleanup() method? I’ve got a reducer that is downloading a few 10’s of millions of images from a set of URLs feed to it. To ...
David Parks
Feb 11, 2013 at 6:03 am
Feb 11, 2013 at 6:44 pm -
Hi I found that my job runs with such parameters: mapred.tasktracker.map.tasks.maximum 4 mapred.tasktracker.reduce.tasks.maximum 2 I try to change these parameters from my java code Properties ...
Oleg Ruchovets
Feb 11, 2013 at 11:45 am
Feb 11, 2013 at 11:55 am -
Hi am fresher in Hadoop technologies, I want to take part in any(hive, pig) related projects( I used to be informatica developer) and start off my career . All enterprises need experienced ...
Monkey2Code
Feb 11, 2013 at 5:02 am
Feb 11, 2013 at 5:19 am -
We have a use case that requires us to have the ability to: * delete all of a customers data as it sits in hdfs on a whims notice * Re-mapreduce all of a particular accounts data, going way back in ...
Sean McNamara
Feb 8, 2013 at 9:53 pm
Feb 9, 2013 at 9:31 pm -
Hi, I want to work on release 1.0.4 source code. As per Hadoop wiki HowToContribute, I can download source code from trunk or from release 1.0.4 tag. 1. Source code from hadoop/common/trunk with ...
Trupti Gaikwad
Feb 9, 2013 at 2:45 pm
Feb 9, 2013 at 3:07 pm -
Thanks for the reply Alejandro. Using a temp output directory was my first guess as well. What's the best way to proceed? I've come across FileSystem.rename but it's consistently returning false for ...
Tony Burton
Feb 1, 2013 at 3:13 pm
Feb 8, 2013 at 7:07 pm -
Hi, I'm trying to install FUSE with Hadoop 1.0.3 and I'm facing some issues. I'm following the steps I have there: http://wiki.apache.org/hadoop/MountableHDFS I have extracted 1.0.3 code using svn ...
Jean-Marc Spaggiari
Feb 8, 2013 at 2:24 pm
Feb 8, 2013 at 2:53 pm -
hive-0.9.0-cdh4.1.2)" MIME-Version: 1.0 Content-Type: multipart/alternative; boundary=bcaec502d692f132a604d5202931 --bcaec502d692f132a604d5202931 Content-Type: text/plain; charset="utf-8" ...
Viral Bajaria
Feb 7, 2013 at 11:30 am
Feb 7, 2013 at 11:09 pm -
Hi, I'm facing a problem with hadoop's secondary sort such that it is displaying the following error message. The code I have used has been used by me previously and had not given any issues for a ...
Aseem Anand
Feb 5, 2013 at 1:11 pm
Feb 7, 2013 at 5:04 pm -
Hi All, Can any one list me the mandatory system level check (ulimit,firewall,selinux...) before starting a hadoop cluster. Regards Sathish
Sara raji
Feb 7, 2013 at 3:07 pm
Feb 7, 2013 at 4:07 pm
Group Overview
group | mapreduce-user |
categories | hadoop |
discussions | 80 |
posts | 251 |
users | 106 |
website | hadoop.apache.org... |
irc | #hadoop |
106 users for February 2013
Archives
- February 2013 (251)
- January 2013 (868)
- December 2012 (621)
- November 2012 (742)
- October 2012 (868)
- September 2012 (733)
- August 2012 (1,082)
- July 2012 (226)
- June 2012 (135)
- May 2012 (102)
- April 2012 (180)
- March 2012 (164)
- February 2012 (167)
- January 2012 (284)
- December 2011 (249)
- November 2011 (201)
- October 2011 (130)
- September 2011 (310)
- August 2011 (168)
- July 2011 (207)
- June 2011 (241)
- May 2011 (225)
- April 2011 (157)
- March 2011 (146)
- February 2011 (174)
- January 2011 (226)
- December 2010 (166)
- November 2010 (135)
- October 2010 (126)
- September 2010 (145)
- August 2010 (128)
- July 2010 (121)
- June 2010 (136)
- May 2010 (82)
- April 2010 (108)
- March 2010 (62)
- February 2010 (59)
- January 2010 (95)
- December 2009 (46)
- November 2009 (45)
- October 2009 (75)
- September 2009 (24)
- August 2009 (30)
- July 2009 (15)