Search Discussions

110 discussions - 408 posts

  • Hello. I'm facing a issue when trying to configure my SecondaryNameNode on a different machine than my NameNode. When both are on the same machine everything works fine but after moving the secondary ...
    Ramon PinRamon Pin
    Jun 4, 2012 at 11:08 am
    Jun 6, 2012 at 1:56 pm
  • Hi all, I have a csv file with 46 columns but i'm getting error when i do some analysis on that data type. For simplification i have taken 3 columns and now my csv is like c,zxy,xyz d,"abc,def",abcd ...
    Sandeep Reddy PSandeep Reddy P
    Jun 26, 2012 at 7:08 pm
    Jun 27, 2012 at 12:40 pm
  • Hello, we're testing application on 8 nodes, where each node has 20GB of local storage available. What we are trying to achieve is to get more than 20GB to be processed on this cluster. Is there a ...
    Ondřej KlimperaOndřej Klimpera
    Jun 14, 2012 at 7:56 am
    Jun 14, 2012 at 2:49 pm
  • My local environment: single ubuntu 11.10 desktop version, oracle jdk 7.0_04, MIT kerberos 5, apache hadoop-1.0.2. I am able to get kerberos working, here is my key ...
    Allan YanAllan Yan
    Jun 4, 2012 at 5:37 pm
    Jun 6, 2012 at 2:16 am
  • Hi all, This is my first email to the list, so feel free to be candid in your complaints if I'm doing something canonically uncouth in my requests for assistance. I'm using Hadoop 0.23 on 50 ...
    Ellis H. Wilson IIIEllis H. Wilson III
    Jun 19, 2012 at 2:28 pm
    Jun 29, 2012 at 12:31 am
  • Hi, While reading the book "HADOOP: a definitive guide", 6th chapter{ How does MapReduce work? }, what I understood was that tasktrackers send heartbeat messages indicating free slots where tasks may ...
    Subramanian GanapathySubramanian Ganapathy
    Jun 25, 2012 at 6:55 am
    Jun 27, 2012 at 3:04 am
  • Hello, I have very small input size (kB), but processing to produce some output takes several minutes. Is there a way how to say, file has 100 lines, i need 10 mappers, where each mapper node has to ...
    Ondřej KlimperaOndřej Klimpera
    Jun 16, 2012 at 9:02 am
    Jun 21, 2012 at 5:06 am
  • Hi, I want to use Gridmix3 benchmark with hadoop version 1.0.0. I am following this link. http://hadoop.apache.org/mapreduce/docs/current/gridmix.html Here, it is mentioned that, to run Gridmix3, I ...
    Jun 13, 2012 at 3:28 pm
    Jun 19, 2012 at 8:00 pm
  • Hi, Is it possible to implement transpose operation of rows into columns and vice versa... i.e. col1 col2 col3 col4 col5 col6 col7 col8 col9 col10 col11 col12 can this be converted to col1 col4 col7 ...
    Subir SSubir S
    Jun 21, 2012 at 8:17 am
    Jun 25, 2012 at 7:56 pm
  • Hi I am getting the following error while trying to execute any example (wordcount, terasort etc): 12/06/21 09:52:39 INFO mapreduce.Job: Running job: job_1340251923324_0001 12/06/21 09:52:45 INFO ...
    Prajakta KalmeghPrajakta Kalmegh
    Jun 21, 2012 at 4:33 am
    Jun 22, 2012 at 11:20 am
  • Hi, HiBench, a Hadoop benchmark suite constructed by Intel, is used intensively for Hadoop benchmarking, tuning & optimizations both inside Intel and by our customers/partners. It consists of a set ...
    Dai, JasonDai, Jason
    Jun 14, 2012 at 4:27 am
    Jun 20, 2012 at 2:32 am
  • We have continuous flow of data into the sequence file. I am wondering what would be the ideal file size before file gets rolled over. I know too many small files are not good but could someone tell ...
    Mohit AnchliaMohit Anchlia
    Jun 6, 2012 at 2:01 pm
    Jun 7, 2012 at 11:34 pm
  • "I was always wondering after mapping, how each reduce task get its input. It is said in google's paper and hadoop's documentation that a sort is done to aggregate the same key of the map output. But ...
    Barry, Sean FBarry, Sean F
    Jun 5, 2012 at 10:47 pm
    Jun 6, 2012 at 5:20 pm
  • Hi All, We are running a mapreduce job in a fully distributed cluster.The output of the job is writing to HBase. While running this job we are getting an error: *Caused by ...
    Manu SManu S
    Jun 6, 2012 at 2:25 pm
    Jul 3, 2012 at 6:46 am
  • HI all, I ran word count examples in hadoop and it's very good starting point for hadoop.But i am looking for more programs with advanced concept . If you have any programs or suggestion, please send ...
    Saravanan NagarajanSaravanan Nagarajan
    Jun 29, 2012 at 4:47 pm
    Jun 30, 2012 at 4:14 pm
  • Hi, I tried to decommission a datanode, and then found that the active nodes don't have enough space to store the replicated blocks on the decommissioning node. So can I remove a big folder when the ...
    Adrian LiuAdrian Liu
    Jun 26, 2012 at 2:07 pm
    Jun 26, 2012 at 5:17 pm
  • Hi, all I'm using gridmix2 to test my cluster, while in its README file, there are statements like the following: +1) Three stage map/reduce job + Input: 500GB compressed (2TB uncompressed) ...
    Nan ZhuNan Zhu
    Jun 14, 2012 at 6:58 am
    Jun 25, 2012 at 9:46 am
  • Hi, it may be a stupid question, but in my application I could do without sort by keys. If only reducers could be told to start their work on the first maps that they see, my processing would begin ...
    Mark KerznerMark Kerzner
    Jun 18, 2012 at 2:20 pm
    Jun 18, 2012 at 3:13 pm
  • Hi, I've got a question regarding hadoop configuration. Is it possible to pass configuration parameters on job start up? Something like that: hadoop -HADOOP_HEAPSIZE=4G jar some.jar ...
    Matthias ZenglerMatthias Zengler
    Jun 13, 2012 at 12:21 pm
    Jun 14, 2012 at 1:20 pm
  • Dear All. I've been configuring 3 server using Hadoop 1.0.x , Single Node, how to assembly them into 1 Multi Node Cluster? Because when I search for documentation, i've just got configuration for ...
    Alpha Bagus SunggonoAlpha Bagus Sunggono
    Jun 14, 2012 at 10:01 am
    Jun 14, 2012 at 1:17 pm
  • Hi All I am using cdh4 for running a HBase cluster on CentOs6.0. I have 5 nodes in my cluster(2 Admin Node and 3 DN). My resourcemanager is up and running and showing that all three DN are running ...
    Anil guptaAnil gupta
    Jun 13, 2012 at 7:25 pm
    Jun 14, 2012 at 5:42 am
  • Why is there no documentation on how to run a datanode in secure mode? Or have I just missed it. I've searched and found bits and pieces, but shouldn't this information be in the security ...
    Tony DeanTony Dean
    Jun 4, 2012 at 1:02 am
    Jun 6, 2012 at 11:58 am
  • Hi, I have a small application where I have only mapper class defined(no reducer, no combiner). Within the mapper class, I have an if condition according to which I decide If I want to put something ...
    Murat migdisogluMurat migdisoglu
    Jun 4, 2012 at 12:22 pm
    Jun 5, 2012 at 10:19 am
  • I am encountering this error Exception in thread "main" java.io.IOException: Cannot open filename /data/input.txt When trying to run this command on a single hadoop cluster running hadoop .20.2 ...
    Jun 18, 2012 at 1:27 am
    Jul 25, 2012 at 4:58 am
  • Hi, I have setup a hadoop cluster with three nodes. when I start the cluster, all daemons are running in Master and Slave as well. while monitor from UI(using port 50030), its showing only one node ...
    Habeeb RazaHabeeb Raza
    Jun 18, 2012 at 9:52 am
    Jul 25, 2012 at 3:34 am
  • Hi I am trying to get the ApplicationMaster info using the <http://<proxy http address:port /proxy/{appid}/ws/v1/mapreduce/info link as described on the < ...
    Prajakta KalmeghPrajakta Kalmegh
    Jun 28, 2012 at 12:44 pm
    Jul 4, 2012 at 12:26 pm
  • First, I'd like to thank the community for the time and effort they put into sharing their knowledge... A few weeks back I was able to configure a secure hadoop/hbase cluster (MIT 1.6.1 Kerberos on ...
    Tony DeanTony Dean
    Jun 29, 2012 at 8:51 pm
    Jun 29, 2012 at 10:13 pm
  • Hi all, sorry to bother, I have a simple hadoop job. It was running well both in local mode and in real hadoop cluster. Recently I try to run it again in single node cluster, and I got the following ...
    Sheng GuoSheng Guo
    Jun 23, 2012 at 7:41 am
    Jun 24, 2012 at 2:50 pm
  • Hi all, I'm using textbook example (page 56) to move data from local file system to HDFS. But there is an error in the line FileSystem fs = FileSystem.get(URI.create(dst), conf); Error is the method ...
    Sandeep Reddy PSandeep Reddy P
    Jun 22, 2012 at 3:05 pm
    Jun 22, 2012 at 3:42 pm
  • Hey there, I am a student at the Department of Foreign Languages and at the same time a volunteer at an organization named “Translation for Education”. I love surfing on the Internet and being ...
    Vseslava KavchichVseslava Kavchich
    Jun 19, 2012 at 4:23 pm
    Jun 19, 2012 at 7:15 pm
  • I am wondering the role of sync in replication of data to other nodes. Say client writes a line to a file in Hadoop, at this point file handle is open and sync has not been called. In this scenario ...
    Mohit AnchliaMohit Anchlia
    Jun 8, 2012 at 11:29 pm
    Jun 10, 2012 at 7:17 pm
  • Hello, I'm in the trying to figure out how to decommission data nodes. Here's what I do: In hdfs-site.xml I have: <property <name dfs.hosts.exclude</name <value ...
    Chris GrierChris Grier
    Jun 8, 2012 at 6:47 pm
    Jun 8, 2012 at 7:57 pm
  • Hi I have done MapReduce programming using Eclipse before but now I need to learn the Hadoop code internals for one of my projects. I have forked Hadoop from github ...
    Prajakta KalmeghPrajakta Kalmegh
    Jun 8, 2012 at 8:04 am
    Jun 8, 2012 at 2:33 pm
  • how can i integrate hadood and nutch ..anyone please brief me .
    Abhishek tiwariAbhishek tiwari
    Jun 8, 2012 at 7:59 am
    Jun 8, 2012 at 9:32 am
  • Hi All, In pseudo distributed node HBaseMaster is stopping automatically when we starts HbaseRegion. I have changed all the configuration files of Hadoop,Hbase & Zookeeper to set the exact hostname ...
    Manu SManu S
    Jun 7, 2012 at 9:19 am
    Jun 7, 2012 at 4:59 pm
  • with kerberos enabled a mapreduce job "runs as" the user that submitted it. does this mean the user that submitted the job needs to have linux accounts on all machines on the cluster? how does ...
    Koert KuipersKoert Kuipers
    Jun 7, 2012 at 1:21 pm
    Jun 7, 2012 at 3:44 pm
  • Running Hadoop 0.22 and I need to restart the namenode so my new rack configuration will be set into place. I am thinking of doing a quick stop and start of the namenode but what will happen to the ...
    Jun 7, 2012 at 10:30 am
    Jul 9, 2012 at 6:11 pm
  • Hi, We have a situation where all files that we have are 64 MB block size. I want to change these files (output of a map job mainly) to 128 MB blocks. What would be good way to do this migration from ...
    Anurag TangriAnurag Tangri
    Jun 26, 2012 at 2:07 pm
    Jun 26, 2012 at 2:54 pm
  • .. [INFO] Apache Hadoop Archives ............................ SUCCESS [1.767s] [INFO] Apache Hadoop HDFS Raid ........................... FAILURE [1.321s] . . . [ERROR] Failed to execute goal ...
    Jun 20, 2012 at 10:10 pm
    Jun 21, 2012 at 4:26 pm
  • I have a textfile which doesn't have any newline characters. The records are separated by a special character ( e.g. $ ). if I push a single file of 5 GB to hdfs, how will it identify the boundaries ...
    Prasenjit mukherjeePrasenjit mukherjee
    Jun 14, 2012 at 1:12 am
    Jun 21, 2012 at 6:08 am
  • Hello, I'm tring to use MapFile (stored on HDFS) in my reduce task, which processes some text data. When I try to initialize MapFile.Reader in reducer configure() method, app throws ...
    Ondřej KlimperaOndřej Klimpera
    Jun 19, 2012 at 12:34 pm
    Jun 19, 2012 at 7:31 pm
  • when i am trying to start the namenode there is an error in the start up and when i try to check the hadoop-env.sh file then i get error message as JAVA_HOME not set properly resourcemanager running ...
    Soham sardarSoham sardar
    Jun 19, 2012 at 4:24 pm
    Jun 19, 2012 at 7:24 pm
  • All hadoop contributors/experts, I am trying to simulate split brain in our installation. There are a few things we want to know 1. Does data corruption happen? 2. If Yes in #1, how to recover from ...
    Hdev mlHdev ml
    Jun 19, 2012 at 1:30 am
    Jun 19, 2012 at 3:27 pm
  • Hi all I run a simple sort program, however, I meet such error like below. 12/06/15 01:13:17 WARN mapred.JobClient: Error reading task outputServer returned HTTP response code: 403 for URL ...
    Yongwei XingYongwei Xing
    Jun 14, 2012 at 5:15 pm
    Jun 16, 2012 at 1:54 pm
  • Hello, why when running Hadoop, there is always HADOOP_HOME shell variable being told to be deprecated. How to set installation directory on cluster nodes, which variable is correct. Thanks Ondrej ...
    Ondřej KlimperaOndřej Klimpera
    Jun 14, 2012 at 5:42 am
    Jun 14, 2012 at 1:28 pm
  • Hi, I need to know difference between two hardware configurations below for 24TB of data. (slave machines only for hadoop,hive and pig) TYPE A: 2 quad core, 32 GB memory, 6 x 1TB drives(6TB / ...
    Sandeep Reddy PSandeep Reddy P
    Jun 13, 2012 at 2:37 pm
    Jun 13, 2012 at 7:29 pm
  • Hello everyone.. I have a cluster of 5 VMs, 1 as master/slave the rest are slaves. I run bin/start-all.sh everything seems ok i get no errors.. I check with jps in all server they run: master: 22418 ...
    Pierre Antoine DuBoDeNaPierre Antoine DuBoDeNa
    Jun 9, 2012 at 7:52 pm
    Jun 9, 2012 at 8:32 pm
  • If I type 'http://localhost:50070' or 'http://localhost:9000' to see the nodes,my browser shows me nothing I think it can't connect to the server. I tested my hadoop with this command: hadoop jar ...
    Babak BastanBabak Bastan
    Jun 6, 2012 at 4:36 pm
    Jun 6, 2012 at 5:04 pm
  • Hello all, I am looking forward to build a 5 node hadoop cluster with the following configurations per machine. -- 1. Intel Xeon E5-2609 (2.40GHz/4-core) 2. 32 GB RAM (8GB 1Rx4 PC3) 3. 5 x 900GB 6G ...
    Praveenesh kumarPraveenesh kumar
    Jun 4, 2012 at 10:27 am
    Jun 5, 2012 at 4:54 am
  • Hi, I am new to hadoop and hbase, but have spent the last few weeks learning as much as I can... I am attempting to create an hbase table during a hadoop job by simply doing puts to a table from each ...
    Jonathan BishopJonathan Bishop
    Jun 3, 2012 at 1:26 am
    Jun 4, 2012 at 12:27 am
Group Navigation
period‹ prev | Jun 2012 | next ›
Group Overview
groupcommon-user @

143 users for June 2012

Harsh J: 43 posts Jagat Singh: 11 posts Ondřej Klimpera: 11 posts Prajakta Kalmegh: 11 posts Praveenesh kumar: 11 posts Michel Segel: 10 posts Shashwat shriparv: 10 posts Ramon Pin: 9 posts Sandeep Reddy P: 9 posts Tony Dean: 9 posts Nitin Pawar: 7 posts Anil Gupta: 6 posts Madhu phatak: 6 posts Manu S: 6 posts Mohit Anchlia: 6 posts Barry, Sean F: 5 posts Bejoy KS: 5 posts Devaraj k: 5 posts Ellis H. Wilson III: 5 posts Ravi Prakash: 5 posts
show more