FAQ

Search Discussions

238 discussions - 1,201 posts

  • Hi, I recently used a backup tool to back up all my HDFS data to S3. The data is on S3 in multiparts. I need to test the restore now. Could you please give me some pointers on how to test this. 1) Do ...
    Prem yadavPrem yadav
    Aug 8, 2012 at 1:21 pm
    Aug 18, 2012 at 5:36 am
  • I'm facing the exact same issue on 0.20.2-cdh3u0. Does anybody have an idea? Tnx. Best, Christoph -------------------------- Subject Re: pending clean up step Date Tue, 14 Feb 2012 13:17:04 GMT ...
    ListenbruderListenbruder
    Aug 28, 2012 at 11:59 am
    Feb 8, 2013 at 7:07 pm
  • hi all, how can i unsubscribe form this mailing list? thanks, Eyal Golan <span class="m_body_email_addr" title="b04a676714fc56e594d86d1078f8d8ca" egolan74@gmail.com</span Visit ...
    Eyal GolanEyal Golan
    Aug 8, 2012 at 10:56 am
    Aug 28, 2012 at 7:15 pm
  • Hi, Is there any HBASE JDBC/ODBC something similar to HIVE JDBC?ODBC drivers? -- Thanks, sandeep
    Sandeep Reddy PSandeep Reddy P
    Aug 10, 2012 at 2:56 pm
    Sep 11, 2012 at 3:43 pm
  • Hi, I have a WAR which is deployed on tomcat server the WAR contains some java classes which uploads files, will i be able to upload directly in to hadoop iam using the below code in one of my java ...
    Visioner SadakVisioner Sadak
    Aug 30, 2012 at 8:32 am
    Sep 6, 2012 at 2:48 pm
  • Hi, i am just learning the Hadoop and i am setting the development environment with CDH3 pseudo distributed mode without any ssh cofiguration in CentOS 6.2 . i can run the sample programs as usual ...
    Anand sharmaAnand sharma
    Aug 9, 2012 at 10:16 am
    Aug 11, 2012 at 3:55 pm
  • Hi, i have a question concerning the execution of reducers. To use effectively the data locality of blocks in my use case i want to control on which node a reducer will be executed. In my scenario i ...
    Eduard SkaleyEduard Skaley
    Aug 27, 2012 at 5:12 pm
    Sep 5, 2012 at 1:53 pm
  • Hi, i am currently trying to run my hadoop program on a cluster. Sadly though my datanodes and tasktrackers seem to have difficulties with their communication as their logs say: * Some datanodes and ...
    Björn-Elmar MacekBjörn-Elmar Macek
    Aug 13, 2012 at 12:39 pm
    Aug 20, 2012 at 10:16 am
  • Hi folks! I have just read about the HDFS RAID feature that was added to Hadoop 0.21 or 0.22. and I am quite curious to know if people use it, what kind of use they have and what they think about ...
    Sourygna LuangsaySourygna Luangsay
    Aug 8, 2012 at 4:46 pm
    Aug 9, 2012 at 10:56 am
  • Hi, Moving this to the <span class="m_body_email_addr" title="858a0c8e479a78c1038b7355244ec07c" user@hadoop.apache.org</span lists. The general@ lists is only for project level discussions, not ...
    Harsh JHarsh J
    Aug 14, 2012 at 3:11 am
    Aug 16, 2012 at 2:10 pm
  • Hi, We have a hadoop cluster of version 0.20.2 in production. Now we have another new Hadoop cluster using cloudera's CDH3U4. We like to run distcp to copy files between the two clusters. Since the ...
    Jian FangJian Fang
    Aug 9, 2012 at 8:17 pm
    Aug 15, 2012 at 5:25 pm
  • I am very new to Hadoop. I am considering setting up a Hadoop cluster consisting of 5 nodes where each node has 3 internal hard drives. I understand HDFS has a configurable redundancy feature but ...
    Aji JanisAji Janis
    Aug 10, 2012 at 6:38 pm
    Aug 14, 2012 at 12:16 am
  • Dear list, Lets say i have a file, like this: id \t at,tlng <-- structure 1\t40.123,-50.432 2\t41.431,-43.32 ... ... lets call it: 'points.txt' I'm trying to build a map-reduce job that runs over ...
    Dexter morganDexter morgan
    Aug 27, 2012 at 8:46 pm
    Sep 10, 2012 at 1:31 pm
  • Hi I am trying to use the Hadoop filesystem abstraction with S3 but in my tinkering I am not having a great deal of success. I am particularly interested in the ability to mimic a directory structure ...
    Chris CollinsChris Collins
    Aug 28, 2012 at 5:06 pm
    Aug 30, 2012 at 11:49 am
  • hi, I have doc files in msword doc and docx format. These have entries which are seperated by an empty line. Is it possible for me to read these lines separated from empty lines at a time. Also which ...
    Siddharth TiwariSiddharth Tiwari
    Aug 24, 2012 at 5:52 am
    Aug 25, 2012 at 6:18 pm
  • Hello, I'm thinking about building a hadoop cluster to analyze all the unsubscribe mails that people mistakenly send to this address. How many PB of storage will I need? - Ryan
    Hennig, RyanHennig, Ryan
    Aug 8, 2012 at 11:12 pm
    Aug 9, 2012 at 3:25 am
  • Hello, I had a running hadoop cluster. I restarted it and after that namenode is unable to start. I am getting error saying that it's not formatted. :( Is it possible to recover the data on HDFS? ...
    Abhay RatnaparkhiAbhay Ratnaparkhi
    Aug 24, 2012 at 7:29 am
    Aug 27, 2012 at 5:35 pm
  • Hi users, I am working on a CDH3 cluster of 12 nodes (Task Trackers running on all the 12 nodes and 1 node running the Job Tracker). In order to perform a WordCount benchmark test, I did the ...
    Gaurav DasguptaGaurav Dasgupta
    Aug 16, 2012 at 2:13 pm
    Aug 17, 2012 at 12:53 pm
  • Hi, I have installed and configured Hadoop(hadoop-1.0.3) and HBase(hbase-0.94.1) in a single Fedora linux box, where HBase as pseudo cluster setup. I am able to connect the HBase suing shell from ...
    Jilani ShaikJilani Shaik
    Aug 30, 2012 at 1:31 pm
    Aug 30, 2012 at 10:57 pm
  • Hi, I'm doing some research that involves pulling data stored in a mysql cluster directly for a map reduce job, without storing the data in HDFS. I'd like to run hadoop task tracker nodes directly on ...
    Tharindu MathewTharindu Mathew
    Aug 21, 2012 at 9:07 am
    Aug 22, 2012 at 6:31 am
  • How hard would it be to block **ALL** messages with "unsubscribe" in the title? -- Mike Lyon 408-621-4826 <span class="m_body_email_addr" title="3fcb611458345a1452cbca7bcd70f6f8" ...
    Mike LyonMike Lyon
    Aug 9, 2012 at 5:10 am
    Aug 9, 2012 at 12:02 pm
  • Hi, I am currently trying to tune a CDH 4.0.1 cluster running HDFS, YARN, and HBase managed by Cloudera Manager 4.0.3 (Free Edition). In CM, there are a number of options for setting mapreduce.* ...
    MgMg
    Aug 16, 2012 at 4:23 pm
    Aug 19, 2012 at 12:34 pm
  • I am trying to setup a small cluster using hadoop 2.0.0 and using PI example to validate the setup. When I have 1 master and 1 slave the example works fine. I am getting exceptions with the PI ...
    Arjun ReddyArjun Reddy
    Aug 8, 2012 at 9:40 pm
    Aug 11, 2012 at 4:51 am
  • A AshwinA Ashwin
    Aug 8, 2012 at 3:10 pm
    Aug 11, 2012 at 4:12 am
  • Hi Users, How to open HDFS zip file(.gz) file in hadoop.? example: bin/hadoop fs -ls /user/hive/warehouse/sample -rw-r--r-- 4 root supergroup 465141227 2012-08-14 17:02 ...
    prabhu Kprabhu K
    Aug 17, 2012 at 6:48 am
    Aug 18, 2012 at 3:18 pm
  • Hi Users, Is it possible from HDFS file to local unix system, is there any command? as anyone knows please reply. Thanks, Prabhu.
    prabhu Kprabhu K
    Aug 10, 2012 at 2:21 pm
    Aug 11, 2012 at 4:09 am
  • Hello I have a very basic question - There are various flavors of hadoop by Apache, Cloudera, MapR, HortonWorks(may be more I am not aware of). I would like to learn what are the differences between ...
    Harit HimanshuHarit Himanshu
    Aug 8, 2012 at 3:54 pm
    Aug 9, 2012 at 4:48 am
  • Hello folks, I am new to hadoop, I just want to get information that how hadoop framework is usefull for real time service.?can any one explain me..? Thanks.
    Mahout userMahout user
    Aug 19, 2012 at 3:44 pm
    Aug 22, 2012 at 6:22 pm
  • All We are getting the following show in when we talk to hadoop 1.0.3 Seems it relates to these lines in Configuration.java public Configuration(boolean loadDefaults) { 225 this.loadDefaults = ...
    Ben CuthbertBen Cuthbert
    Aug 17, 2012 at 3:59 pm
    Aug 22, 2012 at 3:05 pm
  • Hi, I was going through the Apache Hadoop's distribution dependencies (jars in lib folder) and I could not find avro-1.x.x.jar. I though hadoop internally uses avro as its serialization mechanism for ...
    Rahul BhattacharjeeRahul Bhattacharjee
    Aug 22, 2012 at 5:40 am
    Aug 22, 2012 at 7:47 am
  • I am a bit confused about the different options for namenode high availability (or something along those lines) in CDH4 (hadoop-2.0.0). I understand that the secondary namenode is deprecated, and ...
    Jan Van BesienJan Van Besien
    Aug 16, 2012 at 8:12 am
    Aug 16, 2012 at 9:08 pm
  • nothing has confused me as much in hadoop as FileSystem.close(). any decent java programmer that sees that an object implements Closable writes code like this: Final FileSystem fs = ...
    Koert KuipersKoert Kuipers
    Aug 4, 2012 at 5:54 pm
    Aug 7, 2012 at 3:54 pm
  • Hi, I want to read file paragraph wise that is until it encounters an empty line it must take the content and pass out to mapper. Please guide me on how can I achieve it. Some example would be of ...
    Siddharth TiwariSiddharth Tiwari
    Aug 23, 2012 at 9:57 am
    Sep 6, 2012 at 11:53 pm
  • Hi folks, Replying to this thread is not going to get you unsubscribed and will just annoy everyone else who's subscribed. To unsubscribe please send an email to <span class="m_body_email_addr" ...
    Andy IsaacsonAndy Isaacson
    Aug 29, 2012 at 9:55 pm
    Aug 30, 2012 at 5:35 am
  • Epic Ryan!!! Sent from my Windows Phone ------------------------------ Da: Hennig, Ryan Inviato: 28/08/2012 21:14 A: <span class="m_body_email_addr" title="858a0c8e479a78c1038b7355244ec07c" ...
    Fabio PitzoluFabio Pitzolu
    Aug 28, 2012 at 9:50 pm
    Aug 29, 2012 at 6:00 pm
  • Hi Users. We have flat files on mainframes with around a billion records. We need to sort them and then use them with different jobs on mainframe for report generation. I was wondering was there any ...
    Siddharth TiwariSiddharth Tiwari
    Aug 28, 2012 at 4:24 pm
    Aug 29, 2012 at 10:38 am
  • Hi, I am trying to install sqoop. i am different commands to install in my Ubuntu but nothing is working. Can someone help me on the same these the commands i have tried sudo yum -y install ...
    Rahul pRahul p
    Aug 21, 2012 at 10:44 am
    Aug 28, 2012 at 7:15 pm
  • We have smaller nodes (4 to 6 disks), and we used to write logs to the same disk as where the OS is. So if that disks goes then i don't really care about tasktrackers failing. Also, the fact that ...
    Koert KuipersKoert Kuipers
    Aug 26, 2012 at 5:32 pm
    Aug 26, 2012 at 7:04 pm
  • Hey there, I have a doubt about reduce tasks and block writes. Do a reduce task always first write to hdfs in the node where they it is placed? (and then these blocks would be replicated to other ...
    Marc SturleseMarc Sturlese
    Aug 24, 2012 at 8:10 pm
    Aug 26, 2012 at 5:14 pm
  • I configure a job in hadoop ,set the number of map tasks in the code to 8. Then I run the job and it gets 152 map tasks. Can't get why its being overriden and whhere it get 152 from. The ...
    Nutch buddyNutch buddy
    Aug 21, 2012 at 12:20 pm
    Aug 23, 2012 at 11:31 am
  • Hi I have two mappers MAP1 and MAP2, which collect data from two different files, In reducer I want to traverse all keys and values of MAP2 for each key and value of MAP1. How can I achieve it in one ...
    Siddharth TiwariSiddharth Tiwari
    Aug 20, 2012 at 7:54 pm
    Aug 21, 2012 at 8:03 am
  • We have an application or a series of applications that listen to incoming feeds they then distribute this data in XML form to a number of queues. Another set of processes listen to these queues and ...
    Robert NicholsonRobert Nicholson
    Aug 19, 2012 at 4:47 pm
    Aug 20, 2012 at 2:14 am
  • Are there any utilities available to help parse jobtracker log files? Hank Cohen <span class="m_body_email_addr" title="a2c96842b14ad23483355b001088ac32" hank.cohen@altior.com</span ...
    Hank CohenHank Cohen
    Aug 17, 2012 at 4:30 am
    Aug 18, 2012 at 12:28 am
  • Hello all, I'm using CDH3u3. If I want to process one File, set to non splitable hadoop starts one Mapper and no Reducer (thats ok for this test scenario). The Mapper goes through a configuration ...
    Matthias KrickeMatthias Kricke
    Aug 13, 2012 at 1:17 pm
    Aug 13, 2012 at 4:08 pm
  • Hello I use "Hadoop Crypto Compressor" from this site" https://github.com/geisbruch/HadoopCryptoCompressor" for encryption hdfs files. I've downloaded the complete code & create the jar file,Change ...
    Farrokh ShahriariFarrokh Shahriari
    Aug 7, 2012 at 7:41 am
    Aug 10, 2012 at 10:11 am
  • I'm having some trouble with permissions on HDFS. I'm trying to create a file in a directory where the user belongs to a group that has write permissions, but it doesn't seem to be working. First, ...
    John ArmstrongJohn Armstrong
    Aug 9, 2012 at 11:56 am
    Aug 9, 2012 at 6:08 pm
  • Hi, I am trying to add a file to HDFS programmatically. In my code, I am adding hdfs-site.xml and other xml to Hadoop Configuration object as follows Configuration configuration = null ...
    Chandra Mohan, Ananda Vel MuruganChandra Mohan, Ananda Vel Murugan
    Aug 9, 2012 at 7:08 am
    Aug 9, 2012 at 8:43 am
  • Hi, I tried decommissioning a node in my Hadoop cluster. I am running Apache Hadoop 1.0.2 and ours is a four node cluster. I also have HBase installed in my cluster. I have shut down region server in ...
    Chandra Mohan, Ananda Vel MuruganChandra Mohan, Ananda Vel Murugan
    Aug 7, 2012 at 12:59 am
    Aug 8, 2012 at 3:42 pm
  • Hi Hadoopers, We have a plan to migrate Hadoop cluster to a different datacenter where we can triple the size of the cluster. Currently, our 0.20.2 cluster have around 1PB of data. We use only ...
    Patai SangbutsarakumPatai Sangbutsarakum
    Aug 3, 2012 at 6:50 pm
    Aug 7, 2012 at 3:58 pm
  • Hi All, I executed the "MRBench" program from "hadoop-test.jar" in my 12 node CDH3 cluster. After executing, I had some strange observations regarding the number of Maps it ran. First I ran the ...
    Gaurav DasguptaGaurav Dasgupta
    Aug 28, 2012 at 10:32 am
    Aug 29, 2012 at 5:12 pm
Group Navigation
period‹ prev | Aug 2012 | next ›
Group Overview
groupcommon-user @
categorieshadoop
discussions238
posts1,201
users377
websitehadoop.apache.org...
irc#hadoop

377 users for August 2012

Harsh J: 104 posts Mohammad Tariq: 41 posts Bertrand Dechoux: 33 posts Bejoy KS: 29 posts Michel Segel: 27 posts Rahul p: 24 posts Anil Gupta: 23 posts Mohit Anchlia: 21 posts Sathyavageeswaran: 21 posts Siddharth Tiwari: 21 posts Steve Loughran: 17 posts Chandra Mohan, Ananda Vel Murugan: 15 posts Gaurav Dasgupta: 15 posts Håvard Wahl Kongsgård: 14 posts Koert Kuipers: 13 posts Björn-Elmar Macek: 11 posts Anand sharma: 10 posts Ted Dunning: 10 posts Arun C Murthy: 9 posts Eduard Skaley: 9 posts
show more