FAQ
Hi Clouderian

I am Beginner to Hadoop, I just come to setup a small cluster of three
Nodes (VirtualBox machines) after a long struggle. I tried to run a simple
MapReduce program, As any beginner will do, I started by the wordcount Hadoop
Tutorial <https://ccp.cloudera.com/display/DOC/Hadoop+Tutorial> tutorial on
the cloudera documentation site, but Unfortunately I feel while reading
through the tutorial that some important steps are missing and above that
the program doesn't run at all (It shows that there are some errors). Can
you guide me, what to learn first, to become familier with hadoop and
cloudera manager? What are the most important things that I have to learn
first to make my experience simpler with hadoop and to be productive in the
shortest possible time?

Please help. Thanks

Search Discussions

  • Adam Smieszny at Dec 20, 2012 at 2:05 pm
    Hi Nafaa,

    The following article might be useful to you:
    http://www.ibm.com/developerworks/data/library/techarticle/dm-1209hadoopbigdata/

    Thanks,
    Adam

    On Thu, Dec 20, 2012 at 2:51 AM, Nafaa Boutefer wrote:

    Hi Clouderian

    I am Beginner to Hadoop, I just come to setup a small cluster of three
    Nodes (VirtualBox machines) after a long struggle. I tried to run a simple
    MapReduce program, As any beginner will do, I started by the wordcount Hadoop
    Tutorial <https://ccp.cloudera.com/display/DOC/Hadoop+Tutorial> tutorial
    on the cloudera documentation site, but Unfortunately I feel while reading
    through the tutorial that some important steps are missing and above that
    the program doesn't run at all (It shows that there are some errors). Can
    you guide me, what to learn first, to become familier with hadoop and
    cloudera manager? What are the most important things that I have to learn
    first to make my experience simpler with hadoop and to be productive in the
    shortest possible time?

    Please help. Thanks

    --
    Adam Smieszny
    Cloudera | Systems Engineer | http://www.linkedin.com/in/adamsmieszny
    917.830.4156
  • Jonathan Natkins at Dec 20, 2012 at 5:11 pm
    Hi Nafaa,

    It might be worthwhile to take a look at the Analyzing Twitter with
    Hadoop<http://blog.cloudera.com/blog/2012/09/analyzing-twitter-data-with-hadoop/>blog
    post series. There is a Github repo associated with it (
    https://github.com/cloudera/cdh-twitter-example), which has some
    step-by-step instructions on how to set up the application. It doesn't use
    MapReduce, but it does use other very common CDH components, like Flume and
    Hive. I'd also suggest looking at the recent How-To: Run a MapReduce Job in
    CDH4<http://blog.cloudera.com/blog/2012/12/how-to-run-a-mapreduce-job-in-cdh4/>,
    which has a lot of information on writing, building, and executing MR jobs
    in CDH4.

    Thanks,
    Natty

    On Wed, Dec 19, 2012 at 11:51 PM, Nafaa Boutefer wrote:

    Hi Clouderian

    I am Beginner to Hadoop, I just come to setup a small cluster of three
    Nodes (VirtualBox machines) after a long struggle. I tried to run a simple
    MapReduce program, As any beginner will do, I started by the wordcount Hadoop
    Tutorial <https://ccp.cloudera.com/display/DOC/Hadoop+Tutorial> tutorial
    on the cloudera documentation site, but Unfortunately I feel while reading
    through the tutorial that some important steps are missing and above that
    the program doesn't run at all (It shows that there are some errors). Can
    you guide me, what to learn first, to become familier with hadoop and
    cloudera manager? What are the most important things that I have to learn
    first to make my experience simpler with hadoop and to be productive in the
    shortest possible time?

    Please help. Thanks

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupscm-users @
categorieshadoop
postedDec 20, '12 at 7:51a
activeDec 20, '12 at 5:11p
posts3
users3
websitecloudera.com
irc#hadoop

People

Translate

site design / logo © 2022 Grokbase