FAQ
Hi All
Am a student trying to integrate PIG and Hadoop technologies to build a
custom application as a part of my MS project.
Am trying out a simple scenario where I have setup a single node hadoop
cluster and trying to execute the pig script "script1-*hadoop*.*pig" *mentioned
in the pig tutorial.

Am hitting several issues like "Failed to create data storage "etc.
Had posted same to the groups already.
http://www.nabble.com/Integration-of-pig-and-hadoop-fails-with-%22Failed-to-create-DataStorage%22-error.-td18931962.html

Could you please suggest me the proper steps to integrate pig and hadoop.
Right now, am following the below ones.

1) Have downloded latest source for hadoop and PIG
2) Compiled hadoop and started single node cluster
3) Compiled PIG and replaced the hadoop class files with the new ones from
step 2 in the pig.jar
4) executing the pig script by setting HADOOPSITEPATH

Please let me if the above steps needs are incorrect (or) should i use any
specific pig and hadoop versions? We are stuck up with the errors. Request
you to pls help in resolving the same.

Thankyou
Srilatha

Search Discussions

  • Alan Gates at Sep 22, 2008 at 3:22 pm
    Which version of hadoop do you have? The latest hadoop at this moment
    is 0.18.1 I believe. The pig patch
    (https://issues.apache.org/jira/browse/PIG-253) to integrate with hadoop
    0.18 was only checked in Sept 18. If you downloaded pig 0.1.0, it works
    with hadoop 0.17.x. Or, if you directly downloaded the source code
    before 9/18, it will work with hadoop 0.17.x.

    Alan.

    Latha wrote:
    Hi All
    Am a student trying to integrate PIG and Hadoop technologies to build a
    custom application as a part of my MS project.
    Am trying out a simple scenario where I have setup a single node hadoop
    cluster and trying to execute the pig script "script1-*hadoop*.*pig" *mentioned
    in the pig tutorial.

    Am hitting several issues like "Failed to create data storage "etc.
    Had posted same to the groups already.
    http://www.nabble.com/Integration-of-pig-and-hadoop-fails-with-%22Failed-to-create-DataStorage%22-error.-td18931962.html

    Could you please suggest me the proper steps to integrate pig and hadoop.
    Right now, am following the below ones.

    1) Have downloded latest source for hadoop and PIG
    2) Compiled hadoop and started single node cluster
    3) Compiled PIG and replaced the hadoop class files with the new ones from
    step 2 in the pig.jar
    4) executing the pig script by setting HADOOPSITEPATH

    Please let me if the above steps needs are incorrect (or) should i use any
    specific pig and hadoop versions? We are stuck up with the errors. Request
    you to pls help in resolving the same.

    Thankyou
    Srilatha
  • Latha at Sep 23, 2008 at 8:09 pm
    Hi All,

    Thankyou for your input. Was using 0.18 version of hadoop with pig
    0.1.0.which caused the errors.
    Now pig 0.1.0 is working well with branch 0.17 .And the script1-hadoop.pig
    example is getting executed properly.

    Thankyou so much
    Srilatha
    On Mon, Sep 22, 2008 at 8:51 PM, Alan Gates wrote:

    Which version of hadoop do you have? The latest hadoop at this moment is
    0.18.1 I believe. The pig patch (
    https://issues.apache.org/jira/browse/PIG-253) to integrate with hadoop
    0.18 was only checked in Sept 18. If you downloaded pig 0.1.0, it works
    with hadoop 0.17.x. Or, if you directly downloaded the source code before
    9/18, it will work with hadoop 0.17.x.

    Alan.


    Latha wrote:
    Hi All
    Am a student trying to integrate PIG and Hadoop technologies to build a
    custom application as a part of my MS project.
    Am trying out a simple scenario where I have setup a single node hadoop
    cluster and trying to execute the pig script "script1-*hadoop*.*pig"
    *mentioned
    in the pig tutorial.

    Am hitting several issues like "Failed to create data storage "etc.
    Had posted same to the groups already.

    http://www.nabble.com/Integration-of-pig-and-hadoop-fails-with-%22Failed-to-create-DataStorage%22-error.-td18931962.html

    Could you please suggest me the proper steps to integrate pig and hadoop.
    Right now, am following the below ones.

    1) Have downloded latest source for hadoop and PIG
    2) Compiled hadoop and started single node cluster
    3) Compiled PIG and replaced the hadoop class files with the new ones from
    step 2 in the pig.jar
    4) executing the pig script by setting HADOOPSITEPATH

    Please let me if the above steps needs are incorrect (or) should i use any
    specific pig and hadoop versions? We are stuck up with the errors. Request
    you to pls help in resolving the same.

    Thankyou
    Srilatha

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupdev @
categoriespig, hadoop
postedSep 22, '08 at 5:15a
activeSep 23, '08 at 8:09p
posts3
users2
websitepig.apache.org

2 users in discussion

Latha: 2 posts Alan Gates: 1 post

People

Translate

site design / logo © 2022 Grokbase