Grokbase Groups Pig dev August 2010
FAQ
better error handling in casting relations to scalars
-----------------------------------------------------

Key: PIG-1550
URL: https://issues.apache.org/jira/browse/PIG-1550
Project: Pig
Issue Type: Bug
Reporter: Olga Natkovich
Assignee: Thejas M Nair
Fix For: 0.8.0


I ran the following script:

Input data:

joe 100
sam 20
bob 134

Script:

A = load 'user_clicks' as (user: chararray, clicks: int);
B = group A by user;
C = foreach A generate group, SUM(A.clicks);
D = foreach A generate clicks/(double)C.$1;
dump C;

Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 0: Scalar has more than one row in the output
However at the end of processing, I saw a misleading error:

210709 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage


--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Search Discussions

  • Thejas M Nair (JIRA) at Sep 2, 2010 at 5:18 pm
    [ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Thejas M Nair updated PIG-1550:
    -------------------------------

    Attachment: PIG-1550.1.patch

    PIG-1550.1.patch
    test-patch has succeeded . unit tests are still running.
    [exec] +1 overall.
    [exec]
    [exec] +1 @author. The patch does not contain any @author tags.
    [exec]
    [exec] +1 tests included. The patch appears to include 3 new or modified tests.
    [exec]
    [exec] +1 javadoc. The javadoc tool did not generate any warning messages.
    [exec]
    [exec] +1 javac. The applied patch does not increase the total number of javac compiler warnings.
    [exec]
    [exec] +1 findbugs. The patch does not introduce any new Findbugs warnings.
    [exec]
    [exec] +1 release audit. The applied patch does not increase the total number of release audit warnings.

    better error handling in casting relations to scalars
    -----------------------------------------------------

    Key: PIG-1550
    URL: https://issues.apache.org/jira/browse/PIG-1550
    Project: Pig
    Issue Type: Bug
    Reporter: Olga Natkovich
    Assignee: Thejas M Nair
    Fix For: 0.8.0

    Attachments: PIG-1550.1.patch


    I ran the following script:
    Input data:
    joe 100
    sam 20
    bob 134
    Script:
    A = load 'user_clicks' as (user: chararray, clicks: int);
    B = group A by user;
    C = foreach A generate group, SUM(A.clicks);
    D = foreach A generate clicks/(double)C.$1;
    dump C;
    Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 0: Scalar has more than one row in the output
    However at the end of processing, I saw a misleading error:
    210709 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
    10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Thejas M Nair (JIRA) at Sep 2, 2010 at 5:18 pm
    [ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Thejas M Nair updated PIG-1550:
    -------------------------------

    Status: Patch Available (was: Open)
    better error handling in casting relations to scalars
    -----------------------------------------------------

    Key: PIG-1550
    URL: https://issues.apache.org/jira/browse/PIG-1550
    Project: Pig
    Issue Type: Bug
    Reporter: Olga Natkovich
    Assignee: Thejas M Nair
    Fix For: 0.8.0

    Attachments: PIG-1550.1.patch


    I ran the following script:
    Input data:
    joe 100
    sam 20
    bob 134
    Script:
    A = load 'user_clicks' as (user: chararray, clicks: int);
    B = group A by user;
    C = foreach A generate group, SUM(A.clicks);
    D = foreach A generate clicks/(double)C.$1;
    dump C;
    Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 0: Scalar has more than one row in the output
    However at the end of processing, I saw a misleading error:
    210709 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
    10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Thejas M Nair (JIRA) at Sep 2, 2010 at 10:49 pm
    [ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12905715#action_12905715 ]

    Thejas M Nair commented on PIG-1550:
    ------------------------------------

    Unit tests have succeeded. Patch is ready for review.

    better error handling in casting relations to scalars
    -----------------------------------------------------

    Key: PIG-1550
    URL: https://issues.apache.org/jira/browse/PIG-1550
    Project: Pig
    Issue Type: Bug
    Reporter: Olga Natkovich
    Assignee: Thejas M Nair
    Fix For: 0.8.0

    Attachments: PIG-1550.1.patch


    I ran the following script:
    Input data:
    joe 100
    sam 20
    bob 134
    Script:
    A = load 'user_clicks' as (user: chararray, clicks: int);
    B = group A by user;
    C = foreach A generate group, SUM(A.clicks);
    D = foreach A generate clicks/(double)C.$1;
    dump C;
    Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 0: Scalar has more than one row in the output
    However at the end of processing, I saw a misleading error:
    210709 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
    10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Olga Natkovich (JIRA) at Sep 2, 2010 at 10:50 pm
    [ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12905717#action_12905717 ]

    Olga Natkovich commented on PIG-1550:
    -------------------------------------

    I will review the patch

    better error handling in casting relations to scalars
    -----------------------------------------------------

    Key: PIG-1550
    URL: https://issues.apache.org/jira/browse/PIG-1550
    Project: Pig
    Issue Type: Bug
    Reporter: Olga Natkovich
    Assignee: Thejas M Nair
    Fix For: 0.8.0

    Attachments: PIG-1550.1.patch


    I ran the following script:
    Input data:
    joe 100
    sam 20
    bob 134
    Script:
    A = load 'user_clicks' as (user: chararray, clicks: int);
    B = group A by user;
    C = foreach A generate group, SUM(A.clicks);
    D = foreach A generate clicks/(double)C.$1;
    dump C;
    Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 0: Scalar has more than one row in the output
    However at the end of processing, I saw a misleading error:
    210709 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
    10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Olga Natkovich (JIRA) at Sep 2, 2010 at 11:23 pm
    [ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12905731#action_12905731 ]

    Olga Natkovich commented on PIG-1550:
    -------------------------------------

    +1, looks good
    better error handling in casting relations to scalars
    -----------------------------------------------------

    Key: PIG-1550
    URL: https://issues.apache.org/jira/browse/PIG-1550
    Project: Pig
    Issue Type: Bug
    Reporter: Olga Natkovich
    Assignee: Thejas M Nair
    Fix For: 0.8.0

    Attachments: PIG-1550.1.patch


    I ran the following script:
    Input data:
    joe 100
    sam 20
    bob 134
    Script:
    A = load 'user_clicks' as (user: chararray, clicks: int);
    B = group A by user;
    C = foreach A generate group, SUM(A.clicks);
    D = foreach A generate clicks/(double)C.$1;
    dump C;
    Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 0: Scalar has more than one row in the output
    However at the end of processing, I saw a misleading error:
    210709 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
    10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Thejas M Nair (JIRA) at Sep 2, 2010 at 11:30 pm
    [ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Thejas M Nair updated PIG-1550:
    -------------------------------

    Status: Resolved (was: Patch Available)
    Resolution: Fixed

    Patch committed to trunk and 0.8 branch.

    better error handling in casting relations to scalars
    -----------------------------------------------------

    Key: PIG-1550
    URL: https://issues.apache.org/jira/browse/PIG-1550
    Project: Pig
    Issue Type: Bug
    Reporter: Olga Natkovich
    Assignee: Thejas M Nair
    Fix For: 0.8.0

    Attachments: PIG-1550.1.patch


    I ran the following script:
    Input data:
    joe 100
    sam 20
    bob 134
    Script:
    A = load 'user_clicks' as (user: chararray, clicks: int);
    B = group A by user;
    C = foreach A generate group, SUM(A.clicks);
    D = foreach A generate clicks/(double)C.$1;
    dump C;
    Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 0: Scalar has more than one row in the output
    However at the end of processing, I saw a misleading error:
    210709 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
    10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupdev @
categoriespig, hadoop
postedAug 20, '10 at 12:30a
activeSep 2, '10 at 11:30p
posts7
users1
websitepig.apache.org

1 user in discussion

Thejas M Nair (JIRA): 7 posts

People

Translate

site design / logo © 2022 Grokbase