Grokbase Groups Pig dev March 2010
FAQ
Document in Load statement syntax that Pig and underlying M/R does not handle concatenated bz2 and gz files correctly
----------------------------------------------------------------------------------------------------------------------

Key: PIG-1305
URL: https://issues.apache.org/jira/browse/PIG-1305
Project: Pig
Issue Type: Bug
Components: documentation
Reporter: Viraj Bhat
Fix For: 0.7.0


The Pig Reference Manual needs to be updated:

Relational Operators

Syntax:

LOAD 'data' [USING function] [AS schema];

'data'

Please note:
Pig reads in both bz2 and gz formats correctly as long as they are not concatenated gzip or bz2 generated in this manner. cat *.bz2 > text/concat.bz2. Your M/R jobs may succeed but the results will not be accurate.

Viraj

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Search Discussions

  • Olga Natkovich (JIRA) at Mar 17, 2010 at 8:39 pm
    [ https://issues.apache.org/jira/browse/PIG-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Olga Natkovich reassigned PIG-1305:
    -----------------------------------

    Assignee: Corinne Chandel
    Document in Load statement syntax that Pig and underlying M/R does not handle concatenated bz2 and gz files correctly
    ----------------------------------------------------------------------------------------------------------------------

    Key: PIG-1305
    URL: https://issues.apache.org/jira/browse/PIG-1305
    Project: Pig
    Issue Type: Bug
    Components: documentation
    Reporter: Viraj Bhat
    Assignee: Corinne Chandel
    Fix For: 0.7.0


    The Pig Reference Manual needs to be updated:
    Relational Operators
    Syntax:
    LOAD 'data' [USING function] [AS schema];
    'data'
    Please note:
    Pig reads in both bz2 and gz formats correctly as long as they are not concatenated gzip or bz2 generated in this manner. cat *.bz2 > text/concat.bz2. Your M/R jobs may succeed but the results will not be accurate.
    Viraj
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Corinne Chandel (JIRA) at Mar 22, 2010 at 10:39 pm
    [ https://issues.apache.org/jira/browse/PIG-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Corinne Chandel resolved PIG-1305.
    ----------------------------------

    Resolution: Fixed
    Release Note:
    Documentation updated (Pig Latin Ref Manual 2, Load/Store Statements).
    Fix will be committed as part of PIG-1320.

    Thanks/C
    Document in Load statement syntax that Pig and underlying M/R does not handle concatenated bz2 and gz files correctly
    ----------------------------------------------------------------------------------------------------------------------

    Key: PIG-1305
    URL: https://issues.apache.org/jira/browse/PIG-1305
    Project: Pig
    Issue Type: Bug
    Components: documentation
    Reporter: Viraj Bhat
    Assignee: Corinne Chandel
    Fix For: 0.7.0


    The Pig Reference Manual needs to be updated:
    Relational Operators
    Syntax:
    LOAD 'data' [USING function] [AS schema];
    'data'
    Please note:
    Pig reads in both bz2 and gz formats correctly as long as they are not concatenated gzip or bz2 generated in this manner. cat *.bz2 > text/concat.bz2. Your M/R jobs may succeed but the results will not be accurate.
    Viraj
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Daniel Dai (JIRA) at May 14, 2010 at 6:48 am
    [ https://issues.apache.org/jira/browse/PIG-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Daniel Dai closed PIG-1305.
    ---------------------------

    Document in Load statement syntax that Pig and underlying M/R does not handle concatenated bz2 and gz files correctly
    ----------------------------------------------------------------------------------------------------------------------

    Key: PIG-1305
    URL: https://issues.apache.org/jira/browse/PIG-1305
    Project: Pig
    Issue Type: Bug
    Components: documentation
    Reporter: Viraj Bhat
    Assignee: Corinne Chandel
    Fix For: 0.7.0


    The Pig Reference Manual needs to be updated:
    Relational Operators
    Syntax:
    LOAD 'data' [USING function] [AS schema];
    'data'
    Please note:
    Pig reads in both bz2 and gz formats correctly as long as they are not concatenated gzip or bz2 generated in this manner. cat *.bz2 > text/concat.bz2. Your M/R jobs may succeed but the results will not be accurate.
    Viraj
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Viraj Bhat (JIRA) at Nov 5, 2010 at 10:26 pm
    [ https://issues.apache.org/jira/browse/PIG-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Viraj Bhat updated PIG-1305:
    ----------------------------

    Fix Version/s: (was: 0.7.0)
    0.9.0
    Document in Load statement syntax that Pig and underlying M/R does not handle concatenated bz2 and gz files correctly
    ----------------------------------------------------------------------------------------------------------------------

    Key: PIG-1305
    URL: https://issues.apache.org/jira/browse/PIG-1305
    Project: Pig
    Issue Type: Bug
    Components: documentation
    Reporter: Viraj Bhat
    Assignee: Corinne Chandel
    Fix For: 0.9.0


    The Pig Reference Manual needs to be updated:
    Relational Operators
    Syntax:
    LOAD 'data' [USING function] [AS schema];
    'data'
    Please note:
    Pig reads in both bz2 and gz formats correctly as long as they are not concatenated gzip or bz2 generated in this manner. cat *.bz2 > text/concat.bz2. Your M/R jobs may succeed but the results will not be accurate.
    Viraj
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupdev @
categoriespig, hadoop
postedMar 17, '10 at 7:03p
activeNov 5, '10 at 10:26p
posts5
users1
websitepig.apache.org

1 user in discussion

Viraj Bhat (JIRA): 5 posts

People

Translate

site design / logo © 2022 Grokbase