FAQ
My text file works with one line per record with numerical data. How do I
convert this text file to sequence file?

--
Flávio Dias
flaviodiasps@gmail.com

Search Discussions

  • Bejoy KS at Aug 8, 2012 at 3:13 pm
    If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.

    Regards
    Bejoy KS

    Sent from handheld, please excuse typos.

    -----Original Message-----
    From: Flavio Dias <flaviodiasps@gmail.com>
    Date: Wed, 8 Aug 2012 09:43:26
    To: <user@hadoop.apache.org>
    Reply-To: user@hadoop.apache.org
    Subject: text file to sequence file

    My text file works with one line per record with numerical data. How do I
    convert this text file to sequence file?

    --
    Flávio Dias
    flaviodiasps@gmail.com
  • Harit Himanshu at Aug 8, 2012 at 3:50 pm
    quick question, what is sequence file?
    On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:

    If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
    Regards
    Bejoy KS

    Sent from handheld, please excuse typos.
    From: Flavio Dias <flaviodiasps@gmail.com>
    Date: Wed, 8 Aug 2012 09:43:26 -0300
    To: <user@hadoop.apache.org>
    ReplyTo: user@hadoop.apache.org
    Subject: text file to sequence file

    My text file works with one line per record with numerical data. How do I convert this text file to sequence file?

    --
    Flávio Dias
    flaviodiasps@gmail.com
  • Mohammad Tariq at Aug 8, 2012 at 3:53 pm
    Hello Harit,

    SequenceFile is a flat file consisting of binary key/value pairs.
    Since, our data is already is in key/value format it is highly
    efficient to run MapReduce jobs on these files. You can get complete
    info here - http://wiki.apache.org/hadoop/SequenceFile/

    Regards,
    Mohammad Tariq


    On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
    wrote:
    quick question, what is sequence file?

    On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:

    If you have large number of files and using MapReduce to do the conversion
    to Sequence Files, set the output format of the MR job as
    SequenceFileOutputFormat.
    Regards
    Bejoy KS

    Sent from handheld, please excuse typos.
    ________________________________
    From: Flavio Dias <flaviodiasps@gmail.com>
    Date: Wed, 8 Aug 2012 09:43:26 -0300
    To: <user@hadoop.apache.org>
    ReplyTo: user@hadoop.apache.org
    Subject: text file to sequence file

    My text file works with one line per record with numerical data. How do I
    convert this text file to sequence file?

    --
    Flávio Dias
    flaviodiasps@gmail.com
  • Harit Himanshu at Aug 8, 2012 at 3:55 pm
    cool, thanks Tariq
    On Aug 8, 2012, at 8:52 AM, Mohammad Tariq wrote:

    Hello Harit,

    SequenceFile is a flat file consisting of binary key/value pairs.
    Since, our data is already is in key/value format it is highly
    efficient to run MapReduce jobs on these files. You can get complete
    info here - http://wiki.apache.org/hadoop/SequenceFile/

    Regards,
    Mohammad Tariq


    On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
    wrote:
    quick question, what is sequence file?

    On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:

    If you have large number of files and using MapReduce to do the conversion
    to Sequence Files, set the output format of the MR job as
    SequenceFileOutputFormat.
    Regards
    Bejoy KS

    Sent from handheld, please excuse typos.
    ________________________________
    From: Flavio Dias <flaviodiasps@gmail.com>
    Date: Wed, 8 Aug 2012 09:43:26 -0300
    To: <user@hadoop.apache.org>
    ReplyTo: user@hadoop.apache.org
    Subject: text file to sequence file

    My text file works with one line per record with numerical data. How do I
    convert this text file to sequence file?

    --
    Flávio Dias
    flaviodiasps@gmail.com

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupuser @
categorieshadoop
postedAug 8, '12 at 3:10p
activeAug 8, '12 at 3:55p
posts5
users4
websitehadoop.apache.org
irc#hadoop

People

Translate

site design / logo © 2021 Grokbase