FAQ
Hello Folks:
I wrote a map reduce program for analyzing text files. I would like to use a
large data set with text files to test the performance of the program. Are
there any available text data set which can be used to
test programs on Hadoop? If you know, please let me know.
Thanks.
Richard

Search Discussions

  • Tim robertson at Jun 28, 2008 at 7:34 am
    Perhaps something like a RandomTextWriter to generate a file for input?
    http://hadoop.apache.org/core/docs/r0.17.0/api/org/apache/hadoop/examples/RandomTextWriter.html

    Cheers

    Tim


    On Sat, Jun 28, 2008 at 4:42 AM, Richard Zhang wrote:

    Hello Folks:
    I wrote a map reduce program for analyzing text files. I would like to use
    a
    large data set with text files to test the performance of the program. Are
    there any available text data set which can be used to
    test programs on Hadoop? If you know, please let me know.
    Thanks.
    Richard

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupcommon-user @
categorieshadoop
postedJun 28, '08 at 2:43a
activeJun 28, '08 at 7:34a
posts2
users2
websitehadoop.apache.org...
irc#hadoop

2 users in discussion

Richard Zhang: 1 post Tim robertson: 1 post

People

Translate

site design / logo © 2023 Grokbase