FAQ
TextInputFormat should not create input splits for 0 byte files
---------------------------------------------------------------

Key: HADOOP-2952
URL: https://issues.apache.org/jira/browse/HADOOP-2952
Project: Hadoop Core
Issue Type: Improvement
Components: mapred
Reporter: Owen O'Malley


As part of HADOOP-2027, I discovered that we create input splits for 0 byte files. (In theory this is for both sequence file and text files, but in practice sequence files can't be 0 bytes.) I think 0 byte files can and should be dropped, since they have no input to process.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Search Discussions

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupcommon-dev @
categorieshadoop
postedMar 6, '08 at 5:32p
activeMar 6, '08 at 5:32p
posts1
users1
websitehadoop.apache.org...
irc#hadoop

1 user in discussion

Owen O'Malley (JIRA): 1 post

People

Translate

site design / logo © 2022 Grokbase