fs -getmerge isn't guaranteed to work well over non-HDFS filesystems

Key: HADOOP-7659
URL: https://issues.apache.org/jira/browse/HADOOP-7659
Project: Hadoop Common
Issue Type: Bug
Components: fs
Affects Versions:
Reporter: Harsh J
Priority: Minor
Fix For: 0.24.0

When you use {{fs -getmerge}} with HDFS, you are guaranteed file list sorting (part-00000, part-00001, onwards). When you use the same with other FSes we bundle, the ordering of listing is not guaranteed at all. This is cause of http://download.oracle.com/javase/6/docs/api/java/io/File.html#list() which we use internally for native file listing.

This should either be documented as a known issue on -getmerge help pages/mans, or a consistent ordering (similar to HDFS) must be applied atop the listing. I suspect the latter only makes it worthy for what we include - while other FSes out there still have to deal with this issue. Perhaps we need a recommendation doc note added to our API?

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Search Discussions

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupcommon-dev @
postedSep 20, '11 at 7:06a
activeSep 20, '11 at 7:06a

1 user in discussion

Harsh J (JIRA): 1 post



site design / logo © 2022 Grokbase