Doug Cutting commented on HADOOP-2187:
--------------------------------------
+1 This sounds like a good change.
I might instead call the class BlockLocations, and the methods getBlockLocations.
When we deprecate the existing method ideally we can upgrade all existing implementations with a single back-compatibility implementation on the base class.
Also, should we refer to hosts by hostname or IP here?
FileSystem should return location information with byte ranges
--------------------------------------------------------------
Key: HADOOP-2187
URL: https://issues.apache.org/jira/browse/HADOOP-2187
Project: Hadoop
Issue Type: Improvement
Components: fs
Reporter: Owen O'Malley
Fix For: 0.16.0
The FileSystem interface should provide location information with byte ranges rather than a String[][] of locations. I suggest that we deprecate FileSystem.getFileCacheHints and replace it with:
{code}
abstract public class FileSystem {
...
public static class BlockInformation implements Writable {
public BlockInformation(long start, String[] locations) {...}
public String[] getHosts() {...}
public long getStartingOffset() {...}
}
BlockInformation[] getFileLocations(Path f, long start, long length) { ... }
}
{code}
This will allow us to fix the FileInputFormat in map/reduce to make just one call per a file to the name node instead of one per a block.
----------------------------------------------------------------
Key: HADOOP-2187
URL: https://issues.apache.org/jira/browse/HADOOP-2187
Project: Hadoop
Issue Type: Improvement
Components: fs
Reporter: Owen O'Malley
Fix For: 0.16.0
The FileSystem interface should provide location information with byte ranges rather than a String[][] of locations. I suggest that we deprecate FileSystem.getFileCacheHints and replace it with:
{code}
abstract public class FileSystem {
...
public static class BlockInformation implements Writable {
public BlockInformation(long start, String[] locations) {...}
public String[] getHosts() {...}
public long getStartingOffset() {...}
}
BlockInformation[] getFileLocations(Path f, long start, long length) { ... }
}
{code}
This will allow us to fix the FileInputFormat in map/reduce to make just one call per a file to the name node instead of one per a block.
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.