|| at Mar 2, 2011 at 9:05 pm
Thanks Mike :)
I was also wondering what if:
hdfs.CopyToLocal( src-file, dst-file) ; // is executed on node N
and there exists a copy of src-file from the replication process in that same node(N) local file system ?
Will hdfs recognize that there is already a copy in there and hence just move that copy to dst-file path ?
Will hdfs go ahead with the copy and hence node N will have two copies of the src-file? (ie. one on HDFS namespace and another in the local file system)
On Mar 2, 2011, at 12:38 PM, Michael Segel wrote:
Run is local to your edge machine where you launched your job.
It then connects to the cluster / job tracker ...
Subject: ToolRunner run function
Date: Wed, 2 Mar 2011 12:10:05 -0800
Assuming my program implements the ToolRunner, my question is where does the "run" function execute? ie. which daemon (DataNode/TT) ? or is it on the local machine where it is run?