I'm using hadoop-0.20.2 to try out some simple tasks. I asked a question
about FileInputFormat a few days ago and get some prompt replys from
this forum and it helped a lot. Thanks again! Now I have another
question. I'm trying to invoke a C++ process from my mapper for each
hdfs file in the input directory to achieve some parallel processing.
But how do I pass the file to the program? I would want to do something
like the following in my mapper:
Process lChldProc = Runtime.getRuntime().exec("myprocess -file
How do I pass the hdfs filesystem to an outside process like that? Is
HadoopStreaming the direction I should go?
Thanks very much for any reply in advance.