In PigMapReduce.run(RecordReader input, OutputCollector output, Reporter
reporter), as I can see, Pig does create its own OutputCollector and write
output to its own files (using PigWriter).
How does the shuffle process work if the files aren't created from the
outputCollector supplied in run(RecordReader input, OutputCollector output,
Reporter reporter)? Do we just put the output files to the location where
shuffle expects?
Thanks for explanation in advance,
Pi