loading/reading json for Pig processing sounds like a common useful
however, I have not found any implementation for such.
(and yes, I know of Elephant Bird, which reads LZO-compressed json (but not
but I did see a reference in the "Hadoop Training: Introduction to Pig" (
within the downloadable IntroToPig.pdf, where there is a mention of
however, there is no such UDF within the piggybank source of
the cloudera distributed vm, or within any other piggybank jar out there
that I have seen.
so I wonder, where can I find a pig json reader/loader that can accomplish
the equivalent of: A = LOAD ‘data.json’ USING PigJsonLoader();
any pointeres would be greatly appreciated ...