Grokbase Groups Pig user October 2010
The easiest way to do this might be to have your loader return a
single tuple that contains bag, with all of the tuples you want to
return in that bag. Then your next statement can be a foreach with a
flatten to turn each of those into its own record.

A = load 'foo' as (b:bag{});
B = foreach A generate flatten(b);

This way you do not have to modify Pig's internal code to make your
use case work.

On Oct 27, 2010, at 3:00 PM, John Hui wrote:

Hi Pig Users,

I am currently writing a UDF loader. In one of my use case, one
line in the
input stream results in multiple tuples. Has anyone encounter or
solve this
issue on their end.

The current structure of the code getNext method only return tuple
but I
want it to return a List<tuple>. Let me know if there's use case
out there
like mine, I am coding it up to return List<tuple> which is more more
flexible than return only one tuple.



Search Discussions

Discussion Posts


Follow ups

Related Discussions

Discussion Navigation
viewthread | post
posts ‹ prev | 2 of 7 | next ›
Discussion Overview
groupuser @
categoriespig, hadoop
postedOct 27, '10 at 10:39p
activeOct 28, '10 at 3:52p



site design / logo © 2021 Grokbase