Grokbase Groups Pig user August 2010
I need to sort the DataBags that are input to my UDF after a COGROUP.
I am currently sorting them in memory but it is not going to scale in
the long term.

Is there a way to control the way that Pig sorts them (e.g. as you can
with a WritableComparable in raw map/reduce) prior to passing them in
so that I don't have to respill them to disk?

Thanks for any info,

Search Discussions

Discussion Posts

Follow ups

Related Discussions

Discussion Navigation
viewthread | post
posts ‹ prev | 1 of 4 | next ›
Discussion Overview
groupuser @
categoriespig, hadoop
postedAug 17, '10 at 7:00p
activeAug 17, '10 at 11:19p



site design / logo © 2021 Grokbase