|| at Jun 15, 2010 at 12:01 am
When I need reports, I do:
echo "My\tcolumn\tnames\n" > report.tsv
hdfs -cat my/pig/output/* >> report.tsv
If you need something more elaborate, you could use something likehttp://search.cpan.org/dist/Spreadsheet-WriteExcel/
or simply load your TSV
into a database with a script after your pig job finishes, and use any of
the database reporting tools.
MySQL (with the Infobright engine if you have bigger data output) and
something like Pentaho would work: http://www.pentaho.com/
Tableau is really nice, and can load smaller TSV directly, but is Windows
only and a bit pricey. http://www.tableausoftware.com/
On Mon, Jun 14, 2010 at 4:24 PM, elein wrote:
Does there exist any reporting tools that can run on top of
pig or using pig? Or does everyone load TSV results in some type of excel.
I will need to create reports with labels and sequential pig queries
and any fancy display stuff I can send out with email.