Grokbase Groups Pig user January 2009

Search Discussions

15 discussions - 66 posts

  • I need to add a column, to a data file, with unique integer value for each record. In simplest case it could be a record number in a dataset. For example: (A) (B) (C) should become (1,A) (2,B) (3,C) ...
    Vadim ZalivaVadim Zaliva
    Jan 24, 2009 at 12:05 am
    Jan 26, 2009 at 10:14 am
  • Hi Folks, I have a case where-in I need to do top-K on nested fields in my tuple. For e.g. Consider the following tuples (format is [url, query]) (, A) (, A) (, C) (, B) ...
    Goel, AnkurGoel, Ankur
    Jan 8, 2009 at 9:33 am
    Jan 13, 2009 at 4:43 pm
  • Hi all Entity Attribute Value model< (aka EAV model) is a simple relational model. in my use case I have a schema which looks like this: Table ...
    Yonatan mamanYonatan maman
    Jan 7, 2009 at 10:18 am
    Jan 16, 2009 at 10:47 pm
  • Hi, As many of you know, for more than nine month now we have been doing most of our development work on the types branch. The code on the types branch is almost a complete rewrite of the system with ...
    Olga NatkovichOlga Natkovich
    Jan 8, 2009 at 9:13 pm
    Jan 12, 2009 at 10:41 pm
  • Hi, I'm trying to filter on the number of columns in a relation as suggested in the FAQ, but I get the following error. This is in the types branch. Has the syntax changed or does this look like a ...
    Tom WhiteTom White
    Jan 5, 2009 at 9:53 pm
    Jan 5, 2009 at 11:34 pm
  • I often meet errors with chararray bag items. It seems that a bag item can be casted to some other type rather than specified chararray type. May be it's just before becoming a true chararray value. ...
    Jan 10, 2009 at 5:28 pm
    Jan 12, 2009 at 7:43 pm
  • Hi, This may have already been asked, but I couldn't find anything in old mails ... I did find an old bug report PIG-66 about this but it got closed with no pointer to what the outcome was. My ...
    Chris OlstonChris Olston
    Jan 9, 2009 at 6:02 pm
    Jan 9, 2009 at 6:42 pm
  • SUM can fail on a column declared as long. It seems to occur when actual values in the column are small enough for int. They are serialized and probably passed to SUM as ints. Then the LongSum() code ...
    Jan 6, 2009 at 4:40 pm
    Jan 9, 2009 at 12:52 am
  • All, I'm broadcasting this to all of the Hadoop dev and users lists, however, in the future I'll only send cross-subproject announcements to Please subscribe over there ...
    Owen O'MalleyOwen O'Malley
    Jan 27, 2009 at 8:28 pm
    Jan 29, 2009 at 2:25 am
  • The next Bay Area Hadoop User Group meeting is scheduled for Wednesday, January 21st at Yahoo! 2821 Mission College Blvd, Santa Clara, Building 1, Training Rooms 3 & 4 from 6:00-7:30 pm. Agenda: ...
    Ajay AnandAjay Anand
    Jan 8, 2009 at 8:10 pm
    Jan 21, 2009 at 10:44 pm
  • Hi All, I have a custom loader that returns a set of fields after reading a log line. One of the fields returned is of type DataType.Map. My question is how can I set the data types for this map's ...
    Jan 15, 2009 at 2:32 pm
    Jan 15, 2009 at 6:34 pm
  • Another bags(?) related issue. The code below generally join three data files into one. targetWords = load 'targetWords' as ( word: chararray, phrases: bag{t: tuple(id: chararray)}); historyWords = ...
    Jan 10, 2009 at 1:44 pm
    Jan 10, 2009 at 1:44 pm
  • Hi, This is to announce that as of now Pig requires Java 1.6 to build and run the system. This allows Pig to take advantage of the new features available in 1.6 and any performance improvements. This ...
    Olga NatkovichOlga Natkovich
    Jan 9, 2009 at 7:14 pm
    Jan 9, 2009 at 7:14 pm
  • Live in San Diego/Southern California and want to get together and talk Hadoop/HBase/Mahout/Zookeeper/Thrift/Cloud/Pig/HBase/etc? If so, please join us for the first San Diego Hadoop Users Group ...
    George PorterGeorge Porter
    Jan 8, 2009 at 8:31 pm
    Jan 8, 2009 at 8:31 pm
  • Using a small set of my data, I can process it, write it to BinStorage and load it again. When I use a larger set, I get an error " One schema is null [One schema is null]" when ...
    Sean TimmSean Timm
    Jan 5, 2009 at 5:32 pm
    Jan 5, 2009 at 5:32 pm
Group Navigation
period‹ prev | Jan 2009 | next ›
Group Overview
groupuser @
categoriespig, hadoop

26 users for January 2009

Ted Dunning: 10 posts Vadim Zaliva: 9 posts Olga Natkovich: 7 posts Daga: 5 posts Goel, Ankur: 5 posts Alan Gates: 4 posts Yonatan maman: 3 posts Ajay Anand: 2 posts Benjamin Reed: 2 posts Chris Olston: 2 posts Santhosh Srinivasan: 2 posts Baldo Faieta: 1 post Christophe Bisciglia: 1 post Craig Macdonald: 1 post Dmitriy Ryaboy: 1 post George Porter: 1 post Jeff Hammerbacher: 1 post Kevin Weil: 1 post Mridul Muralidharan: 1 post Owen O'Malley: 1 post
show more