Grokbase Groups Pig user October 2009

Search Discussions

24 discussions - 105 posts

  • Hi All, So I'm running into an issue in trying to use a UDF I wrote to do GeoIP location on IP addresses in tuples. I thought I could simply pack the source/class files along with the resource file ...
    Zaki rahamanZaki rahaman
    Oct 1, 2009 at 3:20 pm
    Oct 1, 2009 at 9:54 pm
  • Sorry if this double sends, just subscribed from this account. I am running the following query, and am having trouble getting what I want. I have two data sources, and I am JOINING them, then ...
    Russell JurneyRussell Jurney
    Oct 13, 2009 at 11:35 pm
    Oct 14, 2009 at 8:16 pm
  • Hi there. As some of you may have read on this mailing list previously, I'm studying various interfaces with Hadoop, one of those being Pig. I have three further questions. I am now beginning to ...
    Rob StewartRob Stewart
    Oct 30, 2009 at 12:06 pm
    Nov 9, 2009 at 9:11 pm
  • Hello Pig user group ! OK, here's two things about me: 1. I'm new to Pig and Hadoop 2. I'm studying for a Masters in Software Engineering in the UK. 3. I'm looking to do a comparitive study on ...
    Rob StewartRob Stewart
    Oct 7, 2009 at 1:19 pm
    Oct 8, 2009 at 6:49 am
  • Hello, I'm new to PIG, and I have a bunch of statements that process the same input, which is actually the result of a JOIN between two very big data set (millions of entries). I wonder if it is ...
    Vincent BARATVincent BARAT
    Oct 7, 2009 at 2:55 pm
    Oct 12, 2009 at 6:51 pm
  • I woudl like to create 2 separate tables in a single pass ( i.e. in a single foreach statement). For example : r0 = load 'foo'; tab1 = foreach r0 generate f1 tab2 = foreach r0 generate f2,f3,f6 The ...
    Prasenjit mukherjeePrasenjit mukherjee
    Oct 12, 2009 at 11:38 am
    Oct 13, 2009 at 2:45 am
  • Hello, I'm using PIG from Java and I store my results using the regular call:, outputFilePath); Now, I need to read the file produced (in order to store it to a MySQL table). ...
    Vincent BaratVincent Barat
    Oct 28, 2009 at 4:21 pm
    Oct 29, 2009 at 11:33 am
  • hi guys! i have a list of numbers that i was to rescale to 0.0 - 1.0 eg for (6,4,8) i want to convert to (0.5, 0.0, 1.0) i can find the min/max... grunt numbers = load 'numbers' as (n:int); grunt ...
    Mat KelceyMat Kelcey
    Oct 17, 2009 at 10:41 am
    Oct 19, 2009 at 9:27 am
  • Hi, I have a couple questions about using ILLUSTRATE with a custom load function 1. This seems to require I manually make sure my code is in the local classpath (doing REGISTER in the pig script only ...
    Sam RashSam Rash
    Oct 12, 2009 at 6:55 pm
    Oct 14, 2009 at 4:24 pm
  • Hi, I'm facing a strange assertion (using the current PIG trunk) when trying to perform a simple filter using comparison operator: pigServer.registerQuery("sessions = FILTER sessions BY end - start < ...
    Vincent BaratVincent Barat
    Oct 30, 2009 at 9:40 am
    Oct 30, 2009 at 2:22 pm
  • Hello, Sorry for the naive question, but is there an easy way (operator or similar) to know if an element belongs to a bag? I couldn't find it in the documentation. Best,jfcg ¿Estás fuera de ...
    Juan Francisco Contreras GaitanJuan Francisco Contreras Gaitan
    Oct 16, 2009 at 12:18 am
    Oct 28, 2009 at 10:18 am
  • Hello, Quick question: is there a set of ready to use PIG UDFs functions ? I'm looking to TOLOWERCASE function... Cheers,
    Vincent BaratVincent Barat
    Oct 21, 2009 at 3:33 pm
    Oct 21, 2009 at 6:27 pm
  • Hi, I have some data that I'm trying to join, but since the join isn't a straight match it requires pushing the key into a UDF to find a match. I'm just wondering what the best way to do this is. I ...
    Miles ScruggsMiles Scruggs
    Oct 6, 2009 at 11:49 pm
    Oct 7, 2009 at 11:07 pm
  • Hello, I'm not sure if it's a bug, but the handling of NULL fields seems not to work correctly: My data (events): 0,,jawi ,0,juug ,,lfou 0,0,caro My script: events = load 'events' using ...
    Vincent BARATVincent BARAT
    Oct 15, 2009 at 12:51 pm
    Oct 15, 2009 at 1:41 pm
  • new to pig, I want to do an out join using pig, but cannot the result I want. did I do something wrong? --1.txt a 1 b 2 c 3 ---2.txt a aa c cc A = LOAD '1.txt' USING PigStorage('\t') as (a1,a2); B = ...
    Yonggang QiaoYonggang Qiao
    Oct 14, 2009 at 6:48 pm
    Oct 14, 2009 at 7:29 pm
  • Hie, I was implementing LoadFunc Interface , in that getNext() returns a tuple ..... I have a bag of tuples which I have to return from getNext() so I made a tuple of bag , when I print it is fine it ...
    Miryala vigneshMiryala vignesh
    Oct 2, 2009 at 3:38 am
    Oct 2, 2009 at 4:13 pm
  • Greetings, (You're receiving this e-mail because you're on a DL or I think you'd be interested) It's time for another Hadoop/Lucene/Apache "Cloud" stack meetup! This month it'll be on Wednesday, the ...
    Bradford StephensBradford Stephens
    Oct 19, 2009 at 12:11 am
    Oct 27, 2009 at 11:06 pm
  • Hello to all of you, I have some PIG code I run from Java that store a file on Hadoop:"session_count_and_length", "session_count_and_length"); An then just after I try to ...
    Vincent BaratVincent Barat
    Oct 19, 2009 at 3:03 pm
    Oct 19, 2009 at 3:11 pm
  • Hi All, I have a query regarding the execution of the Map tasks in Pig/Hadoop. Suppose I have a query with 2 JOINS JOIN 1 - between sets A and B JOIN 2 - between sets C and D (We took both as ...
    Padmashree RavindraPadmashree Ravindra
    Oct 15, 2009 at 7:01 pm
    Oct 15, 2009 at 9:13 pm
  • I'm setting up a pig job that needs to stream a grouped set of data to an instance of a perl script. I need to ensure that a full group is run through a single instance of the perl script (don't want ...
    Paul BPaul B
    Oct 13, 2009 at 11:05 pm
    Oct 13, 2009 at 11:24 pm
  • Hi, The unit tests are failing in the trunk. Could the contributor check what is happening and cleanup code/tests as needed. Thanks, Olga
    Olga NatkovichOlga Natkovich
    Oct 9, 2009 at 6:09 pm
    Oct 12, 2009 at 3:26 am
  • Greetings everyone, If you happen to be in Pittsburgh on Nov 3, please consider dropping by the Carnegie Mellon campus for the Hadoop Users Group meeting. Ashutosh Chauhan and I will talk about Pig, ...
    Dmitriy RyaboyDmitriy Ryaboy
    Oct 28, 2009 at 3:15 pm
    Oct 28, 2009 at 3:15 pm
  • Forwarding this to pig-user, as many pig users may want to give feedback on this issue. Alan. Begin forwarded message:
    Alan GatesAlan Gates
    Oct 26, 2009 at 10:22 pm
    Oct 26, 2009 at 10:22 pm
  • Pig Team is happy to announce Pig 0.4.0 release! Pig is a Hadoop subproject that provides high-level data-flow language and an execution framework for parallel computation on a Hadoop cluster. More ...
    Olga NatkovichOlga Natkovich
    Oct 8, 2009 at 8:16 pm
    Oct 8, 2009 at 8:16 pm
Group Navigation
period‹ prev | Oct 2009 | next ›
Group Overview
groupuser @
categoriespig, hadoop

31 users for October 2009

Dmitriy Ryaboy: 15 posts Vincent BARAT: 14 posts Alan Gates: 9 posts Zaki rahaman: 8 posts Rob Stewart: 6 posts Russell Jurney: 6 posts Ashutosh Chauhan: 5 posts Kevin Weil: 3 posts Nikhil Gupta: 3 posts Prasenjit mukherjee: 3 posts Sam Rash: 3 posts Santhosh Srinivasan: 3 posts Bradford Stephens: 2 posts Jeff Hammerbacher: 2 posts Mat Kelcey: 2 posts Miryala vignesh: 2 posts Olga Natkovich: 2 posts Tamir Kamara: 2 posts Thejas Nair: 2 posts Yonggang Qiao: 2 posts
show more