Grokbase Groups Pig user August 2009

Search Discussions

31 discussions - 108 posts

  • Hello, I upgraded to Hadoop 0.20.0 (it's stable and working) and Pig no longer connects to the HDFS. I tried rebuilding and applying patch PIG660. I have a script that I run that exports these ...
    Turner KunkelTurner Kunkel
    Aug 18, 2009 at 5:37 pm
    Sep 5, 2009 at 12:54 am
  • Hi guys, I have recently started using pig and I have a doubt regarding the LOAD statement. Does the LOAD statement load data from the local file system or from HDFS? I am asking this question since ...
    Nipun SaggarNipun Saggar
    Aug 10, 2009 at 7:05 pm
    Aug 17, 2009 at 6:44 pm
  • Hello there, Is it possible to do something like this by using one join? Thanks! select * from tbl_a, tbl_b, tbl_c where tbl_a.b = tbl_b.b and tbl_a.c= tbl_c.c
    Yonggang QiaoYonggang Qiao
    Aug 7, 2009 at 6:49 pm
    Aug 7, 2009 at 9:00 pm
  • Hi all, I am working no building a analytics kind of engine which takes daily server logs, crunches the data using Pig scripts and (for now) outputs data to HDFS. Later, this data is to be stored on ...
    Nikhil GuptaNikhil Gupta
    Aug 19, 2009 at 7:49 pm
    Sep 21, 2009 at 8:13 pm
  • Hello Everyone, I am trying to write Pig scripts for my project. Problem I ma facing is I want to load different files to same variable .Can it be possible to do without modifying the Loader. I read ...
    Pankil DoshiPankil Doshi
    Aug 26, 2009 at 5:22 pm
    Sep 3, 2009 at 2:03 pm
  • Hi, I haven't seen this before but nightly jobs failed over the weekend because due to memory issues. The weird part is the jobs failed during the map phase (at about ~98% complete). The task tracker ...
    Shrikrishna ShrinShrikrishna Shrin
    Aug 10, 2009 at 8:00 pm
    Aug 11, 2009 at 4:04 pm
  • Hi Everybody, I have this error PIG-766 ( I wonder if somebody fix this issue or is there some recommendations related to this issue. Xavier
    Xavier QuintunaXavier Quintuna
    Aug 5, 2009 at 6:45 pm
    Aug 5, 2009 at 10:27 pm
  • Hi all, I'd like to set property in Configuration to customize my UDF. But it looks like I can not access the Configuration object in UDF. Does pig have a plan to support this feature ? Thank you. ...
    Zhang jianfengZhang jianfeng
    Aug 3, 2009 at 7:40 am
    Aug 4, 2009 at 3:32 pm
  • Hi, What is the best way to get the COUNT(*) for a empty relation? s6 = FILTER r1 BY f1 == 'x1'; -- dump s6; b6 = GROUP s6 ALL; c6 = FOREACH b6 GENERATE 'c6', FLATTEN((IsEmpty(s6) ? 0 : ...
    Irfan MohammedIrfan Mohammed
    Aug 19, 2009 at 3:12 pm
    Sep 10, 2009 at 1:52 pm
  • Time : Sunday, November 15, 2009 City: Beijing, China Sponsored by Yahoo!, Cloudera Organized by Website: ...
    He YongqiangHe Yongqiang
    Aug 21, 2009 at 4:22 pm
    Sep 3, 2009 at 2:03 pm
  • Hi all, I had a question about running Pig jobs on Amazon's cloud services. Specifically, how do you go about adding UDF jar files and what, if any, modifications to make to a script to make sure it ...
    Zaki rahamanZaki rahaman
    Aug 28, 2009 at 6:39 pm
    Sep 3, 2009 at 1:15 am
  • I'm still very new to Pig and still trying to get a good grasp of Pig Latin. I had two main questions (I would have split this into two threads, but in the interest of not spamming people's inboxes, ...
    Zaki rahamanZaki rahaman
    Aug 26, 2009 at 3:12 pm
    Aug 26, 2009 at 6:49 pm
  • Hi all, In one of my pig scripts, I am using GROUP on a few fields. I observed that after there were duplicates entries of the fields on which I have grouped in the output of the GROUP statement. For ...
    Uppuluri, RohiniUppuluri, Rohini
    Aug 5, 2009 at 3:39 am
    Aug 17, 2009 at 3:40 pm
  • I saw a thread on the list-serv about doing distinct count in a nested foreach. I'm not sure I followed exactly what was meant, but below is my script. Any suggestions on optimizations (it's my first ...
    Zaki rahamanZaki rahaman
    Aug 27, 2009 at 9:28 pm
    Aug 28, 2009 at 2:36 am
  • Is there a best practice on generating run-time log entries based on data values (perhaps in the form of adding a row to another relation in the script)? answer = FOREACH foo GENERATE id, bar; -- If ...
    Greg HarmanGreg Harman
    Aug 26, 2009 at 5:00 am
    Aug 26, 2009 at 1:54 pm
  • Hi, Thanks for the excellent tool. I want to traverse a tuple of tuples in pig. 1<tab ((t1,1),(t2,2),(t3,3)) 2<tab ((t3,3),(t4,4)) I defined the load schema as follows r1 = LOAD ...
    Irfan MohammedIrfan Mohammed
    Aug 16, 2009 at 2:42 pm
    Aug 16, 2009 at 3:43 pm
  • Hi, This is a follow up question to the thread "Tuple ordering after a group-by". Would this - {suppose A has the schema [date, id, some_value]} B = GROUP A BY id; C = FOREACH B { A1 = ORDER A BY ...
    Aug 13, 2009 at 5:55 pm
    Aug 13, 2009 at 7:30 pm
  • Hi I am using hadoop version 0.18 and pig 2.0 and nutch-1.0.but they dont have common hadoop version so it is not working; what is the hadoop version that is used both in pig and nutch can any one ...
    Venkata ramanaiah anneboinaVenkata ramanaiah anneboina
    Aug 11, 2009 at 3:55 pm
    Aug 11, 2009 at 4:43 pm
  • Context: I am trying to group data like so: grunt cat test.dat 1 2 3 1 2 4 1 2 5 2 3 0 2 3 8 A = load 'test.dat' as (f1:int, f2:int, f3:int); B = group A by (f1, f2); C = foreach B generate group, ...
    Leo AlekseyevLeo Alekseyev
    Aug 31, 2009 at 8:19 am
    Aug 31, 2009 at 8:36 am
  • Hey there, Apologies for this not going out sooner -- apparently it was sitting as a draft in my inbox. A few of you have pinged me, so thanks for your vigilance. It's time for another ...
    Bradford StephensBradford Stephens
    Aug 25, 2009 at 11:21 pm
    Aug 27, 2009 at 1:17 am
  • Would someone mind clarifying something that confuses me in the Pig Latin manual? Each alias within a cogroup can be assigned a keyword of INNER or OUTER. If I have two aliases being cogrouped, and ...
    Greg HarmanGreg Harman
    Aug 25, 2009 at 7:10 pm
    Aug 25, 2009 at 9:13 pm
  • Hi all, Since SQL and Hive both have Date type, I'd like to know whether Pig also have plan to add this type ? Thank you. Jeff Zhang
    Zhang jianfengZhang jianfeng
    Aug 5, 2009 at 5:41 am
    Aug 5, 2009 at 6:18 pm
  • We have some data produced by other hadoop jobs and stored with custom readers/writers. this seems to be a pretty standard process, yet I'm a little unclear on how to access this data with pig. I was ...
    Lance RiedelLance Riedel
    Aug 3, 2009 at 8:44 pm
    Aug 3, 2009 at 9:10 pm
  • Since it's not really spelled out in the UDF manuals*, I wanted to make sure I properly understand what is the input into the Initial, Intermediate, and Final steps, in a case like: X = FOREACH Y ...
    Richard RussoRichard Russo
    Aug 1, 2009 at 1:31 am
    Aug 3, 2009 at 4:25 pm
  • This is exactly what null handling works. It is introduced since Pig 0.2. ----- Original Message ----- From: "charles du" < To: < Sent: Friday, August ...
    Daniel DaiDaniel Dai
    Aug 25, 2009 at 10:09 am
    Aug 25, 2009 at 10:09 am
  • I would like to announce the September-2009 Hadoop Get Together in newthinking store Berlin. When: 29. September 2009 at 5:00pm Where: newthinking store, Tucholskystr. 48, Berlin, Germany As always ...
    Isabel DrostIsabel Drost
    Aug 24, 2009 at 11:18 pm
    Aug 24, 2009 at 11:18 pm
  • As mentioned in the was following the steps but unfortunately when i do svn checkout ...
    Miryala vigneshMiryala vignesh
    Aug 14, 2009 at 7:51 am
    Aug 14, 2009 at 7:51 am
  • Hadoop Fans, please pardon the short notice, but we wanted to let you know that we are offering a 3 day training program at the end of the month in San Francisco. There is a $300 discount for those ...
    Christophe BiscigliaChristophe Bisciglia
    Aug 14, 2009 at 2:03 am
    Aug 14, 2009 at 2:03 am
  • Hi I am using pig 2.0 and nutch 1.0; but it dont have common hadoop verion. what is common hadoop verion for both pig and hadoop; GIVE the pig version, nutch version and hadoo please can any one help ...
    Venkata ramanaiah anneboinaVenkata ramanaiah anneboina
    Aug 12, 2009 at 6:25 am
    Aug 12, 2009 at 6:25 am
  • Hey all, I just wanted to send a link to a presentation I made on how my company is building its entire core BI infrastructure around Hadoop, HBase, Lucene, and more. It features a decent amount of ...
    Bradford StephensBradford Stephens
    Aug 5, 2009 at 3:50 am
    Aug 5, 2009 at 3:50 am
  • Hadoop Fans, it looks like most of you prefer to have this on Thursday (November 5th), so that's what we'll plan for. Anyone is welcome to come to this meetup, even if you don't attend ApacheCon. ...
    Christophe BiscigliaChristophe Bisciglia
    Aug 5, 2009 at 2:31 am
    Aug 5, 2009 at 2:31 am
Group Navigation
period‹ prev | Aug 2009 | next ›
Group Overview
groupuser @
categoriespig, hadoop

33 users for August 2009

Dmitriy Ryaboy: 17 posts Turner Kunkel: 10 posts Nipun Saggar: 7 posts Zjffdu: 7 posts Nikhil Gupta: 6 posts Mridul Muralidharan: 5 posts Irfan Mohammed: 4 posts Zaki rahaman: 4 posts Xavier Quintuna: 3 posts Alan Gates: 3 posts Bradford Stephens: 3 posts Chris Olston: 3 posts Greg Harman: 3 posts Olga Natkovich: 3 posts Yonggang Qiao: 3 posts Christophe Bisciglia: 2 posts Daniel Dai: 2 posts He Yongqiang: 2 posts Pankil Doshi: 2 posts Prasenjit mukherjee: 2 posts
show more