Search Discussions

34 discussions - 169 posts

  • Hi all, Today I try pig 0.5, but it can not connect to hdfs 0.18.3. From the release announcement, it seems pig 0.5 already support hadoop 0.20. But does it means it do not support 0.18.3 any more ? ...
    Jeff ZhangJeff Zhang
    Nov 2, 2009 at 5:58 am
    Nov 20, 2009 at 10:44 am
  • Hi, I am back with a questions again :). This time i can explain better because I have explored little better than what i did last time :). I have three fields in my table. And they are name, id, ...
    Dhaval deshpandeDhaval deshpande
    Nov 25, 2009 at 9:54 pm
    Nov 26, 2009 at 5:51 am
  • Hi all, Often, I will run one script on different data set. Sometimes small data set and sometimes large data set. And different size of data set require different number of reducers. I know that the ...
    Jeff ZhangJeff Zhang
    Nov 12, 2009 at 8:12 am
    Nov 28, 2009 at 4:43 am
  • Hi, i wanted to create a custom group function in pig. I was not sure where to start from. I check some documentation online but couldnt figure out. I also checked on wiki and it says I need to ...
    Dhaval deshpandeDhaval deshpande
    Nov 24, 2009 at 8:56 am
    Nov 25, 2009 at 8:01 pm
  • Hi all, New to pig. The simplest "load" and then "dump" does not work under Mapreduce mode :-( Here is the error information I get. I am wondering what "Queue 'default' does not exist" means. ...
    Haiyi ZhuHaiyi Zhu
    Nov 23, 2009 at 3:01 pm
    Dec 2, 2009 at 4:46 pm
  • Is there a way to pass a string as an argument to the STORE function? For example: STORE A in 'somefile' USING PigStorage("<STRING ") Thanks, Satish
    V Satish KumarV Satish Kumar
    Nov 27, 2009 at 5:02 am
    Nov 30, 2009 at 5:28 pm
  • Hi All, I have the following mini-script running as part of a larger set of scripts/workflow... however it seems like pig is dropping records as when I tried running the same thing as a simple grep | ...
    Zaki rahamanZaki rahaman
    Nov 19, 2009 at 7:03 pm
    Nov 21, 2009 at 6:17 pm
  • Hi, I'm struggling to get the tokens out of a bag of tuples created by the TOKENIZE UDF and could use some help. I want to tokenize and then be able to reference the tokens by their position. Is this ...
    Bill GrahamBill Graham
    Nov 18, 2009 at 8:03 pm
    Nov 18, 2009 at 9:30 pm
  • All, I would like to welcome Jeff Zhang as our newest Pig committer. Jeff has been contributing to Pig for about nine months now. He's been active on the mailing lists, in contributing patches, and ...
    Alan GatesAlan Gates
    Nov 19, 2009 at 10:49 pm
    Nov 30, 2009 at 3:33 pm
  • Hi, I seem to be having an odd problem with pig. At least I haven't found any documentation on it. I've been using hadoop 20.1 to do some parsing of my data, and I thought pig might be a good tool to ...
    James R. LeekJames R. Leek
    Nov 24, 2009 at 7:48 am
    Nov 24, 2009 at 9:38 pm
  • hi all, I'm knew to Hadoop. Found Pig very quick and easy to learn, made my own simple scripts and my first own UDF. it's a loader UDF based on the piggybank samples found on 0.5.0 folders, it ...
    Matteo NasiMatteo Nasi
    Nov 17, 2009 at 3:48 pm
    Nov 17, 2009 at 5:11 pm
  • Hi all, I have the follwoing data file (1L,2L,3L) (4L,2L,1L) (8L,3L,4L) I am trying to write a UDF (like sum) that would add the fields in Tuple. This works -- public class SumAll extends ...
    Kelvin MossKelvin Moss
    Nov 5, 2009 at 10:24 am
    Nov 8, 2009 at 2:26 pm
  • Hi there, I'm missing something obvious. Looking at the Pig DataGenerator Wiki page, it refers to using the DataGenerator class, found at ClassPath: org.apache.pig.test.utils.datagen.DataGenerator I ...
    Rob StewartRob Stewart
    Nov 2, 2009 at 4:48 pm
    Nov 2, 2009 at 5:32 pm
  • Hi all, I try to load data from HBase into pig with HBaseStorage. Something is going wrong because no data from HBase (test table) shows up in Pig; only errors. I configured the Hadoop and HBase in ...
    Morris SwertzMorris Swertz
    Nov 19, 2009 at 4:20 pm
    Nov 20, 2009 at 6:14 pm
  • I would like the output from my pig task to be stored in a mysql database. Is there a way of storing this pig output directly into a mysql database? Thanks, Satish
    V Satish KumarV Satish Kumar
    Nov 17, 2009 at 6:18 am
    Nov 17, 2009 at 7:08 am
  • We presented on Pig tonight at the Pittsburgh HUG. Here are the slides: http://squarecog.wordpress.com/2009/11/03/apache-pig-apittsburgh-hadoop-user-group/ The presentation takes a brief romp through ...
    Dmitriy RyaboyDmitriy Ryaboy
    Nov 4, 2009 at 3:36 am
    Nov 4, 2009 at 5:44 pm
  • Hi all. I'm trying to use a map with a tuple as the value. From the documentation it looks like it would be possible. But I just can't get it to work. Look at this small example. When it tries to ...
    Kimmo BjörnssonKimmo Björnsson
    Nov 19, 2009 at 2:29 pm
    Nov 21, 2009 at 1:58 am
  • I am using PIG and this is what I am trying to do this: 1) Sort a relation A into B by a field x. The smallest value of x is first. Just use SORT. 2) Label each tuple in B with a number denoting its ...
    Desai DharmendraDesai Dharmendra
    Nov 20, 2009 at 8:03 pm
    Nov 20, 2009 at 9:15 pm
  • In the param file, I'd like to be able to do something like UDF_PATH='$LIB_DIR'; I know I can set it on the command line, but I'd prefer to have it read in via the file if possible. Is there a way to ...
    Sean TimmSean Timm
    Nov 13, 2009 at 8:24 pm
    Nov 13, 2009 at 9:32 pm
  • I downloaded hadoop 0.20.1 and pig 0.5.0 I can't find the configuration directory for pig. The download has no $PIG_HOME/conf directory and I cannot find any pig-env.sh or properties file of any ...
    John HaywardJohn Hayward
    Nov 13, 2009 at 5:01 pm
    Nov 13, 2009 at 6:31 pm
  • Hi all, I'd like to know where's the name zebra come from ? does it convey the meaning of this meta data system that the columnar storage format is like the lines on the zebra's skin. Thank you Jeff ...
    Jeff ZhangJeff Zhang
    Nov 26, 2009 at 3:40 pm
    Nov 30, 2009 at 5:26 pm
  • All, Yahoo has a number of Hadoop development positions open. There are engineering, architect, management, and QA positions all open. See ...
    Alan GatesAlan Gates
    Nov 21, 2009 at 1:03 am
    Nov 23, 2009 at 5:25 pm
  • We are planning to hold first Hadoop India user group meet up on 28th November 2009 in Noida. We would be talking about our experiences with Apache Hadoop/Hbase/Hive/PIG/Nutch/etc. The agenda would ...
    Sanjay SharmaSanjay Sharma
    Nov 10, 2009 at 6:20 am
    Nov 22, 2009 at 4:42 pm
  • Hello all, I'm encountering an error when I call illustrate - it blows up with a null pointer and the following exception: java.lang.NullPointerException at ...
    James KebingerJames Kebinger
    Nov 15, 2009 at 7:38 pm
    Nov 16, 2009 at 6:43 pm
  • Hi, I am writing a storage converter (implementing ReversibleLoadStoreFunc) for pig 0.4. The schema for this data is stored in an external file and I need this schema to correctly serialize and ...
    Shane EvansShane Evans
    Nov 16, 2009 at 4:49 pm
    Nov 16, 2009 at 5:05 pm
  • Hi, I have a file whose contents look like {(1L),(2L),(3L)} {(4L),(2L),(1L)} {(8L),(3L),(4L)} In short I want a bag with 3 tuples. I do the following to accomplish it grunt A = LOAD 'data1' as ...
    Kelvin MossKelvin Moss
    Nov 4, 2009 at 2:17 pm
    Nov 4, 2009 at 3:29 pm
  • I have a tuple X = "contents of html file" like X=(file:chararray) X = (<html <body <h2 hie</h2 <h2 hie</h2 <h2 hie</h2 <h2 hie</h2 djfkdj<p jhsdaj</p <h2 hie</h2 </body </html ) in Y I have indices ...
    Miryala vigneshMiryala vignesh
    Nov 1, 2009 at 1:28 pm
    Nov 3, 2009 at 7:51 pm
  • Hi all, New to pig. The simplest sentences of "load" and then "dump" does not work in Mapreduce mode :-( Here is the error information I get. I am wondering what "Queue 'default' does not exist" ...
    Haiyi ZhuHaiyi Zhu
    Nov 23, 2009 at 5:24 pm
    Nov 23, 2009 at 5:24 pm
  • As announced at ApacheCon US, the next Apache Hadoop Get Together Berlin is scheduled for December 2009. When: Wednesday December 16, 2009 at 5:00pm Where: newthinking store, Tucholskystr. 48, Berlin ...
    Isabel DrostIsabel Drost
    Nov 11, 2009 at 1:02 am
    Nov 11, 2009 at 1:02 am
  • Hadoop Fans, we're growing again, and wanted to let the Hadoop community know. If you enjoy working with Hadoop, are excited by doing cool things with interesting data, and have experience working ...
    Christophe BiscigliaChristophe Bisciglia
    Nov 7, 2009 at 8:16 pm
    Nov 7, 2009 at 8:16 pm
  • Hi, I have a file whose contents look like {(1L),(2L),(3L)} {(4L),(2L),(1L)} {(8L),(3L),(4L)} In short I want a bag with 3 tuples. I do the following to accomplish it grunt A = LOAD 'data1' as ...
    Kelvin MossKelvin Moss
    Nov 4, 2009 at 9:45 am
    Nov 4, 2009 at 9:45 am
  • Dmitriy's post about his talk reminded me that I forgot to send my t<goog_1256947021263 alk on Hadoop and Pig at Twitter<http://www.slideshare.net/kevinweil/hadoop-pig-and-twitter-nosql-east-2009 ...
    Kevin WeilKevin Weil
    Nov 4, 2009 at 9:15 am
    Nov 4, 2009 at 9:15 am
  • Pig Team is happy to announce Pig 0.5.0 release! Pig is a Hadoop subproject that provides high-level data-flow language and an execution framework for parallel computation on a Hadoop cluster. More ...
    Olga NatkovichOlga Natkovich
    Nov 4, 2009 at 1:14 am
    Nov 4, 2009 at 1:14 am
  • Mohan AgarwalMohan Agarwal
    Nov 2, 2009 at 7:01 pm
    Nov 2, 2009 at 7:01 pm
Group Navigation
period‹ prev | Nov 2009 | next ›
Group Overview
groupuser @
categoriespig, hadoop

46 users for November 2009

Alan Gates: 20 posts Dmitriy Ryaboy: 20 posts Zjffdu: 14 posts Bennie Schut: 10 posts Zaki Rahaman: 10 posts Ashutosh Chauhan: 8 posts Dhaval deshpande: 7 posts James R. Leek: 6 posts V Satish Kumar: 6 posts Olga Natkovich: 5 posts Thejas Nair: 5 posts Haiyi Zhu: 4 posts Kelvin Moss: 4 posts Rekha Joshi: 4 posts Bill Graham: 3 posts Rob Stewart: 3 posts Santhosh Srinivasan: 3 posts Vincent Barat: 3 posts Matteo Nasi: 2 posts Mridul Muralidharan: 2 posts
show more