Grokbase Groups Pig user July 2008
FAQ

Search Discussions

12 discussions - 46 posts

  • Hi, I am trying to start pig for the first time, so here is a beginner's question. How do I tell the bin/pig shell script where the cluster can be found? I used the conf/pic.properties as follows: # ...
    Gert PfeiferGert Pfeifer
    Jul 1, 2008 at 1:27 pm
    Jul 3, 2008 at 4:13 am
  • Hi All, (Thanks for the help with the tutorial!) I'm looking at using pig to chew through some apache server log files. Has anyone on the list done this? Any best practices or UDF's floating around ...
    Mark SnowMark Snow
    Jul 2, 2008 at 8:57 pm
    Sep 27, 2008 at 6:09 am
  • Folks, Is there a way to do something akin to map (of map/reduce) over a tuple? The input file is lines like this: category word1 word2 ... So simplest is to read it as a tuple (PigStorage ' '), but ...
    Handerson, Steven K.Handerson, Steven K.
    Jul 28, 2008 at 7:35 pm
    Jul 29, 2008 at 7:13 pm
  • Hi Can you use Pig with XML data files? If so, does anyone have any examples? I want to do something that would equate to an XPath query against the XML. Thanks.
    Kayla JayKayla Jay
    Jul 1, 2008 at 6:25 pm
    Jul 2, 2008 at 8:46 pm
  • Hi: Just start learning hadoop and pig latin. How can I get the number of elements in a data bag? For example, a data bag like follow has four elements. B= {1, 2, 3, 5} I tried C = COUNT(B), it did ...
    Charles duCharles du
    Jul 18, 2008 at 7:23 pm
    Aug 7, 2008 at 11:39 pm
  • Hello, I'm attempting to run a Pig job on a Hadoop cluster with a 5GB/35 million row input. When run on sample data of 100k rows, I get the correct results, but when I run it on the whole dataset, ...
    Brandon DimcheffBrandon Dimcheff
    Jul 24, 2008 at 5:53 pm
    Jul 25, 2008 at 5:49 pm
  • Hi, I'm trying to build piggybank, but is failing... Any ideas anyone? Thank you in advance, Andre -- andre-philippis-macbook:java andrephilippi$ pwd /Users/andrephilippi/pig/piggybank/java ...
    Andre PhilippiAndre Philippi
    Jul 14, 2008 at 8:31 pm
    Jul 17, 2008 at 6:32 pm
  • Hi, I started learning and working with Pig 3 days ago and I have been able to make it work with my EC2/HDFS environment and parse some files. Yay! :) Now, after reading all of the docs on the wiki, ...
    Andre PhilippiAndre Philippi
    Jul 14, 2008 at 8:47 pm
    Jul 15, 2008 at 1:07 am
  • Hi, I wrote a small pig script with a couple of functions and it works fine in the local mode. However, when I run it on a hadoop cluster on a 4Gig file (apache access log). The job is submitted ...
    Raghu RajagopalanRaghu Rajagopalan
    Jul 1, 2008 at 7:13 pm
    Jul 2, 2008 at 8:05 pm
  • A reminder that the next user group meeting is scheduled for July 22nd from 6 - 7:30 pm at Yahoo! Mission College, Building 1, Training Rooms 3 and 4. Agenda: Cascading - Chris Wensel Performance ...
    Ajay AnandAjay Anand
    Jul 21, 2008 at 5:08 pm
    Jul 21, 2008 at 5:08 pm
  • Charles, The right forum for Pig is pig-user@incubator.apache.org, I'm redirecting you there... good luck! Arun
    Arun C MurthyArun C Murthy
    Jul 18, 2008 at 11:41 pm
    Jul 18, 2008 at 11:41 pm
  • The next Hadoop User Group meeting is scheduled for July 22nd from 6 - 7:30 pm at Yahoo! Mission College, Building 1, Training Rooms 3 and 4. Agenda: Cascading - Chris Wenzel Performance Benchmarking ...
    Ajay AnandAjay Anand
    Jul 8, 2008 at 6:02 pm
    Jul 8, 2008 at 6:02 pm
Group Navigation
period‹ prev | Jul 2008 | next ›
Group Overview
groupuser @
categoriespig, hadoop
discussions12
posts46
users18
websitepig.apache.org