Grokbase Groups Pig user October 2008

Search Discussions

29 discussions - 102 posts

  • Hi there, am i right that the LIMIT function is in types branch only ? Are there any schedules for merging this with trunk/ 0.2.0 releas ? thanks Johannes ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 101tec GmbH ...
    Johannes ZillmannJohannes Zillmann
    Oct 9, 2008 at 3:10 pm
    Oct 10, 2008 at 4:33 pm
  • Hi all, I'd like to remind everyone that the Hadoop Camp & ApacheCon US is coming up in New Orleans next month. It will be the largest gathering of Hadoop developers ...
    Owen O'MalleyOwen O'Malley
    Oct 2, 2008 at 4:13 pm
    Nov 18, 2008 at 11:01 pm
  • Dear Users, The requirements document for error handling in Pig is now published at: Please take a look and feel free to provide feedback. Thanks, Santhosh
    Santhosh SrinivasanSanthosh Srinivasan
    Oct 20, 2008 at 10:29 pm
    Nov 6, 2008 at 6:11 pm
  • My latest stuff looks at apache logs, aggregates to txt files, then I have a simple perl script that +='s into mysql tables. A few thoughts * Would sure be nice if I could just STORE my aggregations ...
    Earl CahillEarl Cahill
    Oct 18, 2008 at 5:18 am
    Oct 21, 2008 at 7:24 am
  • Greetings! My requirement is to search an input string in a given file and output all lines of the file that contains the string. Am writing the following searchString.pig script( Have integrated ...
    Oct 5, 2008 at 1:34 am
    Oct 5, 2008 at 4:32 am
  • Hi, With the new typing and schema work, will there be (or is there already) a way to introspect the schema of a given tuple, for example in a UDF? Since we specify schemas and field names on load, ...
    Kevin WeilKevin Weil
    Oct 29, 2008 at 6:15 pm
    Oct 31, 2008 at 11:34 am
  • Hi all: I want to derive multiple records from one record through pig latin. What is the right syntax to do it? For example, I have a record with two fields f1 and f2. I would like to generate two ...
    Charles duCharles du
    Oct 30, 2008 at 8:53 pm
    Oct 31, 2008 at 11:21 am
  • Maybe I am way off, but I sure can't seem to load from a directory. I made a directory (/tmp/pig_test) with three dumb files each containing lines that look like bob\t3 alice\t2 , using the perl code ...
    Earl CahillEarl Cahill
    Oct 25, 2008 at 11:36 pm
    Oct 30, 2008 at 4:48 pm
  • Dear Users and Developers, As you all know, Pig has graduated from the Apache Incubator and is joining Hadoop as subproject. We are in the process of migrating the project with means several things ...
    Olga NatkovichOlga Natkovich
    Oct 24, 2008 at 9:16 pm
    Oct 28, 2008 at 10:58 pm
  • I am having issues with a custom load function that reads protocol buffers. It worked with pig 0.1, and now after the refactoring to support 0.2/types, I can't get it to do anything past the line ...
    Kevin WeilKevin Weil
    Oct 12, 2008 at 10:26 am
    Oct 14, 2008 at 4:41 pm
  • So I have been watch (and using) Earl's latest patches with great joy but I was wondering if there was a less cumbersome way of specifying the functions for example searchTerms = FOREACH row GENERATE ...
    Ian HolsmanIan Holsman
    Oct 10, 2008 at 4:09 pm
    Oct 13, 2008 at 7:46 pm
  • Howdy, I was well on my way to writing a class that would load logs built from apache's common log format, as has been discussed on this list. Then it hit me that really, I was just loading based on ...
    Earl CahillEarl Cahill
    Oct 5, 2008 at 8:06 am
    Oct 7, 2008 at 4:48 pm
  • Greetings! Hi , When I load a directory(from hdfs) into an alias and try to dump it, I find all the lines of various files in that directory appearing one after another. However, not able to figure ...
    Oct 5, 2008 at 5:36 pm
    Oct 6, 2008 at 9:50 pm
  • There are 3 plans in the Pig: LogicalPlan; PhysicalPlan and MROperPlan. Here is my understanding of these plans. 1. LogicalPlan: implemented by PIG. It is a Logical graph for the querys plan. Every ...
    Oct 24, 2008 at 9:51 am
    Oct 26, 2008 at 12:07 pm
  • Hi all, This RDF proposal is a good long time ago. Now we'd like to settle down to research again. I attached our proposal, We'd love to hear your feedback & stories!! Thanks. -- Best regards, Edward ...
    Edward J. YoonEdward J. Yoon
    Oct 21, 2008 at 1:02 am
    Oct 22, 2008 at 5:36 am
  • Hi, I'm trying to analyze a dataset that looks like (string, number, bag { string, number }). (in the pig-types branch.) In my load function, what should the AS clause for my bag look like? I'm doing ...
    Kevin WeilKevin Weil
    Oct 20, 2008 at 5:37 am
    Oct 21, 2008 at 9:54 pm
  • Greetings! Hi All, A = load 'file' using PigStorage(' ') as (key , value); B = filter A by key matches '*Database*' matches is not working for me. ... at ...
    Oct 19, 2008 at 6:11 pm
    Oct 20, 2008 at 3:43 pm
  • All, As part of my talk at ApacheCon this year, I'd like to be able to give a list of companies, universities, research labs, etc. that are using pig. If you use pig and can share that with the ...
    Alan GatesAlan Gates
    Oct 18, 2008 at 12:17 am
    Oct 20, 2008 at 1:08 pm
  • I want to compute the statistics(like SUM/COUNT) of data, may be I also will use the SUM result to compute the next value. So I used the PigLatin like this: urls = LOAD 'logs' AS (url, ip,time, ...
    Oct 17, 2008 at 1:50 pm
    Oct 20, 2008 at 1:38 am
  • All, As you have probably noticed if you've been watching the mailing list, much work has gone into an almost complete rework of pig over the last six months. This work has been done on the types ...
    Alan GatesAlan Gates
    Oct 10, 2008 at 4:25 pm
    Oct 17, 2008 at 4:22 pm
  • I download the types-stable-1, and build it with hadoop 0.18 using ant. Also, I copy the hadoop-site.xml from hadoop0.18's conf, and I annotate the hod's parameters in the conf/, like ...
    Oct 16, 2008 at 1:59 pm
    Oct 17, 2008 at 3:42 am
  • Hi, Say that I write out a tuple with three fields using BinStorage. And then in a couple months, I add a parameter, so now I write out a tuple with a new fourth field. If I'm loading a directory ...
    Kevin WeilKevin Weil
    Oct 8, 2008 at 7:00 pm
    Oct 8, 2008 at 8:42 pm
  • I have now collected a few weeks of data, and I tried to run a simple pig script against it: rawdata = load '/serves' using PigStorage('\u0001') as (remote_ip, user_agent, timestamp, served_time, ...
    Emmett ShearEmmett Shear
    Oct 31, 2008 at 11:56 pm
    Nov 10, 2008 at 5:22 pm
  • Hi all: I implemented a pig latin function that takes all fields in a record as parameters. Because my record has 30 fields, I do not want to list them all when I call this function, like myfunc(f0, ...
    Charles duCharles du
    Oct 31, 2008 at 10:07 pm
    Oct 31, 2008 at 10:12 pm
  • code: * * *term2url_orig = LOAD 'term2url_orig' AS (term, termscore:double, url:chararray, total:double); set 'same term'; term2url_termqueryscore_group = GROUP term2url_orig BY term ...
    Oct 22, 2008 at 3:55 pm
    Oct 23, 2008 at 9:32 pm
  • I found that the structure of LogicalPlan described in wiki: is different with the codes in the types-stable-1. If I want to know more about the ...
    Oct 21, 2008 at 1:56 am
    Oct 21, 2008 at 5:50 pm
  • Hi There is an open issue for supporting HBase in Pig. I was wondering whether anybody in the community has worked up some techniques/patches/workarounds for using HBase from Pig. Thanks Bob
    Robert GoodmanRobert Goodman
    Oct 6, 2008 at 6:24 pm
    Oct 7, 2008 at 4:50 pm
  • All, I wanted to let you know that pig will be part of ApacheCon US 2008, coming up in a few weeks. I will be giving a talk on pig as part of the Hadoop Camp ...
    Alan GatesAlan Gates
    Oct 18, 2008 at 12:11 am
    Oct 18, 2008 at 12:11 am
  • I propose that pig develop a standard set of benchmark queries that can be run from release to release to measure pig's (hopefully improving) performance over time. This would be similar in nature to ...
    Alan GatesAlan Gates
    Oct 2, 2008 at 7:59 pm
    Oct 2, 2008 at 7:59 pm
Group Navigation
period‹ prev | Oct 2008 | next ›
Group Overview
groupuser @
categoriespig, hadoop

25 users for October 2008

Alan Gates: 18 posts Olga Natkovich: 13 posts Kevin Weil: 11 posts Earl Cahill: 7 posts Latha: 7 posts Santhosh Srinivasan: 7 posts Paradisehit: 6 posts Ian Holsman: 4 posts Daniel Dai: 3 posts Edward J. Yoon: 3 posts Johannes Zillmann: 3 posts Pi song: 3 posts Ted Dunning: 3 posts Charles du: 2 posts Pradeep Kamath: 2 posts Ajay Anand: 1 post Emmett Shear: 1 post Ian Holsman: 1 post Ian Theocharis Athanasakis: 1 post Jim Kellerman (POWERSET): 1 post
show more