Search Discussions
-
Hi, probably that was discussed before in this list, but i couldn't find. We are implementing log analysis tools for some web sites that have high traffic. We have millions of logs of a web site in a ...
Gökhan Çapan
Dec 24, 2009 at 9:17 am
Dec 25, 2009 at 1:17 pm -
13
CROSS issue
I am not sure if this is a bug, or something more subtle, but here is the problem that I am having. When I LOAD a dataset, change it with an ORDER, LIMIT it, then CROSS it with itself, the results ...Corry Haines
Dec 15, 2009 at 12:03 am
Dec 24, 2009 at 6:28 pm -
Hi: We tried to get top N results after a groupby and sort, and got different results with or without storing the full sorted results. Here is a skeleton of our pig script. raw_data = Load ...
Chuang liu
Dec 18, 2009 at 12:46 am
Dec 22, 2009 at 6:37 pm -
I followed the instructions on the Pig Setup page but I can't seem to be able to attach to my HDFS cluster. Is there a configuration file I'm missing or a environment variable that I'm missing?
Aryeh Berkowitz
Dec 23, 2009 at 4:42 pm
Dec 23, 2009 at 6:15 pm -
Just a quick question out there before I go doing this myself but has anyone written a StoreFunc (or even better a reversible one that does both load/store) for JSON... basically I have a relation ...
Zaki rahaman
Dec 4, 2009 at 6:04 pm
Dec 12, 2009 at 10:19 pm -
I am trying to figure out a way to identify the potential bottlenecks in my pig script by putting some timestamps in the log before storing my output. Thinking of using the following : --debug tmp = ...
Prasenjit mukherjee
Dec 18, 2009 at 9:48 am
Jan 12, 2010 at 4:05 pm -
Hi, I'm pretty sure the answer to my question is no, but I have to ask. Is it possible within Pig to store different groups of data into different output files where the grouping is dynamic (i.e. not ...
Bill Graham
Dec 15, 2009 at 8:00 pm
Dec 28, 2009 at 2:19 am -
Hi all, Thanks for sharing PigMix with us. I do have a related question to this thread. The generate_data.sh references test.jar in datagenjar=$PIG_HOME/build/test/classes/test.jar. This jar is never ...
Iman E
Dec 11, 2009 at 6:57 pm
Dec 15, 2009 at 5:38 am -
Hi, I'm trying to use FileLocalizer in a UDF to check if a path passed in as a parameter is a file or a directory. I saw in some of the pig interval code that something like this: PigContext pc = ...
Tamir Kamara
Dec 3, 2009 at 6:50 am
Dec 3, 2009 at 1:25 pm -
pigServer.setJobName() does not work for me. Now it always shows something like Job5377610087230523458.jar for Hadoop jobs. Is there any way to give them more meaningful names? Thanks, Yonggang
Yonggang Qiao
Dec 17, 2009 at 10:42 pm
Dec 18, 2009 at 12:17 am -
Hi all, I realized a week or two ago that PigStorage(',') wasn't adequate to parse files that had commas embedded in properly CSV quoted fields. I went ahead and built a CSV parser for pig 0.3 that ...
James Kebinger
Dec 8, 2009 at 11:12 pm
Dec 9, 2009 at 7:51 am -
Please forgive my ignorance, but is there a comment character in Pig scripts? It occurs to me I've never seen an example with a comment in it, and leading # or ; characters don't appear to work as ...
James Kebinger
Dec 4, 2009 at 11:57 pm
Dec 5, 2009 at 12:48 am -
I found this edge case issue: the ORDER statement assumes non-empty partitions to operate on. A simplified example below. in = LOAD 'a.gz' AS (label:int); sel = DISTINCT in PARALLEL <X ; ord = ORDER ...
Skepticus Smith
Dec 24, 2009 at 7:37 pm
Jan 13, 2010 at 4:00 am -
Hi everyone, Is there any way I can setup job specific properties without changing pig.properties every time? Can pig grunt "set" do that? or I need to wrap it into java and use the setProperty() to ...
Xiaomeng Wan
Dec 18, 2009 at 9:22 pm
Dec 19, 2009 at 2:51 am -
Is it possible to provide params to pig, on the command line or in properties, that get passed through to the hadoop jobs pig runs? Specifically, -D args to get picked up by the options parser on the ...
Derek Brown
Dec 9, 2009 at 3:07 am
Dec 10, 2009 at 8:54 pm -
Thanks guys! When I think about it, it may be good enough to do this at the CLI level as that is probably the most common use case for this (in most of the other "API" style modes the apps can ...
Vijay
Dec 30, 2009 at 8:04 pm
Jan 4, 2010 at 3:06 pm -
It looks like the way to use muti-query from Java is as follows: 1. pigServer.setBatchOn(); 2. register your queries with pigServer 3. List<ExecJob jobs = pigServer.executeBatch(); 4. for (ExecJob ...
Dmitriy Ryaboy
Dec 15, 2009 at 8:45 pm
Dec 15, 2009 at 9:40 pm -
Are there set operations on bags beyond the DIFF operator? I'd like to compare bags to find elements in both of them (intersection). I can imagine union and set addition and subtraction being useful ...
James Kebinger
Dec 13, 2009 at 1:11 am
Dec 14, 2009 at 5:15 pm -
Greetings, Due to the holiday season, the Hadoop/HBase/Etc. Meetup is not going to happen. If anyone wants to get together for casual coffee or drinks, though, let me know! We'll be back on schedule ...
Bradford Stephens
Dec 29, 2009 at 11:30 pm
Dec 29, 2009 at 11:30 pm -
Hadoop Fans, it's been a few weeks since we've hosted public training sessions, and now we're happy to announce three sessions in three cities over the next three months. These sessions are all ...
Christophe Bisciglia
Dec 21, 2009 at 9:07 pm
Dec 21, 2009 at 9:07 pm
Group Overview
group | user |
categories | pig, hadoop |
discussions | 21 |
posts | 106 |
users | 32 |
website | pig.apache.org |
32 users for December 2009
Archives
- May 2013 (92)
- April 2013 (226)
- March 2013 (362)
- February 2013 (192)
- January 2013 (166)
- December 2012 (115)
- November 2012 (223)
- October 2012 (249)
- September 2012 (275)
- August 2012 (249)
- July 2012 (219)
- June 2012 (371)
- May 2012 (281)
- April 2012 (377)
- March 2012 (341)
- February 2012 (323)
- January 2012 (364)
- December 2011 (266)
- November 2011 (234)
- October 2011 (207)
- September 2011 (321)
- August 2011 (271)
- July 2011 (253)
- June 2011 (249)
- May 2011 (239)
- April 2011 (341)
- March 2011 (321)
- February 2011 (276)
- January 2011 (320)
- December 2010 (244)
- November 2010 (136)
- October 2010 (251)
- September 2010 (161)
- August 2010 (201)
- July 2010 (198)
- June 2010 (171)
- May 2010 (205)
- April 2010 (192)
- March 2010 (237)
- February 2010 (192)
- January 2010 (182)
- December 2009 (106)
- November 2009 (169)
- October 2009 (105)
- September 2009 (134)
- August 2009 (108)
- July 2009 (140)
- June 2009 (151)
- May 2009 (150)
- April 2009 (133)
- March 2009 (124)
- February 2009 (119)
- January 2009 (66)
- December 2008 (45)
- November 2008 (80)
- October 2008 (102)
- September 2008 (112)
- August 2008 (32)
- July 2008 (46)
- June 2008 (78)
- May 2008 (79)
- April 2008 (26)
- March 2008 (42)
- February 2008 (30)
- January 2008 (15)
- December 2007 (31)
- November 2007 (13)
- October 2007 (9)