Search Discussions
-
I've come across a very basic problem—unions simply do not work in Hadoop mode. data files: $ cat ~/tmp/data 1 1 2 1 3 10 $ cat ~/tmp/data-2 4 20 5 20 pig script: data = load '/Users/arthur/tmp/data' ...
Arthur Zwiegincew
Sep 30, 2008 at 1:30 am
Oct 3, 2008 at 4:31 pm -
A = (a, b, c) I just want add a column 0 into A, and the A will be like this: A = (a, b, c, 0) How can I? I use cross, but when I use PARALLEL BIGGER( =300), it occurs that: ERROR ...
Paradisehit
Sep 8, 2008 at 10:27 am
Sep 26, 2008 at 7:43 am -
Hi, How are the GROUP and COGROUP functions implemented? What's its efficiency? Thanks, Chuck
Chuck Lan
Sep 2, 2008 at 9:16 pm
Sep 19, 2008 at 5:46 pm -
I would like to parse a standard access log and get named variables back. Thinking I need to read in all the lines, then send them through my parsing function. Perhaps the two steps can be combined, ...
Earl Cahill
Sep 25, 2008 at 7:01 am
Sep 26, 2008 at 12:31 am -
Is it possible to implement these using an object composed of List, Commons bag, Integer, Double, String, Boolean and Byte? (I will also have type information, so I don't need to endure the cost of ...
Pete Wyckoff
Sep 20, 2008 at 12:36 am
Sep 25, 2008 at 3:45 pm -
Hi, If I ran the query below (and this is based on actual user query): -- Note that data1 has more than 1 column but as only declares a single one A = load 'data1' as (x); B = load 'data2' as (x, y, ...
Olga Natkovich
Sep 17, 2008 at 8:00 pm
Sep 19, 2008 at 1:44 am -
Hi, If I want to see what's happening on reduce from running a pig script, what's the best place to look at to start debugging? Thanks, Chuck
Chuck Lan
Sep 29, 2008 at 8:53 pm
Oct 2, 2008 at 4:18 pm -
The next Hadoop User Group (Bay Area) meeting is scheduled for Wednesday, Sept 17th from 6 - 7:30 pm at Yahoo! Mission College, Santa Clara, CA, Building 2, Training Rooms 3&4. Agenda: Cloud ...
Ajay Anand
Sep 9, 2008 at 12:05 am
Sep 23, 2008 at 5:37 am -
Hello, I would like to write a Load Function to load data in Pig directly from tables in a database. The definition of the function would be LoadFromDb(selectQuery, connectionParameters). In order to ...
Iván de Prado
Sep 5, 2008 at 8:53 am
Sep 8, 2008 at 5:04 pm -
Hi All Am a student trying to integrate PIG and Hadoop technologies to build a custom application as a part of my MS project. Am trying out a simple scenario where I have setup a single node hadoop ...
Latha
Sep 21, 2008 at 7:03 pm
Nov 24, 2008 at 7:29 pm -
I use the script like this: querys = GROUP clear_log ALL PARALLEL 4; TOTAL = FOREACH querys GENERATE FLATTEN(clear_log.($1, $2)), COUNT($1); STORE TOTAL INTO 'total'; AND I see the monitor page in ...
Paradisehit
Sep 26, 2008 at 4:28 am
Sep 30, 2008 at 4:06 pm -
howdy, Just starting to dive into pig, and have had a hard time finding examples. I would like to put up some examples (on the wiki?) of what I hope to be simple scripts that could help find the ...
Earl Cahill
Sep 23, 2008 at 7:46 am
Sep 29, 2008 at 4:49 pm -
Pretty sure that the svn pig link on this page http://wiki.apache.org/pig/PigDeveloperCookbook should be http://svn.apache.org/repos/asf/incubator/pig/ But there are no trunk/pig or src-gen ...
Earl Cahill
Sep 26, 2008 at 6:19 am
Sep 27, 2008 at 7:06 am -
Hello, I am having a problem when giving parameters to a load function that loads data from a database. It receives a SQL query as parameter. Something like that: A = LOAD 'dbparams.cfg' USING ...
Iván de Prado
Sep 19, 2008 at 11:08 am
Sep 22, 2008 at 8:28 am -
I have a log file I'm writing explicitly to process in pig. The data for each line is a set of key-value pairs, which seems pretty much perfect as a fit for Pig, however I'm having some trouble. It's ...
Emmett Shear
Sep 16, 2008 at 11:01 pm
Sep 17, 2008 at 6:11 pm -
I split a relation to two relations based on a condition, and then join these two relations. How can I access the fields in two relations when I output join results? My code is like: A = Load ... AS ...
Charles du
Sep 9, 2008 at 7:04 pm
Sep 9, 2008 at 10:58 pm -
Versions: pig from trunk (v 694861), hadoop v 17.2 java -cp pig.jar:$HADOOPSITEPATH org.apache.pig.Main works fine; it gives me a grunt shell connected to hdfs://localhost:9000/ bin/pig doesn't seem ...
Emmett Shear
Sep 13, 2008 at 1:06 am
Sep 16, 2008 at 11:01 pm -
Hi, Pig release 0.1.0 is now available. This is the first Pig release from the incubator! For release details and downloads, visit: http://incubator.apache.org/pig/releases.html ...
Olga Natkovich
Sep 11, 2008 at 9:03 pm
Sep 12, 2008 at 6:58 pm -
Hi All, I seem to be seeing a problem with the DISTINCT operator. I have a script that looks like this: raw_tran_hdr = load 'tran_hdr/tran_header' using PigStorage( '|' ) as ( ... many fields ... ); ...
Paul O'Leary
Sep 24, 2008 at 11:17 pm
Sep 24, 2008 at 11:59 pm -
Hi All, Finally have the opportunity to use Pig in my day job. Things are going well; doing some pretty real stuff! Quick type massage question, a minor issue really. I have an expression that looks ...
Paul O'Leary
Sep 23, 2008 at 8:21 pm
Sep 23, 2008 at 8:48 pm -
howdy, I am hoping to parse referers to find the refering search engine and terms. Wondering if there is an analog to http://search.cpan.org/~sden/URI-ParseSearchString-2.5/ for java / pig. Tonight I ...
Earl Cahill
Sep 23, 2008 at 7:35 am
Sep 23, 2008 at 8:09 am -
group a by $0, b by $0, c by $0 VS AA = group a by $0, b by $0 d = FOREACH AA generate $0, flatten($1) group d by $0, c by $0; Does them have the same efficiency, which is faster?
Paradisehit
Sep 5, 2008 at 5:56 am
Sep 5, 2008 at 6:28 am -
I use a hadoop 0.15, and it didn't have the hod contribution, I didn't wanna change my hadoop distribution. If the hod is not necessary, which files would be modified? If the hod should be used, i ...
Paradisehit
Sep 4, 2008 at 4:05 am
Sep 4, 2008 at 4:30 am -
I use GROUP can manage the data that has the same "value" into one bag. For example: character, num (b, 1) (a, 1) (b, 2) group (a, {(a, 2)}) {(a, 1)} = { (a, 1) } (a, 2) (b, {(b, 1)}) (a, 1) (b, 2) ...
Paradisehit
Sep 2, 2008 at 9:45 am
Sep 2, 2008 at 7:47 pm -
I can use LOAD or FOREACH to create a data type as a Tuple, or use GROUP/COGROUP to create a data type bag. But when does the map type occurs? Or it may be used in UDFs?
Paradisehit
Sep 1, 2008 at 2:27 am
Sep 2, 2008 at 7:42 pm -
The next Hadoop User Group (Bay Area) meeting is scheduled for Wednesday, Sept 17th from 6 - 7:30 pm at Yahoo! Mission College, Santa Clara, CA, Building 1, Training Rooms 3&4. Agenda: Update on ...
Ajay Anand
Sep 16, 2008 at 10:30 pm
Sep 16, 2008 at 10:30 pm -
pig-*-core.jar the classes from the libraries (junit-4.1.jar, ${hadoop.jarfile}, jsch-0.1.33.jar). As was discussed in a previous mail ...
Iván de Prado
Sep 3, 2008 at 4:11 pm
Sep 3, 2008 at 4:11 pm
Group Overview
group | user |
categories | pig, hadoop |
discussions | 27 |
posts | 112 |
users | 23 |
website | pig.apache.org |
23 users for September 2008
Archives
- May 2013 (92)
- April 2013 (226)
- March 2013 (362)
- February 2013 (192)
- January 2013 (166)
- December 2012 (115)
- November 2012 (223)
- October 2012 (249)
- September 2012 (275)
- August 2012 (249)
- July 2012 (219)
- June 2012 (371)
- May 2012 (281)
- April 2012 (377)
- March 2012 (341)
- February 2012 (323)
- January 2012 (364)
- December 2011 (266)
- November 2011 (234)
- October 2011 (207)
- September 2011 (321)
- August 2011 (271)
- July 2011 (253)
- June 2011 (249)
- May 2011 (239)
- April 2011 (341)
- March 2011 (321)
- February 2011 (276)
- January 2011 (320)
- December 2010 (244)
- November 2010 (136)
- October 2010 (251)
- September 2010 (161)
- August 2010 (201)
- July 2010 (198)
- June 2010 (171)
- May 2010 (205)
- April 2010 (192)
- March 2010 (237)
- February 2010 (192)
- January 2010 (182)
- December 2009 (106)
- November 2009 (169)
- October 2009 (105)
- September 2009 (134)
- August 2009 (108)
- July 2009 (140)
- June 2009 (151)
- May 2009 (150)
- April 2009 (133)
- March 2009 (124)
- February 2009 (119)
- January 2009 (66)
- December 2008 (45)
- November 2008 (80)
- October 2008 (102)
- September 2008 (112)
- August 2008 (32)
- July 2008 (46)
- June 2008 (78)
- May 2008 (79)
- April 2008 (26)
- March 2008 (42)
- February 2008 (30)
- January 2008 (15)
- December 2007 (31)
- November 2007 (13)
- October 2007 (9)