FAQ

Search Discussions

27 discussions - 112 posts

  • I've come across a very basic problem—unions simply do not work in Hadoop mode. data files: $ cat ~/tmp/data 1 1 2 1 3 10 $ cat ~/tmp/data-2 4 20 5 20 pig script: data = load '/Users/arthur/tmp/data' ...
    Arthur ZwiegincewArthur Zwiegincew
    Sep 30, 2008 at 1:30 am
    Oct 3, 2008 at 4:31 pm
  • A = (a, b, c) I just want add a column 0 into A, and the A will be like this: A = (a, b, c, 0) How can I? I use cross, but when I use PARALLEL BIGGER( =300), it occurs that: ERROR ...
    ParadisehitParadisehit
    Sep 8, 2008 at 10:27 am
    Sep 26, 2008 at 7:43 am
  • Hi, How are the GROUP and COGROUP functions implemented? What's its efficiency? Thanks, Chuck
    Chuck LanChuck Lan
    Sep 2, 2008 at 9:16 pm
    Sep 19, 2008 at 5:46 pm
  • I would like to parse a standard access log and get named variables back. Thinking I need to read in all the lines, then send them through my parsing function. Perhaps the two steps can be combined, ...
    Earl CahillEarl Cahill
    Sep 25, 2008 at 7:01 am
    Sep 26, 2008 at 12:31 am
  • Is it possible to implement these using an object composed of List, Commons bag, Integer, Double, String, Boolean and Byte? (I will also have type information, so I don't need to endure the cost of ...
    Pete WyckoffPete Wyckoff
    Sep 20, 2008 at 12:36 am
    Sep 25, 2008 at 3:45 pm
  • Hi, If I ran the query below (and this is based on actual user query): -- Note that data1 has more than 1 column but as only declares a single one A = load 'data1' as (x); B = load 'data2' as (x, y, ...
    Olga NatkovichOlga Natkovich
    Sep 17, 2008 at 8:00 pm
    Sep 19, 2008 at 1:44 am
  • Hi, If I want to see what's happening on reduce from running a pig script, what's the best place to look at to start debugging? Thanks, Chuck
    Chuck LanChuck Lan
    Sep 29, 2008 at 8:53 pm
    Oct 2, 2008 at 4:18 pm
  • The next Hadoop User Group (Bay Area) meeting is scheduled for Wednesday, Sept 17th from 6 - 7:30 pm at Yahoo! Mission College, Santa Clara, CA, Building 2, Training Rooms 3&4. Agenda: Cloud ...
    Ajay AnandAjay Anand
    Sep 9, 2008 at 12:05 am
    Sep 23, 2008 at 5:37 am
  • Hello, I would like to write a Load Function to load data in Pig directly from tables in a database. The definition of the function would be LoadFromDb(selectQuery, connectionParameters). In order to ...
    Iván de PradoIván de Prado
    Sep 5, 2008 at 8:53 am
    Sep 8, 2008 at 5:04 pm
  • Hi All Am a student trying to integrate PIG and Hadoop technologies to build a custom application as a part of my MS project. Am trying out a simple scenario where I have setup a single node hadoop ...
    LathaLatha
    Sep 21, 2008 at 7:03 pm
    Nov 24, 2008 at 7:29 pm
  • I use the script like this: querys = GROUP clear_log ALL PARALLEL 4; TOTAL = FOREACH querys GENERATE FLATTEN(clear_log.($1, $2)), COUNT($1); STORE TOTAL INTO 'total'; AND I see the monitor page in ...
    ParadisehitParadisehit
    Sep 26, 2008 at 4:28 am
    Sep 30, 2008 at 4:06 pm
  • howdy, Just starting to dive into pig, and have had a hard time finding examples. I would like to put up some examples (on the wiki?) of what I hope to be simple scripts that could help find the ...
    Earl CahillEarl Cahill
    Sep 23, 2008 at 7:46 am
    Sep 29, 2008 at 4:49 pm
  • Pretty sure that the svn pig link on this page http://wiki.apache.org/pig/PigDeveloperCookbook should be http://svn.apache.org/repos/asf/incubator/pig/ But there are no trunk/pig or src-gen ...
    Earl CahillEarl Cahill
    Sep 26, 2008 at 6:19 am
    Sep 27, 2008 at 7:06 am
  • Hello, I am having a problem when giving parameters to a load function that loads data from a database. It receives a SQL query as parameter. Something like that: A = LOAD 'dbparams.cfg' USING ...
    Iván de PradoIván de Prado
    Sep 19, 2008 at 11:08 am
    Sep 22, 2008 at 8:28 am
  • I have a log file I'm writing explicitly to process in pig. The data for each line is a set of key-value pairs, which seems pretty much perfect as a fit for Pig, however I'm having some trouble. It's ...
    Emmett ShearEmmett Shear
    Sep 16, 2008 at 11:01 pm
    Sep 17, 2008 at 6:11 pm
  • I split a relation to two relations based on a condition, and then join these two relations. How can I access the fields in two relations when I output join results? My code is like: A = Load ... AS ...
    Charles duCharles du
    Sep 9, 2008 at 7:04 pm
    Sep 9, 2008 at 10:58 pm
  • Versions: pig from trunk (v 694861), hadoop v 17.2 java -cp pig.jar:$HADOOPSITEPATH org.apache.pig.Main works fine; it gives me a grunt shell connected to hdfs://localhost:9000/ bin/pig doesn't seem ...
    Emmett ShearEmmett Shear
    Sep 13, 2008 at 1:06 am
    Sep 16, 2008 at 11:01 pm
  • Hi, Pig release 0.1.0 is now available. This is the first Pig release from the incubator! For release details and downloads, visit: http://incubator.apache.org/pig/releases.html ...
    Olga NatkovichOlga Natkovich
    Sep 11, 2008 at 9:03 pm
    Sep 12, 2008 at 6:58 pm
  • Hi All, I seem to be seeing a problem with the DISTINCT operator. I have a script that looks like this: raw_tran_hdr = load 'tran_hdr/tran_header' using PigStorage( '|' ) as ( ... many fields ... ); ...
    Paul O'LearyPaul O'Leary
    Sep 24, 2008 at 11:17 pm
    Sep 24, 2008 at 11:59 pm
  • Hi All, Finally have the opportunity to use Pig in my day job. Things are going well; doing some pretty real stuff! Quick type massage question, a minor issue really. I have an expression that looks ...
    Paul O'LearyPaul O'Leary
    Sep 23, 2008 at 8:21 pm
    Sep 23, 2008 at 8:48 pm
  • howdy, I am hoping to parse referers to find the refering search engine and terms. Wondering if there is an analog to http://search.cpan.org/~sden/URI-ParseSearchString-2.5/ for java / pig. Tonight I ...
    Earl CahillEarl Cahill
    Sep 23, 2008 at 7:35 am
    Sep 23, 2008 at 8:09 am
  • group a by $0, b by $0, c by $0 VS AA = group a by $0, b by $0 d = FOREACH AA generate $0, flatten($1) group d by $0, c by $0; Does them have the same efficiency, which is faster?
    ParadisehitParadisehit
    Sep 5, 2008 at 5:56 am
    Sep 5, 2008 at 6:28 am
  • I use a hadoop 0.15, and it didn't have the hod contribution, I didn't wanna change my hadoop distribution. If the hod is not necessary, which files would be modified? If the hod should be used, i ...
    ParadisehitParadisehit
    Sep 4, 2008 at 4:05 am
    Sep 4, 2008 at 4:30 am
  • I use GROUP can manage the data that has the same "value" into one bag. For example: character, num (b, 1) (a, 1) (b, 2) group (a, {(a, 2)}) {(a, 1)} = { (a, 1) } (a, 2) (b, {(b, 1)}) (a, 1) (b, 2) ...
    ParadisehitParadisehit
    Sep 2, 2008 at 9:45 am
    Sep 2, 2008 at 7:47 pm
  • I can use LOAD or FOREACH to create a data type as a Tuple, or use GROUP/COGROUP to create a data type bag. But when does the map type occurs? Or it may be used in UDFs?
    ParadisehitParadisehit
    Sep 1, 2008 at 2:27 am
    Sep 2, 2008 at 7:42 pm
  • The next Hadoop User Group (Bay Area) meeting is scheduled for Wednesday, Sept 17th from 6 - 7:30 pm at Yahoo! Mission College, Santa Clara, CA, Building 1, Training Rooms 3&4. Agenda: Update on ...
    Ajay AnandAjay Anand
    Sep 16, 2008 at 10:30 pm
    Sep 16, 2008 at 10:30 pm
  • pig-*-core.jar the classes from the libraries (junit-4.1.jar, ${hadoop.jarfile}, jsch-0.1.33.jar). As was discussed in a previous mail ...
    Iván de PradoIván de Prado
    Sep 3, 2008 at 4:11 pm
    Sep 3, 2008 at 4:11 pm
Group Navigation
period‹ prev | Sep 2008 | next ›
Group Overview
groupuser @
categoriespig, hadoop
discussions27
posts112
users23
websitepig.apache.org

23 users for September 2008

Alan Gates: 19 posts Olga Natkovich: 14 posts Earl Cahill: 12 posts Paradisehit: 12 posts Chuck Lan: 7 posts Iván de Prado: 6 posts Mridul Muralidharan: 6 posts Prashanth Pappu: 6 posts Bertrand Delacretaz: 4 posts Emmett Shear: 4 posts Arthur Zwiegincew: 3 posts Pete Wyckoff: 3 posts Ajay Anand: 2 posts Charles du: 2 posts Chris Olston: 2 posts Corinne Chandel: 2 posts Paul O'Leary: 2 posts Arun C Murthy: 1 post Chris K Wensel: 1 post Daniel: 1 post
show more