Grokbase Groups Pig user May 2013
FAQ

Search Discussions

28 discussions - 92 posts

  • All, Please join me in welcoming Prashant Kommireddi as our newest Pig committer. He's been contributing to Pig for a while now. We look forward to him being a part of the project. Julien
    Julien Le DemJulien Le Dem
    May 2, 2013 at 7:59 pm
    May 3, 2013 at 5:45 am
  • Hello, In a Pig script I want to store the results in 2 different MySql tables (using DBStorage) and a file on HDFS. This means 3 different STORE statements. Right now when I do that, it does give ...
    Shahab YunusShahab Yunus
    May 8, 2013 at 2:11 pm
    May 10, 2013 at 1:21 am
  • Hi there, I have a huge input on an HDFS and I would like to use Pig to calculate several unique metrics. To help explain the problem more easily, I assume the input file has the following schema ...
    Thomas EdisonThomas Edison
    May 6, 2013 at 3:11 am
    May 9, 2013 at 3:50 am
  • Dear all, I wonder if someone can tell me if the current version of pig support loop and branching? regards! Yong
    YonghuYonghu
    May 7, 2013 at 11:14 am
    May 7, 2013 at 3:16 pm
  • Any fix for parsing string array in the near future? https://issues.apache.org/jira/browse/PIG-2949 -- Wayne Zhu
    Zhu WayneZhu Wayne
    May 7, 2013 at 10:39 pm
    May 8, 2013 at 7:32 pm
  • Hey all, If I need to load a Hbase table with Hex values into Pig, does that require a specific UDF? IS there any inbuilt function in Pig? I searched the documentation but cannot find anything that ...
    John MeekJohn Meek
    May 6, 2013 at 3:16 am
    May 6, 2013 at 9:14 pm
  • Hi all, In my script a = load 'data' using PigStorage(); b = foreach a generate 342 as col1, substring(x,0,4) as col2, ; I want to use col2 later in foreach statement. derived col2 should be used ...
    AbhishekAbhishek
    May 7, 2013 at 2:52 am
    May 9, 2013 at 10:24 pm
  • Hi, I have 2 differents behaviour for the same Pig version (with hadoop 1.1.2) on different servers. If anyone can tell me why with the same versions and the same parameters I don't have the same ...
    Cscetbon ExtCscetbon Ext
    May 3, 2013 at 9:45 am
    May 7, 2013 at 3:06 pm
  • PIG 0.11 Query : I register the below string String query = "A = LOAD '" + BENCHMARK_PARQUET_MR_DATA_TEXTINPUT + "' using PigStorage() as (" + schemaString + ");"; with ...
    ÐΞ€ρ@Ҝ (๏̯͡๏)ÐΞ€ρ@Ҝ (๏̯͡๏)
    May 4, 2013 at 5:53 am
    May 6, 2013 at 5:26 am
  • Hi, I have a dataset with two three columns, group_id, position, and name. I need for each group to generate a concatenated string of all names ordered by their position. I can do this by sorting all ...
    Ahmed EldawyAhmed Eldawy
    May 13, 2013 at 5:35 pm
    May 13, 2013 at 7:49 pm
  • Hi, I am using PigTest in order to verify a script reading and storing data in avro format. However, at the moment, the script fails due to the optimisation rule ColumnMapKeyPrune. I known I can ...
    Bertrand DechouxBertrand Dechoux
    May 13, 2013 at 9:25 am
    May 13, 2013 at 5:50 pm
  • Greetings! Did someone encounter the same issue? Well-formated XML for <Sellers </Sellers is fine: grunt register /usr/lib/pig/piggybank.jar; grunt a = load 'sample.xml' using ...
    Zhu WayneZhu Wayne
    May 6, 2013 at 5:05 pm
    May 7, 2013 at 4:25 pm
  • Thought I understood how to output to a single file but It doesn't seem to be working. Anything I'm missing here? -- Dedupe and store rows = LOAD '$input'; unique = DISTINCT rows PARELLEL 1; STORE ...
    MarkMark
    May 1, 2013 at 4:52 pm
    May 1, 2013 at 5:21 pm
  • Hi, I have a very weird issue with my PIG script. Following is the content of my script *REGISTER /home/hadoopuser/Workspace/lib/piggybank.jar* *REGISTER /home/hadoopuser/Workspace/lib/datafu.jar;* ...
    Praveen BysaniPraveen Bysani
    May 14, 2013 at 3:10 am
    May 14, 2013 at 10:29 pm
  • I want to something like this B = FOREACH A GENERATE a1, *if a2 = 0: a2=a2+1 else a2*, a3) how to do " if a2 = 0: a2=a2+1 else a2" in PIG (or it could be "if a2 matches < some regex : a2+0 else a2") ...
    Ashish GuptaAshish Gupta
    May 14, 2013 at 4:11 pm
    May 14, 2013 at 4:20 pm
  • I'm trying to set useMatches=false in REGEX_EXTRACT_ALL as per the javadoc: http://pig.apache.org/docs/r0.11.0/api/org/apache/pig/builtin/REGEX_EXTRACT_ALL.html (and yes, I'm using pig 0.11). But it ...
    William ObermanWilliam Oberman
    May 8, 2013 at 5:21 pm
    May 8, 2013 at 5:31 pm
  • 1

    NVL

    I'd like to write a java UDF that functions more or less the same as a SQL NVL command. I've been stymied on writing a general function by the fact that I want it to work on all data types--in ...
    Catherine MillerCatherine Miller
    May 8, 2013 at 2:15 pm
    May 8, 2013 at 2:55 pm
  • Hi, How to add days to the current date in PIG? Is there any built in fucntion? Regards Soniya
    soniya Bsoniya B
    May 7, 2013 at 12:55 am
    May 7, 2013 at 1:17 am
  • I'm new to pig and I'm getting a ClassCastException when I try to run the following script in pig 0.11.1: A = LOAD 'test.log' AS (timestamp:long, pk_id:int, array_field:chararray, fk_id:int); B = ...
    Peter ConnollyPeter Connolly
    May 3, 2013 at 7:01 pm
    May 5, 2013 at 6:57 am
  • Hello, What is the bets way to get the count of records in an HDFS file generated by a PIG script. Thanks
    Mix NinMix Nin
    May 13, 2013 at 5:52 pm
    May 13, 2013 at 5:52 pm
  • Hey all, One of my scripts is giving the below error. The script works fine when I run it in Grunt but I get the "Error to read counters into Rank operation counterSize 0". ?? I see this ...
    John MeekJohn Meek
    May 13, 2013 at 5:48 pm
    May 13, 2013 at 5:48 pm
  • Has anyone built the Piggybank jar with the DSE-Cassandra distribution of Pig? I'm using Pig 0.9.2 on DSE 3.0, and would ultimately just like to use CSVExcelStorage UDF from the Piggybank ...
    Anita MehrotraAnita Mehrotra
    May 9, 2013 at 3:53 pm
    May 9, 2013 at 3:53 pm
  • Hi, I'm working on https://issues.apache.org/jira/browse/PIG-3297 and I ran into something strange. The issue is that Pig crashes with a Java Exception on a specific feature in the Avro format (I ...
    Niels BasjesNiels Basjes
    May 8, 2013 at 2:52 pm
    May 8, 2013 at 2:52 pm
  • Hi, I've got some objects originally loaded, using the JSON loader from elephantbird, into nested maps, and subsequently stored using LZOPigStorage after various stages of processing. When I ...
    Kris CowardKris Coward
    May 7, 2013 at 5:06 pm
    May 7, 2013 at 5:06 pm
  • it will help me to debug the pig script. Thanks, Jack
    Jinyuan ZhouJinyuan Zhou
    May 3, 2013 at 9:39 pm
    May 3, 2013 at 9:39 pm
  • I'm using pig 0.11.1 to load about 100 rows into mysql using DBStorage. If I run my script from grunt everything works fine and the rows are committed. If I run pig in batch mode the script says it ...
    Peter ConnollyPeter Connolly
    May 3, 2013 at 1:19 pm
    May 3, 2013 at 1:19 pm
  • I was wondering if there is a way to compute the product of all the values in a bag much like the built in function SUM does currently. For reference, I am currently implementing a multinomial naive ...
    Sergey GoderSergey Goder
    May 2, 2013 at 11:42 pm
    May 2, 2013 at 11:42 pm
  • Hi, Anyone can help me to generate system generated DateTime in PIG? I have tried it and didn't get any clue. Regards Soniya
    soniya Bsoniya B
    May 2, 2013 at 7:18 pm
    May 2, 2013 at 7:18 pm
Group Navigation
period‹ prev | May 2013 | next ›
Group Overview
groupuser @
categoriespig, hadoop
discussions28
posts92
users45
websitepig.apache.org

45 users for May 2013

Cheolsoo Park: 7 posts Jonathan Coveney: 6 posts Shahab Yunus: 6 posts Niels Basjes: 4 posts Zhu Wayne: 4 posts Cscetbon Ext: 3 posts Alan Gates: 3 posts ÐΞ€ρ@Ҝ (๏̯͡๏): 3 posts John Meek: 3 posts Johnny Zhang: 3 posts Ruslan Al-Fakikh: 3 posts Thomas Edison: 3 posts Abhishek: 2 posts Ahmed Eldawy: 2 posts Bertrand Dechoux: 2 posts Mark: 2 posts Mike Sukmanowsky: 2 posts Peter Connolly: 2 posts Prasanth J: 2 posts Prashant Kommireddi: 2 posts
show more