Search Discussions

37 discussions - 161 posts

  • As directed in our vote to become a TLP, we (Pig's PMC) need to set out bylaws for the project. I have put up a first proposal for these by laws at http://wiki.apache.org/pig/ProposedByLaws. Please ...
    Alan GatesAlan Gates
    Sep 28, 2010 at 1:18 am
    Oct 5, 2010 at 9:39 pm
  • Hello, After getting all the errors to go away with LZO libraries not being found and missing jar files for elephant-bird I've run into a new problem when using the elephant-bird branch for pig 0.7 ...
    Sep 23, 2010 at 1:50 pm
    Oct 25, 2010 at 12:55 pm
  • Hello, I have a small cluster up and running with LZO compressed files in it. I'm using the lzo compression libraries available at http://github.com/kevinweil/hadoop-lzo (thank you for maintaining ...
    Sep 21, 2010 at 9:23 pm
    Sep 23, 2010 at 12:42 pm
  • Hi folks! I'm brand new to this list, so apologies if this is an inappropriate newbie question, or is otherwise incorrect, but here goes. I'm working with a bunch of pig scripts, and we're adding new ...
    Eric WadsworthEric Wadsworth
    Sep 29, 2010 at 5:01 pm
    Sep 30, 2010 at 8:31 pm
  • Guys, I'm seeing this one 2998 Unexpected internal error. Can we be more specific or dump a stack trace when this happens?
    Hc busyHc busy
    Sep 30, 2010 at 3:10 am
    Sep 30, 2010 at 11:13 pm
  • How do you filter a relation by a field NOT matching a regex? You would think this would work, but it does not: B = FILTER A BY field_foo NOT matches 'test' Russ
    Russell JurneyRussell Jurney
    Sep 26, 2010 at 5:50 pm
    Sep 27, 2010 at 7:08 pm
  • Hi, Is there a good way to access nested properties that are multilevel deep from Json objects loaded in Pig ? For example, if my json is like: {"keyA":{"pA":"vA"}} and I need to access "pA". Thanks, ...
    Rakesh kothariRakesh kothari
    Sep 28, 2010 at 8:12 pm
    Oct 7, 2010 at 5:56 pm
  • Hi, I am using pig 0.7.0 in hadoop mapreduce mode. The problem I have is that I simply can't use STORE INTO alias USING PigStorage(); I can load dataset in, write UDFs to manipulate the dataset, but ...
    Alex WangAlex Wang
    Sep 21, 2010 at 8:50 pm
    Sep 22, 2010 at 10:54 pm
  • Hi, I am trying to write pig script that is quite complex so I am testing it against very small data subset in local mode. However it might take up to 2 _minutes_ to finish. Or 30 seconds if I ...
    Konstantin IgnatyevKonstantin Ignatyev
    Sep 30, 2010 at 8:09 pm
    Apr 21, 2011 at 7:36 am
  • Say I have a bunch of tuples that is a result of a GROUP, how can I just store the values.. not the key? As a side note, how can I output bags to be separated by tabs instead of commas? How can I ...
    Sep 16, 2010 at 12:05 am
    Sep 16, 2010 at 1:10 am
  • It seems that pig generates some folders/files under "/tmp" in HDFS for pig jobs. I remember that hadoop saves such intermediate results (map output, etc.) in non-hdfs folders, which are specified in ...
    Jiang lichtJiang licht
    Sep 13, 2010 at 9:23 pm
    Sep 13, 2010 at 10:22 pm
  • Hello, I'm running very big MR job with Pig, and sometimes some maps fail, but I would like this job to finish anyway. I know that option "mapred.max.map.failures.percent" is what I need, but how to ...
    Wojciech LangiewiczWojciech Langiewicz
    Sep 28, 2010 at 8:50 am
    Oct 6, 2010 at 10:02 pm
  • Hi everyone! After reading Ed's email, I got really intrigued about Pig using indexes, I thought those were just plans lol But as commented in here https://issues.apache.org/jira/browse/PIG-209, we ...
    Renato Marroquín MogrovejoRenato Marroquín Mogrovejo
    Sep 22, 2010 at 1:33 am
    Oct 5, 2010 at 3:25 pm
  • Hey folks, Not sure if this has been discussed already or if this is due to some limitation in pig, hadoop, or java - but is there a particular reason the PiggyBank SequenceFileLoader doesn't support ...
    Zach BaileyZach Bailey
    Sep 27, 2010 at 8:30 pm
    Sep 28, 2010 at 12:11 am
  • Hi guys, how u doing? I got a problem with my pig's script and I really appreciate if someone could give me a tip. Here's the problem: if I run this command everything goes ok result_logs = FOREACH ...
    Marcos PintoMarcos Pinto
    Sep 10, 2010 at 4:28 pm
    Sep 11, 2010 at 2:39 am
  • Hi , I am getting the below error while running a pig script : "ERROR 6017: org.apache.pig.backend.executionengine.ExecException: ERROR 2118: Unable to create input splits for: hdfs://localhost:9000" ...
    Saurav DattaSaurav Datta
    Sep 1, 2010 at 7:41 pm
    Sep 1, 2010 at 9:08 pm
  • loading/reading json for Pig processing sounds like a common useful functionality. however, I have not found any implementation for such. (and yes, I know of Elephant Bird, which reads LZO-compressed ...
    Benny SadehBenny Sadeh
    Sep 28, 2010 at 4:00 pm
    Sep 29, 2010 at 3:53 am
  • Hi guys, I wanted to check if anybody has fixed this error and recall how to fix it? 2010-09-21 14:43:46,288 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2160: Error during fixing ...
    Hc busyHc busy
    Sep 21, 2010 at 9:53 pm
    Sep 27, 2010 at 4:22 pm
  • Hi, I am trying to use SUBSTRING like this: ......generate SUBSTRING($19, 1, 13) as name I get an error complaining that it cannot resolve substring using imports....Error 1070 Do I have to register ...
    Ravi FernandoRavi Fernando
    Sep 21, 2010 at 11:15 pm
    Sep 21, 2010 at 11:37 pm
  • hi, I have Two files, loaded as Two relations A and B as fallows File1.txt -------------- ramana krishna siva venkat File2.txt --------------- krishna venkat kishore basha these two files are loaded ...
    Ramana VenkataRamana Venkata
    Sep 9, 2010 at 4:21 pm
    Sep 13, 2010 at 5:51 am
  • Hi, I've just deployed some new Pig jobs live (Pig version 0.7.0) and I'm getting the error shown below. Has anyone seen this before? What's strange is that I have a tier of 4 load-balanced machines ...
    Bill GrahamBill Graham
    Sep 7, 2010 at 5:01 pm
    Sep 7, 2010 at 6:40 pm
  • Hi I have 2 files, each file contains one column of data I want to combine the two files into single file with two columns ex: file1.txt raju krishan siva venkat file2.txt CSE IT MECH CIVIL the ...
    Ramana VenkataRamana Venkata
    Sep 2, 2010 at 3:15 pm
    Sep 6, 2010 at 2:20 am
  • Hello, I am trying to filter tuples in bag which is generated by sequence of operation in pig. My data looks like this. (0,{(0,8),(0,1),(0,6),(0,7),(0,4)}) (1,{(1,6),(1,7),(1,8),(1,4)}) ...
    Dhaval deshpandeDhaval deshpande
    Sep 5, 2010 at 8:13 pm
    Sep 5, 2010 at 11:19 pm
  • Kindly give a set of project on the above, for a degree course
    Sep 28, 2010 at 8:02 pm
    Sep 29, 2010 at 8:52 pm
  • I have a Pig script--currently running in local mode--that processes a huge file containing a list of categories: /root/level1/level2/level3 /root/level1/level2/level3/level4 ... I need to insert ...
    Rob WilkersonRob Wilkerson
    Sep 29, 2010 at 12:16 pm
    Sep 29, 2010 at 4:30 pm
  • It's been some while since I started using Cassandra in combination with Pig, but I still haven't figured out the best way to work with the data. I wrote some Index Readers based on the format that ...
    Christian DeckerChristian Decker
    Sep 26, 2010 at 3:47 pm
    Sep 27, 2010 at 1:40 pm
  • I just dropped the plugin in (Eclipse Galileo on Suse Linux), and tried to open a pig doc and get this error: Could not open the editor: The editor class could not be instantiated. This usually ...
    Matt TanquaryMatt Tanquary
    Sep 24, 2010 at 9:46 pm
    Sep 24, 2010 at 9:55 pm
  • Hi all, once again I can't wrap my head around how to approach a problem in Pig. I'm trying to count a number of elements in a timespan if they are the first that match a criterion. So let's say I ...
    Christian DeckerChristian Decker
    Sep 21, 2010 at 10:22 am
    Sep 21, 2010 at 3:53 pm
  • Hi, Thanks for your great work on pig. I've been trying to use the code from pig 0.7.0, and the pig 0.8.0 branch to submit jobs to a hadoop 0.21.0 cluster. Submissions don't seem to work due to API ...
    Aditya MuralidharanAditya Muralidharan
    Sep 8, 2010 at 2:40 pm
    Sep 8, 2010 at 5:02 pm
  • I'm not a committer, but I'd like to suggest the following patch to handle loading hbase rows containing null cell values (since hbase is all about sparsly populated data rows): ...
    George StathisGeorge Stathis
    Sep 2, 2010 at 1:55 am
    Sep 2, 2010 at 5:09 am
  • BTW, please ask pig-related question in the user mail list -- Best Regards Jeff Zhang
    Jeff ZhangJeff Zhang
    Sep 28, 2010 at 2:43 am
    Sep 28, 2010 at 2:43 am
  • Pig Users, ------------------------------------------------------------------------ The 11th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid 2011) May 23-26, 2011 - ...
    Viraj BhatViraj Bhat
    Sep 26, 2010 at 10:25 pm
    Sep 26, 2010 at 10:25 pm
  • Dear Pig Users and Developers, ASF board just voted for Pig to become TLP. Please, see board notes below. Over the next several weeks we will be moving our infrastructure out of Hadoop. You can keep ...
    Olga NatkovichOlga Natkovich
    Sep 22, 2010 at 11:51 pm
    Sep 22, 2010 at 11:51 pm
  • Confirmed
    Sep 16, 2010 at 12:01 am
    Sep 16, 2010 at 12:01 am
  • ROOM CHANGE TO 211 (one floor up from usual) Hello Fellow Hadoopists, We are meeting at 7:15 pm on September 16th at the University Heights Community Center 5031 University Way NE Seattle WA 98105 ...
    Sean jensen-greySean jensen-grey
    Sep 15, 2010 at 1:02 am
    Sep 15, 2010 at 1:02 am
  • Good afternoon, I am using pig on server logs to make statistics on visited pages. For now I am able to do such matches: - one user has visited a given page matching a given aim. - one user has ...
    Sep 6, 2010 at 2:06 pm
    Sep 6, 2010 at 2:06 pm
  • Pardon the cross-post: Does Pig ever re-use FileInputLoadFunc objects? We suspect state is being retained between different stores, but we don't actually know this. Figured I'd ask to verify the ...
    Russell JurneyRussell Jurney
    Sep 1, 2010 at 2:28 am
    Sep 1, 2010 at 2:28 am
Group Navigation
period‹ prev | Sep 2010 | next ›
Group Overview
groupuser @
categoriespig, hadoop

52 users for September 2010

Dmitriy Ryaboy: 27 posts Pig: 15 posts Hc busy: 10 posts Alan Gates: 9 posts Rohan Rai: 7 posts Thejas M Nair: 7 posts Renato Marroquín Mogrovejo: 6 posts Jeff Zhang: 5 posts Olga Natkovich: 5 posts Christian Decker: 4 posts Kim Vogt: 4 posts Santhosh Srinivasan: 4 posts Saurav Datta: 4 posts Eric Wadsworth: 3 posts Mark: 3 posts Russell Jurney: 3 posts Alex Wang: 2 posts Benjamin Reed: 2 posts Bill Graham: 2 posts Gerrit van Vuuren: 2 posts
show more