Grokbase Groups Pig user June 2009

Search Discussions

41 discussions - 151 posts

  • pig 0.20.0 hadoop 0.18.0 I am running into a an exception when I try to my pig script in mapreduce mode which incidentally works fine in the local mode register my.jar; define scoreEval ScoreEval(); ...
    Parmod MehtaParmod Mehta
    Jun 26, 2009 at 11:39 pm
    Jul 6, 2009 at 9:21 pm
  • Hey all, just a friendly reminder that this is Wednesday! I hope to see everyone there again. Please let me know if there's something interesting you'd like to talk about -- I'll help however I can. ...
    Bradford StephensBradford Stephens
    Jun 23, 2009 at 12:46 am
    Jun 25, 2009 at 9:20 pm
  • Hello Everyone, I had My hadoop cluster running very well with version 0.19.1.. I had pig setup but at that time I was using hadoop 0.18.0.But after my hadoop upgradation I am not able to start up my ...
    Pankil DoshiPankil Doshi
    Jun 26, 2009 at 4:39 pm
    Jun 26, 2009 at 9:22 pm
  • Hi all, First question: I use the one single PigServer to run serveral pig scripts in multiple threads, but some exceptions will be throw, so Pig do not support multi thread, Is it right ? I just ...
    Zhang jianfengZhang jianfeng
    Jun 19, 2009 at 9:07 am
    Jun 22, 2009 at 4:58 pm
  • Hello Mailinglist, I have a project going on and need some help with the following problem I have this schema in a bag: {(word1), (word2)} // I'm only simplifying here, the actual structure is more ...
    Johan UhleJohan Uhle
    Jun 18, 2009 at 2:46 pm
    Jun 22, 2009 at 4:03 pm
  • Hi All, I¹m trying to use flatten for some pig scripts, but FLATTEN is insisting on using the disambiguation clause even when it doesn¹t need to: Is there any way to force FLATTEN to NOT use the ...
    Chris RiccominiChris Riccomini
    Jun 29, 2009 at 3:37 pm
    Jun 29, 2009 at 6:25 pm
  • Hello , When I am trying pig -x mapreduce I get the following error ERROR org.apache.pig.Main - ERROR 2999: Unexpected internal error. Failed to create DataStorage The log file has the following ...
    Jun 18, 2009 at 12:34 pm
    Jun 24, 2009 at 5:10 pm
  • Hi Pig mavens, I'm curious what my options are for more flexible options regarding string-based data comparison. I need to check whether a substring of one field is equivalent to another field. I can ...
    Aaron KimballAaron Kimball
    Jun 3, 2009 at 2:41 am
    Nov 22, 2009 at 5:55 am
  • Pig Team is happy to announce Pig 0.3.0 release! Pig is a Hadoop subproject that provides high-level data-flow language and an execution framework for parallel computation on a Hadoop cluster. More ...
    Olga NatkovichOlga Natkovich
    Jun 26, 2009 at 12:18 am
    Jun 26, 2009 at 8:11 pm
  • Hi Everyone, I'm working on a project where processed data is outputted as import-ready SQL to re-import it to a database. I've written a very basic UDF for that implementing the StoreFunc Interface. ...
    Johan UhleJohan Uhle
    Jun 30, 2009 at 4:01 pm
    Jun 30, 2009 at 8:19 pm
  • Hi all, I am facing problem with Order. It is throwing below error when I tried to execute one of my script which has "order" command in it at the end (Before STORE). The error is: ERROR ...
    Pallavi PalletiPallavi Palleti
    Jun 25, 2009 at 12:21 pm
    Jun 26, 2009 at 5:04 am
  • Hi all, Yuntao Jia, our intern this summer, did a simple performance benchmark for Hadoop, Hive and Pig based on the queries in the SIGMOD 2009 paper: A Comparison of Approaches to Large-Scale Data ...
    Zheng ShaoZheng Shao
    Jun 19, 2009 at 3:17 pm
    Jun 19, 2009 at 6:08 pm
  • Dear pig users, When I run my script on PigPen, it can connect to my hadoop, and it can run on hadoop and generate result successfully, However, the PigPen example generator doesn't work, which is ...
    George PangGeorge Pang
    Jun 6, 2009 at 9:43 pm
    Jun 16, 2009 at 9:37 pm
  • Hi pig users, I tried to copyToLocal my stored result from pig queries to my local workspace. My lines of code in Java are: ........"B","output"); ...
    George PangGeorge Pang
    Jun 11, 2009 at 6:46 pm
    Jun 16, 2009 at 5:27 pm
  • Hi - I'm doing some log crunching with Pig and am trying to achieve a certain goal for geolocation. I have two sets of data, the log and the ip to geo database: log: {shorturl: bytearray,date: ...
    Daniel HengeveldDaniel Hengeveld
    Jun 10, 2009 at 9:26 pm
    Jun 11, 2009 at 11:12 pm
  • I believe that my question/problem primarily extends from my inability to access fields within a bag_of_tokenTuples. Here's an example: I don't know quite why there's only one column. I guess that ...
    Marco NicosiaMarco Nicosia
    Jun 11, 2009 at 9:09 am
    Jun 11, 2009 at 6:39 pm
  • Hi all, How can I get the exception message in mapreduce mode, my scripts failed, maybe in the Load Func or some UDF. But it works in local, failed in mapreduce mode. So is there any way I get the ...
    Zhang jianfengZhang jianfeng
    Jun 6, 2009 at 11:19 am
    Jun 9, 2009 at 5:33 pm
  • Hi all, I am running a pig script using a cron job. During this, once in a while, my pig script is failing with the error message as "No Space Left On Device". I can see enough space in the machine ...
    Pallavi PalletiPallavi Palleti
    Jun 9, 2009 at 3:41 am
    Jun 9, 2009 at 5:43 am
  • Hi, Yet to find decent documentation on "flatten". Seems that the behavior is different from depending on the argument. For example if it is DataBag you get a different output, compared to when its ...
    Prasenjit mukherjeePrasenjit mukherjee
    Jun 18, 2009 at 4:12 pm
    Aug 9, 2009 at 3:47 pm
  • Hello Pig fans, I've implemented a collaborative filtering job in Pig using CROSS and FOREACH with a UDF. It works great until my dataset grows to a certain size, at which point I start to get Pig ...
    Bill GrahamBill Graham
    Jun 25, 2009 at 4:28 pm
    Jun 26, 2009 at 1:10 am
  • Bay Area Hadoop Fans, We're excited to hold our first Hadoop User Group at Cloudera's office in Burlingame (just south of SFO). We pushed the start time back 30 minutes to allow a little extra time ...
    Christophe BiscigliaChristophe Bisciglia
    Jun 25, 2009 at 12:04 am
    Jun 25, 2009 at 5:45 am
  • Hi all, Because of my input file format, the first line of file is the definition of each field, and then lines of records. So I did not found one good method of using customer slicer. So I'd like to ...
    Zhang jianfengZhang jianfeng
    Jun 22, 2009 at 5:48 am
    Jun 23, 2009 at 1:00 am
  • Hi all, I'm new to this list. Pardon me if this is a FAQ. Pardon me also that the only version I have available, for some reason, is pig 0.3.0. Anyways, has something changed regarding loading ...
    Marco NicosiaMarco Nicosia
    Jun 11, 2009 at 1:48 am
    Jun 11, 2009 at 5:16 am
  • Hi all , Can anyone tellme whether we can use PIG for retrieving and processing data for Hbase . If yes please tell me the link. Thanks in advance,
    Bharath vissapragadaBharath vissapragada
    Jun 10, 2009 at 5:25 am
    Jun 10, 2009 at 1:19 pm
  • Hi pig users, I try to make pig and hadoop part of my web application. My plan is to place pig on the back end, taking input from my JSP, and read the output file on the HDFS in my servlet. By doing ...
    George PangGeorge Pang
    Jun 9, 2009 at 7:51 pm
    Jun 10, 2009 at 12:56 am
  • Hi users, With pig latin, If I want to do cross product, I think I can do 1) a CROSS b 2) a JOIN b , but may I also do this in a "COGROUP"'s style? Thank you! George
    George PangGeorge Pang
    Jun 5, 2009 at 4:43 am
    Jun 6, 2009 at 8:59 pm
  • Hi all, I have a requirement where I need to pass an argument to pig script which has space in it. I found that pig script is failing to parse such kind of parameters as it is breaking at the space. ...
    Palleti, PallaviPalleti, Pallavi
    Jun 1, 2009 at 10:00 am
    Jun 2, 2009 at 4:23 am
  • I noticed that the RegExLoader class and family disappeared from release 0.2. Is that intentional or an accident due to merging the types branch? I am referring to pig-472, pig-473, pig-474, pig-476, ...
    Dmitriy RyaboyDmitriy Ryaboy
    Jun 1, 2009 at 5:16 pm
    Jun 2, 2009 at 12:15 am
  • In trying to use the Top() function from the piggybank (PIG-732), I consistently get the error "could not infer matching function". This happens even when I write a simple script that just goes ...
    Dmitriy RyaboyDmitriy Ryaboy
    Jun 16, 2009 at 2:20 pm
    Jun 18, 2009 at 8:31 pm
  • Hi all, I run a pig script in local mode successfully, but it fails in mapreduce mode. This is the error message I found in the jobtracker: Type mismatch in key from map: ...
    Zhang jianfengZhang jianfeng
    Jun 9, 2009 at 7:14 am
    Jun 10, 2009 at 7:06 am
  • Hi all, Can we use PIG syntax in java files . I mean , is a JAVA interface available for PIG that we can perform the processing part more efficiently .. This is somewhat similar to a JDBC driver ...
    Bharath vissapragadaBharath vissapragada
    Jun 10, 2009 at 4:47 am
    Jun 10, 2009 at 5:13 am
  • We are using pig trunk. Is it possible for pig to run without hod? $ bin/pig 2009-06-04 21:35:34,515 [main] INFO org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to HOD... ...
    Zheng ShaoZheng Shao
    Jun 5, 2009 at 4:04 pm
    Jun 5, 2009 at 4:14 pm
  • Hi all, Is there a way to print parameters(similar to echo in bash script) that are passed to pig script for debugging purposes? Especially the ones that we declare in the beginning of the script ...
    Pallavi PalletiPallavi Palleti
    Jun 2, 2009 at 4:45 am
    Jun 2, 2009 at 6:38 pm
  • All, I've posted the slides from the Pig talk I gave at the 2009 Hadoop Summit, One clarification I want to give with the slides. We have had questions on ...
    Alan GatesAlan Gates
    Jun 19, 2009 at 3:31 pm
    Jun 19, 2009 at 3:31 pm
  • I've added a page to Pig's wiki on tools available for pig users. If you know of other tools that people have built or you've built and are willing to share, ...
    Alan GatesAlan Gates
    Jun 16, 2009 at 7:46 pm
    Jun 16, 2009 at 7:46 pm
  • Greetings, On the heels of our smashing success last month, we're going to be convening the Pacific Northwest (Oregon and Washington) Hadoop/HBase/Lucene/etc. meetup on the last Wednesday of June, ...
    Bradford StephensBradford Stephens
    Jun 16, 2009 at 4:52 am
    Jun 16, 2009 at 4:52 am
  • Hey folks, Pig newbie here. I was wondering whats the purpose of -jar option in org.apache.pig.Main class (0.2.0) I was expecting this should add the jar up in PigContext Path so that I don't have to ...
    Bhupesh BansalBhupesh Bansal
    Jun 15, 2009 at 11:04 pm
    Jun 15, 2009 at 11:04 pm
  • Hi Pig Users, Is there JSP tag library supporting Pig or Hadoop? Thanks George
    George PangGeorge Pang
    Jun 11, 2009 at 9:12 am
    Jun 11, 2009 at 9:12 am
  • Hi Pig Users, I restate the question here: I run my Pig embedded program and get a strange outcome. This is the message from my console: 09/06/10 02:34:13 INFO executionengine.HExecutionEngine: ...
    George PangGeorge Pang
    Jun 11, 2009 at 3:15 am
    Jun 11, 2009 at 3:15 am
  • I would like to announce maintenance release 1.1 of hamake ( It mostly includes bug fixes, optimizations and code cleanup. There was minor syntax changes in ...
    Vadim ZalivaVadim Zaliva
    Jun 10, 2009 at 9:38 pm
    Jun 10, 2009 at 9:38 pm
  • Hadoop Fans, I'm happy to announce a new tool from the Cloudera team. We often found our customers wanting to import data from RDBMSs so they could conduct deeper analysis. To facilitate this, we ...
    Christophe BiscigliaChristophe Bisciglia
    Jun 1, 2009 at 5:10 pm
    Jun 1, 2009 at 5:10 pm
Group Navigation
period‹ prev | Jun 2009 | next ›
Group Overview
groupuser @
categoriespig, hadoop

39 users for June 2009

George Pang: 22 posts Alan Gates: 20 posts Dmitriy V Ryaboy: 11 posts Zjffdu: 11 posts Bradford Stephens: 7 posts Palleti, Pallavi: 7 posts Pradeep Kamath: 6 posts Santhosh Srinivasan: 6 posts Johan Uhle: 5 posts baburaj.S: 4 posts Marco Nicosia: 4 posts Olga Natkovich: 4 posts Pankil Doshi: 4 posts Chris Riccomini: 3 posts Parmod Mehta: 3 posts Zhangjiayin: 3 posts Zheng Shao: 3 posts Amr Awadallah: 2 posts Benjamin Reed: 2 posts Bill Graham: 2 posts
show more