Grokbase Groups Pig user May 2009
FAQ

Search Discussions

39 discussions - 150 posts

  • Olga, I have looked a bit deeper and sounds like it is not possible to extract individual file name from the Custom Loader UDF. The bindTo just give me the directory name (not individual file). In ...
    Ricky HoRicky Ho
    May 20, 2009 at 5:34 am
    Jun 9, 2009 at 7:02 pm
  • Hi users, When I run pig in hadoop mode, before the grunt shell appears there is something on the command prompt not quite right: "Initializing JVM Metrics with processName=JobTracker, sessionId= ...
    George PangGeorge Pang
    May 25, 2009 at 6:43 pm
    May 29, 2009 at 2:11 am
  • Dear users, I compiled and ran the pig-embedded Java code from the "pig quick start" example on Eclipse. I got the following error: INFO executionengine.HExecutionEngine: Connecting to hadoop file ...
    George PangGeorge Pang
    May 31, 2009 at 2:44 am
    Jun 11, 2009 at 12:56 am
  • Hey guys, What's the roadmap like for getting Pig to run on Hadoop .20? I did a checkout from trunk, and it looks like it's still on .18 :) Cheers, Bradford
    Bradford StephensBradford Stephens
    May 6, 2009 at 9:38 pm
    May 13, 2009 at 12:16 pm
  • There are a bunch of files in a directory. My goal is to process these files to compute their TF/IDF. I am looking for something like the following ... A = LOAD 'input/dir' as (filename, text); DUMP ...
    Ricky HoRicky Ho
    May 19, 2009 at 4:53 pm
    May 21, 2009 at 5:45 pm
  • Hi, All that I did is checked the pig project from the trunk (http://svn.apache.org/repos/asf/hadoop/pig/trunk), built it, built the tutorial, and untarred the pigtutorial file, and then ran the ...
    N . RamakrishnaN . Ramakrishna
    May 4, 2009 at 11:00 am
    May 6, 2009 at 2:31 am
  • Dear users, Anyone knows how to add the pig library to eclipse ide? How do you program java with pig? Thank you George
    George PangGeorge Pang
    May 22, 2009 at 2:55 am
    May 22, 2009 at 4:51 am
  • Dear users, The official PIG tutorial doesn't talk much about the settings, I just put the pig download somewhere, then cd to the folder of pig.jar, typing pig. Nothing happened. No Grunt shell. The ...
    George PangGeorge Pang
    May 20, 2009 at 9:32 pm
    May 21, 2009 at 7:07 am
  • Hi, I am experiencing a loss of tuples when running queries on an 8-node cluster using Pig 0.2.0 and Hadoop 0.18.3. For example, something as simple as the below script causes 41,429,443 to be lost: ...
    Dylan NunleyDylan Nunley
    May 10, 2009 at 10:37 pm
    May 15, 2009 at 4:20 am
  • Hi all, I'd like to know more about the relationship mapreduce and physical operator. I noticed there's a class called MRCompiler which will compile the Physical Plan to MROperPlan. So if there is ...
    ZjffduZjffdu
    May 5, 2009 at 4:15 pm
    May 6, 2009 at 9:46 pm
  • Hallo, I am a new user and I am following the steps in the tutorial "http://hadoop.apache.org/pig/docs/r0.2.0/tutorial.html". I am trying to run Pig in local mode. I managed to do the following ...
    Christine JardakChristine Jardak
    May 28, 2009 at 3:31 pm
    May 29, 2009 at 8:31 am
  • Dear Users, In the "PigPen" Eclipse plugin tutorial ( http://wiki.apache.org/pig/PigPen), it talks about the path configuration: "To start using PigPen you will have to set the path to the directory ...
    George PangGeorge Pang
    May 15, 2009 at 5:25 am
    May 15, 2009 at 12:44 pm
  • Hi all, I meet a problem in local model, when using a small data, it works fine, but when I run larger data, the pig will hang there, And it's difficult for me to analysis why it hangs and where it ...
    Zhang jianfengZhang jianfeng
    May 11, 2009 at 3:00 am
    May 11, 2009 at 10:56 am
  • Hi, Yesterday we committed support for multiquery into Pig trunk. This functionality allows to share scans and other computations across multiple stores within the same pig script. The details are ...
    Olga NatkovichOlga Natkovich
    May 5, 2009 at 4:15 pm
    May 6, 2009 at 6:16 am
  • Hi users, Some advices on how to do subquery in pig latin? Some examples? Thanks, George
    George PangGeorge Pang
    May 29, 2009 at 2:14 am
    May 30, 2009 at 6:26 am
  • Dear users, I run the query on the "Pig Script" window, my query is : log = LOAD 'excite-small.log' AS (user, timestamp, query); grpd = GROUP log BY user; cntd = FOREACH grpd GENERATE group, ...
    George PangGeorge Pang
    May 28, 2009 at 11:29 pm
    May 29, 2009 at 5:46 pm
  • Yuanyuan TianYuanyuan Tian
    May 28, 2009 at 6:17 pm
    May 28, 2009 at 10:43 pm
  • I am a new comer and just start to look into PIG seriously, I am pretty impressed with its language model … Given that PIG is just slightly slower than the native Hadoop (I remember Alan mention 20% ...
    Ricky HoRicky Ho
    May 26, 2009 at 4:05 pm
    May 26, 2009 at 6:51 pm
  • Hi All, I¹m trying to write a StoreFunc to store data using Voldemort Serialization (http://project-voldemort.com/). This serialization involves storing a json map (describing the schema) in the ...
    Chris RiccominiChris Riccomini
    May 26, 2009 at 5:56 pm
    May 26, 2009 at 6:02 pm
  • Something I am looking is actually something like JDBC for pig. Any idea? George
    George PangGeorge Pang
    May 22, 2009 at 4:49 am
    May 22, 2009 at 12:28 pm
  • Dear users, Today when I run pig shell, one error message appears: " ERROR 2999: Unexpected internal error. Failed to create DataStorage ........... Caused by: java.net.SocketTimeoutException: timed ...
    George PangGeorge Pang
    May 21, 2009 at 6:49 pm
    May 21, 2009 at 11:48 pm
  • Today, I grabbed the Pig 0.2 release and started working on porting my application to it. In a nutshell, I need to run Pig programmatically (I'm using PigServer as the entry point) and development ...
    Gregory HarmanGregory Harman
    May 13, 2009 at 8:04 pm
    May 13, 2009 at 10:27 pm
  • Hi all, Here I want to generate report based on pig scripts, and to insure the quality of report I need some way to validate the pig scripts. Is there any good way to validate it ? and I am ...
    Zhang jianfengZhang jianfeng
    May 11, 2009 at 9:37 am
    May 11, 2009 at 7:22 pm
  • Hi, Is there a way we can use PIG to interact with RDBMS? Do we have any API to handle such a scenario? Is there a way we can use hadoop's API ( Hadoop 0.19 DBInputFormat/DBOutputFormat) to interact ...
    NellaiNellai
    May 4, 2009 at 10:14 am
    May 4, 2009 at 10:14 pm
  • Hi users, I install pig on my own system, with a pig.jar. The test run is ok, but, WHERE IS THE PIG.PROPERTIES for that? Thanks a lot! George ps., the tutorial I used for installing PIG is : ...
    George PangGeorge Pang
    May 20, 2009 at 12:43 am
    Jun 9, 2009 at 9:42 pm
  • The next Bay Area Hadoop User Group meeting is scheduled for Wednesday, May 20th at Yahoo! 700 First Avenue, Sunnyvale, Building E, Class room 10 from 6:00-7:30 pm. Please note that the location has ...
    Ajay AnandAjay Anand
    May 13, 2009 at 8:14 pm
    May 20, 2009 at 4:35 pm
  • Thank you, Shubham. Well, another question is, I got the error message : "An error has occurred when activating this view, java.lang.NullPointerException" I can see the Operator Graph is working, but ...
    George PangGeorge Pang
    May 15, 2009 at 4:26 pm
    May 15, 2009 at 4:40 pm
  • Alan GatesAlan Gates
    May 14, 2009 at 4:13 pm
    May 14, 2009 at 5:34 pm
  • I would like to find a way to escape the delimiter character in my data so that it doesn't get interpreted as extra columns. For example, if I'm using comma as a delimiter, and I have a column with ...
    Gregory HarmanGregory Harman
    May 12, 2009 at 2:24 pm
    May 12, 2009 at 5:09 pm
  • Hi all, I read the wiki page http://wiki.apache.org/pig/PigExecutionModel Here the proposal is push model, but after I read the code, It seems that Pig is using pull model. Each Physical Operator ...
    ZjffduZjffdu
    May 5, 2009 at 3:20 pm
    May 5, 2009 at 3:48 pm
  • Dear users, One quick question: What and where should I add into Eclipse to make my Pig-embedded Java programs run "import org.apache.pig.PigServer;" ? Thank you George
    George PangGeorge Pang
    May 28, 2009 at 7:07 pm
    May 28, 2009 at 7:07 pm
  • Hadoop Fans, Lately, we've been spending a lot of time on the East Coast, and one thing is clear: Hadoop is everywhere. Hadoop usage on the East Coast tends to be slightly different. There are still ...
    Christophe BiscigliaChristophe Bisciglia
    May 27, 2009 at 11:08 pm
    May 27, 2009 at 11:08 pm
  • Hadoop Fans, just a quick note that we are hosting two days of Hadoop training in Washington DC area (Alexandria, VA) on June 22 and 23. We cover Hadoop, Hive, Pig and more with a focus on hands-on ...
    Christophe BiscigliaChristophe Bisciglia
    May 26, 2009 at 11:16 pm
    May 26, 2009 at 11:16 pm
  • Dear users, 1) I got "Unable to store alias null" after running the grunt script. what is it? 2) When I ran grunt ls, it displays the files on my local file, not a hadoop file system. Why? Thanks! ...
    George PangGeorge Pang
    May 22, 2009 at 7:24 pm
    May 22, 2009 at 7:24 pm
  • Dear users, When I try to use pig editor on Eclipse (with PigPen plugin), one error message appears on the console: "*org.apache.hadoop.dfs.DistributedFileSystem cannot be cast to ...
    George PangGeorge Pang
    May 21, 2009 at 6:13 am
    May 21, 2009 at 6:13 am
  • Hello, I'm wondering if it's possible to pass parameters to scripts executed via PigServer? Here is a 2 liners snippet that better explains what I'm trying to do: PigServer pigServer = new ...
    StephaneStephane
    May 15, 2009 at 9:22 pm
    May 15, 2009 at 9:22 pm
  • Dear Users, I got an error message on Eclipse(with PigPen pllugin) , when I tried to save the Pig Script : "Save FailedLexical error at line 1, column 18. Encountered: <EOF after : "" " The script I ...
    George PangGeorge Pang
    May 15, 2009 at 7:45 am
    May 15, 2009 at 7:45 am
  • Just wanted to follow up on this and let everyone know that Cloudera and Y! are teaming up to offer two day-long training sessions for free on the day after the summit (June 11th). We'll cover Hadoop ...
    Christophe BiscigliaChristophe Bisciglia
    May 6, 2009 at 1:26 am
    May 6, 2009 at 1:26 am
  • This year's Hadoop Summit (http://developer.yahoo.com/events/hadoopsummit09/) is confirmed for June 10th at the Santa Clara Marriott, and is now open for registration. We have a packed agenda, with ...
    Ajay AnandAjay Anand
    May 5, 2009 at 9:15 pm
    May 5, 2009 at 9:15 pm
Group Navigation
period‹ prev | May 2009 | next ›
Group Overview
groupuser @
categoriespig, hadoop
discussions39
posts150
users35
websitepig.apache.org

35 users for May 2009

George Pang: 33 posts Alan Gates: 16 posts Olga Natkovich: 13 posts Zjffdu: 12 posts Ricky Ho: 9 posts Mridul Muralidharan: 5 posts Nitesh bhatia: 5 posts Ted Dunning: 5 posts Gregory Harman: 4 posts Naber, Chad: 4 posts Tamir Kamara: 4 posts Ajay Anand: 3 posts Bradford Stephens: 3 posts Christophe Bisciglia: 3 posts Dylan Nunley: 3 posts Chris Riccomini: 2 posts Christine Jardak: 2 posts Iman Elghandour: 2 posts Lance Riedel: 2 posts N . Ramakrishna: 2 posts
show more