Search Discussions

34 discussions - 136 posts

  • Hi! Part of data I have resides in MySQL. Is there a loader that would allow loading directly from it? I can't find anything on the net, but it seems to me this must be a quite common problem. I ...
    Nov 3, 2010 at 7:22 am
    Nov 5, 2010 at 5:25 pm
  • We are trying to use the HBaseStorage LoadFunc in pig 0.8 and getting an exception. org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1066: Unable to open iterator for alias raw at ...
    Corbin HoenesCorbin Hoenes
    Nov 19, 2010 at 11:56 pm
    Dec 17, 2010 at 3:12 am
  • Hi all I need some help with PIG. The requirement is to generate the topX records for a group. I can easily do this using PIG script where I can order by DESC and then limit at X. If there are more ...
    Sheeba GeorgeSheeba George
    Nov 26, 2010 at 3:01 am
    Dec 14, 2010 at 7:08 pm
  • I am trying to do what seems like should be a simple task using pig and a UDF I have written but can't seem to figure out the syntax to get it working. At a high level I have a UDF that takes a ...
    Zach BaileyZach Bailey
    Nov 30, 2010 at 7:49 pm
    Dec 2, 2010 at 9:16 am
  • (not sure if this double posted or not... I accidentally sent it to the Hadoop mailing list and not the pig mailing list) I appreciate any help you can give. I've searched around and haven't found ...
    Jonathan CoveneyJonathan Coveney
    Nov 30, 2010 at 4:17 pm
    Nov 30, 2010 at 6:03 pm
  • Hello: I hope this is not double posting. I want to do something simple: I have a data file, mydata.log, formatted like this: a1 | b1 | c=foo&d=bar | e1 a2 | b2 | c=john&d=doe | e2 a3 | b3 | ...
    Yves RoyYves Roy
    Nov 30, 2010 at 5:17 pm
    Nov 30, 2010 at 8:30 pm
  • Hi, I am using Pig 0.7.0. Is there a good way to have Pig assign an informative name to each of the MR Job generated in Pig Physical plan ? Maybe name of the relation itself. Also I am not able to ...
    Rakesh kothariRakesh kothari
    Nov 16, 2010 at 10:53 pm
    Nov 24, 2010 at 3:58 am
  • Hi My name is Cornelio Iñigo and I´m a developer just beginning with this of hadoop and pig. I have a doubt about developing an application on pig, I already have my program on hadoop, this program ...
    Cornelio IñigoCornelio Iñigo
    Nov 15, 2010 at 8:57 pm
    Nov 17, 2010 at 4:01 pm
  • Doing a NOT matches of a pattern? Can you just do a FILTER A BY NOT (matches 'pattern'); Or NOT matches 'pattern'; I have a list of bad words I DO NOT want to match and I am unsure how to do this ...
    Brian AdamsBrian Adams
    Nov 11, 2010 at 9:43 pm
    Nov 11, 2010 at 10:17 pm
  • Hi all, Our dataset consists of multiple files. The name of each file reflects the creation date of the file. (e.g. 20101031.dat, 20101101.dat, etc) We need this date information for all relations ...
    Sangchul SongSangchul Song
    Nov 23, 2010 at 6:20 pm
    Nov 24, 2010 at 2:22 am
  • Is there anything that allows someone to do ad hoc Hadoop / pig job and have the results emailed to a users email account via web interface? The app would ideally allow for passing parameters to pig ...
    Brian AdamsBrian Adams
    Nov 17, 2010 at 6:00 pm
    Nov 18, 2010 at 11:41 pm
  • Hi all, I'm happily using Pig to ORDER BY and LIMIT some large relations quite effectively. However I'm curious about how these are/would be implemented in "raw" MapReduce. Can anyone shed some ...
    Josh DevinsJosh Devins
    Nov 14, 2010 at 2:29 pm
    Nov 18, 2010 at 1:06 pm
  • Hi, While trying to run Pig in local mode using "pig -x local" I get the following error: 10/11/04 07:29:18 INFO pig.Main: Logging error messages to: ...
    Michael SundellMichael Sundell
    Nov 4, 2010 at 3:11 pm
    Nov 17, 2010 at 7:44 pm
  • Hi, I have a file that has the char (254) as a separator. I can force the character into the file, but wanted to p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px Monaco} LOAD 'file.log.gz' USING ...
    Marilson CamposMarilson Campos
    Nov 10, 2010 at 11:34 pm
    Dec 6, 2010 at 7:16 pm
  • Hi all, I was reading this: http://pig.apache.org/docs/r0.7.0/udf.html#Passing+Configurations+to+UDFs It sounded like I can pass some configuration or context to the UDF but I can't figure out how I ...
    Dexin WangDexin Wang
    Nov 24, 2010 at 1:25 am
    Nov 29, 2010 at 10:50 pm
  • Hi all, I have a a simple schema that I want to store as JSON. So I've written a simple JsonStorage class but it requires that the tuple's first field is a map. The problem is in converting a regular ...
    Josh DevinsJosh Devins
    Nov 25, 2010 at 7:53 pm
    Nov 27, 2010 at 10:37 pm
  • Hi all, I have 2 data files. One which contains a number of records, and one which contains a number of prefixes. A = load 'data' AS (id, name) B = load 'prefixes' AS (prefix) I'd like to pull ...
    Joe CiaramitaroJoe Ciaramitaro
    Nov 2, 2010 at 5:19 pm
    Nov 2, 2010 at 8:31 pm
  • I realize this may be a lowly question, but I've searched around and couldn't find anything definitive. I am also quite new to Pig and am trying to get my head around the pig-esque way of doing ...
    Jonathan CoveneyJonathan Coveney
    Nov 29, 2010 at 10:59 pm
    Nov 30, 2010 at 2:33 pm
  • Hi, I'm trying to write a UDF to take a bag of http get arguments (e.g. {(s=556477989), (ts=1265964662)} ) and turn them into a map (e.g. [s#556477989, ts#1265964662] ), and have written a class ...
    Kris CowardKris Coward
    Nov 26, 2010 at 6:27 pm
    Nov 26, 2010 at 10:34 pm
  • Is there any updated on when pig 0.8 will be release? There are some interesting features I would like to use. I know some people have used the branch code. Have you found it stable? There seems only ...
    Robert GoodmanRobert Goodman
    Nov 5, 2010 at 3:52 pm
    Nov 5, 2010 at 5:01 pm
  • Hi all! I have a problem that I can't find solution to... Hope someone can shed some light. :) ----- grunt dump X; (1,a-b-c) (2,d-a) (3,c) ----- (where $1 is a chararray) I would like to generate ...
    Nov 4, 2010 at 9:43 am
    Nov 4, 2010 at 10:08 am
  • Hi I'm starting with this of hadoop and Pig, I have to pass a hadoop MapReduce program that i made to Pig, in the hadoop program I have just a Map function and on it I perform all the process that ...
    Cornelio IñigoCornelio Iñigo
    Nov 30, 2010 at 11:47 pm
    Dec 1, 2010 at 12:16 am
  • I have the following bytearray: -------------------------------- -------------------------------- -------------------------------- I would like to cast it to something like bag{tuple(chararray), ...
    Matt TanquaryMatt Tanquary
    Nov 30, 2010 at 10:23 pm
    Nov 30, 2010 at 10:32 pm
  • I have this problem which I solved easily with M/R but I'm trying to solve through PIG instead: Given the following bags, perform a lookup in a special table to retrieve 4 additional variations of ...
    Matt TanquaryMatt Tanquary
    Nov 29, 2010 at 10:51 pm
    Nov 30, 2010 at 12:07 am
  • We have some aggregate statistics we are gathering in an NLP application , and we have implemented it in Pig (using 0.7.0 on Hadoop 0.20.2 with Java 1.6.0_22). But some of the mapreduce jobs use a ...
    Greg LangmeadGreg Langmead
    Nov 29, 2010 at 10:12 pm
    Nov 29, 2010 at 10:59 pm
  • Hi, I have a table of events of type A for users in the form of (userid: chararray, timestamp. long), and a list of events of type B in the same form. I need to get only events B that happened within ...
    Marko MusnjakMarko Musnjak
    Nov 26, 2010 at 4:21 pm
    Nov 29, 2010 at 6:44 pm
  • What is the standard way to copy up jar dependencies to the cluster with Pig (so that the nodes in the cluster don't get runtime errors with class not found exceptions)?
    Jeremy HannaJeremy Hanna
    Nov 11, 2010 at 12:18 am
    Nov 12, 2010 at 7:05 pm
  • I just ran a Pig job and for the first time noticed the output at the end of the job (and of course a matching counter): Encountered Warning UDF_WARNING_1 108939522 time(s) What exactly does this ...
    Josh DevinsJosh Devins
    Nov 10, 2010 at 3:18 pm
    Nov 10, 2010 at 8:56 pm
  • Hi, I'm still getting the error associated with https://issues.apache.org/jira/browse/CASSANDRA-1700 I have 7 suse nodes running Cassandra0.7 branch (latest as of the morning of Nov 9). I've loaded ...
    Aditya MuralidharanAditya Muralidharan
    Nov 10, 2010 at 4:47 pm
    Nov 10, 2010 at 7:10 pm
  • Hi, I'm building (on windows) a release tar from the HEAD of the Cassandra 0.7 branch. Running a new single node instance of Cassandra gives me the following bootstrap exception: INFO 10:54:14,030 ...
    Aditya MuralidharanAditya Muralidharan
    Nov 10, 2010 at 5:05 pm
    Nov 10, 2010 at 5:41 pm
  • Is it possible to feed a path of the format "hdfs:///path/to/my.jar" to the REGISTER command in Pig? I was recently watching some of the tutorials on Amazon's Elastic MapReduce and their version of ...
    Zach BaileyZach Bailey
    Nov 1, 2010 at 7:54 pm
    Nov 1, 2010 at 8:44 pm
  • Hello: I want to do something simple: 1) I have a data file, mydata.log, formatted like this: a1 | b1 | c=foo&d=bar | e1 a2 | b2 | c=john&d=doe | e2 a3 | b3 | c=foo&d=doe | e3 ... 2) and I want to ...
    Yves RoyYves Roy
    Nov 30, 2010 at 2:45 pm
    Nov 30, 2010 at 2:45 pm
  • Hello Hadoop fans, The Bay Area Hadoop User Group meetups are a long hike for those of us who come from San Francisco. I'd like to gauge interest in SF-centric Hadoop gatherings. In contrast to the ...
    Aaron KimballAaron Kimball
    Nov 4, 2010 at 7:39 pm
    Nov 4, 2010 at 7:39 pm
  • Alan GatesAlan Gates
    Nov 3, 2010 at 6:54 pm
    Nov 3, 2010 at 6:54 pm
Group Navigation
period‹ prev | Nov 2010 | next ›
Group Overview
groupuser @
categoriespig, hadoop

49 users for November 2010

Anze: 11 posts Dmitriy Ryaboy: 11 posts Daniel Dai: 10 posts Thejas M Nair: 8 posts Alan Gates: 6 posts Brian Adams: 5 posts Corbin Hoenes: 5 posts Jonathan Coveney: 5 posts Arvind: 4 posts Rakesh kothari: 4 posts Zach Bailey: 4 posts Aaron Kimball: 3 posts Aditya Muralidharan: 3 posts Josh Devins: 3 posts Yves Roy: 3 posts Ankur C. Goel: 2 posts Cornelio Iñigo: 2 posts Daniel Dai: 2 posts Dexin Wang: 2 posts Gerrit van Vuuren: 2 posts
show more