Grokbase Groups Hive user June 2010

Search Discussions

61 discussions - 262 posts

  • Hi, I am trying to create an external table against already existing data in sequencefile format. However I have written a custom Serde to interpret the data. I am able to create the table fine but ...
    Pradeep KamathPradeep Kamath
    Jun 17, 2010 at 1:50 pm
    Jul 1, 2010 at 4:18 pm
  • Hi, I'm trying to map a Hbase table in Hive that contains large number of columns. Since Hbase is designed to be a wide table, does Hive/Hbase integration have any set limitation on the number of ...
    Ray DuongRay Duong
    Jun 15, 2010 at 6:57 pm
    Jun 17, 2010 at 7:03 pm
  • Hi all, I have a specific version of Hadoop 20, which I need to compile with Hive. I tried the following steps $ant ...
    Viraj BhatViraj Bhat
    Jun 15, 2010 at 5:23 am
    Jun 22, 2010 at 12:52 am
  • To generate smart output from base data we need to copy some base tables from relational database into Hadoop. Some of them are big. To dump the entire table into Hadoop everyday is not an option ...
    Jun 8, 2010 at 6:54 pm
    Mar 1, 2012 at 6:09 am
  • Hi all, I am working on an M/R program to convert Zebra data to Hive RC format. The TableInputFormat (Zebra) returns keys and values in the form of BytesWritable and (Pig) Tuple. In order to convert ...
    Viraj BhatViraj Bhat
    Jun 8, 2010 at 5:52 pm
    Jun 11, 2010 at 9:48 pm
  • hi, I'm using hive svn rev946854. And try to set at hive cli, but seemes it doesn't work, total map tasks still over 300+. Is this a svn version problem?
    Jun 9, 2010 at 3:21 am
    Jun 11, 2010 at 2:26 am
  • Shuja RehmanShuja Rehman
    Jun 5, 2010 at 4:16 pm
    Feb 28, 2011 at 2:24 pm
  • Hi, I'm running the latest version of trunk r953172. I'm doing doing a dynamic partition insert overwrite query which generates a lot of small files in each of the partition. I was hoping this could ...
    Sammy YuSammy Yu
    Jun 13, 2010 at 6:23 am
    Jul 7, 2010 at 1:52 am
  • Hi I want to create a table in hive which should have row formated line terminated other than '\n'. so i can read xml file as single cell in one row and column of table. kindly let me know how to do ...
    Shuja RehmanShuja Rehman
    Jun 11, 2010 at 12:40 am
    Jun 13, 2010 at 3:25 am
  • Hi I have created a table in hive (Suppose table1 with two columns, col1 and col2 ) now i have an xml file for which i have write a python script which read the xml file and transform it in single ...
    Shuja RehmanShuja Rehman
    Jun 9, 2010 at 10:07 pm
    Jun 11, 2010 at 3:44 pm
  • Hi, when I use Multi Table/File Inserts commands, some may be not more effective than run table insert commands separately. For example, from pokes insert overwrite table pokes_count select ...
    Zhou ShuaifengZhou Shuaifeng
    Jun 22, 2010 at 6:56 am
    Jun 23, 2010 at 3:11 am
  • Hi all, I have a table known as "oldtable" which is partitioned by datestamp. The schema of the "oldtable" is: name string age bigint property string region string datestamp string I now need to ...
    Viraj BhatViraj Bhat
    Jun 16, 2010 at 11:10 pm
    Jun 17, 2010 at 1:25 pm
  • Hi, I am having an issue with a large number of 4000 partitions (each being very small <10k files). Any queries that I do which involve these partitions take an extremely long time to complete (10+ ...
    Sammy YuSammy Yu
    Jun 11, 2010 at 8:12 am
    Jun 13, 2010 at 1:12 am
  • Hi all, I am having a question about Hive and the Hive ODBC driver (with latest patch HIVE-1101 applied). I was wondering if anybody can tell me if the ODBC driver is single-threaded or if Hive ...
    Omer, FarahOmer, Farah
    Jun 25, 2010 at 1:47 pm
    Jun 30, 2010 at 6:03 pm
  • Hi all, Yesterday I committed Arvind's patch for HIVE-1176, which includes an upgrade from datanucleus 1.x to 2.x. The patch works fine against a clean checkout, but just now Paul Yang and I noticed ...
    John SichiJohn Sichi
    Jun 25, 2010 at 2:01 am
    Jun 25, 2010 at 4:18 pm
  • Hey guys, We are currently using the JDBC interface to Hive to remotely send Hive queries. The only problem here is that when the statement is executed, it just sits and hangs until the Hive query ...
    Ryan LeCompteRyan LeCompte
    Jun 24, 2010 at 2:29 pm
    Jun 24, 2010 at 8:54 pm
  • Hello, I am a newbie to Hive, but I'm already quite familiar with Hadoop/HBase. I must appreciate the whole project and especially the new integration with HBase, which is what we really need. :) So ...
    Martin FialaMartin Fiala
    Jun 14, 2010 at 4:01 pm
    Jun 15, 2010 at 7:09 am
  • Hi all, by", the sort by of hive is for each reducer. Our team think the globally "order by" is an important feature for users, so wondering is anybody working it ? I am very interested to been ...
    Jeff ZhangJeff Zhang
    Jun 11, 2010 at 9:24 am
    Jun 12, 2010 at 6:09 am
  • Hi When i try to run hive queries like show tables, then the query did not respond, i have checked the hive log file in tmp/root/hive.log. it contains the following information. 2010-06-06 ...
    Shuja RehmanShuja Rehman
    Jun 6, 2010 at 9:19 pm
    Jun 9, 2010 at 10:28 am
  • Hi all, I have an input table which has 3 columns one of which is an array list, bcookie, id, info [Schema: string, string, arraylist<map<string,string ] Here is a sample row 1245, 1, {[myid#id2, ...
    Viraj BhatViraj Bhat
    Jun 29, 2010 at 9:08 pm
    Jun 30, 2010 at 11:39 pm
  • I've run into a problem using the Hive JDBC driver, with tables that contain floats. The following two statements work fine in the hive shell, but break if I execute them through the JDBC driver. I ...
    Marc LimotteMarc Limotte
    Jun 24, 2010 at 2:11 am
    Jun 25, 2010 at 1:08 am
  • How can&nbsp;I skip error record in hive. because when there is some error record in hive table,the hadoop job always fail! Is there some configuration to avoid this? Hope for your reply!
    Jun 24, 2010 at 1:24 am
    Jun 24, 2010 at 2:14 am
  • Hi, Are there any logs created when running the metastore using "bin/hive -service metastore" ? From what I can tell this command generates output on the screen and runs in the foreground. Would the ...
    Pradeep KamathPradeep Kamath
    Jun 9, 2010 at 10:10 pm
    Jun 10, 2010 at 2:13 am
  • Hey guys, We have a very simple JDBC client that uses the Hive JDBC driver to execute queries. We are trying to use it to execute a simple "create temporary function ..." statement, but Hive is ...
    Ryan LeCompteRyan LeCompte
    Jun 2, 2010 at 6:37 pm
    Jun 3, 2010 at 2:18 pm
  • Hi, I am trying to explode or lateral view on a list that is returned by get_jsonb_object, something along these lines: select explode(get_json_object(jsonblob, '$.output_dict.articles.url')) as url ...
    Andraz ToriAndraz Tori
    Jun 30, 2010 at 3:57 pm
    Jun 30, 2010 at 6:17 pm
  • Hi all, I am joining 2 datasets, one is around 1.5TB in size and the other is around 350MB in size. I wanted to do a Map Side join using "id" as the join column between the two tables. I read about ...
    Viraj BhatViraj Bhat
    Jun 24, 2010 at 5:46 pm
    Jun 29, 2010 at 8:45 pm
  • Hi all, I have a lookup function in hive which looks if a certain pattern is present in a large text file. I upload this text file to HDFS. I hope to use this text file in my UDF evaluate() method. ...
    Viraj BhatViraj Bhat
    Jun 21, 2010 at 11:05 pm
    Jun 24, 2010 at 5:35 pm
  • Hi All , I've got some problem in programming hive-hbase client, could someone help me? The code is very simple, select some data from a hbase-based table. public static void main(String[] args) ...
    Zhou ShuaifengZhou Shuaifeng
    Jun 17, 2010 at 1:28 pm
    Jun 17, 2010 at 7:37 pm
  • Hi All I want to load data from xml file to hive tables. Any suggestion how can I achieve this?? -- Regards Shuja-ur-Rehman Baig MS CS - School of Science and Engineering Lahore University of ...
    Shuja RehmanShuja Rehman
    Jun 10, 2010 at 8:39 pm
    Jun 10, 2010 at 9:41 pm
  • Hi All, For an existing mysql database, I have a tool which dumps out the create table statements for all tables, and checks them into SVN on a daily basis, to keep the schema under version control. ...
    Mark TozziMark Tozzi
    Jun 9, 2010 at 6:10 pm
    Jun 9, 2010 at 6:40 pm
  • Hi, Hive uses TextInputFormat by default and which treats '\n' AND '\r' as a line separator. However I don't want '\r' to be treated as a separator. Does Hive provide a way to set custom InputFormat? ...
    Akira KitadaAkira Kitada
    Jun 3, 2010 at 1:45 pm
    Jun 9, 2010 at 4:14 pm
  • Hi guys, When I run hive as a service like this: "hive --service hiveserver" I get these errors after about a day of running a lot of queries: java.lang.OutOfMemoryError: PermGen space I use: trunk ...
    Bennie SchutBennie Schut
    Jun 3, 2010 at 9:21 am
    Jun 4, 2010 at 11:53 am
  • Dear Hive Community, I've been trying to see if Hive (0.5.0) supports data type validation when loading data into it. My initial experiments show me that if it is of wrong data type (vs the table ...
    Jun 3, 2010 at 6:10 am
    Jun 3, 2010 at 7:32 am
  • Hi all, I have taken the generic udtf method in the contrib. examples directory and modified it and added it as a jar in my classpath. hive add jar /homes/viraj/hiveudfs.jar; hive create temporary ...
    Viraj BhatViraj Bhat
    Jun 30, 2010 at 8:02 am
    Jun 30, 2010 at 6:22 pm
  • I would like to control the maximum number of reducers a Hive query has access to. I have seen cases of Hive using up to 999 reducers, which seems inefficient (starting and stopping individual ...
    Scott WhitecrossScott Whitecross
    Jun 28, 2010 at 1:08 pm
    Jun 28, 2010 at 8:12 pm
  • Hi I have a Questions about external tables Suppose I have this kind of directory structure in hdfs. /user/Mydata/Data1/part-000 /user/Mydata/Data1/part-001 /user/Mydata/Data1/part-002 ... 1- I have ...
    Shuja RehmanShuja Rehman
    Jun 26, 2010 at 10:16 am
    Jun 27, 2010 at 10:29 pm
  • Hi, I tried testing my odbc build with isql, but I get the following error: [ISQL]ERROR: Could not SQLAllocEnv I tried, dltest /usr/local/lib/ SQLAllocEnv which succeeds, so I guess the ...
    Amogh VasekarAmogh Vasekar
    Jun 25, 2010 at 11:33 am
    Jun 25, 2010 at 5:57 pm
  • Hi all, Do anyone of you have an idea of what the following error means when I run a Mapside join. java.lang.RuntimeException: Error while reading from task log url at ...
    Viraj BhatViraj Bhat
    Jun 24, 2010 at 6:22 pm
    Jun 24, 2010 at 6:47 pm
  • Hi, I'm referring to , which has Linux 32 and 64 bit thrift libs. I noticed that the 64 bit lib doesn't contain the fb303 module, unlike the 32 bit ...
    Amogh VasekarAmogh Vasekar
    Jun 24, 2010 at 5:12 am
    Jun 24, 2010 at 5:52 am
  • subscribe mail list ! 2010-06-11 dongli.bai
    Jun 11, 2010 at 7:33 am
    Jun 23, 2010 at 8:52 am
  • Hi, I am trying to run a select query with case statements and it worked correctly on a small dataset with a million rows (~150 MB). It however gives me the following error for a 5GB dataset. There ...
    Iyengar, AkshayaIyengar, Akshaya
    Jun 22, 2010 at 6:35 pm
    Jun 23, 2010 at 1:20 am
  • Hi, I have run into the following problem when I am trying to upgrade my current existing Hive to be Hive trunk. Till now, I was using the hive version that came in Cloudera's VM, but now I need to ...
    Omer, FarahOmer, Farah
    Jun 22, 2010 at 8:16 pm
    Jun 22, 2010 at 11:05 pm
  • Hi! I have table tb1 defined by: CREATE TABLE tb1(user int, counter int) PARTITIONED BY (day string) STORED AS TEXTFILE I want to export data from this table into mysql table defined by: CREATE TABLE ...
    Szymon GwóĽdĽSzymon GwóĽdĽ
    Jun 18, 2010 at 12:52 pm
    Jun 18, 2010 at 7:51 pm
  • Hi all, I have some data in Zebra around 9 TB which I converted first to PlainText using the TextOutputFormat in M/R and it resulted in around 43.07TB. [[I think I used no compression here.]] I then ...
    Viraj BhatViraj Bhat
    Jun 11, 2010 at 10:00 pm
    Jun 12, 2010 at 12:43 am
  • Most of my Hadoop data is produced by Java MR jobs that store data as custom Writable pairs in SequenceFiles. I'm excited to bring that data into a Hive table so that I can start building out and ...
    Andrew RothsteinAndrew Rothstein
    Jun 6, 2010 at 6:45 pm
    Jun 6, 2010 at 8:24 pm
  • I was trying to write a transform script in java. I simply used and system.out, but didn't work out. Does anyone know how to write a java script, or some example? Any suggestions will be ...
    Fan YangFan Yang
    Jun 2, 2010 at 3:38 pm
    Jun 2, 2010 at 5:24 pm
  • Hi, I have some problems in saving a union of two tables as compressed file. My Hive query is as follows: # begin of query set mapred.reduce.tasks=40; DROP TABLE f_table; CREATE TABLE f_table ( col1 ...
    Ping ZhuPing Zhu
    Jun 25, 2010 at 8:29 pm
    Jun 25, 2010 at 8:29 pm
  • Hi All, Today I was running a big set of reports using HIVE(trunk version) and I ran into the following problem. The reports start running but after a while they all start failing and I see that the ...
    Omer, FarahOmer, Farah
    Jun 25, 2010 at 6:59 pm
    Jun 25, 2010 at 6:59 pm
  • Hi, I have 2 tables: hive describe extended idtablerc; id string from deserializer Detailed Table Information Table(tableName:idtablerc, dbName:default, owner:viraj, createTime:1277418576, ...
    Viraj BhatViraj Bhat
    Jun 24, 2010 at 11:32 pm
    Jun 24, 2010 at 11:32 pm
  • Hi, The Hypertable Hive storage handler is included in the latest release of Hypertable < . Documentation can be found here ...
    Sanjit JhalaSanjit Jhala
    Jun 24, 2010 at 9:14 pm
    Jun 24, 2010 at 9:14 pm
Group Navigation
period‹ prev | Jun 2010 | next ›
Group Overview
groupuser @
categorieshive, hadoop

59 users for June 2010

John Sichi: 24 posts Edward Capriolo: 23 posts Viraj Bhat: 22 posts Shuja Rehman: 20 posts Pradeep Kamath: 17 posts Paul Yang: 12 posts Carl Steinbach: 9 posts Ning Zhang: 9 posts Zhou Shuaifeng: 8 posts Ray Duong: 7 posts Wd: 7 posts Yongqiang He: 7 posts Ashish Thusoo: 5 posts Omer, Farah: 5 posts Ryan LeCompte: 5 posts Sammy Yu: 5 posts Sanjit Jhala: 5 posts Amr Awadallah: 4 posts Atreju: 4 posts Vinithra Varadharajan: 4 posts
show more