Grokbase Groups Hive user April 2011

Search Discussions

46 discussions - 127 posts

  • Hello, We have a situation where the data coming from source systems to hive may contain the common characters and delimiters such as |, tabs, new line characters etc. We may have to use multi ...
    Shantian PurkadShantian Purkad
    Apr 27, 2011 at 6:06 am
    May 7, 2011 at 11:51 pm
  • hi,all The dynamic partition function is amazing ,but only works in insert clause. Can I use it while loading data into table? For example: load data LOAD DATA LOCAL INPATH ...
    Erix YaoErix Yao
    Apr 15, 2011 at 5:53 am
    Apr 15, 2011 at 5:08 pm
  • Hi, How would I set the field separator for Hive output to files? I see that the default is a space (or tab, don't know exactly) but I would like to use another character to facilitate loading of the ...
    Jasper KnulstJasper Knulst
    Apr 7, 2011 at 12:39 pm
    Apr 7, 2011 at 6:35 pm
  • As I know, all the data exported from hive use ASCII \001 as the default field delimiter, and I want to change it, How can I achieve this? Thanks -- haitao.yao@Beijing
    Erix YaoErix Yao
    Apr 23, 2011 at 5:08 am
    Apr 24, 2011 at 2:28 pm
  • Hi Some times the data analyst need to store the temp result into temp tables. But the data analyst forgot the clear the temp tables. As the hive administrator, I suggest make the data analyst work ...
    Erix YaoErix Yao
    Apr 20, 2011 at 10:32 am
    Apr 21, 2011 at 5:05 am
  • Regards to all. I was reading the guest post ( on the Cloudera Blog from John Sichi ( about the ...
    Marcos OrtizMarcos Ortiz
    Apr 12, 2011 at 3:08 am
    Apr 13, 2011 at 6:50 am
  • Or does it support hadoop 0.21.0 in the near further? Regards, Xiaobo Gu
    Xiaobo GuXiaobo Gu
    Apr 4, 2011 at 5:28 am
    Apr 4, 2011 at 1:57 pm
  • hi, I've tried to load gzip files into hive to save disk space, but failed. hive load data local inpath 'tmp_b.20110426.gz' into table raw_logs partition ( dt=20110426 ); Copying data from ...
    Apr 28, 2011 at 4:34 am
    Apr 28, 2011 at 10:35 am
  • Hi, What is the difference between external table and managed tables (apart from data being stored outside hive warehouse and not deleted when table is dropped) Are there any drawbacks of external ...
    Shantian PurkadShantian Purkad
    Apr 26, 2011 at 4:19 pm
    Apr 26, 2011 at 6:30 pm
  • Hi, Wonder if anyone can help please? I've read that you can use Hive to create HBase tables and map the columns across. When this is used, if the HBase data is changed directly, will Hive be able to ...
    Stuart ScottStuart Scott
    Apr 19, 2011 at 12:57 pm
    Apr 19, 2011 at 1:50 pm
  • Could not find the instructions regarding this to avoid performance issues when too many mappers have to be created for every small file. Thanks!
    Michael JiangMichael Jiang
    Apr 8, 2011 at 6:35 pm
    Apr 8, 2011 at 9:38 pm
  • Hi(ve), I created a table like this; create table testtable (veld1 STRING,veld2 STRING,veld3 STRING) ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe' ...
    Jasper KnulstJasper Knulst
    Apr 5, 2011 at 10:51 pm
    Apr 6, 2011 at 11:20 pm
  • Is there a way to have hive skip the first line of CSV loading (say, to skip column headers)? Or will this require a second stage with a transform, and a) a hard coded knowledge of what a header row ...
    Daniel JueDaniel Jue
    Apr 13, 2011 at 8:55 pm
    Sep 27, 2011 at 7:19 am
  • I'm doing a project using hive to provide data to a PHP interface. I followed the Hive's Wiki how to when I try to Start the Hiveserver using the terminal as above $ HIVE_PORT=1000 ./hive --service ...
    Alexandre \"TAZ\" dos Santos AndradeAlexandre \"TAZ\" dos Santos Andrade
    Apr 27, 2011 at 5:37 pm
    Apr 27, 2011 at 7:32 pm
  • Hello, I am using the hive/hbase integration (which is a great feature) on a standalone one machine environment. For production I set up a hadoop cluster and now hbase. Here ...
    Apr 10, 2011 at 4:49 pm
    Apr 17, 2011 at 2:31 pm
  • Hello, I have to use Hive 7 for a project so wanted to know some details about it and also if some one can forward em the link to a tutorial so that I can go ahead and implement it. Thanks a lot.
    Soumya mishraSoumya mishra
    Apr 12, 2011 at 7:22 pm
    Apr 15, 2011 at 3:43 pm
  • Hi, The Hive documentation describes keyword "external" as following: The EXTERNAL keyword lets you create a table and provide a LOCATION so that Hive does not use a default location for this table. ...
    Prashanth RPrashanth R
    Apr 11, 2011 at 9:10 pm
    Apr 12, 2011 at 4:51 pm
  • When the following query was run with mapred.job.reuse.jvm.num.tasks=20, some of the map tasks failed with "Error: Java heap space", causing the job to fail. After changing to ...
    Steven WongSteven Wong
    Apr 8, 2011 at 1:53 am
    Apr 8, 2011 at 11:39 pm
  • For some UDFs I'm working on now it feels like it would be handy to be able to pass in parameters during construction. It's an integration with an external reporting API... e.g. -- include last 30 ...
    Larry OgrodnekLarry Ogrodnek
    Apr 5, 2011 at 6:20 pm
    Apr 5, 2011 at 6:29 pm
  • Hi guys. I've got some input data with comment lines starting with '#' and lasting to the end of line, like: #this is a comment .... Is there a way to make hive ignore these comments when importing ...
    Bjørn RemsethBjørn Remseth
    Apr 4, 2011 at 4:16 pm
    Apr 4, 2011 at 4:22 pm
  • OK, feeling a bit dumb here . so I need the hive user group jolt to the head ... Given a table like: describe hive_map_test col_name data_type comment log_record_type int <null key_pairs ...
    Sunderlin, MarkSunderlin, Mark
    Apr 28, 2011 at 5:57 pm
    Apr 28, 2011 at 6:07 pm
  • We have a hadoop/hive cluster which is using cloudera's distribution. Metastore is stored in mysql and all the relevant drivers are in classpath and in conf files. While running queries on hive I am ...
    Vipul sharmaVipul sharma
    Apr 27, 2011 at 9:27 pm
    Apr 27, 2011 at 11:55 pm
  • Hello, I'm the chief editor at, a social linking and blogging community for developers. We've got this weekly 6pg cheat sheet (sorta like spark charts) that we produce every week on a ...
    Mitch PronschinskeMitch Pronschinske
    Apr 27, 2011 at 2:00 pm
    Apr 27, 2011 at 2:07 pm
  • We had a good meetup yesterday, with a lot of discussion topics; here are my notes: JVS
    John SichiJohn Sichi
    Apr 27, 2011 at 12:29 am
    Apr 27, 2011 at 2:51 am
  • here's the configuration file section for default create table restriction, as the wiki said: <property <name</name <value true</value <description enable or ...
    Erix YaoErix Yao
    Apr 21, 2011 at 8:04 am
    Apr 21, 2011 at 8:13 am
  • Hello, I am encountering the exact issue that a user posted about last month (with no response): ...
    Karl OstmoKarl Ostmo
    Apr 20, 2011 at 8:25 pm
    Apr 20, 2011 at 9:11 pm
  • I'm having trouble using the union type introduced in HIVE-537. Consider a table with a column, union1, with a union type uniontype<float,boolean,string . While it's possible to select this column: ...
    Jakob HomanJakob Homan
    Apr 19, 2011 at 1:15 pm
    Apr 19, 2011 at 1:17 pm
  • hi, I installed the hive-0.7 release for the index feature. Here's my test table schema: create table testforindex (id bigint, type int) row format delimited fields terminated by ',' lines terminated ...
    Erix YaoErix Yao
    Apr 15, 2011 at 8:34 am
    Apr 15, 2011 at 6:10 pm
  • Hi, I am able to launch the map-reduce job (select userid from user) from hive shell.I am also passing the auxpath parameter to the shell (specifying the Hive/HBase integration related jars). ...
    Ankit JainAnkit Jain
    Apr 15, 2011 at 11:14 am
    Apr 15, 2011 at 3:29 pm
  • Dear Hive users, I need to sync data between two Hive data warehouse. These two Hive data warehouse have their own independent HDFS and metastore. Currently my option is to sync HDFS data and then ...
    Ravi .Ravi .
    Apr 11, 2011 at 6:37 pm
    Apr 12, 2011 at 4:23 am
  • Well, I'm new here but I can point you to the docs as well as old timers probably. First, I think the developers would prefer that you only direct questions like this to the users list, not to both. ...
    Geoff HowardGeoff Howard
    Apr 7, 2011 at 11:19 am
    Apr 8, 2011 at 9:24 am
  • Hello, I'm the chief editor at, a social linking and blogging community for developers. We've got this weekly 6pg cheat sheet (sorta like spark charts) that we produce every week on a ...
    Mitch PronschinskeMitch Pronschinske
    Apr 25, 2011 at 6:00 pm
    Apr 25, 2011 at 6:00 pm
  • I have about 5K input files so running a Hive job creates as many (small) output files. Small-file merging seems to be enabled by default (hive.merge.mapfiles=true) but it doesn't seem to work unless ...
    Igor TatarinovIgor Tatarinov
    Apr 21, 2011 at 5:56 pm
    Apr 21, 2011 at 5:56 pm
  • Hi All, I am using hive 0.7 with hadoop 0.20.2 . Cluster has 9 data nodes / task tracker . I am running a query on table containing 900 GB data (approx) in 4000 partitions . query construct is like - ...
    Vaibhav negiVaibhav negi
    Apr 21, 2011 at 9:31 am
    Apr 21, 2011 at 9:31 am
  • Hello Hadoop fans, This last week we had a very successful meetup of the SF Hadoop User Group, hosted by Twitter. Breakout topics included: * Log analysis * Cluster resource management * FlumeBase * ...
    Aaron KimballAaron Kimball
    Apr 19, 2011 at 1:38 pm
    Apr 19, 2011 at 1:38 pm
  • Hi, Today I had to kill quite a large hive generated MR job. The progress on the mappers was reversed halfway (so actually declining). When I got to the local mapped logs from the TT I saw that there ...
    Jasper KnulstJasper Knulst
    Apr 14, 2011 at 9:01 pm
    Apr 14, 2011 at 9:01 pm
  • Our environment is heavy into storing data in hive. I find myself currently working on something that it outside the scope though. I have a mapreduce written, but it requires a lot of direct user ...
    John VinesJohn Vines
    Apr 14, 2011 at 7:42 pm
    Apr 14, 2011 at 7:42 pm
  • hello good news for you thousands of new original products here < take a look , it is the best place for Chrisama gift . i had bought some from them , and i like much so i tell you ...
    Vasilis LiaskovitisVasilis Liaskovitis
    Apr 12, 2011 at 7:54 am
    Apr 12, 2011 at 7:54 am
  • Announcing a new Meetup for Hive Contributors Group! *What*: April Hive Contributors Meeting< *When*: Monday, April 25, 2011 4:30 PM ...
    Carl SteinbachCarl Steinbach
    Apr 12, 2011 at 5:35 am
    Apr 12, 2011 at 5:35 am
  • ------ Forwarded Message From: Avik Dey < Reply-To: < Date: Thu, 7 Apr 2011 10:43:23 -0700 To: < Subject: [hadoop] Hadoop Summit ...
    Devaraj DasDevaraj Das
    Apr 11, 2011 at 7:59 pm
    Apr 11, 2011 at 7:59 pm
  • I use JDBC to connection hive server. I run insert clause, example: insert overwrite table test1 select * from test , how can I receive the row numbers of the sql? I use hive0.6 version. Thanks, ...
    Lei liuLei liu
    Apr 11, 2011 at 1:47 pm
    Apr 11, 2011 at 1:47 pm
  • Hi, I am trying to load data that is in HDFS to the hive table whose data store is in s3. However, while performing a load operation, i get this error: ...
    Prashanth RPrashanth R
    Apr 10, 2011 at 8:56 pm
    Apr 10, 2011 at 8:56 pm
  • Hello, I have been trying to optimize one of my longer running queries using a MAPJOIN hint. The query is fairly complex and it joins my base table (1+ billion rows) with multiple metadata tables ...
    Viral BajariaViral Bajaria
    Apr 8, 2011 at 3:07 am
    Apr 8, 2011 at 3:07 am
  • Hey folks, I wanted to consult with on something that has been bothering me for a while... I have declared external tables, these table are partitioned by dates_hour. I have a batch hadoop process ...
    Guy DoulbergGuy Doulberg
    Apr 7, 2011 at 6:46 am
    Apr 7, 2011 at 6:46 am
  • Hi, I had one doubt as to whether we can upload data into Hive table which is partitioned using range. For example: LOAD DATA LOCAL INPATH './6.txt' OVERWRITE INTO TABLE GICP PARTITION ( ...
    Soumya mishraSoumya mishra
    Apr 6, 2011 at 5:57 pm
    Apr 6, 2011 at 5:57 pm
  • Hi, I am trying to create an external hive table (with data) using the sqoop import from the "--hive-import" command. The requirement is to actually create an "External" hive table instead of it ...
    Sharma, AkashSharma, Akash
    Apr 4, 2011 at 2:25 pm
    Apr 4, 2011 at 2:25 pm
Group Navigation
period‹ prev | Apr 2011 | next ›
Group Overview
groupuser @
categorieshive, hadoop

58 users for April 2011

Erix Yao: 10 posts Edward Capriolo: 9 posts Jasper Knulst: 6 posts Carl Steinbach: 4 posts John Sichi: 4 posts Ning Zhang: 4 posts Bejoy_ks: 3 posts 김영우: 3 posts Ankit Jain: 3 posts Igor Tatarinov: 3 posts Loren Siebert: 3 posts Marcos Ortiz: 3 posts Michael Jiang: 3 posts Rosanna Man: 3 posts Shantian Purkad: 3 posts Steven Wong: 3 posts Wd: 3 posts Alexandre \"TAZ\" dos Santos Andrade: 2 posts Avram Aelony: 2 posts Christopher, Pat: 2 posts
show more