Grokbase Groups Hive user July 2014

Search Discussions

121 discussions - 389 posts

  • Hi, Cannot add a jar to hive classpath. Once I launch HIVE, I type - ADD JAR hdfs://; I get the error, Failed to read external resource ...
    Shouvanik HaldarShouvanik Haldar
    Jul 2, 2014 at 3:51 am
    Jul 2, 2014 at 12:04 pm
  • beeline does not seem to be connecting remotely. It works if I connect using the embedded client. I am using all the default configurations, except I configured my hiveserver2 thrift port to 11000 ...
    Hang ChanHang Chan
    Jul 2, 2014 at 5:16 pm
    Jul 10, 2014 at 8:42 pm
  • Hello, I am interested in selecting specific data from a source and loading it to a table. For example, if I have 5 columns in my dataset, I want to load 3 columns of it. Is it possible to do it ...
    CHEBARO AbdallahCHEBARO Abdallah
    Jul 30, 2014 at 8:17 am
    Jul 30, 2014 at 12:00 pm
  • we just moved to hadoop2.0 (HDP2.1 distro). it turns out that the new hive version generates a lot of logs into /tmp/ and is quickly creating the danger of running out of our /tmp/ space. I see these ...
    Jul 17, 2014 at 6:29 pm
    Jul 19, 2014 at 6:38 am
  • We have been struggling to get a reliable system working where we interact with Hive over JDBC a lot. The pattern we see is that everything starts ok but the memory used by the Hive server process ...
    Jul 3, 2014 at 10:36 am
    Jul 9, 2014 at 11:15 am
  • Hello All, Can any one help me to answer to my question posted on Stackoverflow? It is pretty urgent. Please help me. Thanks ...
    Malligarjunan SMalligarjunan S
    Jul 8, 2014 at 5:54 pm
    Jul 11, 2014 at 2:28 am
  • Hi , i am trying to compute statistics on ORC File but i am unable see any changes in PART_COL_STATS as well on using set hive.compute.query.using.stats=true; set hive.stats.reliable=true; set ...
    Navdeep AgrawalNavdeep Agrawal
    Jul 22, 2014 at 5:20 pm
    Jul 24, 2014 at 3:10 pm
  • Hi all, Is there a wiki page somewhere that shows how to turn on Tez for Hive? I found "hive.execution.engine" in hive-default.xml.template. But I'm sure there must be more. Do I have to install Tez ...
    Tim HarschTim Harsch
    Jul 16, 2014 at 11:12 pm
    Jul 17, 2014 at 11:47 pm
  • Hi, I'm a newbie to Hive. Facing an issue while installing hive stable version (0.13). I downloaded the tar file from the site ( apache-hive-0.13.1-bin.tar.gz ...
    Sarath ChandraSarath Chandra
    Jul 8, 2014 at 10:33 am
    Jul 16, 2014 at 12:40 pm
  • Hi, We're running various "triangle" join queries on Hive 0.9.0, and we're wondering if we can get any better performance. Here's the query we're running: SELECT count(*) FROM table r1 JOIN table r2 ...
    Firas AbuzaidFiras Abuzaid
    Jul 31, 2014 at 7:29 pm
    Aug 6, 2014 at 11:42 pm
  • Hello, I am trying to create a new table using an existing table's schema (existing table name in hive: jobs). However, when I do that it doesn't put the new table (new table name in hive: jobs_ex2) ...
    Vidya SujeetVidya Sujeet
    Jul 18, 2014 at 6:42 am
    Jul 25, 2014 at 1:11 am
  • dear all, I want know that one table is a partitioned table in hive, and return the result to shell. How can I do?
    Jul 31, 2014 at 7:42 am
    Aug 1, 2014 at 7:39 am
  • Hi, I am using Hive 0.13.1 and Hadoop 2-2.0 on amazon EC2 t2.micro instances. I have 4 instances, master has the namenode and yarn, secondarynode is a separate instance and two slaves are on separate ...
    Sarfraz RamaySarfraz Ramay
    Jul 22, 2014 at 7:20 am
    Aug 1, 2014 at 3:06 am
  • Hello everyone. I need some assistance. I have a join that fails with return code 3. The query is; SELECT B.CARD_NBR AS CNT FROM TENDER_TABLE A JOIN LOYALTY_CARDS B ON A.CARD_NBR = B.CARD_NBR LIMIT ...
    Clay McDonaldClay McDonald
    Jul 18, 2014 at 2:36 pm
    Jul 21, 2014 at 1:20 am
  • This is probably a simple question, but I'm noticing that for queries that run on 1+TB of data, it can take Hive up to 30 minutes to actually start the first map-reduce stage. What is it doing? I ...
    Jul 18, 2014 at 1:37 pm
    Jul 19, 2014 at 1:32 am
  • Hi, I am looking for any scheduling implementation for Hive job. (e.g. some hive command have to be executed every 15 minutes.) It is supposed to be some ways to achieve it but I haven't find a ...
    Cheng Ju ChuangCheng Ju Chuang
    Jul 10, 2014 at 10:04 pm
    Jul 11, 2014 at 5:18 pm
  • Dear Hive users, Hive community is considering a user group meeting during Hadoop World that will be held in New York October 15-17th. To make this happen, your support is essential. First, I'm ...
    Xuefu ZhangXuefu Zhang
    Jul 8, 2014 at 1:01 am
    Sep 10, 2014 at 5:52 pm
  • I am using cdh5 with hive 0.12. We have some hive jobs migrated from hive 0.10 and they are written like below: select /*+ MAPJOIN(sup) */ c1, c2, sup.c from ( select key, c1, c2 from table1 union ...
    Chen SongChen Song
    Jul 31, 2014 at 1:04 am
    Aug 12, 2014 at 3:38 pm
  • Hello, I am using Hive and trying to read from a txt file. I have an input like the following: "string";"string";"integer". First, I specified that the row fields are delimited by a semi-column. Is ...
    CHEBARO AbdallahCHEBARO Abdallah
    Jul 31, 2014 at 10:14 am
    Jul 31, 2014 at 7:21 pm
  • Hi, I want to stop hive commands from generating the hive job file (under /tmp/user/hive_log_job*) for every query, as we run multiple queries in batch and the file is getting really big. (1GB+) what ...
    Gitansh ChadhaGitansh Chadha
    Jul 31, 2014 at 3:40 am
    Jul 31, 2014 at 5:19 pm
  • Hello, I am interested in testing Hive with a huge sample data. Does Hive read all data types? Should the file be a table? Thank you ******************************* This e-mail contains information ...
    CHEBARO AbdallahCHEBARO Abdallah
    Jul 30, 2014 at 12:25 pm
    Jul 30, 2014 at 2:57 pm
  • Hi, We are connecting ODBC/JDBC tools to hiveserver2 using <ipaddr:10000 and ldap authentication and wanted to pass hiveconf variables explicitly through it. Can anybody help me how to pass the ...
    Sai chaitanya tirumerlaSai chaitanya tirumerla
    Jul 28, 2014 at 11:29 pm
    Jul 29, 2014 at 8:23 pm
  • Hi All, I have Windows PC (Windows 7, 64bit). I tried to install Apache Hadoop in the past but could not succeed with Cigwyn. Could you please suggest me the Cloudera software (free software) that I ...
    R JR J
    Jul 28, 2014 at 12:55 am
    Jul 29, 2014 at 8:16 am
  • Can anyone point me to the source code in hive where the calls to initialize, process and forward in a UDTF are made? Thanks. Doug
    Doug ChristieDoug Christie
    Jul 28, 2014 at 7:30 pm
    Jul 28, 2014 at 11:48 pm
  • if I do a join of a table based on txt file and a table based on HBase, and say the latter is very large, is HIVE smart enough to utilize the HBase table's index to do the join, instead of ...
    Jul 24, 2014 at 9:04 pm
    Jul 25, 2014 at 12:30 am
  • Recently I developed a Hive Generic UDF *getad*. It accepts a map type and a string type parameter and outputs a string value. But I found the UDF output really confusing in different conditions ...
    Jul 23, 2014 at 5:35 am
    Jul 24, 2014 at 7:53 am
  • I think the documentation related to exchanging partitions is not accurate when I try it out on hortonworks sandbox 2.1 which runs ...
    Kristof VanbecelaereKristof Vanbecelaere
    Jul 20, 2014 at 7:52 pm
    Jul 21, 2014 at 6:22 am
  • Hello Hive Community, I am trying to run the JDBC (from, using HiveServer2. Everything in the Java code (attached above) runs well except for the last query: sql = "select * from " ...
    CHEBARO AbdallahCHEBARO Abdallah
    Jul 17, 2014 at 12:18 pm
    Jul 18, 2014 at 7:39 am
  • Hi, I have a table name siplogs_partitioned which is partitioned by columns str_date(DATE) and str_hour(INT). I want to rename the partitioned columns to call_date and call_hour. I am using the below ...
    Manish KothariManish Kothari
    Jul 9, 2014 at 4:20 pm
    Jul 9, 2014 at 8:01 pm
  • Hi, I asked a question on Stack Overflow ( ce-0) which hasn't seemed to get much traction, so I'd like to ask it here as ...
    Tim HarschTim Harsch
    Jul 8, 2014 at 8:22 pm
    Jul 8, 2014 at 9:49 pm
  • Hi guys, I am trying to identify a DAG in Tez with a different id, based on job name(for e.g. query55.sql from hive-testbench) + input size. So my new identifier should be for example ...
    Grandl RobertGrandl Robert
    Jul 10, 2014 at 3:44 am
    Aug 26, 2014 at 1:19 am
  • I am trying to enable Column statistics usage with Parquet tables. This is the query I am executing. However on explain, I see that even though *Basic stats: COMPLETE *is seen *Column stats *is seen ...
    Sandeep SamudralaSandeep Samudrala
    Jul 24, 2014 at 12:14 pm
    Jul 25, 2014 at 7:18 am
  • i am trying to compute statistics on ORC File but i am unable see any changes in PART_COL_STATS as well on using set hive.compute.query.using.stats=true; set hive.stats.reliable=true; set ...
    Navdeep AgrawalNavdeep Agrawal
    Jul 22, 2014 at 5:14 pm
    Jul 23, 2014 at 7:47 am
  • adding <span class="m_body_email_addr" title="d98685a90d9543c52703799da3a2508b"</span for wider audience From: Gajendran, Vishnu Sent: Tuesday, July 22, 2014 10:42 AM To: <span ...
    Gajendran, VishnuGajendran, Vishnu
    Jul 22, 2014 at 5:51 pm
    Jul 22, 2014 at 6:57 pm
  • Hi everyone, I have the following problem: I have a partitoned managed table (Partition table is a string which represents a date, eg. log-date="2014-07-15"). Unfortunately there is one partition in ...
    Fab wolFab wol
    Jul 21, 2014 at 2:02 pm
    Jul 22, 2014 at 1:08 pm
  • Hi, I would like to restrict users doing "select * from table;" when accessed from any jdbc/odbc tools like sql workbench/excel etc.. connecting to hiveserver2 on port 10000. I am able to ...
    Sai chaitanya tirumerlaSai chaitanya tirumerla
    Jul 19, 2014 at 8:33 am
    Jul 22, 2014 at 3:00 am
  • Hi all, I'm currently experimenting with using the new HBaseKeyFactory interface (implemented in to do some custom serialization and predicate ...
    Andrew MainsAndrew Mains
    Jul 17, 2014 at 12:12 am
    Jul 17, 2014 at 4:24 am
  • Hi, Currently I am submitting multiple hive jobs using hive cli with "hive -f" from different scripts. All these jobs I could see in application tracker and these get processed in parallel. Now I ...
    Bogala, Chandra ReddyBogala, Chandra Reddy
    Jul 11, 2014 at 6:56 am
    Jul 14, 2014 at 5:31 am
  • Does anyone know what *rank() over(distribute by p_mfgr sort by p_name) * does exactly and how it's different from *rank() over(partition by p_mfgr order by p_name)*? Thanks, Eric
    Eric ChuEric Chu
    Jul 11, 2014 at 8:09 am
    Jul 11, 2014 at 6:20 pm
  • Hey all, I'm on Hive 0.10.0 on one of my clusters. We had a namenode hostname change, so I'm trying to point all of our tables, partitions and databases to the new locations. When i describe database ...
    Jon BenderJon Bender
    Jul 1, 2014 at 12:15 am
    Jul 1, 2014 at 7:46 am
  • Has anyone had any experience with a multiple-machine HiveServer2 setup? Hive needs to be available at all times for our use-case, so if for some reason, one of our HiveServer2 machines goes down or ...
    Raymond LauRaymond Lau
    Jul 25, 2014 at 9:39 pm
    Sep 12, 2014 at 11:04 pm
  • hi, Currently, if we change orc format hive table using "alter table orc_table change c1 c1 bigint ", it will throw exception from SerDe (" cannot be cast to ...
    Jul 30, 2014 at 8:57 pm
    Aug 22, 2014 at 6:19 am
  • Hi I am using hive queries on structured RC file. Can you please let me know, the key performance parameters that I have tune for better query performance (for Hadoop 2.3/ Yarn and Hive 0.13). Thanks ...
    Natarajan, Prabakaran 1. (NSN - IN/Bangalore)Natarajan, Prabakaran 1. (NSN - IN/Bangalore)
    Jul 31, 2014 at 12:51 pm
    Aug 1, 2014 at 2:43 am
  • Am using 0.13.0 version of hive with parquet table having 34 columns with the following props while creating the table *CLUSTERED BY (udid) SORTED BY (udid ASC) INTO 256 BUCKETS STORED as PARQUET ...
    Suma ShivaprasadSuma Shivaprasad
    Jul 30, 2014 at 1:14 pm
    Jul 30, 2014 at 1:36 pm
  • Hi everyone,I have a TSV file (around 4 GB). I have creted a hive table on that using the following command. It works finr without indexing. However, when I create an index based on 2 columsn I get ...
    Sameer TilakSameer Tilak
    Jul 28, 2014 at 8:04 pm
    Jul 29, 2014 at 7:47 pm
  • Hi All, I hope I’m not duplicating a previous question, but I couldn’t find any search functionality for the user list archives. I have written a relatively simple python script that is meant to take ...
    Kevin WeilerKevin Weiler
    Jul 24, 2014 at 3:52 pm
    Jul 29, 2014 at 12:53 pm
  • I am trying to Create a table in Hive. It's a very long script contained large number of columns and also contains complex fields like STRUCT, ARRAY etc. * Cannot create full table in one shot using ...
    Azaz RasoolAzaz Rasool
    Jul 25, 2014 at 12:04 am
    Jul 25, 2014 at 12:14 am
  • I am trying to enable Column statistics usage with Parquet tables. This is the query I am executing. However on explain, I see that even though *Basic stats: COMPLETE *is seen *Column stats *is seen ...
    Suma ShivaprasadSuma Shivaprasad
    Jul 24, 2014 at 6:43 am
    Jul 24, 2014 at 11:32 am
  • Hi- I'm stuck on Hive .10 right now and I'm trying to figure out how to accomplish the equivalent of a not exists or minus statement: Select x from t1 where x not in ( select x from t2) I know this ...
    Brenden CobbBrenden Cobb
    Jul 22, 2014 at 6:02 pm
    Jul 22, 2014 at 9:24 pm
  • While playing with the movielens data set to learn about dynamic partitions I ran from u_data insert overwrite table u_data_p partition (rating) select * This failed with [Error 20004]: Fatal error ...
    Kristof VanbecelaereKristof Vanbecelaere
    Jul 22, 2014 at 8:07 pm
    Jul 22, 2014 at 8:24 pm
Group Navigation
period‹ prev | Jul 2014 | next ›
Group Overview
groupuser @
categorieshive, hadoop

131 users for July 2014

Lefty Leverenz: 18 posts CHEBARO Abdallah: 16 posts Navis류승우: 16 posts Andre Araujo: 15 posts Nitin Pawar: 13 posts Edward Capriolo: 11 posts Navdeep Agrawal: 10 posts Shouvanik Haldar: 9 posts D K: 9 posts Suma Shivaprasad: 9 posts Yang: 8 posts Hadoop hive: 7 posts Hang Chan: 7 posts Sai chaitanya tirumerla: 6 posts Sarfraz Ramay: 6 posts Tim Harsch: 6 posts Xuefu Zhang: 6 posts Andrew Mains: 5 posts Carlotta Hicks: 5 posts Devopam Mittra: 5 posts
show more