Search Discussions

95 discussions - 323 posts

  • We are attempting to load CSV text files (compressed to bz2) containing newlines in fields using EXTERNAL tables and INSERT/SELECT into ORC format tables. Data volume is ~1TB/day, we are really ...
    Gerber, Bryan WGerber, Bryan W
    Jan 12, 2016 at 5:41 pm
    Jan 15, 2016 at 11:34 pm
  • Hi, We hit an issue when doing Hive testing to rebuild index on Tez. We were told by our Hadoop distro vendor that it's not recommended (or should avoid) using index with Hive. But I don't see an ...
    Ting(Goden) YaoTing(Goden) Yao
    Jan 5, 2016 at 6:17 pm
    Feb 9, 2016 at 2:13 am
  • Hi All, I have enabled bucketing in table. I created 256 buckets on user id. Now when I am querying (select count(*) from table where userid =172839393) that table, map reduce should only use single ...
    Akansha JainAkansha Jain
    Jan 22, 2016 at 9:54 pm
    Jan 27, 2016 at 8:34 pm
  • Hi, Trying to run Hive on TEZ for the first time. Getting the error below 0: jdbc:hive2://rhes564:10010/default set hive.execution.engine=tez; No rows affected (0.001 seconds) 0 ...
    Mich TalebzadehMich Talebzadeh
    Jan 5, 2016 at 12:00 am
    Jan 6, 2016 at 3:49 pm
  • Ok we hope that partitioning improves performance where the predicate is on partitioned columns I have two tables. One a basic table called smallsales defined as below CREATE TABLE `smallsales`( | ...
    Mich TalebzadehMich Talebzadeh
    Jan 7, 2016 at 10:54 pm
    Jan 8, 2016 at 10:08 am
  • Hi, I have read some notes on ORC files in Hive and indexes. The document describes in the indexes but makes reference to statistics Indexes I am confused as it is mixing up indexes with statistics ...
    Ashok KumarAshok Kumar
    Jan 19, 2016 at 3:51 pm
    Jan 19, 2016 at 10:41 pm
  • Hello... Anyone please help me how to delete empty rows from hive table through java? Thanks in advance
    Sateesh KaruturiSateesh Karuturi
    Jan 5, 2016 at 6:58 am
    Jan 5, 2016 at 4:15 pm
  • Hi All, Hope all are enjoying working in hive !!! I am having one question regarding hive and beeline: I am passing parameters to hive script using "-d". Eg: *hive -d table_name -f emp.hql* * ...
    Trainee BingoTrainee Bingo
    Jan 13, 2016 at 12:44 pm
    Jan 14, 2016 at 3:16 pm
  • I' trying to add jars before running a query using hive on spark on cdh 5.4.3. I've tried applying the patch in https://issues.apache.org/jira/browse/HIVE-12045 (manually as the patch is done on a ...
    Ophir EtzionOphir Etzion
    Jan 7, 2016 at 9:03 pm
    Jan 12, 2016 at 8:48 pm
  • All, I have a huge table that I periodically want to do select on some particular value. For example, supposing I have a table for the entire world population. Then I know the id of “1234” is ...
    Frank LuoFrank Luo
    Jan 29, 2016 at 12:47 am
    Feb 3, 2016 at 6:20 pm
  • Hi, What is the easiest method of importing data from an Oracle 11g table to Hive please? This will be a weekly periodic job. The source table has 20 million rows. I am running Hive 1.2.1 regards
    Ashok KumarAshok Kumar
    Jan 31, 2016 at 1:07 pm
    Jan 31, 2016 at 9:48 pm
  • Hi, I did some tests on ORC table by creating a simple ORC table as below CREATE TABLE orctest ( PROD_ID bigint , CUST_ID bigint , TIME_ID timestamp , CHANNEL_ID bigint , PROMO_ID bigint , ...
    Mich TalebzadehMich Talebzadeh
    Jan 20, 2016 at 6:32 pm
    Jan 22, 2016 at 5:35 pm
  • Hive, I am trying out the Hive on Spark with hive 1.2.1 and spark 1.5.2. Could someone help me on this? Thanks! Following are my steps: 1. build spark 1.5.2 without Hive and Hive Thrift Server. At ...
    Jan 11, 2016 at 7:48 am
    Jan 12, 2016 at 9:34 am
  • Hi Experts, I am trying to write a Hive UDF which access https request and based on the response return the result. From Plain Java, the https response is coming but the https accessed from UDF is ...
    Prabhu JosephPrabhu Joseph
    Jan 8, 2016 at 8:51 am
    Jan 12, 2016 at 6:40 am
  • Hi, Thinking loudly. Ideally we should consider a totally columnar storage offering in which each column of table is stored as compressed value (I disregard for now how actually ORC does this but ...
    Mich TalebzadehMich Talebzadeh
    Jan 6, 2016 at 6:24 am
    Jan 6, 2016 at 10:12 pm
  • Hi, When I perform any operation on a data set stored in Parquet format using Hive on Tez, I get an NPE (see bottom for stack trace). The same operation works fine on tables stored as text, Avro, ORC ...
    Adam HuntAdam Hunt
    Jan 4, 2016 at 4:58 pm
    Feb 2, 2016 at 11:22 pm
  • hi,all I tried hive on spark with version hive1.2.1 spark1.5.2. I build spark witout -Phive . And I test spark cluster stand alone with spark-submit and it is ok. but when I use hive , on spark ...
    Jan 26, 2016 at 8:45 am
    Jan 27, 2016 at 1:44 am
  • Hello, Following on from my earlier post concerning syncing Hive data from an on premise cluster to the cloud, I've been experimenting with the IMPORT/EXPORT functionality to move data from an ...
    Elliot WestElliot West
    Jan 7, 2016 at 12:18 pm
    Jan 25, 2016 at 8:09 am
  • Hello, all: As shown in the topic, I am so confused by this onfiguration parameters “hive.compute.splits.in.am”. what is the difference between“hive.compute.splits.in.am=true”and ...
    Jan 19, 2016 at 3:29 am
    Jan 19, 2016 at 7:04 am
  • Hi, How to do convert a column stored as String in Hive into Decimal if possible. The excel looks like this Invoice Number Payment date Net VAT Total 360 10/02/2014 £10,000.00 £2000.00 £12,000.00 And ...
    Mich TalebzadehMich Talebzadeh
    Jan 15, 2016 at 3:11 pm
    Jan 16, 2016 at 9:41 pm
  • Hi All, Is there a way we can write the hive column headers also along with the output when we are overwriting a query's output to an HDFS or local directory ? -- Sreenath S Kamath Bangalore Ph ...
    Jan 13, 2016 at 6:14 am
    Jan 13, 2016 at 10:36 am
  • Hi, I need 3000-4000 concurrent connections to hive.The hive metastore is running on a 256 GB Ram machine. Can anyone tell me what's the maximum number of connections a hive metstaore can support ? ...
    Kashif HussainKashif Hussain
    Jan 12, 2016 at 7:16 am
    Jan 12, 2016 at 8:11 am
  • Dear all We have a Hive query that 'insert overwrites' from one main hive table to another table about 24million rows every day. This query was working fine so long, but lately it has started to hang ...
    Suresh VSuresh V
    Jan 9, 2016 at 11:34 am
    Jan 9, 2016 at 4:34 pm
  • First time posting to this list. Please forgive me if I break etiquette. I'm looking for some help with getting data from hive to hbase. I'm using HDP 2.2.8. I have a compressed (zlib), orc-based ...
    Riesland, ZackRiesland, Zack
    Jan 28, 2016 at 8:05 pm
    Jan 28, 2016 at 9:23 pm
  • Hi all, I’m exporting a table with Hive CLI using hive –f query.hql file.tsv. The resulting tab separated file won’t read in R because it seems that some of my fields contain the \t separator and ...
    Thomas AchacheThomas Achache
    Jan 21, 2016 at 12:17 am
    Jan 21, 2016 at 7:37 pm
  • hi list, we use the HDFS and S3 as the Hive Filesystem at the same time. here has an issue: *scenario* 1: hive command: use default; create table temp.t1 // the database of temp which points to HDFS ...
    Jan 20, 2016 at 3:46 am
    Jan 20, 2016 at 9:27 am
  • Hi, Need tips/guidance to optimize(increase perfomance) billion data rows joins in hive . Any help would be appreciated. Thanks, Divya
    Divya GehlotDivya Gehlot
    Jan 18, 2016 at 8:08 am
    Jan 18, 2016 at 9:56 am
  • Hi, I am importing an excel sheet saved as csv file comma separated and compressed with bzip2 into Hive as external table with bzip2 The excel looks like this Invoice Number Payment date Net VAT ...
    Mich TalebzadehMich Talebzadeh
    Jan 15, 2016 at 10:15 am
    Jan 16, 2016 at 8:23 pm
  • I am trying to query hive table with basic example code: *import pyhs2* *with pyhs2.connect(host='dmet-master05.inetu.net <http://dmet-master05.inetu.net ',* * port=10000,* * authMechanism='PLAIN',* ...
    Karimkhan PathanKarimkhan Pathan
    Jan 15, 2016 at 9:59 am
    Jan 15, 2016 at 10:23 am
  • Hi All, I am facing strange behaviour as explained below. I have tow hive table T1 and T2 , joined with LEFT OUTER JOIN ..I am getting strange value for two columns t2c2 t2c3 of table T2 after join ...
    @Sanjiv Singh@Sanjiv Singh
    Jan 9, 2016 at 10:13 am
    Jan 12, 2016 at 5:45 am
  • hi, what is the equivalent to foreign keys in Hive? Thanks
    Ashok KumarAshok Kumar
    Jan 10, 2016 at 2:48 pm
    Jan 10, 2016 at 10:53 pm
  • Hi, So I am using the AccumuloStorageHandler to allow me to access Accumulo tables from Hive. This works fine. So typically I would use something like this: CREATE EXTERNAL TABLE test_text (rowid ...
    Peter MarronPeter Marron
    Jan 21, 2016 at 3:52 pm
    Feb 16, 2016 at 1:27 pm
  • Hi, * Spark 1.5.2 on Hive 1.2.1 * Hive 1.2.1 on Spark 1.3.1 * Oracle Release * Hadoop 2.6 I am running spark-sql using Hive metastore and I am pleasantly surprised by the speed by which ...
    Mich TalebzadehMich Talebzadeh
    Jan 31, 2016 at 11:07 pm
    Feb 1, 2016 at 7:05 am
  • Hi, I am trying to setup Hortonworks Data Platform. I would want to setup Hive in high availability mode (both metastore and as well as HiveServer2). Along with that, Hortonworks recommendation is to ...
    Greenhorn TechieGreenhorn Techie
    Jan 24, 2016 at 8:49 pm
    Jan 28, 2016 at 9:26 pm
  • I am a developer at Qubole and I want to introduce an open source project - Quark - https://github.com/qubole/quark. If you are using Apache Hive with data warehouses like Vertica, Greenplum or ...
    Rajat VenkateshRajat Venkatesh
    Jan 27, 2016 at 10:57 am
    Jan 27, 2016 at 3:14 pm
  • *I have run a query many times, there will be two results without regular.* *One is 36834699 and other is 18464706.* *The query is * set spark.yarn.queue=soft.high; set hive.execution.engine=spark ...
    Jone ZhangJone Zhang
    Jan 27, 2016 at 7:20 am
    Jan 27, 2016 at 2:39 pm
  • Hi, There are number of questions brought up about Hive Bucketing. As I see - it is another name for hash partitioning (assuming that Hive partitioning is effectively range partitioning). I borrow ...
    Mich TalebzadehMich Talebzadeh
    Jan 26, 2016 at 9:44 pm
    Jan 26, 2016 at 10:28 pm
  • Hi, I'd like to use S3 as the hive warehouse on my emr 4.x cluster. I've set hive.metastore.warehouse.dir=s3n://testbucket/hive_warehouse and ...
    Zsolt TóthZsolt Tóth
    Jan 22, 2016 at 12:52 pm
    Jan 22, 2016 at 1:11 pm
  • Hi all, Apologies for the nature of this question. Someone asked me whether it is possible to perform file search by hashes in Hadoop. I am thinking that he means wildcard searches in HDFS? Anyone ...
    Mich TalebzadehMich Talebzadeh
    Jan 21, 2016 at 3:15 pm
    Jan 21, 2016 at 10:09 pm
  • Hi all, As a reminder, the meeting will be held tomorrow as scheduled. Please refer to the meetup page[1] for details. Looking forward to meeting you all! Thanks, Xuefu [1] ...
    Xuefu ZhangXuefu Zhang
    Jan 20, 2016 at 5:45 pm
    Jan 21, 2016 at 10:05 pm
  • I got json string of the form: {"k1":"v1","k2":"v2,"k3":"v3"} How would I go about converting this to a map<string, string ? Thanks!
    Buntu DevBuntu Dev
    Jan 20, 2016 at 10:04 pm
    Jan 21, 2016 at 2:12 am
  • Hi I am trying to use beeline with hive + kerberos (Hortonworks sandbox 2.3) The problem is that I can use hdfs but not beeline and I do not know what is wrong. Console output: [margusja@sandbox ~]$ ...
    Margus RooMargus Roo
    Jan 9, 2016 at 3:49 pm
    Jan 11, 2016 at 8:30 am
  • Hello, We have a requirement to load data from xml file to Hive tables. The xml tags woud be the columns and values will be the data for those columns. Any pointers will be really helpful. Thanks, ...
    Nitinpathakala .Nitinpathakala .
    Jan 7, 2016 at 12:36 pm
    Jan 11, 2016 at 5:32 am
  • We made a comparison of the number of records between Hive on MapReduce and Hive on Spark.And they are in good agreement. But how to ensure that the record values of Hive on MapReduce and Hive on ...
    Jone ZhangJone Zhang
    Jan 8, 2016 at 3:37 am
    Jan 8, 2016 at 5:14 am
  • Hi Java Gurus, I have written a simple Java program that works fine when I run it on Linux as hduser (the OS owner for Hadoop, Hive etc) When I create a project in Eclipse on Windows and have copied ...
    Mich TalebzadehMich Talebzadeh
    Jan 7, 2016 at 10:37 am
    Jan 7, 2016 at 9:51 pm
  • Hello all With respect to command line hive shell, is the query execution time reported by hive the total time elapsed since the issue of the query or the actual time spent in the query itself? I ask ...
    Awhan PatnaikAwhan Patnaik
    Jan 7, 2016 at 10:41 am
    Jan 7, 2016 at 11:53 am
  • Hi, I am using Hive 0.14, and I am using JDBC to connect the Hive thrift server to do queries things, I encounter two issues- 1. When the query is issued,how can i get the job id(mapreduce that run ...
    Jan 28, 2016 at 2:11 pm
    Jan 28, 2016 at 11:22 pm
  • Hi see this cloudera blog at: http://blog.cloudera.com/blog/2014/08/improving-query-performance-using-partitioning-in-apache-hive/ That mentions "Do not over-partition the data. With too many small ...
    Shubhvardhan ManjayyaShubhvardhan Manjayya
    Jan 27, 2016 at 4:14 am
    Jan 27, 2016 at 5:17 am
  • Hi, We have a table in which the files are created by different users (under the same group). When a user inserts into the table it will finish successfully but after moving the files the user will ...
    Daniel HavivDaniel Haviv
    Jan 20, 2016 at 10:28 am
    Jan 25, 2016 at 8:30 am
  • Hi All, I am trying to execute hive commands on json file using jsonserde's,but I am always getting null values ,but not actual data. I have used serde's provided in ...
    Sri sowjSri sowj
    Jan 14, 2016 at 5:59 pm
    Jan 24, 2016 at 11:05 pm
Group Navigation
period‹ prev | Jan 2016 | next ›
Group Overview
groupuser @
categorieshive, hadoop

89 users for January 2016

Mich Talebzadeh: 76 posts Gopal Vijayaraghavan: 19 posts Jörn Franke: 12 posts Ashok Kumar: 10 posts Elliot West: 8 posts Todd: 8 posts Ophir Etzion: 7 posts Marcin Tustin: 6 posts @Sanjiv Singh: 6 posts Alexander Pivovarov: 5 posts Daniel Haviv: 5 posts Jone Zhang: 5 posts LLBian: 5 posts Prasanth Jayachandran: 5 posts Sofia Panagiotidi: 5 posts Xuefu Zhang: 5 posts 董亚军: 5 posts Gerber, Bryan W: 4 posts Rajesh Balamohan: 4 posts Sergey Shelukhin: 4 posts
show more