Grokbase Groups Hive user May 2012
FAQ

Search Discussions

114 discussions - 458 posts

  • How do I load data from S3 into Hive using Amazon EMR? I've booted a small cluster, and I want to load a 3-column TSV file from Pig into a table like this: create table from_to (from_address string, ...
    Russell JurneyRussell Jurney
    May 29, 2012 at 8:20 pm
    May 30, 2012 at 9:30 pm
  • I am pretty new to hive and was trying to clearly understand the difference between a managed and an external table. As my current understanding stands, a managed table is a table whose data is ...
    Kulkarni SwarnimKulkarni Swarnim
    May 10, 2012 at 9:53 pm
    May 14, 2012 at 1:41 pm
  • Hello all, I have written a Hive JDBC code and created a JAR of it. I am running that JAR on 10 cluster. But the problem as I am using the 10 cluster still the performance is same as that on single ...
    Bhavesh ShahBhavesh Shah
    May 8, 2012 at 4:38 am
    May 9, 2012 at 6:45 am
  • Hello all, My Use Case is: 1) I have a relational database which has a very large data. (MS SQL Server) 2) I want to do analysis on these huge data and want to generate reports on it after analysis ...
    Bhavesh ShahBhavesh Shah
    May 14, 2012 at 7:09 am
    May 15, 2012 at 11:44 am
  • hi, We setup a new hive 0.9 client, Found some sql did not work, for example hive create table t1(a int) partitioned by ( dt int ); OK Time taken: 0.097 seconds hive load data local inpath '/tmp/t' ...
    WdWd
    May 31, 2012 at 7:19 am
    May 31, 2012 at 9:37 am
  • Hello - For those users whose setup is somewhat production, what do you use for job scheduling and dependency management? *thanks, ronak* * * * *
    Ronak BhattRonak Bhatt
    May 26, 2012 at 2:48 pm
    May 30, 2012 at 3:53 am
  • Hello, I've encountered a weird issue with hive and I'm not sure if I'm doing something wrong or if it is a bug. I'm trying to do a multi-group-by select statement on a partitioned table. I wan't ...
    Jan DolinárJan Dolinár
    May 25, 2012 at 10:04 am
    Jun 8, 2012 at 5:42 am
  • I am new to Hive. I have several SQL from RDBMS database that I need to convert to hive. What's the best reference for HIVEQL? For now I am trying to figure out how to do this in hive: Select ...
    Mohit AnchliaMohit Anchlia
    May 24, 2012 at 8:02 pm
    May 24, 2012 at 10:23 pm
  • Hello all, I have one query. It is executing fine on Hive CLI and returning the result. But when I am executing it with the help Hive JDBC I am getting error as: *java.sql.SQLException: Query ...
    Bhavesh ShahBhavesh Shah
    May 10, 2012 at 10:27 am
    May 11, 2012 at 11:34 am
  • Hi I have data warehouse implementation for Click Stream data analysis on RDBMS. Its a start schema (Dimensions and Facts). Now if i want to move to Hive, Do i need to create same data model as ...
    Kuldeep ChitrakarKuldeep Chitrakar
    May 10, 2012 at 1:25 pm
    May 10, 2012 at 7:30 pm
  • Hi All, Does hive support subquery in select statement? Given below data, I need counts and percentage counts per category. The divisor in my case is not the total count, but something that is stored ...
    Shan sShan s
    May 25, 2012 at 3:14 pm
    May 28, 2012 at 9:39 am
  • My use case requires individual accounts for business users groups. Is there a way to mimic MySQL (or other database) to create users with read-only permissions? This avoid business user accidental ...
    Patrick LuoPatrick Luo
    May 18, 2012 at 6:47 pm
    May 23, 2012 at 11:59 pm
  • When i am trying to run a query with index i am getting this ex= ception.My hive version is 0.7.1 java.lang.OutOfMemoryError: GC overhead limit exceeded = at ...
    Debarshi BasakDebarshi Basak
    May 23, 2012 at 8:14 am
    May 23, 2012 at 10:04 am
  • Hi guys, Thanks you very much in advance for your help. My problem in short is getting the date for yesterday in a YYYYMMDD format. As I use this format for partitions, I need this format in quite ...
    Zoltán Tóth-CzifraZoltán Tóth-Czifra
    May 15, 2012 at 1:11 pm
    May 15, 2012 at 3:24 pm
  • Hi, I put mysql-connector-java-5.1.20-bin.jar into $HIVE_HOME/lib, and configurations in hive-site.xml are <property <name javax.jdo.option.ConnectionURL</name <value ...
    Xiaobo GuXiaobo Gu
    May 12, 2012 at 7:38 am
    May 21, 2012 at 9:28 pm
  • Does hive currently support multiple SerDe s to be defined per table name? Looking through the code and documentation, it seems like it doesn't as only one could be specified through the ROW FORMAT ...
    Kulkarni SwarnimKulkarni Swarnim
    May 17, 2012 at 3:30 pm
    May 18, 2012 at 3:21 pm
  • I am currently using hive 0.7.1 and creating indexes based on columns in the where clause. However, when I run the explain plan I do not see the index being leveraged. The syntax that I am using to ...
    Raghunath, RanjithRaghunath, Ranjith
    May 16, 2012 at 8:07 pm
    May 17, 2012 at 2:42 am
  • Hi, I'm setting up a Hadoop cluster and would like to understand how much disk space I should expect to need with joins. Let's assume that I have 2 tables, each of about 500 GB. Since the tables are ...
    Ali Safdar KureishyAli Safdar Kureishy
    May 7, 2012 at 7:32 am
    May 8, 2012 at 6:00 pm
  • How do I set-up a filter constant for TIMESTAMP datatype. In Hive 0.7 since timestamps were represented as strings a query like this would return data select * from LU_day where day_date ='2010-01-01 ...
    Ladda, AnandLadda, Anand
    May 26, 2012 at 1:28 pm
    Jun 5, 2012 at 3:21 am
  • Hi, I build the hive table mapped with hbase table, CREATE TABLE http_access(key string, client_ip string, client_port int, request_method string, event_time timestamp) STORED BY ...
    Peyton PengPeyton Peng
    May 30, 2012 at 8:54 am
    Jun 4, 2012 at 5:12 pm
  • Hi, has anyone managed to get Hive properly set up in IntelliJ? I've tried but so far I've failed to get it running with Ivy and its dependencies. Pointers would be very appreciated. Cheers, Lars
    Lars FranckeLars Francke
    May 31, 2012 at 7:46 am
    May 31, 2012 at 8:46 pm
  • We are using Dynamic partitioning and facing the similar problem. Below is the jobtracker error log. We have a hadoop cluster of 6 nodes, 1.16 TB capacity with over 700GB still free. Caused by ...
    Nimra ChoudharyNimra Choudhary
    May 29, 2012 at 9:33 am
    May 29, 2012 at 10:03 am
  • hi there, I got the following trace stack when startuping hive cli. It worked well last week when i just installed it. Anybody can help? thanks, Dianbau [dzhu@bbdw-194 bin]$ ./hive Logging ...
    Dianbo.zhuDianbo.zhu
    May 18, 2012 at 7:52 am
    May 23, 2012 at 2:36 am
  • All, I'm a relative newcomer to Hadoop/Hive. We have a very standard setup of multiple webapp servers backed by a mySql database. We are evaluating Hive as a high scale solution for our relatively ...
    Jon PalmerJon Palmer
    May 15, 2012 at 12:12 pm
    May 15, 2012 at 7:45 pm
  • I think if I create index for one table When I excute “select c1,c2 from tab where index_col=1”, should not start mapreduce But it was start . So how to use a index without mapreduce? Compact index ...
    Hezhiqiang (Ransom)Hezhiqiang (Ransom)
    May 11, 2012 at 3:34 am
    May 15, 2012 at 5:54 pm
  • Hi, If we set different hive.metastore.warehouse.dir for the different Hive instances, it this ok, does them conflict with other resouces, such as directory paths under /tmp? Regards, Xiaobo Gu
    Xiaobo GuXiaobo Gu
    May 13, 2012 at 12:08 pm
    May 14, 2012 at 5:31 am
  • I can't find it in the release package. Xiaobo Gu
    Xiaobo GuXiaobo Gu
    May 12, 2012 at 6:28 am
    May 13, 2012 at 3:57 am
  • Hi I am using - Hadoop 0.20.2 - Hive 0.8.1 - Sqoop 1.4.1-incubating in my sample project. Currently I am importing data from PostgreSQL to Hive table using Sqoop. My database table in PostgreSQL has ...
    MperformerMperformer
    May 7, 2012 at 12:34 am
    May 9, 2012 at 12:21 am
  • Hi, I have attached a *Sequence* file with the following format: <url:Text <data:CrawlDatum (CrawlDatum is a custom Java type, that contains several fields that would be flattened into several ...
    Ali Safdar KureishyAli Safdar Kureishy
    May 5, 2012 at 8:06 pm
    May 6, 2012 at 2:30 pm
  • Hello, I have a query like SELECT * FROM ( SELECT 1, concat(1_timestamp, ', ', 2_account_id ) FROM table_1 WHERE 2_account_id = 1132576 LIMIT 1000000000 UNION ALL SELECT 2, concat(1_timestamp, ', ', ...
    Alexander GoryunovAlexander Goryunov
    May 4, 2012 at 12:53 pm
    May 4, 2012 at 2:06 pm
  • Hi All, I have changed the namenode from one server to another when there was a crash of hardware. After configuring the Namenode. When i am executing hive query below error is shown.. bin/hive -e ...
    Varun kumarVarun kumar
    May 9, 2012 at 4:15 pm
    Jun 11, 2012 at 5:28 pm
  • Hi, I'm trying to define a table over an external file. My file has 12 fixed columns followed by a varying amount of columns that depends on some of the fixed ones. I tried to define the table as ...
    Ramon PinRamon Pin
    May 30, 2012 at 4:27 pm
    Jun 4, 2012 at 8:59 am
  • Hi there, I'm looking for help with the following error: I keep receiving an error 'Parse Error: line 40:4 cannot recognize input near '-' 'Platform' STRING column type. ' It looks to be a problem ...
    Alex BarreraAlex Barrera
    May 31, 2012 at 3:36 pm
    Jun 1, 2012 at 7:39 pm
  • Hello Hive Users, There is a strange situation I am facing. I have a string column in my Hive table ( its IP address). I am creating a UDF where I am taking this string column and converting it into ...
    Praveenesh kumarPraveenesh kumar
    May 30, 2012 at 2:41 pm
    May 30, 2012 at 4:16 pm
  • Hi Hive Users, Please tell me if i am missing something. I prefer to run hive on a thin client so i connect to a hive server by specifying a (remote) host and port. (eg. $ hive -h <host -p <port ) ...
    Stephen SpragueStephen Sprague
    May 26, 2012 at 2:22 am
    May 28, 2012 at 7:09 pm
  • I have the parameter hive.map.aggr set to true. However, when I look at the counters associated with the map tasks I notice the following "Combine input records 0". I am interpreting this as a ...
    Raghunath, RanjithRaghunath, Ranjith
    May 23, 2012 at 12:43 am
    May 24, 2012 at 12:52 am
  • When I launch simple SQL I see "local hadoop". And when I do hadoop job fs -list in my hadoop cluster I don't see any jobs. Am I doing something wrong here? # hive Hive history ...
    Mohit AnchliaMohit Anchlia
    May 23, 2012 at 11:48 pm
    May 24, 2012 at 12:46 am
  • I have a simple query running on HIVE using Amazon EMR where the reduce operation seems to be stuck. The table is small with just 300,000+ rows (data_analysis). I intend to work on a 30,000,000 row ...
    Balaji RaoBalaji Rao
    May 18, 2012 at 6:57 pm
    May 21, 2012 at 9:24 pm
  • Is that possible? What I am trying to do is create an S3 table using CTAS. Since CTAS doesn't allow specifying a location, I have to create a managed table first: CREATE TABLE T AS SELECT ...; (I ...
    Igor TatarinovIgor Tatarinov
    May 11, 2012 at 9:58 pm
    May 12, 2012 at 6:28 am
  • I'm having a hard time passing a date as a hive environment variable. The setting is this: The table I'm querying is partitioned on a date column, say, local_dt. I wish to query on last two days' ...
    Saurabh SSaurabh S
    May 10, 2012 at 6:06 pm
    May 10, 2012 at 6:58 pm
  • I created a very simple hive table and then ran the following query that should run a M/R job to return the results. hive SELECT COUNT(*) FROM invites; But I am getting the following exception ...
    Kulkarni SwarnimKulkarni Swarnim
    May 7, 2012 at 2:01 pm
    May 7, 2012 at 7:32 pm
  • Hi All, We did a successful setup of hadoop-0.20.203.0 and hive-0.7.1. We also loaded a large number of CSV files into HDFS successfully. We can query through hive CLI. Now when we execute any query ...
    Garg, RinkuGarg, Rinku
    May 2, 2012 at 5:16 am
    May 5, 2012 at 10:15 pm
  • I am trying to write a custom ObjectInspector extending the StructObjectInspector and got a little confused about the use of the getStructFieldData method on the inspector. Looking at the definition ...
    Kulkarni SwarnimKulkarni Swarnim
    May 25, 2012 at 8:59 pm
    Jun 5, 2012 at 3:51 pm
  • I need help with a simple subquery. Given below data, I need counts and percentage counts per category. (Re-phrasing my earlier question ) With the code below I get an error: *FAILED: Parse Error:* ...
    Shan sShan s
    May 28, 2012 at 1:48 pm
    Jun 4, 2012 at 12:40 pm
  • Question from novice. Where I can read table design best practices? I have a measure table with millions of rows and many dimension tables with less than 1000 rows each. I can't find out the way to ...
    Avdeev V. M.Avdeev V. M.
    May 28, 2012 at 5:17 am
    Jun 4, 2012 at 9:30 am
  • Hi, I've got a table vt_new_data which is defined as follows: CREATE TABLE VT_NEW_DATA ( V_ACCOUNT_NUM string ,V_ACCOUNT_MODIFIER_NUM string ,V_DEPOSIT_TYPE_CD string ,V_DEPOSIT_TERM int ...
    Bruce BianBruce Bian
    May 31, 2012 at 10:18 am
    Jun 1, 2012 at 4:27 am
  • Hi, I want to use Hive on some data in the following format: <doc \0x01 field1=val1\0x01 field2=val2\0x01 ... </doc \0x01 the lines between <doc and </doc are a record. How should I define the table? ...
    RichardRichard
    May 21, 2012 at 9:44 am
    May 22, 2012 at 4:52 pm
  • Hello! I'm very new to the world Hadoop and Hive so I cannot solve a problem that I encountered. Hadoop has been deployed on a single-node in a pseudo-distributed mode. I'm trying to copy data from ...
    Avdeev V. M.Avdeev V. M.
    May 22, 2012 at 10:21 am
    May 22, 2012 at 1:08 pm
  • Hi, I'd like to document HIVE-2810 and possibly HIVE-1634 in the wiki. Could I get edit access? My username is lars.francke. I think it'd be great to have a policy to accept patches only with ...
    Lars FranckeLars Francke
    May 15, 2012 at 10:13 am
    May 21, 2012 at 10:23 am
  • Hi, When I want to join two tables, I receive the following error: 12/05/10 12:03:31 WARN conf.HiveConf: hive-site.xml not found on CLASSPATH WARNING: org.apache.hadoop.metrics.jvm.EventCounter is ...
    Mahsa MofidpoorMahsa Mofidpoor
    May 10, 2012 at 4:30 pm
    May 20, 2012 at 4:30 pm
Group Navigation
period‹ prev | May 2012 | next ›
Group Overview
groupuser @
categorieshive, hadoop
discussions114
posts458
users112
websitehive.apache.org

112 users for May 2012

Edward Capriolo: 40 posts Bhavesh Shah: 23 posts Mark Grover: 23 posts Bejoy KS: 19 posts Nitin Pawar: 18 posts Ranjith: 14 posts Kulkarni Swarnim: 12 posts Wd: 11 posts Shashwat shriparv: 10 posts Xiaobo Gu: 10 posts Russell Jurney: 9 posts Mohit Anchlia: 8 posts Ali Safdar Kureishy: 7 posts Alo alt: 7 posts Carl Steinbach: 7 posts Debarshi Basak: 7 posts Hezhiqiang (Ransom): 7 posts Ashutosh Chauhan: 6 posts Ransom.hezhiqiang: 6 posts Shrikanth shankar: 6 posts
show more
Archives