Grokbase Groups Hive user March 2012

Search Discussions

97 discussions - 337 posts

  • I have 2 table, each has 6 million records and clustered into 10 buckets These tables are very simple with 1 key column and 1 value column, all I want is getting the key that exists in both table but ...
    Mar 31, 2012 at 1:16 am
    Apr 12, 2012 at 3:01 pm
  • Hi There, I am trying to get dboutput() UDF to work so that it can write result to a PG DB table. ==This is what I did in hive shell== add jar /location/hive_contrib.jar; add jar ...
    Abhishek ParolkarAbhishek Parolkar
    Mar 29, 2012 at 8:25 am
    Apr 10, 2012 at 3:13 am
  • Is there a way to prevent LOAD DATA LOCAL INPATH from appending _copy_1 to logs that already exist in a partition? If the log is already in hdfs/hive I'd rather it fail and give me an return code or ...
    Sean McNamaraSean McNamara
    Mar 20, 2012 at 1:16 am
    Mar 21, 2012 at 6:17 pm
  • Hello, Is there a reason behind not implementing non-equality joins in Hive? In other words, is there any usage for theta-join, if implemented? Thank you in advance for your response, Mahsa
    Mahsa mofidpoorMahsa mofidpoor
    Mar 13, 2012 at 4:17 pm
    Mar 17, 2012 at 4:40 pm
  • I realize that hive doesn't have a date type for the columns and I realize that hive *does* have various date functions. I just haven't found a concrete example of how these two issues are brought ...
    Keith WileyKeith Wiley
    Mar 13, 2012 at 4:45 pm
    Mar 13, 2012 at 7:57 pm
  • Hi,all when I using hive through jdbc,and execute the code below. Statement stmt = con.createStatement(); stmt.setQueryTimeout(10); hive thrown the exception "Method not support." so how can I set ...
    Mar 26, 2012 at 7:15 am
    Apr 9, 2012 at 1:34 am
  • hi, how can I join two tables A and B so that the result is "In A but not in B"? let's take an example, say, the column to identify record is id. e.g. select A.* from A join B on ( = ...
    Mar 12, 2012 at 3:52 am
    Mar 12, 2012 at 12:44 pm
  • I successfully installed and used Hive to create basic tables (on one of my two machines; another discussion describes the problems I'm having with the other machine). However, basic queries aren't ...
    Keith WileyKeith Wiley
    Mar 9, 2012 at 11:44 pm
    Mar 12, 2012 at 5:26 pm
  • Hello, I have huge gzipped files that I need to drop the header row from before loading to a hive table. Right now, my process is: 1. Gunzip the data (...takes forever) 2. Drop the first row using ...
    Dan YDan Y
    Mar 7, 2012 at 4:01 pm
    Mar 7, 2012 at 4:44 pm
  • Hi users, I have a sequence file produced by mapreduce with TEXT, INTWRITABLE key value pair...I tried to create a external hive table using the file but hive can't read it. Thank you Sent from my ...
    Wei Shung ChungWei Shung Chung
    Mar 6, 2012 at 10:48 pm
    Mar 30, 2012 at 6:46 pm
  • Hi, Does Hive server support multiple concurrent client connections? The following page says it does not support them. Hive Server is currently not thread ...
    Mar 27, 2012 at 9:14 am
    Mar 28, 2012 at 7:11 pm
  • Hi I need to schedule my hive scripts which needs to process incoming weblogs on an hourly basis. Currently, I could process my weblog files by executing my scripts from hive command line interface ...
    LakshmiKanth PLakshmiKanth P
    Mar 19, 2012 at 6:49 pm
    Mar 19, 2012 at 9:42 pm
  • I'm quite comfortable hadoop and the associated lingo, been programming it via Java and via C++ streams for several years. However, I have just started Hive for the first time...and I'm stuck. I was ...
    Keith WileyKeith Wiley
    Mar 9, 2012 at 8:04 pm
    Mar 12, 2012 at 4:50 pm
  • How do I get the following meta information about a table 1. recent users of table, 2. top users of table, 3. recent queries/jobs/reports, 4. number of rows in a table I don't see anything either in ...
    Ladda, AnandLadda, Anand
    Mar 30, 2012 at 10:07 pm
    Apr 3, 2012 at 1:20 pm
  • I see hive-31 supposedly supports this, but when mimicking the syntax in the jira i get errors hive create table dem select demographics_local.* from ...
    Stephen BoeschStephen Boesch
    Mar 29, 2012 at 5:15 pm
    Mar 30, 2012 at 5:52 pm
  • I'm trying to modify a script to allow for more code reuse, by prepending table names with a variable. For example: CREATE TABLE etl_${hiveconf:table}_traffic AS ... The problem I'm running into is ...
    Tucker, MattTucker, Matt
    Mar 30, 2012 at 3:00 pm
    Mar 30, 2012 at 4:11 pm
  • I am trying to do a sqoop export (data from hdfs hadoop to database). The table I am trying to export has 2 million rows. The table has 20 fields. The sqoop command is successful if I did 10 rows ...
    Chalcy RajaChalcy Raja
    Mar 29, 2012 at 1:47 pm
    Mar 30, 2012 at 5:26 am
  • Hi, I'd like to be able to execute a Hive query and for the output to be stored in a path on HDFS (rather than immediately returned by the client). Ultimately I'd like to be able to do this to ...
    Paul InglesPaul Ingles
    Mar 29, 2012 at 11:19 am
    Mar 29, 2012 at 11:56 am
  • Hello, How do I get count from a list of comma separated values? For the lack of better wording, here is an example: Suppose there is a table with two columns, id (integers) and values (string) in ...
    Saurabh SSaurabh S
    Mar 28, 2012 at 6:21 pm
    Mar 28, 2012 at 6:58 pm
  • I'm using Hive distribution from CDH v0.5.0+32 and was able to run a simple query "select * from country;" But when I try to run "select * from country where code = 'US', I get the error below ...
    Nguyen, KhoaNguyen, Khoa
    Mar 27, 2012 at 8:55 pm
    Mar 28, 2012 at 9:58 am
  • Hi All, My raw data looks like this: DateTime,OtherData 01-01-2000-01:00:00,blablabla1 01-01-2000-04:00:00,blablabla2 01-02-2000-02:00:00,blablabla3 I would like to partition on the datepart of ...
    Dan YDan Y
    Mar 21, 2012 at 3:08 pm
    Mar 23, 2012 at 12:48 pm
  • Hi,all I want to track the progress of a query, how can I get the job name including stages of a query?
    Mar 20, 2012 at 4:59 am
    Mar 20, 2012 at 7:49 pm
  • Hi there, when I'm executing the following queries in hive set = true; CREATE TABLE IDAP_ROOT as SELECT a.*,b.acnt_no FROM idap_pi_root a LEFT OUTER JOIN idap_pi_root_acnt b ON ...
    Bruce BianBruce Bian
    Mar 19, 2012 at 9:13 am
    Mar 20, 2012 at 9:02 am
  • if I wang to update a table, e.g, insert overwrite table mytable select lower(col1), col2, col3 from mytable; if mytable has many columns but I only need to update one of them, how can I write the ...
    Mar 16, 2012 at 10:57 am
    Mar 19, 2012 at 2:31 am
  • Hi, Does the MR jobs of a hive query write directly to the destination or are the results of the MR jobs moved to the destination at the end? To be more precise, is it safe to write query in the ...
    Mar 15, 2012 at 11:24 am
    Mar 17, 2012 at 4:57 am
  • Hi , Can we insert data to external hive tables? 1) Create an external table create external table binary_tbl_local(byt TINYINT, bl boolean, it int, lng BIGINT, flt float, dbl double, shrt SMALLINT, ...
    Lu, WeiLu, Wei
    Mar 14, 2012 at 6:34 am
    Mar 14, 2012 at 12:17 pm
  • Um, this is weird. It simply isn't modifying the order of the returned rows at all. I get the same result with no 'order by' clause as with one. Adding a limit or specifying 'asc' has no effect. ...
    Keith WileyKeith Wiley
    Mar 13, 2012 at 8:54 pm
    Mar 13, 2012 at 9:12 pm
  • Hello, Could anybody tell me how can I load data into a Hive table when the flat file is existing on another server and bit locally on Hadoop node. For example, I am trying to load the table ...
    Omer, FarahOmer, Farah
    Mar 1, 2012 at 4:21 pm
    Mar 6, 2012 at 5:57 pm
  • Hi all, I am getting this following error when I am trying to do select ...with group by operation.I am grouping on around 25 columns java.lang.RuntimeException ...
    Praveenesh kumarPraveenesh kumar
    Mar 23, 2012 at 1:05 pm
    Sep 7, 2012 at 5:15 am
  • Hi I am able to run certain hive commands e.g. create table and select.. but not others .. Also my hadoop pseudo disributed cluster is working fine - i can run the examples. Examples of commands that ...
    Stephen BoeschStephen Boesch
    Mar 29, 2012 at 3:23 pm
    Mar 29, 2012 at 5:17 pm
  • I am trying to write a query that will return the first 5% of rows in a table. I've struggled with this for quite a while and can't figure out a command that works in Hive. Has anyone done this? ...
    James NewhavenJames Newhaven
    Mar 28, 2012 at 3:53 pm
    Mar 28, 2012 at 6:40 pm
  • Hi All, I was just going through the implementation scenario of avoiding or deleting Zero byte file in HDFS. I m using Hive partition table where the data in partition come from INSERT OVERWRITE ...
    Abhishek Pratap SinghAbhishek Pratap Singh
    Mar 26, 2012 at 9:21 pm
    Mar 26, 2012 at 9:54 pm
  • HI Folks, i follow all ther steps and build and install snappy and after creating sequencetable when i m insert overwrite the data into this table its throwing this error. Cannot ...
    Hadoop hiveHadoop hive
    Mar 22, 2012 at 11:30 am
    Mar 22, 2012 at 3:12 pm
  • Hiya, I'm using HIVE 0.7.1 with 1) moderate 50GB table, let's call it `temp_view` 2) query: select max(length(get_json_object(json, '$.user_id'))) from temp_view. From my point of view this query is ...
    Alexander ErshovAlexander Ershov
    Mar 20, 2012 at 10:44 am
    Mar 21, 2012 at 1:09 pm
  • Can Hive be configured to work with multiple namenodes(clusters)? I understand we can use command 'SET' to set any hadoop (or hive) configuration variable. But is it possible to handle multiple ...
    Dani RayanDani Rayan
    Mar 16, 2012 at 11:27 pm
    Mar 17, 2012 at 6:17 pm
  • Hi there, when I'm using Hive to doing a query as follows, 6 Map/Reduce jobs are launched, one for each join, and it deals with ~460M data in ~950 seconds, which I think is way toooo slow for a ...
    Bruce BianBruce Bian
    Mar 13, 2012 at 4:24 pm
    Mar 13, 2012 at 5:15 pm
  • Hi, I recently tried Hive-645 feature and save query results directly to Mysql table. The feature can be found here: ...
    Lu, WeiLu, Wei
    Mar 9, 2012 at 2:58 am
    Mar 12, 2012 at 2:19 am
  • Hi, For the query below, I find the five Move Operations (after MapReduce job) are not operated in parallel. from impressions2 insert OVERWRITE LOCAL DIRECTORY '/disk2/iis1' select * where ...
    Lu, WeiLu, Wei
    Mar 7, 2012 at 11:42 am
    Mar 7, 2012 at 5:23 pm
  • Hi all, I am trying to get an idea of what people do for setting up Hive metastore when using Amazon EMR. For those of you using Amazon EMR: 1) Do you have a dedicated RDS instance external to your ...
    Mark GroverMark Grover
    Mar 7, 2012 at 2:54 am
    Mar 7, 2012 at 6:45 am
  • Hi, I tried to do aggregation based on Table impressions2, and then need to save the results to two different local files (or tables). I tried three methods, only the first one succeeded: 1) create a ...
    Lu, WeiLu, Wei
    Mar 6, 2012 at 9:40 am
    Mar 7, 2012 at 2:14 am
  • Hi, I need to load data directly from a ctl A delimiter zipped file from the Linux box directly. Do I need to 1) un-zip the files and then load them to Hive tables, or 2) is there a direct command ...
    Lu, WeiLu, Wei
    Mar 5, 2012 at 2:26 am
    Mar 5, 2012 at 4:44 am
  • Hello, I have a set of URLs which I need to parse. For example, if the url is,, I need to extract, i.e. everything between second and third ...
    Saurabh SSaurabh S
    Mar 1, 2012 at 9:19 pm
    Mar 2, 2012 at 12:18 am
  • If i have a hive table, which is an external table, and have my "log files" being read into it, if a new file is imported into the hdfs and the file has a new column, how can i get hive to handle the ...
    Anson AbrahamAnson Abraham
    Mar 1, 2012 at 8:07 pm
    Mar 1, 2012 at 10:24 pm
  • Hi When I try to create any tables, I receive this message: FAILED: Error in metadata: MetaException(message:Got exception: File file:/user/hive/warehouse/table_name ...
    Mahsa mofidpoorMahsa mofidpoor
    Mar 29, 2012 at 11:00 pm
    Mar 30, 2012 at 6:14 am
  • Hi, My servers are in restricted environment without internet. But I can connect to internet from my PC and copy file to server. How can I install CDH3 on CentOS5.5 server in this situation. Best ...
    Mar 28, 2012 at 10:23 am
    Mar 29, 2012 at 4:02 am
  • <property <name hive.exec.rowoffset</name <value false</value <description Whether to provide the row offset virtual column</description </property I know the others are INPUT__FILE__NAME and ...
    Edward CaprioloEdward Capriolo
    Mar 25, 2012 at 9:30 pm
    Mar 26, 2012 at 12:45 am
  • Hi folks, I have several questions about optimization in Hive, they are mainly related to bucketized/sorted tables. Let say I have a table T bucketized on user_id and sorted by user_id, time. CREATE ...
    Mdefoinplatel ExtMdefoinplatel Ext
    Mar 20, 2012 at 2:20 pm
    Mar 23, 2012 at 10:39 am
  • I haven't had an opportunity to set up a huge Hive database yet because exporting csv files from our SQL database is, in itself, a rather laborious task. I was just curious how I might expect Hive to ...
    Keith WileyKeith Wiley
    Mar 19, 2012 at 10:52 pm
    Mar 20, 2012 at 6:03 am
  • Hi, I am trying to implement a task in Hive like Stored Procedure in SQL. In SQL, when we write cursor, first we execute select query and then fetching the records we perform some actions. Likely I ...
    Bhavesh ShahBhavesh Shah
    Mar 16, 2012 at 5:48 am
    Mar 19, 2012 at 5:01 am
  • Hi all, I need to perform a lot of "point in polygon" checks and want to use Hive (currently I mix Hive, Sqoop and PostGIS in an Oozie workto do this). In an ideal world, I would like to create a ...
    Tim RobertsonTim Robertson
    Mar 16, 2012 at 9:22 am
    Mar 17, 2012 at 1:50 am
Group Navigation
period‹ prev | Mar 2012 | next ›
Group Overview
groupuser @
categorieshive, hadoop

94 users for March 2012

Bejoy KS: 27 posts Keith Wiley: 24 posts Edward Capriolo: 23 posts Tucker, Matt: 12 posts Lu, Wei: 11 posts Mark Grover: 11 posts Abhishek Parolkar: 8 posts Felix.徐: 8 posts Nitin Pawar: 8 posts Gabi D: 7 posts Hadoop hive: 7 posts 王锋: 7 posts Chalcy Raja: 6 posts Mahsa mofidpoor: 6 posts Richard: 6 posts Saurabh S: 6 posts Stephen Boesch: 6 posts Abhishek Pratap Singh: 5 posts Aniket Mokashi: 5 posts Bruce Bian: 5 posts
show more