Search Discussions

57 discussions - 252 posts

  • Hello all, Should I expect to be able to do a Hive JOIN between two tables that have about 10 or 15GB of data each? What I'm noticing (for a simple JOIN) is that all the map tasks complete, but the ...
    Ryan LeCompteRyan LeCompte
    Oct 26, 2009 at 1:40 am
    Oct 26, 2009 at 6:52 pm
  • I access to the http://www.apache.org/dyn/closer.cgi/hadoop/hive/ only Hive 0.3.0 available it works with Hadoop 0.19.0 but mine is $ hadoop version Hadoop 0.20.2-dev Subversion -r Compiled by root ...
    Oct 13, 2009 at 2:47 am
    Oct 14, 2009 at 9:59 pm
  • Guys, I am trying to understand Vertica and how it applies to the Hadoop world. Is this basically a way to store large amounts of data and run SQL-like queries on it that also result in map/red uce ...
    Ryan LeCompteRyan LeCompte
    Oct 17, 2009 at 5:11 am
    Oct 23, 2009 at 5:30 am
  • Hi I'm trying to create a Web Service which will access Hive (0.4.0 release) using JDBC. I used to sample JDBC code from the wiki ...
    Arijit MukherjeeArijit Mukherjee
    Oct 20, 2009 at 8:55 am
    Oct 23, 2009 at 4:23 am
  • Hello all, Very newto Hive (haven't even installed it yet!), but I had a use case that I didn't see demonstrated in any of the tutorial/documentation that I've read thus far. Let's say that I have ...
    Ryan LeCompteRyan LeCompte
    Oct 10, 2009 at 6:43 pm
    Oct 19, 2009 at 6:10 pm
  • Hi guyz, I am trying to build Hive from the trunk - not sure whether I'll be able to do it or not - because every time I tried that, the build process started downloading all versions of hadoop and ...
    Rahul PalRahul Pal
    Oct 23, 2009 at 5:42 am
    Oct 27, 2009 at 9:03 am
  • Hello, If I exect a Hive query in the command-line shell, the results are displayed in a human-readable format in the shell window. However, if I write the query such that it gets redirected to a ...
    Ryan LeCompteRyan LeCompte
    Oct 15, 2009 at 8:27 pm
    Oct 18, 2009 at 2:22 am
  • There were errors in the hive.log 2009-10-01 10:40:53,631 ERROR DataNucleus.Plugin (Log4JLogger.java:error(115)) - Bundle "org.eclipse.jdt.core" requires "org.eclipse.core.resources" but it cannot be ...
    Matt PestrittoMatt Pestritto
    Oct 1, 2009 at 2:54 pm
    Oct 19, 2009 at 7:07 pm
  • It's hive 0.4.0 and Hadoop 0.20.1 yangzhuoluo@ubuntu:~/hive/build/dist$ bin/hive Hive history file=/tmp/yangzhuoluo/hive_job_log_yangzhuoluo_200910161201_337571289.txt hive show tables; FAILED: Error ...
    Clark Yang (杨卓荦)Clark Yang (杨卓荦)
    Oct 16, 2009 at 4:15 am
    Oct 16, 2009 at 10:43 am
  • Hi all , We are trying to work on the Optimizer part of hive . Can anyone point me to a document/link containing how Hive designs it's query plans , what meta data it uses , how it optimizes the ...
    Bharath vissapragadaBharath vissapragada
    Oct 13, 2009 at 11:10 am
    Oct 13, 2009 at 7:25 pm
  • After sitting though some HDFS/BHase presentations yesterday, I started thinking. that the hive model or doing its map/reduce over raw files from HDFS is great, but a dedicated caching/region server ...
    Edward CaprioloEdward Capriolo
    Oct 4, 2009 at 3:43 pm
    Oct 6, 2009 at 2:39 pm
  • Hi there, I want to create a new JSON Field/Column type. I know there exists get_json_object(), but the things is I want to multiple JSON operations in a single select statement and don't want to ...
    Bobby RulloBobby Rullo
    Oct 3, 2009 at 2:37 am
    Oct 5, 2009 at 5:45 pm
  • Hi everyone, I am new to Hive. Here is a problem. I followed the getting started instructions to setup my hive (as well as hadoop). I can create tables, show tables and select without any where ...
    Gang LuoGang Luo
    Oct 22, 2009 at 10:00 pm
    Oct 22, 2009 at 10:31 pm
  • Hi, Has anyone looked into the Microsoft Dryad project? Their basic idea is using DAG(connect computational "vertices" with communication "edges") to model distributed computing flows. And they have ...
    Qing YanQing Yan
    Oct 15, 2009 at 7:31 am
    Oct 17, 2009 at 10:42 am
  • Hello, I am trying to create a table that is bucketed and sorted by various columns. My table is created as a sequence file, and I'm populating it with the LOAD DATA command. However, I just came ...
    Ryan LeCompteRyan LeCompte
    Oct 24, 2009 at 10:01 am
    Oct 28, 2009 at 11:00 pm
  • Hi everyone, I am going to do some interesting things for join in hive. Before I read the source code, could anyone tell me what kinds of join have been implemented in the newest version of hive? ...
    Gang LuoGang Luo
    Oct 25, 2009 at 5:17 pm
    Oct 26, 2009 at 12:50 pm
  • I write a test program to output chinese characters from hive table by java: Test.java: import java.util.Scanner; public class Test { public static void main( String[] args ) { // TODO Auto-generated ...
    Yan xiaohuiYan xiaohui
    Oct 24, 2009 at 2:17 pm
    Oct 25, 2009 at 6:14 am
  • Hello all, Another Hive query question... :) If I have a column in a table of type STRING, and it can take on a comma-delimited set of values (arbitrary, and unknown at query time)... For example: ...
    Ryan LeCompteRyan LeCompte
    Oct 19, 2009 at 8:22 pm
    Oct 19, 2009 at 10:23 pm
  • Hi, I have some basic questions on how hive handles dates and date arithmetic. I apologize if this has already been addressed. Per most samples on this site and elsewhere, I can have an access log ...
    Oct 13, 2009 at 12:04 am
    Oct 16, 2009 at 8:22 am
  • Hi I'm trying to do sort of cube creation on two tables using Hive (0.4.0 release). I have two subqueries and want to do a UNION with them. The subqueries are as follows: 1. SELECT ...
    Arijit MukherjeeArijit Mukherjee
    Oct 15, 2009 at 10:50 am
    Oct 16, 2009 at 4:27 am
  • adding hive-user and hive-dev lists. And removing the common mailing list.. Can you elaborate a bit on the datasize. By default Hive should just be relying on hadoop to give you the number of mappers ...
    Ashish ThusooAshish Thusoo
    Oct 12, 2009 at 6:04 pm
    Oct 13, 2009 at 12:02 pm
  • 1. When I build hive-0.4.0, ivy would try to download hadoop, 0.18.3, 0.19.0 and 0.20.0. But always fail for 2. Then I modified shims/ivy.xml and shims/build.xml to remove ...
    Schubert ZhangSchubert Zhang
    Oct 19, 2009 at 5:03 pm
    Feb 15, 2010 at 10:15 pm
  • hi,I'a a beginner of hive,the problem is when I create a table by cli: $ hive -e "create table abc(a int)"; I use $hive -e "show tables;",which will show this table correctlly. But when I do like ...
    Yan xiaohuiYan xiaohui
    Oct 22, 2009 at 3:27 am
    Oct 24, 2009 at 5:59 am
  • Hi all, I need a small help .. What metadata does hive use for optimizing the query evaluation .. For eg : We can use No of rows in the table etc .. Expecting some response.. Thanks in advance
    Bharath vissapragadaBharath vissapragada
    Oct 16, 2009 at 2:06 pm
    Oct 16, 2009 at 6:50 pm
  • Hello all, I was wondering if there are any performance hits in using a map<string,string column in a Hive schema to represent a line of an apache log. My issue is that frequently new parameters are ...
    Ryan LeCompteRyan LeCompte
    Oct 11, 2009 at 11:20 am
    Oct 13, 2009 at 7:30 am
  • Hi there, It seems that Hive ignores the key when reading hadoop sequence files. Is there a way to make it not do that? If there's no way to do this with a 'stock' Hive build, could someone point me ...
    Bobby RulloBobby Rullo
    Oct 7, 2009 at 1:19 am
    Oct 10, 2009 at 6:53 am
  • Hi, http://wiki.apache.org/hadoop/Hive/LanguageManual/DML How would one insert (more data) into an existing table(with some data already in it). If I am reading the correct documentation, then how do ...
    Vinay guptaVinay gupta
    Oct 6, 2009 at 9:43 pm
    Oct 7, 2009 at 12:16 am
  • Hi, I'm trying to run the unit tests before submitting a patch and I'm getting a test failure. I've tried running the same test on a fresh checkout and it also fails. Below is an excerpt of the ...
    Bill GrahamBill Graham
    Oct 2, 2009 at 5:19 pm
    Oct 2, 2009 at 5:51 pm
  • Hello, I'm trying to convert some Oracle queries to HIVE. I have some queries using LEAD and LAG analytic functions (see: http://ss64.com/ora/syntax-analytic-lead.html) These functions give access to ...
    Bosio AndreaBosio Andrea
    Oct 21, 2009 at 7:59 pm
    Oct 24, 2009 at 5:24 pm
  • Hey guys! Two questions: 1) Is it possible to refer to an aliased column in the GROUP BY of a single query? Hive is complaining if I try to do something like: SELECT if(col1='x',1,0) AS MYALIAS, ...
    Ryan LeCompteRyan LeCompte
    Oct 23, 2009 at 3:12 pm
    Oct 23, 2009 at 5:28 pm
  • Hi there, Is there a way to ask hive what the current partitions are for a given table? I can't simply look in the hive warehouse, because I use "alter table...add partition" so my files don't get ...
    Bobby RulloBobby Rullo
    Oct 19, 2009 at 6:18 pm
    Oct 19, 2009 at 6:37 pm
  • I have already installed hadoop correctly. but what does that mean? ~/hive/build/dist/bin$ ./hive Hive history file=/tmp/yangzhuoluo/hive_job_log_yangzhuoluo_200910151919_995767046.txt hive show ...
    Clark Yang (杨卓荦)Clark Yang (杨卓荦)
    Oct 15, 2009 at 11:50 am
    Oct 15, 2009 at 11:58 am
  • Hi Folks, We have release the rc2 candidate that Namit had generated as Hive 0.4.0. You can find download it from the download page. http://hadoop.apache.org/hive/releases.html#Download Thanks, Ashish
    Ashish ThusooAshish Thusoo
    Oct 14, 2009 at 11:05 pm
    Oct 14, 2009 at 11:33 pm
  • This was actually a previous feature request by someone else: https://issues.apache.org/jira/browse/HIVE-91 The implemented solution was to allow the location to be specified in the "alter table .. ...
    Larry OgrodnekLarry Ogrodnek
    Oct 12, 2009 at 5:33 pm
    Oct 13, 2009 at 11:12 pm
  • Hey there, Is there a way to permanently register your UDF's so that you don't have to do a "create temporary function ..." at beginning of each session? Another alternative would be to have a ...
    Bobby RulloBobby Rullo
    Oct 12, 2009 at 11:29 pm
    Oct 13, 2009 at 12:04 am
  • Hello all, My hive queries are returning back successfully, however if I did in the map/reduce job that's running as a result of running the query, I see the following errors in a lot of map tasks ...
    Ryan LeCompteRyan LeCompte
    Oct 11, 2009 at 11:02 pm
    Oct 12, 2009 at 9:04 pm
  • Hi, I haven't looked at the user manual in detail, so please bear with me if this is a silly question. Does HIVE support the kind of multi-query execution that Pig just added? By multi-query I mean ...
    Utkarsh SrivastavaUtkarsh Srivastava
    Oct 11, 2009 at 5:41 am
    Oct 11, 2009 at 6:24 am
  • Hello all, Is this possible? If I add a file to distributed cache via ADD FILE, could I access it in my Java-based user defined function using the Hadoop APIs or is this not a good idea? Thanks, Ryan
    Ryan LeCompteRyan LeCompte
    Oct 30, 2009 at 9:36 pm
    Nov 2, 2009 at 5:27 pm
  • Hi everyone, I am writing a java program to create a Query Editor to a execute query through hive. I am using hive-jdbc driver for database connection and query execution, but I am facing a problem ...
    Mohan AgarwalMohan Agarwal
    Oct 26, 2009 at 1:22 pm
    Oct 26, 2009 at 8:05 pm
  • Hi, For a table stored as sequence file, I am wondering how the key class is treated/used. I think it is ByteWritable. Does it matter if I write to the table using LongWritable key?? (If its gotta be ...
    Vinay guptaVinay gupta
    Oct 15, 2009 at 10:01 pm
    Oct 24, 2009 at 5:57 am
  • Hey guys, I just noticed there is a BOOLEAN Primitive type. I'm currently using INT for boolean values (0 or 1). Should I expect any performance gains with Hive if I properly use a BOOLEAN or TINYINT ...
    Ryan LeCompteRyan LeCompte
    Oct 23, 2009 at 8:13 pm
    Oct 24, 2009 at 5:49 am
  • Hi all, I'm trying to run this query on two 8gb datasets: SELECT COUNT(UT.UserID) FROM streamtransfers ST JOIN usertracking UT ON (ST.usertrackingid = UT.usertrackingid) WHERE UT.UserID IS NOT NULL ...
    Chris BatesChris Bates
    Oct 20, 2009 at 5:03 pm
    Oct 21, 2009 at 8:51 pm
  • Hi, I'm seeing serious performance issues with regexp_extract. Most simple queries using regexp_extract seem to take 10x longer than comparable requests without using regexp_extract. Of course, there ...
    Oct 18, 2009 at 11:22 pm
    Oct 20, 2009 at 11:31 pm
  • When I use $hive --service hiveserver as a service for JDBC client. Then I use $hive But any use of HiveQL on hive client will not be available? Why does this happen? How can I make them both ...
    Clark Yang (杨卓荦)Clark Yang (杨卓荦)
    Oct 18, 2009 at 11:05 am
    Oct 19, 2009 at 7:51 am
  • If I have a custom udf, is it treated as a singleton? If there is some heavy initialization for the udf that would obviously help. In this case though i'd imagine the udf must be thread-safe. Thanks, ...
    Oct 15, 2009 at 2:02 am
    Oct 15, 2009 at 10:00 am
  • Hi All Has anyone been using the Pentaho report designer with Hive? It's mentioned in the Hive JDBC guide - I've downloaded Pentaho 3.5RC1 - was able to create a hive connection with the url and ...
    Arijit MukherjeeArijit Mukherjee
    Oct 14, 2009 at 9:08 am
    Oct 14, 2009 at 12:22 pm
  • Is there a way to get hive to output the column name before each result returned in a query, like a MySQL query would? Thanks, Ryan
    Ryan LeCompteRyan LeCompte
    Oct 13, 2009 at 5:24 pm
    Oct 13, 2009 at 7:01 pm
  • Hello all, I haven't seen any documentation for this, but can anyone point me in the right direction on how to create my own user-defined functions? Is it possible to create a user-defined function ...
    Ryan LeCompteRyan LeCompte
    Oct 13, 2009 at 11:11 am
    Oct 13, 2009 at 1:45 pm
  • It shows following , I can't download it? $ svn co https://svn.apache.org/viewvc/hadoop/hive/tags/release-0.4.0 hive svn: Repository moved temporarily to '/viewvc/hadoop/hive/tags/release-0.4.0/'; ...
    Clark Yang (杨卓荦)Clark Yang (杨卓荦)
    Oct 13, 2009 at 9:35 am
    Oct 13, 2009 at 9:46 am
  • Is there any built-in support using Hive or any of the underlying output format support to insert Hive query results directly into a mysql database? For example if I want to directly load some ...
    Oct 8, 2009 at 7:16 pm
    Oct 8, 2009 at 7:20 pm
Group Navigation
period‹ prev | Oct 2009 | next ›
Group Overview
groupuser @
categorieshive, hadoop

37 users for October 2009

Ryan LeCompte: 44 posts Zheng Shao: 35 posts Arijit Mukherjee: 19 posts Bobby Rullo: 13 posts Ashish Thusoo: 12 posts Ning Zhang: 11 posts 杨卓荦: 11 posts Edward Capriolo: 10 posts Namit Jain: 10 posts Vijay: 10 posts Matt Pestritto: 7 posts Bill Graham: 6 posts Jeff Hammerbacher: 6 posts Yan xiaohui: 5 posts Bharath v: 4 posts Gang Luo: 4 posts Rahul Pal: 4 posts Touretsky, Gregory: 4 posts David Lerman: 3 posts Larry Ogrodnek: 3 posts
show more