Grokbase Groups Hive user April 2009
FAQ

Search Discussions

29 discussions - 160 posts

  • Hi, I want to check if hive data grows huge in the table (for example to 200GB), does anybody see the mapreduce performance degrade a lot? I did not factor things out, but just want to check first. ...
    Javateck javateckJavateck javateck
    Apr 15, 2009 at 8:00 pm
    Apr 15, 2009 at 11:41 pm
  • I seem to be having problems with LOAD DATA with a file on my local system trying get it into hive: li57-125 ~/test: python hive_test.py Connecting to HiveServer.... Opening transport... LOAD DATA ...
    Suhail DoshiSuhail Doshi
    Apr 4, 2009 at 5:27 am
    Apr 15, 2009 at 7:41 am
  • Hi - I'm having a problem with a query below. When I try to run any aggregate function on a column from the sub-query, the job fails. The queries and output messages are below. Suggestions? thanks in ...
    Matt PestrittoMatt Pestritto
    Apr 22, 2009 at 10:22 pm
    Apr 23, 2009 at 8:39 am
  • I have two standalone machines running on the same box, each of course running on a different port. And these two instances each has its own table space (by starting hive from different directories), ...
    Javateck javateckJavateck javateck
    Apr 18, 2009 at 9:41 am
    Apr 21, 2009 at 1:17 am
  • Hi, I have a query like, select sum(col1) as s from tab1 where s 5 (this will not be working in SQL either), what I need is to filter the result so that I only get results whose sum is 5. Anyway to ...
    Javateck javateckJavateck javateck
    Apr 9, 2009 at 2:15 am
    Apr 10, 2009 at 11:21 pm
  • (I realized i posted this to the dev list, probably better on the user list) I made my own UDF, but I am having trouble getting it running . Any hints? Thanks [hadoop@hadoop1 ~]$ /opt/hive/bin/hive ...
    Edward CaprioloEdward Capriolo
    Apr 30, 2009 at 2:31 pm
    May 1, 2009 at 2:53 am
  • Hello, Is there a way to truncate a table instead of dropping it and the creating it again? I've been looking through the docs and haven't found anything. Suhail -- http://mixpanel.com Blog: ...
    Suhail DoshiSuhail Doshi
    Apr 25, 2009 at 4:40 pm
    Apr 28, 2009 at 11:57 pm
  • Hi All - Has anyone put any thought behind how to create an application using hive ? I have a certain algorithm that I implemented in hive, but it currently lives in a 600+ line text file where I ...
    Matt PestrittoMatt Pestritto
    Apr 27, 2009 at 2:12 pm
    Apr 28, 2009 at 7:45 pm
  • I'm looking for a framework that manages automatic initiation of our daily data loading and processing, with knowledge of dependencies between tables and "data ready" status flags. I think some ...
    Jonathan WardenJonathan Warden
    Apr 19, 2009 at 8:48 am
    Apr 20, 2009 at 9:35 am
  • Hi, I have one standalone hive server running on one machine, and I'm trying to use jdbc querying from another remote machine, if running in single thread, everything is fine, but when I have ...
    Javateck javateckJavateck javateck
    Apr 10, 2009 at 1:10 am
    Apr 13, 2009 at 10:18 pm
  • I need some clearing up with regard to partitioning CREATE TABLE page_view(viewTime INT, userid BIGINT, page_url STRING, referrer_url STRING, ip STRING COMMENT 'IP Address of the User') COMMENT 'This ...
    Suhail DoshiSuhail Doshi
    Apr 2, 2009 at 6:31 pm
    Apr 2, 2009 at 9:23 pm
  • Hey all, You may all be familiar with geo-ip from maxmind. http://www.maxmind.com/app/api. GNU General Public License (GPL) I am running a process where I have to geo locate IP addresses. I think ...
    Edward CaprioloEdward Capriolo
    Apr 28, 2009 at 2:43 pm
    Apr 28, 2009 at 7:01 pm
  • Hi, I tried running hive with bin/hive --service hwi but I keep getting the error 09/04/25 10:19:49 WARN servlet.WebApplicationContext: Web application not found ${HIVE_HOME}/lib/hive_hwi.war ...
    Raghu RRaghu R
    Apr 25, 2009 at 4:59 am
    Apr 25, 2009 at 9:38 pm
  • Hi, I'm struggling with mapred.tasktracker.map.tasks.maximum, I set to 10 in hadoop-site.xml, and I can see the job's configuration is also saying 10 when running the job, but the actual job is ...
    Javateck javateckJavateck javateck
    Apr 22, 2009 at 12:49 am
    Apr 24, 2009 at 8:16 pm
  • Hello, When i try to download Hadoop Hive using svn co http://svn.apache.org/repos/asf/hadoop/hive/trunk hive as given in http://wiki.apache.org/hadoop/Hive/GettingStarted I get 400 BAD Request ...
    Raghu RRaghu R
    Apr 23, 2009 at 9:37 am
    Apr 23, 2009 at 6:20 pm
  • Hi. I'm running into a problem that I can't seem to figure out. I'm running a hive query and the last reduce always fails. Number of Reducers - 1 always complete successfully. If I run with 1 ...
    Matt PestrittoMatt Pestritto
    Apr 9, 2009 at 1:55 pm
    Apr 9, 2009 at 11:36 pm
  • can I do something like select m1.c1/m2.c2 from ((select count(col1) as c1 from tab where col2=='some') m1 join (select count(col1) as c2 from tab where col2< 'some') m2) I know this syntax is not ...
    Javateck javateckJavateck javateck
    Apr 1, 2009 at 7:16 pm
    Apr 1, 2009 at 7:41 pm
  • The first official release of Hive is available For Hadoop release details and downloads, visit: http://hadoop.apache.org/hive/releases.html Congrats everyone and many thanks in making this happen!! ...
    Ashish ThusooAshish Thusoo
    Apr 30, 2009 at 5:36 pm
    Apr 30, 2009 at 6:03 pm
  • Hey Hive users, At Cloudera, we're working to integrate Hive with a broad range of business intelligence tools. If you are currently using a business intelligence tool like JasperReports, Pentaho, or ...
    Jeff HammerbacherJeff Hammerbacher
    Apr 17, 2009 at 10:21 pm
    Apr 29, 2009 at 1:20 am
  • Hello, I have successfully played with normal join on hive with : SELECT tbd.col1, COUNT(1) FROM log_test tbd JOIN log_test tbd2 ON ( ..... expression) GROUP BY tbd.col1 But unfortunately i'm trying ...
    Mathias FrydeMathias Fryde
    Apr 20, 2009 at 11:55 am
    Apr 21, 2009 at 4:26 pm
  • I am attempting to write a SerDe implementation to load a binary formatted file which consists of the following repeating form: Integer (4 Bytes, length of binary block) Binary block of data of ...
    Bill CraigBill Craig
    Apr 13, 2009 at 4:07 pm
    Apr 14, 2009 at 12:32 pm
  • Hi, I used one query in my testcase, which was: FROM (FROM src MAP (src.key,src.value) USING 'python ../data/scripts/dumpdata_script.py' AS (key,value) WHERE src.key = 10) subq INSERT OVERWRITE TABLE ...
    He YongqiangHe Yongqiang
    Apr 27, 2009 at 10:50 am
    Apr 27, 2009 at 5:13 pm
  • Hi, Is it possible to have a web interface for hive where jobs could be automatically submitted in JSP pages, and output retrieved locally, with the user not even aware the processing is done in ...
    Raghu RRaghu R
    Apr 25, 2009 at 10:04 am
    Apr 25, 2009 at 11:28 am
  • Hi. I wanted to ask if anyone has seen the following behavior in Hive. When I execute a cross join ( join with no ON statement) across multiple reducers, I only get output = 1/ <number of reducers . ...
    Matt PestrittoMatt Pestritto
    Apr 23, 2009 at 7:44 pm
    Apr 23, 2009 at 9:21 pm
  • I wonder whether there is anybody tried to run Hive on central file system (like nfs), instead of HDFS. We understand that the file system might become the scalability bottleneck in this case. Just ...
    Jonathan CaoJonathan Cao
    Apr 18, 2009 at 12:18 am
    Apr 18, 2009 at 3:34 am
  • /////////////////////////////////////// Sorry for cross posting. ////////////////////////////////////// Hi,all Hadoop in China Salon is a free discussion forum on Hadoop related technologies and ...
    He YongqiangHe Yongqiang
    Apr 24, 2009 at 12:00 am
    Apr 24, 2009 at 12:00 am
  • when I execute queries, I got some broken pipe errors when executing query, on my application side, I keep on connection pool, do I need to reap the connection once a while? 09/04/14 17:34:33 ERROR ...
    Javateck javateckJavateck javateck
    Apr 14, 2009 at 5:47 pm
    Apr 14, 2009 at 5:47 pm
  • Hi, https://issues.apache.org/jira/browse/HIVE-266 As part of HIVE-266, we are looking for changing UDF signature to use Writable instead of Java Primitive Classes. Some preliminary tests showed that ...
    Zheng ShaoZheng Shao
    Apr 13, 2009 at 7:35 pm
    Apr 13, 2009 at 7:35 pm
  • I took the liberty of moving all non blocking tickets assigned to version 0.2 and 0.3 to 0.4. There is one blocking ticket left before we can release 0.3: ...
    Johan OskarssonJohan Oskarsson
    Apr 2, 2009 at 3:08 pm
    Apr 2, 2009 at 3:08 pm
Group Navigation
period‹ prev | Apr 2009 | next ›
Group Overview
groupuser @
categorieshive, hadoop
discussions29
posts160
users26
websitehive.apache.org

26 users for April 2009

Javateck javateck: 27 posts Suhail Doshi: 18 posts Zheng Shao: 15 posts Ashish Thusoo: 13 posts Raghu Murthy: 12 posts Edward Capriolo: 11 posts Matt Pestritto: 11 posts Namit Jain: 7 posts Prasad Chakka: 6 posts Frederick Oko: 5 posts Stephen Corona: 5 posts Jeff Hammerbacher: 4 posts Raghu R: 4 posts Aaron Kimball: 3 posts Jonathan Warden: 3 posts Amr Awadallah: 2 posts He Yongqiang: 2 posts Mathias Fryde: 2 posts Min Zhou: 2 posts Suhail Doshi: 2 posts
show more
Archives