Grokbase Groups Hive user June 2009

Search Discussions

23 discussions - 106 posts

  • Hi all, we had a query joining two tables, one of which had about 1 billions pieces of records while the other had less than 20k. below is our query: set hive.mapjoin.cache.numrows=20000; select /*+ ...
    Min ZhouMin Zhou
    Jun 15, 2009 at 4:24 am
    Jun 18, 2009 at 1:52 am
  • Hi all, Any helps? thanks, Min -- My research interests are distributed systems, parallel computing and bytecode based virtual machine. My profile: My blog: ...
    Min ZhouMin Zhou
    Jun 4, 2009 at 5:31 am
    Jun 5, 2009 at 10:12 am
  • Hi, I am new to Hive. I would like to know what is the easiest way to get the difference between two sets. For example, how can I convert the following SQL query to Hive? select user from page_views ...
    Rakesh SettyRakesh Setty
    Jun 29, 2009 at 11:03 pm
    Jun 30, 2009 at 12:28 am
  • I'm trying to build a tool which runs a query and writes the results into an automatically generated Hive table. It's all very straightforward, except that I can't find a good way to determine the ...
    David LermanDavid Lerman
    Jun 28, 2009 at 4:07 pm
    Jun 29, 2009 at 7:03 pm
  • We get this error when having ³cast (['time'] as int)/3600.0² in the select clause of a hive query (with join). Is this a known problem/limitation? BTW, this only happens when ...
    Eva TseEva Tse
    Jun 15, 2009 at 12:34 am
    Jun 22, 2009 at 11:42 pm
  • We have a requirement to write an udf returning map/list values. pseudo code of this udf like below public class MyUDF extends UDF { public Map<?,? evaluate(String lhs, String rhs) { ... } } will it ...
    Min ZhouMin Zhou
    Jun 10, 2009 at 11:17 am
    Jun 12, 2009 at 2:26 am
  • I have a query like below, SELECT a.subid,, t.url FROM tbl t JOIN aux_tbl a ON t.url rlike a.url_pattern WHERE t.dt='20090609' AND a.dt='20090609'; and parser reported 'FAILED: Error in semantic ...
    Min ZhouMin Zhou
    Jun 10, 2009 at 6:38 am
    Jun 10, 2009 at 8:38 am
  • Hi all, Yuntao Jia, our intern this summer, did a simple performance benchmark for Hadoop, Hive and Pig based on the queries in the SIGMOD 2009 paper: A Comparison of Approaches to Large-Scale Data ...
    Zheng ShaoZheng Shao
    Jun 19, 2009 at 4:30 am
    Jun 19, 2009 at 6:08 pm
  • I am using a Hive instance using MySQL as the meta-data store. The cluster works fine in command line mode. The problem I am having is that when I attempt to connect using the Hive JDBC connector I ...
    Jun 1, 2009 at 10:15 pm
    Jun 4, 2009 at 7:21 pm
  • Hi all, How can generate plan files like those .q.xml files exist in ql/src/test/results/compiler/plan/** for checking my plan? Thanks, Min -- My research interests are distributed systems, parallel ...
    Min ZhouMin Zhou
    Jun 1, 2009 at 6:43 am
    Jun 1, 2009 at 4:34 pm
  • trying to run the following query: insert overwrite table imp_test3 partition(ds="20090513") select d["query_string"] from imp where imp.ds="20090513" and imp.hour="00"; causes all of the tasks in ...
    Larry OgrodnekLarry Ogrodnek
    Jun 30, 2009 at 11:44 pm
    Jul 1, 2009 at 5:07 pm
  • We are using hive 0.3.0 and are getting the following when joining on Integer fields. We are also getting a very similar problem using the case UDF is this a known problem? ...
    Bill CraigBill Craig
    Jun 15, 2009 at 10:17 pm
    Jun 16, 2009 at 12:30 am
  • Hi guys, Driver.getSchema() obtains current result's tableDesc and assemble it to a String. I found if do select all queries on a table contains partitions, a null pointer exception will happens. See ...
    Min ZhouMin Zhou
    Jun 19, 2009 at 7:18 am
    Jun 19, 2009 at 10:56 pm
  • Hi, I have this error * FAILED: Unknown exception : Wrong FS: hdfs://, expected: hdfs://slave:9000* when i tried to do select statement in hive. I have loaded ...
    Jun 13, 2009 at 5:05 am
    Jun 13, 2009 at 8:52 am
  • Hi, For the second time in two weeks I'm getting errors that blocks that once existed have gone missing from HDFS and I'm baffled as to the cause, or even how to troubleshoot the issue. Any help ...
    Bill GrahamBill Graham
    Jun 12, 2009 at 7:55 pm
    Jun 12, 2009 at 8:27 pm
  • Most probably metastore died. What were you doing when this happened and also what is the log on metastore side? Prasad from bberry ----- Original Message ----- From: Bill Craig < ...
    Prasad ChakkaPrasad Chakka
    Jun 4, 2009 at 7:38 pm
    Jun 4, 2009 at 7:52 pm
  • Hi, I've been testing the Hive JDBC client and I think I've come a across a few bugs, but I wanted to double check my understanding of the expected behavior before opening JIRAs. I'm running the hive ...
    Bill GrahamBill Graham
    Jun 1, 2009 at 10:19 pm
    Jun 1, 2009 at 11:18 pm
  • We have several openings for engineers who can help us build a data warehouse on top of hadoop + hive. We've been running a 30 node cluster for the past year using streaming and cascading and are now ...
    David J. O'DellDavid J. O'Dell
    Jun 30, 2009 at 5:04 pm
    Jun 30, 2009 at 5:04 pm
  • Bay Area Hadoop Fans, We're excited to hold our first Hadoop User Group at Cloudera's office in Burlingame (just south of SFO). We pushed the start time back 30 minutes to allow a little extra time ...
    Christophe BiscigliaChristophe Bisciglia
    Jun 25, 2009 at 12:04 am
    Jun 25, 2009 at 12:04 am
  • Hi facebook guys, Can you synchronise your fb303 thrift code? $thrift -gen java -I metastore/include -I . service/if/hive_service.thrift ...
    Min ZhouMin Zhou
    Jun 16, 2009 at 8:25 am
    Jun 16, 2009 at 8:25 am
  • I've been able to piece together a rough guide for implementation from various blogs / jiras, but does anyone have a basic example publicly available? There are stubs in the wiki but unfortunately ...
    James warrenJames warren
    Jun 12, 2009 at 11:07 pm
    Jun 12, 2009 at 11:07 pm
  • Some progress on getting JDBC to work with Hive using MySQL. The problem was: [java] 09/06/02 08:31:33 INFO metastore.ObjectStore: not found. The jpox default for NontransactionalRead ...
    Jun 2, 2009 at 3:20 pm
    Jun 2, 2009 at 3:20 pm
  • Hadoop Fans, I'm happy to announce a new tool from the Cloudera team. We often found our customers wanting to import data from RDBMSs so they could conduct deeper analysis. To facilitate this, we ...
    Christophe BiscigliaChristophe Bisciglia
    Jun 1, 2009 at 5:10 pm
    Jun 1, 2009 at 5:10 pm
Group Navigation
period‹ prev | Jun 2009 | next ›
Group Overview
groupuser @
categorieshive, hadoop

25 users for June 2009

Min Zhou: 26 posts Zheng Shao: 15 posts Namit Jain: 9 posts Bill Graham: 8 posts Ashish Thusoo: 7 posts Bill Craig: 5 posts Prasad Chakka: 5 posts Eva Tse: 4 posts Raghu Murthy: 4 posts Rakesh Setty: 4 posts Amr Awadallah: 2 posts Christophe Bisciglia: 2 posts David Lerman: 2 posts Zheng Shao: 2 posts David J. O'Dell: 1 post He Yongqiang: 1 post James warren: 1 post Jeff Hammerbacher: 1 post Joydeep Sen Sarma: 1 post Larry Ogrodnek: 1 post
show more