Search Discussions

22 discussions - 105 posts

  • Hi, Is Hive capable of indexing the data and storing them in a way optimized for querying (like a columnar database - bitmap indexes, compression, etc.)? I need to be able to get decent response ...
    Martin MatulaMartin Matula
    Dec 14, 2008 at 2:54 pm
    Dec 16, 2008 at 5:00 am
  • Whatever happened to the compressed storage format? I'd like to keep delimited files in bz2 if possible to save on space, is that sort of thing being considered? Josh
    Josh FergusonJosh Ferguson
    Dec 2, 2008 at 9:09 am
    Dec 3, 2008 at 9:21 pm
  • This Query INSERT OVERWRITE TABLE aggregations PARTITION ( account='80c27664- b047-4c0a-86f3-342c0cdf36c7', application='myproduct', dataset='purchase', hour=341165, aggregation='count', ...
    Josh FergusonJosh Ferguson
    Dec 10, 2008 at 9:06 am
    Dec 12, 2008 at 6:02 am
  • When launching off hive queries using hive -e is there a way to get the job id so that I can just queue them up and go check their statuses later? What's the general pattern for queueing and ...
    Josh FergusonJosh Ferguson
    Dec 8, 2008 at 8:08 pm
    Dec 15, 2008 at 8:29 pm
  • Does anyone have an example of how to setup the hive config file to keep schema information in something other than derby so that multiple users and machines can access it at once? Josh
    Josh FergusonJosh Ferguson
    Dec 9, 2008 at 4:22 am
    Dec 10, 2008 at 6:51 pm
  • Hi. I find IRC to be a very convenient way to communicate with others interested in an open source project. Would people be interested in joining ##hive on irc.freenode.net? This is the IRC network ...
    Johan OskarssonJohan Oskarsson
    Dec 4, 2008 at 5:32 pm
    Jan 5, 2009 at 8:26 pm
  • Hi, I'm just getting started with Hive. I was wondering if anyone can tell me how Hive would translate the following SQL query into MapReduce: select distinct track, count(track) from music where ...
    Shirley CohenShirley Cohen
    Dec 20, 2008 at 4:51 pm
    Dec 22, 2008 at 7:29 pm
  • I'm getting a bunch of errors during my reduce step while doing a join between two string fields for some reason. java.lang.RuntimeException: Hive 2 Internal error: cannot evaluate index expression ...
    Josh FergusonJosh Ferguson
    Dec 2, 2008 at 9:36 am
    Dec 3, 2008 at 4:28 am
  • We store a lot of data in SequenceFiles with the key and value as generated Jute/RecordIO files and would want to process it all using Hive. I noticed that there is a serde/jute package, but I assume ...
    Johan OskarssonJohan Oskarsson
    Dec 8, 2008 at 6:16 pm
    Dec 8, 2008 at 7:01 pm
  • I am new to Hive. I am using Hadoop 0.19.0 and am running the version of Hive packaged with it under contrib/hive. I am following the apache access log example in the Hive user guide Wiki: ...
    Bill AuBill Au
    Dec 4, 2008 at 9:59 pm
    Dec 5, 2008 at 7:41 am
  • FROM hive_test INSERT OVERWRITE TABLE domain_compute SELECT hive_test.domain, COUNT(DISTINCT hive_test.id) GROUP BY hive_test.domain INSERT OVERWRITE LOCAL DIRECTORY '/tmp/hive_test2' SELECT ...
    Dec 4, 2008 at 4:03 am
    Dec 4, 2008 at 9:01 am
  • Hi, I have the standard Hadoop vmware image installed (I think it contains hadoop 0.18). I checked out and run hive based on the instructions on your wiki. I created a simple table containing 7 ...
    Martin MatulaMartin Matula
    Dec 14, 2008 at 3:17 pm
    Dec 14, 2008 at 9:16 pm
  • I filed this JIRA issue. I was wondering, can someone take the time to update the wiki to describe the complete architecture of the system as outlined here?: ...
    Josh FergusonJosh Ferguson
    Dec 3, 2008 at 7:33 am
    Dec 3, 2008 at 9:19 pm
  • yes, it can. But when I write my script to extract the domain, it hangs all the time ,also there is no job page in the job monitor!But it occurs in the cli that: *hive FROM (FROM log_stg2 log SELECT ...
    Dec 3, 2008 at 3:12 am
    Dec 3, 2008 at 4:07 am
  • hive LOAD DATA INPATH 'hdfs://xx-mapred000.xx01:54310/test/shixing/log' INTO TABLE log_stg4; FAILED: Error in semantic analysis: line 1:17 Path is not legal ...
    Dec 2, 2008 at 9:46 am
    Dec 2, 2008 at 8:50 pm
  • My table:a just contains field:url And Now I wanna compute each domain of url's pv? and out put insert into a table:b domain pv. Now I didn't know whether the hive support the udf, maybe also I can ...
    Dec 2, 2008 at 11:07 am
    Dec 2, 2008 at 11:18 am
  • One more thing, can someone explain to me what metastore_db is and what its significance is, apparently if I run hive from separate places certain instances can find some tables and some can't. I'm ...
    Josh FergusonJosh Ferguson
    Dec 2, 2008 at 6:04 am
    Dec 2, 2008 at 6:27 am
  • Hi all, I'm working on configuring a Hive deployment and looking for some guidance on configuration. In hive-default.xml, there's a hive.default.fileformat that currently defaults to text, but can ...
    Aaron KimballAaron Kimball
    Dec 11, 2008 at 9:55 pm
    Dec 11, 2008 at 11:32 pm
  • Did you guys do away with having table_source statements include optional partition clauses? Cause for me it's not working but is listed here: ...
    Josh FergusonJosh Ferguson
    Dec 10, 2008 at 8:13 am
    Dec 10, 2008 at 8:16 am
  • Hive has three date/time types: date, datetime and timestamp. What is their intended (future) purpose and current implementation status?
    David PhillipsDavid Phillips
    Dec 8, 2008 at 11:44 pm
    Dec 8, 2008 at 11:47 pm
  • So say I'm using some ruby code to shell out to hive for data loading such as: hive -e "LOAD ..." What's the preferred method of detecting errors in this situation? Do the failures return some sort ...
    Josh FergusonJosh Ferguson
    Dec 2, 2008 at 6:02 am
    Dec 2, 2008 at 6:06 am
  • Hadoop Fans, I'm happy to announce that Cloudera, in addition to providing commercial support for Hadoop, is now offering a one-day, professional training course for Hadoop. It's open to anyone in ...
    Christophe BiscigliaChristophe Bisciglia
    Dec 13, 2008 at 12:39 am
    Dec 13, 2008 at 12:39 am
Group Navigation
period‹ prev | Dec 2008 | next ›
Group Overview
groupuser @
categorieshive, hadoop