Search Discussions

34 discussions - 146 posts

  • Hello, I'm trying to run the following query where m1 and m2 have the same data ( 29M rows) on a 3-node hadoop cluster. I'm essentially trying to do a self join. It ends up running 269 map jobs and 1 ...
    Nov 4, 2009 at 12:44 am
    Nov 12, 2009 at 6:27 pm
  • Hi, I have a 2 node hadoop/hbase cluster which is working fine. hadoop was installed based on instructions at http://www.michael-noll.com/wiki/Running_Hadoop_On_Ubuntu_Linux_(Multi-Node_Cluster) I ...
    Massoud MazarMassoud Mazar
    Nov 6, 2009 at 3:16 pm
    Nov 10, 2009 at 5:09 pm
  • Hello, all -- I am trying to build as per GettingStarted and am getting this error from Ivy: Downloaded file size doesn't match expected Content Length for ...hadoop-0.19.0.tar.gz. Please retry. This ...
    Sasha OvsankinSasha Ovsankin
    Nov 10, 2009 at 11:40 pm
    Nov 13, 2009 at 2:29 am
  • Hi, guys, hive create table pokes(foo string, bar int) partitioned by (pt string); OK Time taken: 0.146 seconds hive select * from pokes; Total MapReduce jobs = 1 Number of reduce tasks is set to 0 ...
    Min ZhouMin Zhou
    Nov 11, 2009 at 4:02 am
    Nov 11, 2009 at 6:34 am
  • As I know, hive is a data-warehouse infrastructure, does it support some OLAP operation such as drill-down or roll-up? 好玩贺卡等你发,邮箱贺卡全新上线! http://card.mail.cn.yahoo.com/
    Clark Yang (杨卓荦)Clark Yang (杨卓荦)
    Nov 13, 2009 at 3:42 pm
    Nov 16, 2009 at 4:53 pm
  • Hello, I'm using Cloudera's hive-0.4.0+14.tar.gz with hadoop-0.20.1+152.tar.gz on a Centos machine. I've been able to load syslog files into Hive using the RegexSerDe class - this works great. But ...
    Ken BarclayKen Barclay
    Nov 24, 2009 at 11:24 pm
    Nov 25, 2009 at 12:45 am
  • Is it possible to set up Hive with a metastore in MySQL or NFS? I think that changing the configuration parameters (e.g., javax.jdo.option.ConnectionURL) would make it possible to use MySQL, but I ...
    Tomer ShiranTomer Shiran
    Nov 22, 2009 at 11:19 pm
    Nov 23, 2009 at 4:09 am
  • Hi guys, We have a lot of data stored inside compressed SEQ files. Since SEQ is a sequence of (key,value) pairs we are storing set of columns joined by tab in key part of SEQ, and the same for value ...
    Andrey PankovAndrey Pankov
    Nov 5, 2009 at 4:20 pm
    Nov 6, 2009 at 2:51 pm
  • Hi everyone, So I'm evaluating Hive for an Apache access log processing job (who isn't? ;) and for testing I've got a logfile that's about 1 million lines/245MB. I've loaded it into a table and now I ...
    Andrew O'BrienAndrew O'Brien
    Nov 17, 2009 at 7:24 pm
    Nov 18, 2009 at 5:20 pm
  • Hi, I am using hive over hadoop-0.19.0 , I have created a external table pointing its loaction at certain specified directory, and I have also copied the data file (macthing the table structure) ...
    Mohan AgarwalMohan Agarwal
    Nov 3, 2009 at 1:23 pm
    Nov 4, 2009 at 7:00 am
  • Does any one know how to migrate the metastore data in hive when it's been used in single user mode to server mode? Basically I want to export the metastore from the embedded mode and import it into ...
    Steve MorinSteve Morin
    Nov 11, 2009 at 12:00 am
    Nov 11, 2009 at 6:52 am
  • Hi, Can I run multiple Hive CLI from different systems ponting over common hadoop ? Thanking You Mohan Agarwal
    Mohan AgarwalMohan Agarwal
    Nov 4, 2009 at 7:10 am
    Nov 4, 2009 at 3:47 pm
  • Hi, *I have created a simple User Define Function (namely "my_lower" which converts all the characters of input text to lower case and return the modified text).* *I have to register this function ...
    Mohan AgarwalMohan Agarwal
    Nov 26, 2009 at 6:54 am
    Dec 3, 2009 at 6:07 am
  • *Hi, I have installed hadoop-0.19.2 on my system and I am using Hive to access the data on HDFS. But when I starting the hadoop server and trying to print the data stored in the table using hive CLI, ...
    Mohan AgarwalMohan Agarwal
    Nov 20, 2009 at 5:10 am
    Nov 20, 2009 at 2:36 pm
  • Hi guys, Recently I started to investigate Hive, so far I have several questions. 1). I have a table A partitioned by column x INT. The table has several partitions say x=1, x=2, x=10. Running query ...
    Andrey PankovAndrey Pankov
    Nov 4, 2009 at 4:49 pm
    Nov 4, 2009 at 5:01 pm
  • Hello all, I was trying to do something like this today and Hive didn't like it: SELECT AVG(IF(expression, anotherexpression, NULL)) ... I don't think Hive supports returning NULL as the value of the ...
    Ryan LeCompteRyan LeCompte
    Nov 3, 2009 at 10:35 pm
    Nov 3, 2009 at 11:39 pm
  • java.lang.NullPointerException at org.apache.hadoop.hive.ql.exec.MapOperator.initObjectInspector(MapOperator.j ava:176) at org.apache.hadoop.hive.ql.exec.MapOperator.initializeOp(MapOperator.java:204 ...
    Charles HalpernCharles Halpern
    Nov 23, 2009 at 7:29 pm
    Nov 23, 2009 at 7:56 pm
  • We are planning to hold first Hadoop India user group meet up on 28th November 2009 in Noida. We would be talking about our experiences with Apache Hadoop/Hbase/Hive/PIG/Nutch/etc. The agenda would ...
    Sanjay SharmaSanjay Sharma
    Nov 10, 2009 at 6:20 am
    Nov 22, 2009 at 4:42 pm
  • Hi everyone, A question about partitioning: All of the examples I've seen insert into a single hard-coded partition (usually a date) at a time. If I have a log file that spans a large range of dates, ...
    Andrew O'BrienAndrew O'Brien
    Nov 21, 2009 at 5:14 am
    Nov 22, 2009 at 1:32 am
  • Hello, I was trying the following in the Cloudera training vm 0.3.2 (hadoop 0.20): I'm parsing a syslog file with RegexSerDe. I created the table thus: CREATE TABLE syslog (month STRING, day STRING, ...
    Ken BarclayKen Barclay
    Nov 20, 2009 at 12:27 am
    Nov 20, 2009 at 12:45 am
  • Hi all I've just started looking into Hive, and working my way through some of the tutorials out there (notably the one from Cloudera, which I found really helpful). I think I'm going to want to use ...
    David SalgadoDavid Salgado
    Nov 12, 2009 at 3:13 pm
    Nov 14, 2009 at 12:51 am
  • Hello, I'm trying out the nice mapjoin query hint, and it's basically working great, but I've run into an odd thing, and was wondering if anyone could help out: I have a query which looks like: ...
    Dan MilsteinDan Milstein
    Nov 6, 2009 at 9:34 pm
    Nov 6, 2009 at 9:39 pm
  • What version are people using for the most part? The installation instructions say to check out the trunk, but that can't be stable. What version of Hive is considered stable? thanks, M
    Mayuran YogarajahMayuran Yogarajah
    Nov 6, 2009 at 7:20 pm
    Nov 6, 2009 at 7:30 pm
  • Is all that is necessary for this to happen is just to copy over my custom config from the old release to the new release after I build it? Or is there some other upgrade step? Thanks, Ryan
    Ryan LeCompteRyan LeCompte
    Nov 4, 2009 at 4:32 pm
    Nov 5, 2009 at 2:28 am
  • Hello all, Can anyone describe the implications of the following bug? Would most general queries that use GROUP BY produce erroneous results due to this? I know it's fixed in 0.4.1-rc2, which I've ...
    Ryan LeCompteRyan LeCompte
    Nov 4, 2009 at 6:13 pm
    Nov 5, 2009 at 1:12 am
  • Hello all, It appears that Hive's current ODBC driver does not support SQLRowCount. Is there a plan to support this at some point? We see a comment below on SQLRowCount.c in HiveODBC (Hive 0.4 ...
    Ryan LeCompteRyan LeCompte
    Nov 2, 2009 at 12:25 am
    Nov 2, 2009 at 5:17 am
  • I have put together a tutorial-like blog to describe what takes to build a hadoop/hive cluster on top of CentOS. I thought it may be useful to some: ...
    Massoud MazarMassoud Mazar
    Nov 20, 2009 at 6:47 pm
    Nov 20, 2009 at 6:47 pm
  • The links to documentation for releases 3.0 and 4.0 on the left nav of the Hive homepage are broken FYI: http://hadoop.apache.org/hive/ They send to you these pages that show white apache directory ...
    Bill GrahamBill Graham
    Nov 20, 2009 at 6:09 pm
    Nov 20, 2009 at 6:09 pm
  • Has anyone seen a TransactionNotWritableException during a LOAD DATA command? We get it very sporadically - I'd say one out of every couple hundred loads - and if you rerun the same query, it runs ...
    David LermanDavid Lerman
    Nov 19, 2009 at 9:34 pm
    Nov 19, 2009 at 9:34 pm
  • Hey all, A new ticket has just been created for Hive to support dynamic partitions. https://issues.apache.org/jira/browse/HIVE-936 Register and vote for it to make it a priority. I think this feature ...
    Chris BatesChris Bates
    Nov 17, 2009 at 12:03 pm
    Nov 17, 2009 at 12:03 pm
  • Hi Hive users, Would you please add your company's name and a little description of how you used Hive on the following wiki page? This helps new users get more ideas about how Hive can be used and ...
    Zheng ShaoZheng Shao
    Nov 12, 2009 at 11:43 pm
    Nov 12, 2009 at 11:43 pm
  • Hadoop Fans, we're growing again, and wanted to let the Hadoop community know. If you enjoy working with Hadoop, are excited by doing cool things with interesting data, and have experience working ...
    Christophe BiscigliaChristophe Bisciglia
    Nov 7, 2009 at 8:16 pm
    Nov 7, 2009 at 8:16 pm
  • This is slightly off-topic, but our Hadoop + Hive usage is growing at our company and we're feeling the need to start adding more hardware. I've been tasked with trying to figure out what other ...
    Chris BatesChris Bates
    Nov 6, 2009 at 12:15 am
    Nov 6, 2009 at 12:15 am
  • Hello. I'm trying to write my own UDF in Scala which takes two parameters of array<double type and returns double. I used the next prototype: public DoubleWritable evaluate(ArrayWritable x, ...
    Sergey BartunovSergey Bartunov
    Nov 5, 2009 at 9:08 pm
    Nov 5, 2009 at 9:08 pm
Group Navigation
period‹ prev | Nov 2009 | next ›
Group Overview
groupuser @
categorieshive, hadoop

36 users for November 2009

Massoud Mazar: 12 posts Ning Zhang: 12 posts Ryan LeCompte: 11 posts Zheng Shao: 11 posts Defenestrator: 9 posts Edward Capriolo: 8 posts Namit Jain: 8 posts Carl Steinbach: 7 posts Mohan Agarwal: 7 posts Prasad Chakka: 6 posts Andrey Pankov: 5 posts Min Zhou: 5 posts Ken Barclay: 4 posts Chris Bates: 4 posts Sasha Ovsankin: 4 posts Andrew O'Brien: 3 posts Ashish Thusoo: 3 posts Bobby Rullo: 3 posts Eric Arenas: 2 posts Gang Luo: 2 posts
show more