FAQ

Search Discussions

84 discussions - 317 posts

  • Hi All, I've had some great fun playing with Impala over the past couple of months. Since Impala has now officially moved to version 1.0 GA I thought would be interesting to compare it with HANA 1.0, ...
    Aron MacDonaldAron MacDonald
    May 20, 2013 at 2:31 pm
    Jun 11, 2013 at 9:51 am
  • I'm new to CDH4 and Impala, but thus far, I've been successful in installing CDH4 on SLES 11 SP2 x64, with 8 nodes in the cluster. I've been able to configure mapreduce, hdfs, zookeeper, hbase, and ...
    Ek778Ek778
    May 20, 2013 at 3:12 pm
    May 23, 2013 at 4:12 pm
  • Hi Mark, the error indicates that the job was not able to find an applicable compression codec based on the ".lzo" file extension. The job consults the hadoop configuration files to determine such ...
    Alex BehmAlex Behm
    May 9, 2013 at 11:55 pm
    Jul 9, 2013 at 5:01 pm
  • Hi All I have mapped a hbase table to hive.I have 3 machines in ec2 installed using cloudera Manager. i have total of 2 million records. Query : select count(*) from table Result: 2 Million using ...
    Senthil KumarSenthil Kumar
    May 16, 2013 at 7:44 am
    Jun 27, 2013 at 10:39 pm
  • Hi, I Have one cluster running on impala 1.0 with 10 nodes.Queries without joins run successfuly instead queries with joins failed at starting time, impala-shell returns : Backend 20:Couldn't open ...
    Franck GallosFranck Gallos
    May 16, 2013 at 9:04 am
    May 23, 2013 at 4:16 pm
  • Hi, sorry for dummy question, I ca'nt start Impala 1,0 on my CDH 4.2.1 under CM 4.5.1. The error is: ++ dirname /usr/lib64/cmf/service/impala/impala.sh + cloudera_config=/usr/lib64/cmf/service/impala ...
    Serega SheypakSerega Sheypak
    May 6, 2013 at 8:37 am
    May 8, 2013 at 11:25 am
  • Hi, I was following Impala's tutorial on Cloudera's demo VM. I opened a session (logged on as cloudera) and tried to execute following command to create a HDFS directory. *[cloudera@localhost ~]$ ...
    Supriya BiswasSupriya Biswas
    May 28, 2013 at 4:33 am
    May 29, 2013 at 4:53 pm
  • Yes, this is expected behavior in a column store. This page describes the relative benefits and trade-offs of column stores versus row stores ...
    Patrick AngelesPatrick Angeles
    May 24, 2013 at 10:27 pm
    May 28, 2013 at 3:44 pm
  • how can we load the data in the impala table when it is a internal table.. Is the load data local inpath statement will work on it. kindly help thanks ravi
    Ravi KanthRavi Kanth
    May 9, 2013 at 7:23 am
    May 9, 2013 at 8:38 am
  • I cannot find the commit listed in this jira in impala github. I think it is still in Cloudera's git repo, not synced with github yet. https://issues.cloudera.org/browse/IMPALA-333 Could you share ...
    Bewang TechBewang Tech
    May 31, 2013 at 5:20 pm
    Jun 4, 2013 at 8:16 pm
  • I am using cdh 4.2.0 and impala 1.0 and 3 development machines. On the namenode, i have hbase server , zookeeper server, hiveserver2 ,hive and hive metastore On the datanode, i have installed impala ...
    KarthikKarthik
    May 16, 2013 at 2:00 pm
    May 28, 2013 at 4:32 pm
  • Hi, When building Impala on Ubuntu 13.04 I get the following error: src/base/linuxthreads.cc: In function ‘void ListerThread(ListerParams*)’: src/base/linuxthreads.cc:312:24: error: invalid ...
    DiddyDiddy
    May 15, 2013 at 2:39 pm
    May 22, 2013 at 7:53 pm
  • I have an hbase table (say table_1) of about 1.8 million rows and 12 columns. Each cell value is not more than 30 characters long. Size of table in hbase 12.1 gb of data My CDH cluster is made up of ...
    Abhishek desaiAbhishek desai
    May 7, 2013 at 8:58 am
    May 20, 2013 at 7:46 pm
  • Greetings, I was trying to install guest additions in Cloudera VM but I ran into problems. VM info: CentOS v6.2, CDH v4.2 and Impala v1.0. I was following these instructions ...
    Xan McGregorXan McGregor
    May 17, 2013 at 7:42 pm
    May 19, 2013 at 1:58 am
  • Hi all, After I have built impala 1.0 on CentOS 6.4 I want to start the impalad use ${IMPALA_HOME}/bin/start-impalad.sh -use_statestore=false but there is an error here ...
    FU TianyuanFU Tianyuan
    May 18, 2013 at 1:45 pm
    May 18, 2013 at 4:06 pm
  • Hi Serega, I am not very familiar with Cobbler but here are a couple of thoughts: 1. Can you please verify that the name of the repository is correct? In line with what's mentioned at ...
    Mark GroverMark Grover
    May 16, 2013 at 10:52 pm
    May 17, 2013 at 8:16 pm
  • I'm trying out some of our DW queries on Impala 1.0 with CDH 4.1.4. I'm seeing some good performance, although I'm hitting a strange issue with one query that is returning 5 million rows -- Impala ...
    Joe CrobakJoe Crobak
    May 14, 2013 at 8:58 pm
    May 17, 2013 at 4:01 pm
  • Hi, I have dummy query: select * from table where field1=1 There are 140.000.000.000 rows in total. It takes ~600 seconds to return a result. Query runs without any problmes through impala-shell. Hue ...
    Serega SheypakSerega Sheypak
    May 14, 2013 at 3:28 pm
    May 16, 2013 at 8:35 am
  • Hi all, Just F.Y.I. I evaluated Impala GA on our environment to see the performance of Parquet columner format. I posted its result on slideshare ...
    Yukinori SUDAYukinori SUDA
    May 1, 2013 at 11:45 am
    May 3, 2013 at 1:50 pm
  • Please run the pre-compiled package for Centos and let us know if the problem persists.
    Marcel KornackerMarcel Kornacker
    May 26, 2013 at 7:43 pm
    May 31, 2013 at 8:23 am
  • I've done some tests for finding out the effect of the option -num_threads_per_disk. Contrary to my expectations, the result is that adding more number of threads per disk will decrease the overall ...
    Jung-Yup LeeJung-Yup Lee
    May 27, 2013 at 9:48 pm
    May 30, 2013 at 3:00 pm
  • I run the query "select count(*) from hbase_sdtst_m where month=5;" on our impala cluster, when the progress is near 100%(about 98%), the progress will not move forward anymore and the impala shell ...
    Zesheng WuZesheng Wu
    May 20, 2013 at 9:33 am
    May 21, 2013 at 5:25 pm
  • Hi Yukinori, The short answer is no. Marcel has clarifed the "mem-limit" in an earlier user-group thread. Let me repeat what he said here: The memory limit at the moment only applies to the memory ...
    Alan ChoiAlan Choi
    May 13, 2013 at 9:10 pm
    May 17, 2013 at 3:21 am
  • Thanks for your reply Alex. So, the query performance for a hbase-based table is inefficient? In hive, there is one task per region, and tasks can be executed concurrently. In impala, there is one ...
    Anty RaoAnty Rao
    May 10, 2013 at 3:21 am
    May 16, 2013 at 10:41 pm
  • Looks like a bug with the docs. I dont see a split function listed. https://github.com/cloudera/impala/blob/master/common/function-registry/impala_functions.py
    Greg RahnGreg Rahn
    May 16, 2013 at 4:52 am
    May 16, 2013 at 4:57 pm
  • I learned from the Impala session of the 2013 Strata that Impala is working with Berkeley AMPLab on caching. I’m wondering when the feature is planned to be released, and in what release? Note that ...
    Yan ZhouYan Zhou
    May 7, 2013 at 7:48 pm
    May 15, 2013 at 3:59 am
  • Hi I am trying with impala GA with hadoop 1.0.4. I am facing with Broken Pipe error for all queries.Even for describe query. Does impala works with Hadoop 1.0.X?? Thanks Senthil
    Senthil KumarSenthil Kumar
    May 11, 2013 at 12:10 pm
    May 14, 2013 at 11:57 am
  • I was told that there is a new and improved ODBC Driver for Microstrategy version 2.0. On the site, I only see 1.2. We are trying to Integrate Microstrategy 9.3.1 with Impala 1.0 using port 21050. It ...
    Benjamin KimBenjamin Kim
    May 10, 2013 at 7:15 pm
    May 11, 2013 at 1:02 am
  • Hi, Is it possible to install impala on CDH4 on a Single Linux Node in Pseudo-distributed Mode thanks!
    ASAASA
    May 9, 2013 at 7:01 pm
    May 9, 2013 at 11:13 pm
  • Hey Alex - Thanks for the response, I ran the query you provided, the performance is roughly the same as query *4*, but I believe it supports your claim that the query planner should be handling ...
    Ricky SaltzerRicky Saltzer
    May 2, 2013 at 5:35 pm
    May 2, 2013 at 7:50 pm
  • Hi All, I would like to ask is there any 64 bit ODBC driver? If it happen the case that one of the program written in 64 bit system trying to use the 32 bit ODBC driver, then it will have problem ...
    ChiewyeaChiewyea
    May 30, 2013 at 9:09 am
    Jul 1, 2013 at 4:08 pm
  • Running CDH 4.3.0, Impala 1.0 I've got a Hive table created using a pig script with the HCatStorer. It's an RCFile with Snappy compression format. Impala sees the table and the schema but returns no ...
    SilvioSilvio
    May 30, 2013 at 8:47 pm
    Jun 1, 2013 at 3:32 pm
  • The compression in Parquet is affected by the page size. The bigger the page size the better the compression.
    Julien Le DemJulien Le Dem
    May 29, 2013 at 9:36 pm
    May 30, 2013 at 10:10 pm
  • Hi Cloudera people, I have following 4 questions for this project's development policy. 1. Just from my quriosity but why does Cloudera people use both JIRA and github ? I would like to use the ...
    Masahiro KiuraMasahiro Kiura
    May 21, 2013 at 1:03 pm
    May 30, 2013 at 4:12 pm
  • I'm using JDBC to access impala server. Here are the steps: 1. If the HDFS directory for data exists, delete the directory; 2. Create the HDFS directory; 3. Create the external Hive table if it ...
    Bewang TechBewang Tech
    May 22, 2013 at 11:28 pm
    May 23, 2013 at 5:16 pm
  • Zesheng, This is most likely an environental issue. Could you let us know which OS you're running on? Thanks, .. Ishaan
    Ishaan JoshiIshaan Joshi
    May 13, 2013 at 6:57 pm
    May 20, 2013 at 9:36 am
  • Greetings, I'm new to Impala and I'm basically using it for learning purposes. I've installed VM and ran some querries on Impala but I have problems understanding Impala architecture. I understand ...
    Xan McGregorXan McGregor
    May 11, 2013 at 6:52 pm
    May 15, 2013 at 4:38 pm
  • Hi Saurabh, It sounds like somehow the table is missing from Impala's metadata. Can you try doing a "refresh" from the impala shell? The impalad provided in the hue configuration will act as the ...
    Alan ChoiAlan Choi
    May 13, 2013 at 5:22 pm
    May 14, 2013 at 12:58 am
  • Hi Folks, Cloudera recommends not to install Impala on any HDFS NameNode. Is it really that bad to install Impalad on a *StandBy *NameNode in an HA cluster? We can assume that normally it is indeed ...
    Alex IAlex I
    May 13, 2013 at 3:55 pm
    May 13, 2013 at 4:09 pm
  • Jonathan, thank you for your question. I agree completely that a lot of workloads include some form of single-row lookups or range scans over a small number of rows, and for those particular queries ...
    Marcel KornackerMarcel Kornacker
    May 3, 2013 at 8:51 pm
    May 9, 2013 at 9:03 pm
  • Hi Folks, We have a CDH 4.2.0 Hadoop installation from tarballs and Impala 1.0 installation from RPM. I've decided to install libhadoop.so to Hadoop. However, I don't want to build the library ...
    Alex IAlex I
    May 8, 2013 at 5:39 pm
    May 9, 2013 at 12:45 pm
  • I started impalad and the statestore on the same host If I donot enable the security, the impalad and statestore can start normally, but if I enable the security, the impalad can't start This is the ...
    Zesheng WuZesheng Wu
    May 6, 2013 at 11:50 am
    May 8, 2013 at 12:04 pm
  • I have installed Cloudera Manager 4.2.1 with impala 0.7. I wanted to upgrade to impala 1.0, it seems it runs only on Cloudera Manager 4.5.2. I have upgraded CM as per document ...
    ManiMani
    May 8, 2013 at 12:16 am
    May 8, 2013 at 3:10 am
  • Hi, The docs are not quite right and we are working on getting it fixed. You need to specify the schema for the table you are creating: e.g. create table pcustomer(i int) stored as parquetfile or ...
    Nong LiNong Li
    May 1, 2013 at 9:54 pm
    May 1, 2013 at 10:24 pm
  • Hi Nong, That sounds great. Improving file size will decrease the number of disk I/O. In addition to that, how about giving a configuration option to change the page size of Parquet? I think the ...
    Jung-Yup LeeJung-Yup Lee
    May 30, 2013 at 9:53 pm
    Jun 6, 2013 at 2:37 pm
  • Hi all, I have builidng Impala 1.0 on CentOS 6.4 these days, but always have errors about boost. I have referenced Issue #31 and installed boost 1.46.1, but I have another error .How can I solve this ...
    FU TianyuanFU Tianyuan
    May 17, 2013 at 9:09 am
    May 31, 2013 at 9:44 am
  • Hi All I have two tables - A , B each having 10 million records in Hbase I have a query consisting of Join on two tables. In Hive Query , it results 7Million + records. Whereas in Impala 1.0, it ...
    Senthil KumarSenthil Kumar
    May 30, 2013 at 2:55 pm
    May 30, 2013 at 3:12 pm
  • hi,all I want to start impala with kerberos, I have done these things; 1) start hadoop with kerberos and it work 2) enable kerberos in hive metastore according to the instruction of cloudera ...
    jianan Mojianan Mo
    May 30, 2013 at 10:44 am
    May 30, 2013 at 2:54 pm
  • Hi Anson, Moving this thread over to <span class="m_body_email_addr" title="4f11bca44eddf56fc4f71409e44698ab" impala-user@cloudera.org</span , which should be able to help you with this issue. -- ...
    Aaron T. MyersAaron T. Myers
    May 29, 2013 at 5:58 pm
    May 29, 2013 at 6:13 pm
  • Hi everyone, I have a question about Parquet file size. I have a Text table(text_test1) which contains 8 different files. [hadoop@pdpds03 ~]$ hdfs dfs -ls -h /user/hive/warehouse/text_test1 Found 8 ...
    Jung-Yup LeeJung-Yup Lee
    May 28, 2013 at 4:24 am
    May 28, 2013 at 4:28 am
Group Navigation
period‹ prev | May 2013 | next ›
Group Overview
groupimpala-user @
categorieshadoop
discussions84
posts317
users93
websitecloudera.com
irc#hadoop

93 users for May 2013

Alan Choi: 19 posts Serega Sheypak: 15 posts Jung-Yup Lee: 14 posts Vikas Singh: 11 posts Alex Behm: 10 posts Ricky Saltzer: 10 posts Aron MacDonald: 9 posts Lenni Kuff: 9 posts Marcel Kornacker: 9 posts FU Tianyuan: 8 posts Ek778: 7 posts Henry Robinson: 7 posts Senthil Kumar: 7 posts Bewang Tech: 6 posts Franck Gallos: 6 posts Ishaan Joshi: 5 posts Justin Erickson: 5 posts Ravi Kanth: 5 posts Yukinori SUDA: 5 posts Abhishek desai: 4 posts
show more