FAQ

Search Discussions

50 discussions - 180 posts

  • Can someone explain to me why this is happening when I run a query in Impala? We have a 23 nodes cluster. On 10 nodes with 48GB of memory, Impala Daemon Memory Limit is 16GB, and on 13 nodes with ...
    Benjamin KimBenjamin Kim
    Feb 12, 2014 at 6:48 pm
    Feb 14, 2014 at 9:29 pm
  • Hi all, I am using Storm (http://storm.incubator.apache.org/) for doing some real time analytics. I am also using Hadoop for doing some batch processing and HDFS for storing "historical" data. I am ...
    Alex schufoAlex schufo
    Feb 24, 2014 at 6:20 pm
    Mar 1, 2014 at 12:20 am
  • I'm using Impala with parquet table and staging phase described here: https://github.com/cloudera/cdk-examples/tree/master/dataset-staging All looks good but I'm wondering how Impala actually handles ...
    Tivona HuTivona Hu
    Feb 27, 2014 at 7:36 am
    Mar 11, 2014 at 5:58 pm
  • I try to be more detailed. What I do is to execute the following workflow: 1. One JAVA action where I configure the scan object for Hbase, with some filters, and I launch another map-reduce job to ...
    Mario CasolaMario Casola
    Feb 4, 2014 at 3:30 am
    Feb 24, 2014 at 8:45 pm
  • HI impala team, Now I have configured the file /etc/default/impala as ...
    Hanks TomHanks Tom
    Feb 12, 2014 at 2:34 am
    Feb 17, 2014 at 1:40 am
  • Hi, I could not make lzo format work. I followed the instruction described here ...
    György BaloghGyörgy Balogh
    Feb 24, 2014 at 1:52 pm
    Feb 27, 2014 at 6:39 pm
  • I have tried to run Impala JDBC example from https://github.com/onefoursix/Cloudera-Impala-JDBC-Example on CDH 4.5.0. I am getting the following error: java.lang.NoClassDefFoundError ...
    Aleksei UAleksei U
    Feb 17, 2014 at 3:46 pm
    May 31, 2014 at 12:30 am
  • Hi All, This is a reposting of " https://groups.google.com/a/cloudera.org/forum/#!topic/impala-user/rrPhNDcNtZQ", following is our profile result: Our impala version is 1.1.1, and hive version is ...
    Zesheng WuZesheng Wu
    Feb 25, 2014 at 1:41 am
    Feb 27, 2014 at 6:39 pm
  • hi, Hive vs Impala : Hive gives the answer, whereas Impala gives me a cryptic error about Hive metadata. Any idea ? see the below: hive Select last_name,first_name,middle from parquetdata_partitioned ...
    matt Liebermatt Lieber
    Feb 26, 2014 at 6:10 am
    May 19, 2014 at 7:13 pm
  • Hi, We are streaming data to impala with the following strategy: - load to t1 in text or lzo text - from time to time move from t1 to a parquet t2 To be able to query both a view is defined as ...
    György BaloghGyörgy Balogh
    Feb 24, 2014 at 3:24 pm
    Mar 12, 2014 at 9:19 pm
  • Hi, I am using Impala 1.2.3 and have a simple query that count rows and find Average value of a column. File format is Parquet. Query is: select count(*), avg(col_2) from table1; This query (1billion ...
    VladimirVladimir
    Feb 13, 2014 at 12:08 am
    Feb 19, 2014 at 7:49 pm
  • Hi all, I'm running a simple scan-aggregate query on a text file that completes in 73 sec. I also run the profile statement after having executed the query and would like to see how much time was ...
    Avrilia FloratouAvrilia Floratou
    Feb 12, 2014 at 4:41 pm
    Feb 13, 2014 at 3:09 am
  • Hi Impala Team, Now I am using impala-1.2.0 fior querying data on cdh5 with HA. I have change the variable "fs.defaultFS" value to "hdfs://192.168.0.9:8020",the ip "192.168.0.9" is one of HA ...
    Hanks TomHanks Tom
    Feb 10, 2014 at 8:24 am
    Feb 13, 2014 at 2:19 am
  • Hi, We have a mixed format partitioned table. I would like to set a partition's format to text and specify row format (delimiters, escape). Is there a way to do it with alter table? Thank you Gyorgy ...
    György BaloghGyörgy Balogh
    Feb 25, 2014 at 3:03 pm
    Feb 27, 2014 at 11:28 pm
  • Hi, I want to create external impala table that many user in our group can share. However, this does not work because impala user does not have permission. It will work fine with chmod 775 but we ...
    Silaphet MounkhatySilaphet Mounkhaty
    Feb 27, 2014 at 8:48 pm
    Feb 27, 2014 at 10:48 pm
  • Hi there - In general, Impala *streams* data from disk rather than loading it entirely into memory, allowing Impala to analyze datasets much larger than memory. If your query pertains to three ...
    Ricky SaltzerRicky Saltzer
    Feb 24, 2014 at 4:12 pm
    Feb 26, 2014 at 4:32 pm
  • Can Impala work directly with adding partition as Hive does? For example - in Hive this is possible: ALTER TABLE sales ADD PARTITION (country = 'US', year = 2012, month = 12, day = 22) LOCATION ...
    VladimirVladimir
    Feb 22, 2014 at 2:04 am
    Feb 25, 2014 at 1:02 am
  • Given a table with column surrogate_key, id, and decision_type, while id is UUID and 36 bytes long, different ids have different surrogate_keys, and decision_type is just 0 and 1. create table foo ( ...
    Bewang TechBewang Tech
    Feb 24, 2014 at 6:24 pm
    Feb 24, 2014 at 8:59 pm
  • Hi, Can I have unpartitioned data files in a partitioned table? (For example load stating area for a table). Thank you! Gyorgy To unsubscribe from this group and stop receiving emails from it, send ...
    György BaloghGyörgy Balogh
    Feb 18, 2014 at 3:41 pm
    Feb 18, 2014 at 4:23 pm
  • Hi All, I had tested Hive 0.11 on one of the windowing function which is ROW_NUMBER(). I had wrote one query to select top 2 records for each country, the sample query is as the following: *select ...
    ChiewyeaChiewyea
    Feb 7, 2014 at 8:43 am
    Feb 10, 2014 at 5:27 pm
  • Hi, We are evaluating impala for improving query performance. But we are hitting a road block due to no impala support for complex hive types (and avro serde for these types) and custom serde ...
    Sukhendu chakrabortySukhendu chakraborty
    Feb 28, 2014 at 2:20 am
    Feb 28, 2014 at 3:57 am
  • HI, there~ I did a test about impala performance, I created a table TBL_A with 4 column c1, c2, c3, c4, and block size is 1GB. I found that this sql "select c1, sum(c2) from TBL_A where c3=100 and ...
    Smart SunSmart Sun
    Feb 26, 2014 at 7:59 am
    Feb 27, 2014 at 10:12 am
  • Before Hive 0.11, If I use DISTRIBUTE BY ... SORT BY, it is much easier to write a UDF to implement Analytic/Windowing functions. Because Impala doesn't support DISTRIBUTE BY, I'm wondering if there ...
    Bewang TechBewang Tech
    Feb 12, 2014 at 6:57 pm
    Feb 25, 2014 at 6:37 pm
  • Hi, Calling a "insert into t1 select * from t2" via jdbc returns intermediately. (according to jdbc spec insert should block until it is done). If we wait for the end (with a large sleep) the ...
    György BaloghGyörgy Balogh
    Feb 20, 2014 at 4:11 pm
    Feb 21, 2014 at 12:52 pm
  • I am attempting to get Impala working with Kerberos. It appears to mostly work; however, the impala-state-store doesn't appear to start. It's not clear to me how to diagnose any further? Any pointers ...
    DaveDave
    Feb 13, 2014 at 12:24 am
    Feb 14, 2014 at 7:47 pm
  • Hi all, I set up an impala 1.1.1 cluster, all hdfs tables work fine, but hbase tables don't work, I configured hbase-site.xml and put it in the class path. Here is the concrete symptom: when I ...
    Zesheng WuZesheng Wu
    Feb 10, 2014 at 2:42 am
    Feb 11, 2014 at 7:31 pm
  • Hi, there, I built Impala 1.2.1 from source code, and was able to start impalad successfully. But got the following error when using impala shell: [Impala-impala-v1.2.1]$ . bin/impala-shell.sh ...
    Jessica ZhangJessica Zhang
    Feb 4, 2014 at 9:52 pm
    Feb 6, 2014 at 7:04 pm
  • Hi, I am using impala 1.2.3 and trying to setup ldap authentication + sentry. Its a single node install for testing purposes. I have followed the instructions in the cloudera website. When I try to ...
    Sukhendu chakrabortySukhendu chakraborty
    Feb 25, 2014 at 7:58 pm
    Feb 28, 2014 at 5:58 pm
  • Dear Impala-users, I encountered a strange behaviour in Impala v1.2.3 installed using CM to try out CDH 5b2. Say, I got two tables: a and b. a: bk name city 1 custom_hh hamburg 2 custom_fra frankfurt ...
    Chi HuynhChi Huynh
    Feb 28, 2014 at 10:36 am
    Feb 28, 2014 at 5:45 pm
  • Is there a way to access a hive MAP<String, String type in impala. This is a hive table mapped to a HBase table. To unsubscribe from this group and stop receiving emails from it, send an email to ...
    DanoomistmatisteDanoomistmatiste
    Feb 25, 2014 at 6:41 pm
    Feb 26, 2014 at 6:51 am
  • I have done some experiments like this to get logical fields out of long strings, but not pumped through enough data to know all the performance aspects. (In my sample scenario, I take tennis scores ...
    John RussellJohn Russell
    Feb 25, 2014 at 1:32 am
    Feb 25, 2014 at 11:24 pm
  • Hi, I have some questions about impala combining small files strategy and block size. I have a table TBL_SMALL_FILES which stats are: +-----------+--------+----------+---------+ ...
    Smart SunSmart Sun
    Feb 24, 2014 at 6:31 am
    Feb 24, 2014 at 7:25 pm
  • Moving to impala-user (bcc scm-users) -- Thanks, Darren To unsubscribe from this group and stop receiving emails from it, send an email to impala-user+unsubscribe@cloudera.org.
    Darren LoDarren Lo
    Feb 21, 2014 at 1:35 am
    Feb 21, 2014 at 5:24 pm
  • Hi Rasik, I have the same exact set up as you, Centos 6.1 and Microstrategy 9.3.1, with Impala 1.2.3. I just got it to work and is able to create a MSTR free form report to go against Impala. I ...
    Tenny susantoTenny susanto
    Feb 13, 2014 at 6:07 pm
    Feb 14, 2014 at 12:49 pm
  • Looking at building from the source on github I see there is a tag for 1.2.2 but none for 1.2.3. Is there a plan to add the tag for 1.2.3? Or can someone point to the commit that marked the release? ...
    Parth ChandraParth Chandra
    Feb 13, 2014 at 6:40 pm
    Feb 13, 2014 at 11:30 pm
  • We are pleased to announce the Beta release of Cloudera Enterprise 5.0 (CDH 5.0 and Cloudera Manager 5.0). This release contains a number of new features and component versions including the ones ...
    Wendy TurnerWendy Turner
    Feb 11, 2014 at 12:42 am
    Feb 12, 2014 at 1:52 am
  • Tenni, From the error message, it looks like the attribute you're trying to set is not supported by the ODBC driver. I'm not familiar with the tool you're using, but it looks like it sets certain ...
    Ishaan JoshiIshaan Joshi
    Feb 11, 2014 at 2:08 am
    Feb 11, 2014 at 6:11 pm
  • HIve GenericUDAF supports variable arguments. I'm wondering if C++ impala UDAF support that too? For example, I want to implement a windowing analytic function based on partition columns, and pass ...
    Bewang TechBewang Tech
    Feb 10, 2014 at 9:46 pm
    Feb 10, 2014 at 9:51 pm
  • Hi, We are designing a hardware config for Impala. The Impala hw requirements is a good start but we need some more info to optimize the hw config. The doc says that data node should have 12 disks ...
    György BaloghGyörgy Balogh
    Feb 5, 2014 at 8:55 am
    Feb 5, 2014 at 9:20 am
  • Sorry, view creation would be: create view map_view as select regexp_extract(line, "(\\w+),(\\w+):\\w+,\\w+:(.+)\\/\\w+\\/(\\S+)", 1) user, regexp_extract(line, ...
    SecsubsSecsubs
    Feb 24, 2014 at 10:05 pm
    Feb 24, 2014 at 10:05 pm
  • Parquet does not create a single file per column, all columns are stored separately within a single file. But the table data itself is broken up into multiple separate Parquet files, depending on the ...
    Marcel KornackerMarcel Kornacker
    Feb 18, 2014 at 4:03 pm
    Feb 18, 2014 at 4:03 pm
  • I set up impala1.2.3 cluster(catalog + statestore + impalaserver) but when i start a query ,i get the "ERROR: AnalysisException: This Impala daemon is not ready to accept user requests. Status ...
    永恒的爱永恒的爱
    Feb 14, 2014 at 8:37 am
    Feb 14, 2014 at 8:37 am
  • Moving to impala-user -- Thanks, Darren To unsubscribe from this group and stop receiving emails from it, send an email to impala-user+unsubscribe@cloudera.org.
    Darren LoDarren Lo
    Feb 14, 2014 at 1:33 am
    Feb 14, 2014 at 1:33 am
  • I set up catalog server 1.2.3 in my impala cluster,but after a few minutes ,i get the "OutOfMemoryError: GC overhead limit exceeded" error. refer to the /var/run/impala/hs_err_pidxxxx.log file: The ...
    永恒的爱永恒的爱
    Feb 13, 2014 at 3:01 am
    Feb 13, 2014 at 3:01 am
  • Vincent, Could you try using the octal representation, that should work. More details can on escape sequences used by Imapala can be found here ...
    Ishaan JoshiIshaan Joshi
    Feb 13, 2014 at 2:43 am
    Feb 13, 2014 at 2:43 am
  • Rasik - unfortunately it's very difficult to diagnose the issue without more information. Can you provide more detail, such as your odbc.ini, odbc.sh, etc.? A useful tool for testing is unixODBC, ...
    Jonathan SeidmanJonathan Seidman
    Feb 12, 2014 at 1:45 am
    Feb 12, 2014 at 1:45 am
  • HIve GenericUDAF supports variable arguments. I'm wondering if C++ impala UDAF supports that too? For example, I want to implement a windowing analytic function based on variable number of partition ...
    Bewang TechBewang Tech
    Feb 10, 2014 at 9:50 pm
    Feb 10, 2014 at 9:50 pm
  • hi, When i used udf in impala1.2.2,i got IllegalArgumentException error.but the udf function is ok in hive hive results: hive select nginx_url_parse('www.test.com','GET ...
    倪增光倪增光
    Feb 10, 2014 at 11:37 am
    Feb 10, 2014 at 11:37 am
  • Hi All, I had tested Hive 0.11 on one of the windowing function which is ROW_NUMBER(). I had wrote one query to select top 2 records for each country, the sample query is as the following: select ...
    ChiewyeaChiewyea
    Feb 7, 2014 at 9:22 am
    Feb 7, 2014 at 9:22 am
  • Hi All, I had read the article about Impala performance compare with Hive 0.12 (Stinger) at http://blog.cloudera.com/blog/2014/01/impala-performance-dbms-class-speed/. In the article, the Impala is ...
    ChiewyeaChiewyea
    Feb 7, 2014 at 9:20 am
    Feb 7, 2014 at 9:20 am
Group Navigation
period‹ prev | Feb 2014 | next ›
Group Overview
groupimpala-user @
categorieshadoop
discussions50
posts180
users47
websitecloudera.com
irc#hadoop

47 users for February 2014

Nong Li: 23 posts Ishaan Joshi: 15 posts György Balogh: 12 posts Marcel Kornacker: 11 posts Benjamin Kim: 9 posts Hanks Tom: 9 posts Bewang Tech: 7 posts Alex Behm: 6 posts Ricky Saltzer: 6 posts John Russell: 5 posts Mario Casola: 5 posts Vladimir: 5 posts Chiewyea: 4 posts Alex schufo: 4 posts Darren Lo: 4 posts Zesheng Wu: 4 posts Secsubs: 3 posts Ananth Gundabattula: 3 posts Avrilia Floratou: 3 posts Dejan Prokić: 3 posts
show more