Search Discussions

1,329 discussions - 4,589 posts

  • Hi Jung-Yup, Changing the query processing model is not in the immediate plans for Impala (see http://blog.cloudera.com/blog/2014/08/whats-next-for-impala-focus-on-advanced-sql-functionality/). That ...
    Dimitris TsirogiannisDimitris Tsirogiannis
    Nov 12, 2014 at 5:48 pm
    Nov 12, 2014 at 5:48 pm
  • Hi Sean, Which version of Impala are you using? Can you please send the full query profile for the query that is failing as well as the impalad log? Thanks, Dimitris To unsubscribe from this group ...
    Dimitris TsirogiannisDimitris Tsirogiannis
    Nov 11, 2014 at 10:49 pm
    Nov 11, 2014 at 11:20 pm
  • I'm using CDH 5.2.0 and wanted to know if there are there is a good collection of UDFs out there that work with Impala? Is it still true that Impala doesn't support nested or composite types? Thanks! ...
    Buntu DevBuntu Dev
    Nov 11, 2014 at 6:44 pm
    Nov 11, 2014 at 6:50 pm
  • I saw Impala's 2.1 roadmap: - Parquet enhancements – continued performance gains including index pages Will Impala use indexes for Parquet files? To unsubscribe from this group and stop receiving ...
    Denis LamanovDenis Lamanov
    Nov 11, 2014 at 3:56 pm
    Nov 11, 2014 at 3:56 pm
  • Hi People, I'm using impala 2.0.0 version. There is weird thing happening around in subqueries. I've two table name as "a" and "b": Query: select * from a +----+ +----+ Query: select * from b +----+ ...
    Ravi SharmaRavi Sharma
    Nov 11, 2014 at 10:02 am
    Nov 11, 2014 at 2:46 pm
  • Hi all, I have created a simple UDF AddUdf on my redhat box. Instead of IR, I created a shared object of this UDF and transferred it to CDH 5.2 quick VM with Impala 2.0. While I am trying to create ...
    Shabnam perweenShabnam perween
    Nov 11, 2014 at 3:58 am
    Nov 12, 2014 at 5:28 am
  • Hi there, I have been looking into Impala 2.0.0 source code recently. I saw that Impala is using libhdfs to read/write HDFS. However I didn't see the creation of directories for databases/tables in ...
    Chengbing LiuChengbing Liu
    Nov 10, 2014 at 10:20 am
    Nov 12, 2014 at 1:57 am
  • Hi For a PlanFragment root, my understanding is that it is executed on the Coordinator Node, so it should only run on one host. However, the plan I got from profile indicating that it is running on 4 ...
    Nov 10, 2014 at 9:42 am
    Nov 11, 2014 at 2:58 am
  • Hello I have a question about different partitions listed in the query plan. I'm not sure my understanding is correct, I appreciate if anyone understands the query plane well, can help me figure ...
    Nov 10, 2014 at 9:39 am
    Nov 10, 2014 at 11:01 am
  • Hi, In Impala 1.4, I can set parquet compression by 'set PARQUET_COMPRESSION_CODEC=snappy;' After upgraded to Impala 2.0, it reported error message as follows: Unknown query option ...
    Zhe LiZhe Li
    Nov 10, 2014 at 9:03 am
    Nov 11, 2014 at 3:28 am
  • Hi, We have an ETL process that generates parquet files every so often using spark job. I would like to use these files with an external partitioned impala table. Impala can read the original file ...
    Stephane DrouinStephane Drouin
    Nov 10, 2014 at 2:36 am
    Nov 10, 2014 at 5:44 am
  • Hi All: I have done some TPC-H tests for impala 2.0. I have found one misleading point. For example the TPC-H Query3, the total duration is 28s, but the total seconds that sums from summary is only ...
    Nov 9, 2014 at 9:02 am
    Nov 10, 2014 at 5:41 am
  • Hi, I saw where were quite significant updates to UDA api in Impala 2.0, but I cant find any example of how to use the new framework. Maybe someone knows and can post an example of how to use fixed ...
    Jonas JarutisJonas Jarutis
    Nov 8, 2014 at 5:36 am
    Nov 8, 2014 at 5:36 am
  • Hello, I'm trying to enable resource management on Impala using Llama on Yarn. (Running CDH5.2 and CM) I used the "wizard" from the pool's webpage to do so. After a restart of the services, ...
    Philippe MarseillePhilippe Marseille
    Nov 7, 2014 at 8:08 pm
    Nov 11, 2014 at 9:21 pm
  • I am pouring through the documentation for CDH5.2 and can no longer find the steps to configure Impala/Llama to use YARN along with the YARN configurations to accommodate. I see in the Static Service ...
    Benjamin KimBenjamin Kim
    Nov 7, 2014 at 5:47 pm
    Nov 7, 2014 at 6:35 pm
  • Hello, As I mention, I am building a new Impala Cluster with 30 Data Nodes ( 2TB HDD each Server) . But I wonder how big Impala Cluster has been deployed. Thank you very much. Vào 15:25:13 UTC+7 Thứ ...
    Summer nguyenSummer nguyen
    Nov 7, 2014 at 2:30 pm
    Nov 7, 2014 at 2:30 pm
  • Hello All, RIght now we are discussing about having views for aggregated queries or tables. Our test results indicate that issuing a query on view is taking 2 times to the query issued on table ...
    Nov 7, 2014 at 5:30 am
    Nov 7, 2014 at 5:30 am
  • When using impala-shell for query containing Chinese character, some times the encoding is messed up, like below. However when I insert some spaces at the last part(marked in red) the query now back ...
    Binglin ChangBinglin Chang
    Nov 7, 2014 at 3:31 am
    Nov 7, 2014 at 3:31 am
  • Hi all, We'd like to give everyone a heads up on IMPALA-1401 that affects CDH 5.2. Due to this bug, Parquet files created outside Impala (eg Hive or MR) in CDH 5.2 with strings 36 bytes may fail in ...
    Justin EricksonJustin Erickson
    Nov 7, 2014 at 2:05 am
    Nov 9, 2014 at 3:10 am
  • Ken, the date_id approach requires dynamic partition pruning in order to avoid having to scan the entire table for each query. Impala doesn't support that at the moment, but this is on our ...
    Marcel KornackerMarcel Kornacker
    Nov 6, 2014 at 10:20 pm
    Nov 12, 2014 at 11:24 am
  • The documentation is my area of responsibility. Yes, the separate-column approach is what comes across from a lot of customer use cases and questions, so we do address it quite a bit. I personally ...
    John RussellJohn Russell
    Nov 6, 2014 at 9:47 pm
    Nov 6, 2014 at 9:47 pm
  • Hi, I tried play with UDAF collecting numeric from all rows in array and doing some calculation in Finalise step (probably not the best approach in distributed environment but should be feasible for ...
    Jonas JarutisJonas Jarutis
    Nov 6, 2014 at 8:26 am
    Nov 10, 2014 at 7:55 am
  • Stephane, I would take advantage of the fact that you can point external impala tables to new locations while online (in fact multiple tables to same location). So you could keep copy A under ...
    Mauricio AristizabalMauricio Aristizabal
    Nov 4, 2014 at 6:30 pm
    Nov 5, 2014 at 4:20 pm
  • Hi Jason, yes, 10k is the limit. Alex To unsubscribe from this group and stop receiving emails from it, send an email to impala-user+unsubscribe@cloudera.org.
    Alex BehmAlex Behm
    Nov 4, 2014 at 1:51 am
    Nov 4, 2014 at 1:51 am
  • Hi, It does not seem to be supported in impala 2.0. I can see https://issues.apache.org/jira/browse/TAJO-710 on that very subject. Can someone share the expected release date for this? Thanks To ...
    Stephane DrouinStephane Drouin
    Nov 4, 2014 at 1:41 am
    Nov 4, 2014 at 1:41 am
  • Just to make sure are you referring to table partitioning by range like: http://docs.oracle.com/cd/E17952_01/refman-5.5-en/partitioning-range.html Or window clauses with ranges like RANGE BETWEEN ...
    Justin EricksonJustin Erickson
    Nov 3, 2014 at 7:37 pm
    Nov 3, 2014 at 7:37 pm
  • Hi Dima - When an Impala daemon goes offline, the following should happen: 1. The statestore should detect that its regular heartbeats are not getting through to the daemon. When that happens, the ...
    Henry RobinsonHenry Robinson
    Nov 3, 2014 at 5:46 pm
    Nov 12, 2014 at 7:28 pm
  • Hi Dima, We are running CDH4 with Impala 2.0 and we experienced a similar issue when we were doing physical maintenance that implied shutting down a datanode. I'm not sure it did took as much as 30 ...
    Tony BussieresTony Bussieres
    Nov 3, 2014 at 3:22 pm
    Nov 3, 2014 at 3:22 pm
  • Hi, I am able to load upto 30 columns in hbase. Regex is working well. I have searched but didnt get this limit. Why am i unable to upload only upto 30 columns not more then that. Is their any limit ...
    Ashish sanadhyaAshish sanadhya
    Nov 3, 2014 at 7:18 am
    Nov 3, 2014 at 3:08 pm
  • We're running some benchmarks on Impala 2.0. In the first part of the blog, we have a detailed look at how Cloudera ran their previous benchmarks ...
    Kris PeetersKris Peeters
    Nov 2, 2014 at 11:42 am
    Nov 4, 2014 at 9:45 am
  • hey guys Looks like its this error but I am logging this with a Hive versus Impala comparison https://issues.cloudera.org/browse/IMPALA-1401 I have Hive and Impala installed (CDH 5.2.0) First the RAW ...
    Sanjay SubramanianSanjay Subramanian
    Oct 31, 2014 at 4:44 pm
    Oct 31, 2014 at 7:52 pm
  • Hi, I have a problem using the Impala ODBC Connector (v2.5.20) with Varchar data type. I'm using Impala 2.0 (CDH 5.2). This is my create statement: *create external table type_tests(* * varchar_field ...
    Simone BattagliaSimone Battaglia
    Oct 31, 2014 at 12:13 pm
    Nov 3, 2014 at 7:37 pm
  • Hi Dimitris, Unfortunately, the 6-8 second call is the fast one. I can continually issue the same DDL command, and it takes the same(ish) time every call. The initial command takes much longer, ...
    Keith SimmonsKeith Simmons
    Oct 31, 2014 at 1:09 am
    Nov 12, 2014 at 11:52 pm
  • Hey guys/gals, I'm having problems setting up Impala to authenticate user against LDAPS (Active Directory). I'm running CDH 5.2 (Impala 2.0) in a CM managed environment. I've followed this ...
    Philippe MarseillePhilippe Marseille
    Oct 30, 2014 at 7:22 pm
    Nov 3, 2014 at 11:24 pm
  • Hello, Since upgrading to Impala 2.0/CDH 5.2.0, we've been losing nodes mid-query pretty regularly. In each case that I've checked, the failure (per impala-server.log) has been due to SIGSEGV. I've ...
    Charlie FlowersCharlie Flowers
    Oct 30, 2014 at 3:26 pm
    Oct 30, 2014 at 3:26 pm
  • Dear All: Our customer may want us to deploy impala on HPC environment,so it may need the Impala to support INFINIBAND or MPI Protocol, Does anyone know how to do that or there is a completed ...
    Oct 30, 2014 at 6:33 am
    Oct 30, 2014 at 6:33 am
  • Hi all, What is the root cause of this error? Query: insert OVERWRITE DATA_PARQUET PARTITION(month) SELECT * from DATA_LOAD_PARTITIONED WHERE MONTH = "201406" WARNINGS: Cannot write value that needs ...
    Tony BussieresTony Bussieres
    Oct 29, 2014 at 9:48 pm
    Oct 30, 2014 at 9:39 pm
  • Dear All: At one client side, they got a big oracle DB, which has 200 Tables need to import from oracle to Impala. So the Question is is there any current tool that can help us on this migration? For ...
    Oct 29, 2014 at 1:42 am
    Nov 3, 2014 at 7:26 pm
  • Hi, I have CSV files that I load in HDFS and I run queries on it using Impala. I have few double columns in the CSV file. I understand that I can have rounding errors using float or doubles when I do ...
    Tony BussieresTony Bussieres
    Oct 28, 2014 at 3:42 pm
    Oct 28, 2014 at 7:03 pm
  • Dear All: 1. Is Impala Supports record filtering via predicate pushdown in Parquet? Spark did it ...
    Oct 28, 2014 at 2:26 pm
    Oct 31, 2014 at 2:08 am
  • Hi, I am writing this to ask for help for the errors when I start up the catalogd daemon. The impala version I use is 1.3 with CDH 5.0 and I compile it from source code. The commands I used to start ...
    Baoquan ZhangBaoquan Zhang
    Oct 28, 2014 at 11:17 am
    Oct 28, 2014 at 11:17 am
  • Hi, After upgrading to Impala 2.0 my script that updates the stats after a new partition has been added, no longer works. The bash script uses impala-shell to execute the following: alter table ...
    Oct 24, 2014 at 10:57 am
    Oct 26, 2014 at 1:52 pm
  • I've done a Docker image for running Impala on Docker (single node cluster) If you want to try it, it's available on the Docker repo docker pull codingtony/impala ...
    Tony BussieresTony Bussieres
    Oct 23, 2014 at 9:04 pm
    Oct 23, 2014 at 9:04 pm
  • I run Impala 2.0 on CDH 4.7 I have a problem to drop some parquet tables ERROR: ImpalaRuntimeException: Error making 'createTable' RPC to Hive Metastore: CAUSED BY: MetaException ...
    Tony BussieresTony Bussieres
    Oct 23, 2014 at 8:39 pm
    Nov 5, 2014 at 9:54 pm
  • Hi everyone, I have a lot of data :), and I want to partition my dataset by day. I have a timestamp field called "created" and I want to use the dynamic partitioning. So, I've just added 3 calculated ...
    Oct 23, 2014 at 3:14 pm
    Oct 23, 2014 at 3:14 pm
  • Hi, I had a basic question in Impala. We know that Impala allows you to query data that is stored in HDFS. Now, if a file is split into multiple blocks, and let us say a line of text is spread across ...
    Oct 23, 2014 at 8:13 am
    Nov 1, 2014 at 10:35 am
  • Hi, I'm trying to build and test Impala from the source on Ubuntu 12.04. I've done the following: 1. Installed all prerequisites for Impala. 2. Cloned Impala repository ...
    Ammar BakeerAmmar Bakeer
    Oct 21, 2014 at 11:23 pm
    Oct 21, 2014 at 11:23 pm
  • On the EMR 3 node cluster I am getting this error AMI version:3.2.1 Hadoop distribution:Amazon 2.4.0 Applications:Hive 0.13.1, Impala 1.2.4 impala-shell -q "select count(*) from ...
    Sanjay SubramanianSanjay Subramanian
    Oct 21, 2014 at 9:47 pm
    Oct 23, 2014 at 2:43 pm
  • Hi -- I've a kafka stream producing JSON and wanted to use Spark Streaming or Camus to write to HDFS in Parquet format and use with Impala. Just wanted to see if anyone has that working and point me ...
    Buntu DevBuntu Dev
    Oct 21, 2014 at 5:59 pm
    Oct 21, 2014 at 11:36 pm
  • Hi, I am running few impala queries right now, and i wondered were can i find the information about what impala is doing right now and about it progress, beside tailing all datanode impala logs. In ...
    Oct 21, 2014 at 2:28 pm
    Oct 21, 2014 at 2:36 pm
Group Navigation
period‹ prev | Latest | first ›
Group Overview
groupimpala-user @

Top users

Marcel Kornacker: 178 posts Alex Behm: 152 posts Lenni Kuff: 152 posts Alan: 148 posts Nong: 128 posts Henry Robinson: 115 posts Matthew Jacobs: 95 posts Ricky Saltzer: 87 posts Skye Wanderman-Milne: 70 posts Jrussell: 69 posts Ishaan Joshi: 66 posts Benjamin Kim: 62 posts Greg Rahn: 59 posts Darren Lo: 55 posts Justin Erickson: 53 posts Jung-Yup Lee: 52 posts Keith: 41 posts Bewang Tech: 39 posts Vikas Singh: 39 posts Sean O'Brien: 36 posts
show more