FAQ

Search Discussions

56 discussions - 212 posts

  • Hi, We profiled our queries and saw that most of the time that the query is executed is spent on the aggregation phase. Are there any configuration properties that can be set to increase the speed of ...
    Bulvik, NoamBulvik, Noam
    Oct 5, 2014 at 8:08 am
    Oct 16, 2014 at 8:01 am
  • Hi Dimitris, Unfortunately, the 6-8 second call is the fast one. I can continually issue the same DDL command, and it takes the same(ish) time every call. The initial command takes much longer, ...
    Keith SimmonsKeith Simmons
    Oct 31, 2014 at 1:09 am
    Nov 12, 2014 at 11:52 pm
  • I run Impala 2.0 on CDH 4.7 I have a problem to drop some parquet tables ERROR: ImpalaRuntimeException: Error making 'createTable' RPC to Hive Metastore: CAUSED BY: MetaException ...
    Tony BussieresTony Bussieres
    Oct 23, 2014 at 8:39 pm
    Nov 5, 2014 at 9:54 pm
  • Hi, guys. Please tell me how to make the feature of "spill to disk" works in Impala 2.0? I got a error message like "Memory limit exceeded" when running query. I have set the option ...
    Guangwen LiuGuangwen Liu
    Oct 21, 2014 at 8:18 am
    Oct 28, 2014 at 1:52 am
  • Hey guys/gals, I'm having problems setting up Impala to authenticate user against LDAPS (Active Directory). I'm running CDH 5.2 (Impala 2.0) in a CM managed environment. I've followed this ...
    Philippe MarseillePhilippe Marseille
    Oct 30, 2014 at 7:22 pm
    Nov 3, 2014 at 11:24 pm
  • Hi, After upgrading to Impala 2.0 my script that updates the stats after a new partition has been added, no longer works. The bash script uses impala-shell to execute the following: alter table ...
    M.M.
    Oct 24, 2014 at 10:57 am
    Oct 26, 2014 at 1:52 pm
  • Hi All, I'm trying to implement a simple UDA function using impala to find the minimum distance between a line and a point. When i run the function, result I'm getting is not the expected one, it ...
    Impala explorerImpala explorer
    Oct 21, 2014 at 1:02 am
    Oct 21, 2014 at 2:57 am
  • Dear All: At one client side, they got a big oracle DB, which has 200 Tables need to import from oracle to Impala. So the Question is is there any current tool that can help us on this migration? For ...
    吴朱华吴朱华
    Oct 29, 2014 at 1:42 am
    Nov 3, 2014 at 7:26 pm
  • How do I know whether or not my Parquet Table is compressed? I am using below code to create the Impala table with Parquet file format and Snappy compression. use USATPSA; set ...
    Venkat AnkamVenkat Ankam
    Oct 20, 2014 at 9:05 pm
    Oct 27, 2014 at 2:16 pm
  • there are 3848763 rows, but only 3772560 rows were read. And: The files seem to be fine otherwise - we can write M/R jobs against them without any problems. Is there a specific version of parquet-mr ...
    Colin MarcColin Marc
    Oct 20, 2014 at 4:27 pm
    Oct 20, 2014 at 6:05 pm
  • Hi Guys, Just noticed you released Impala 2.0 a few days ago ( http://www.cloudera.com/content/cloudera/en/about/press-center/press-releases/2014/10/14/cloudera-releases-impala-2-0.html), and I even ...
    Keith SimmonsKeith Simmons
    Oct 16, 2014 at 11:04 pm
    Oct 17, 2014 at 3:08 pm
  • Hi, Can you please send some more information about this error. For example: - What version of Impala did you use? - Can you send us the query? - Can you send us the log file? - How many queries ...
    Ippokratis PandisIppokratis Pandis
    Oct 20, 2014 at 4:58 pm
    Oct 25, 2014 at 9:53 am
  • On the EMR 3 node cluster I am getting this error AMI version:3.2.1 Hadoop distribution:Amazon 2.4.0 Applications:Hive 0.13.1, Impala 1.2.4 impala-shell -q "select count(*) from ...
    Sanjay SubramanianSanjay Subramanian
    Oct 21, 2014 at 9:47 pm
    Oct 23, 2014 at 2:43 pm
  • Hi, According to my understanding, currently Impala use the impala user to read/write the warehouse directory, even if Sentry authorization is enabled. The problem is, I have existing data in Hive ...
    Chengbing LiuChengbing Liu
    Oct 11, 2014 at 9:49 am
    Oct 15, 2014 at 7:03 am
  • Hi Matt, What client are you using to execute the INSERT statements (impala-shell, JDBC, etc)? Could you also send your catalogd and impalad service log files? Thanks, Lenni To unsubscribe from this ...
    Lenni KuffLenni Kuff
    Oct 9, 2014 at 8:38 pm
    Oct 12, 2014 at 4:06 am
  • Dear All: 1. Is Impala Supports record filtering via predicate pushdown in Parquet? Spark did it ...
    吴朱华吴朱华
    Oct 28, 2014 at 2:26 pm
    Oct 31, 2014 at 2:08 am
  • Hi, We've upgraded to CDH 5.2 and are itching to try out the new Impala 2.0 features. One question though: When Impala 2 spills to disk, where does it spill to? And is it configurable some where? ...
    Fredrik RagnarFredrik Ragnar
    Oct 20, 2014 at 9:18 am
    Oct 21, 2014 at 5:19 am
  • Does anybody know what are the new data types added in Impala 2.0? Is DATE datatype added? Regards, Venkat To unsubscribe from this group and stop receiving emails from it, send an email to ...
    Venkat AnkamVenkat Ankam
    Oct 17, 2014 at 3:12 pm
    Oct 21, 2014 at 2:30 am
  • Hi, If I join from a large fact table to 1 or more small dimensions tables Impala will do a Broadcast join. What I was curious about was if I turn up the replication on the dimension tables so that ...
    David SinclairDavid Sinclair
    Oct 13, 2014 at 1:10 pm
    Oct 14, 2014 at 12:43 pm
  • Hi, I had a basic question in Impala. We know that Impala allows you to query data that is stored in HDFS. Now, if a file is split into multiple blocks, and let us say a line of text is spread across ...
    SanjaySanjay
    Oct 23, 2014 at 8:13 am
    Nov 1, 2014 at 10:35 am
  • hey guys Looks like its this error but I am logging this with a Hive versus Impala comparison https://issues.cloudera.org/browse/IMPALA-1401 I have Hive and Impala installed (CDH 5.2.0) First the RAW ...
    Sanjay SubramanianSanjay Subramanian
    Oct 31, 2014 at 4:44 pm
    Oct 31, 2014 at 7:52 pm
  • Hi all, What is the root cause of this error? Query: insert OVERWRITE DATA_PARQUET PARTITION(month) SELECT * from DATA_LOAD_PARTITIONED WHERE MONTH = "201406" WARNINGS: Cannot write value that needs ...
    Tony BussieresTony Bussieres
    Oct 29, 2014 at 9:48 pm
    Oct 30, 2014 at 9:39 pm
  • Hi -- I've a kafka stream producing JSON and wanted to use Spark Streaming or Camus to write to HDFS in Parquet format and use with Impala. Just wanted to see if anyone has that working and point me ...
    Buntu DevBuntu Dev
    Oct 21, 2014 at 5:59 pm
    Oct 21, 2014 at 11:36 pm
  • Hi All, I'm switching from hive JDBC to Cloudera JDBC 2.5, encounter a sql exception with following SQL: stmt.execute("insert into TESTDB (STR_, NUM_) values (null, 1)"); the sql exception stacktrace ...
    ChenChen
    Oct 20, 2014 at 2:55 am
    Oct 21, 2014 at 8:53 am
  • Hello, Does Impala expose its metrics anywhere for consumption by monitoring tools like SPM <http://sematext.com/spm/ ? I see ...
    Otis GospodneticOtis Gospodnetic
    Oct 17, 2014 at 9:56 pm
    Oct 17, 2014 at 10:58 pm
  • Hi, I am trying to create an external table with partitions using LIKE PARQUET but am running into problems. I have a directory in HDFS, /user/matth/test_data, which has a few hundred parquet files ...
    Matt HollingsworthMatt Hollingsworth
    Oct 17, 2014 at 9:16 pm
    Oct 17, 2014 at 9:43 pm
  • Hi, when trying to connect to impala using impala-shell on a kerberos cluster we are getting the following error: [root@hadoopclient ~]# impala-shell -k -i hadoopnode Starting Impala Shell using ...
    MrAkhe83MrAkhe83
    Oct 13, 2014 at 8:46 pm
    Oct 14, 2014 at 5:29 pm
  • I am looking forward to 2.0 version, since it will support such as windowing function. Can anyone tell me when is the release date for v2.0. To unsubscribe from this group and stop receiving emails ...
    吴朱华吴朱华
    Oct 6, 2014 at 6:00 am
    Oct 9, 2014 at 3:32 am
  • Hello, I have been trying to test time-stamp support that has been recently added to hive-14. Unfortunately the time-stamp values inserted through hive is not readable by impala and vice versa. I am ...
    Dilip BiswalDilip Biswal
    Oct 7, 2014 at 6:08 am
    Oct 7, 2014 at 6:25 pm
  • Hi, I have a problem using the Impala ODBC Connector (v2.5.20) with Varchar data type. I'm using Impala 2.0 (CDH 5.2). This is my create statement: *create external table type_tests(* * varchar_field ...
    Simone BattagliaSimone Battaglia
    Oct 31, 2014 at 12:13 pm
    Nov 3, 2014 at 7:37 pm
  • Hi, I have CSV files that I load in HDFS and I run queries on it using Impala. I have few double columns in the CSV file. I understand that I can have rounding errors using float or doubles when I do ...
    Tony BussieresTony Bussieres
    Oct 28, 2014 at 3:42 pm
    Oct 28, 2014 at 7:03 pm
  • Hi, I am trying to automate some of a impala related processes by writing a little JDBC tool that frequently drops a table and build a new one from some *.csv files in hdfs. The weird thing is, that ...
    Klausen SchaefersinhoKlausen Schaefersinho
    Oct 21, 2014 at 1:10 pm
    Oct 23, 2014 at 8:40 pm
  • Greetings, We recently upgraded to CDH 5.2 which includes Impala 2.0 and it appears that regexp_extract and regexp_replace functions no longer work when using shorthand character classes. Posix ...
    Shaun SitShaun Sit
    Oct 21, 2014 at 4:47 am
    Oct 21, 2014 at 6:19 pm
  • I'm also experiencing the same issue with parquet files on Impala 2.0 and CDH 5.2 To unsubscribe from this group and stop receiving emails from it, send an email to ...
    Shaun SitShaun Sit
    Oct 21, 2014 at 10:21 am
    Oct 21, 2014 at 4:42 pm
  • Hi, I am running few impala queries right now, and i wondered were can i find the information about what impala is doing right now and about it progress, beside tailing all datanode impala logs. In ...
    RanRan
    Oct 21, 2014 at 2:28 pm
    Oct 21, 2014 at 2:36 pm
  • I cannot access the url "http://util-1.ent.cloudera.com/impala-test-data/" ,so the copy-test-data.sh cannot get impala test data,anyone can tell me where can get impala test data , Thinks! yycoder ...
    692895299692895299
    Oct 20, 2014 at 5:40 am
    Oct 20, 2014 at 6:59 pm
  • Hello, I am trying to understand the impala's query plan for one of the tpcds query. I have attached the plan. My question is on the scan operation of store_sales table. here is the snippet of the ...
    Dilip BiswalDilip Biswal
    Oct 17, 2014 at 6:58 am
    Oct 17, 2014 at 6:30 pm
  • Hey folks I'm trying to port existing Hive work to Impala, and we have made heavy use of parameterized scripts. In hive: $hive -f download.q -hiveconf WHERE=year=2012 Is similar possible in ...
    Tim RobertsonTim Robertson
    Oct 17, 2014 at 12:46 pm
    Oct 17, 2014 at 3:15 pm
  • Hi Tim, I'm afraid the current options are limited: 1. Create an uber jar as you suggested 2. Put all dependencies on Impala's classpath, i.e., manually distribute all dependency jars to all nodes ...
    Alex BehmAlex Behm
    Oct 17, 2014 at 1:09 am
    Oct 17, 2014 at 5:33 am
  • * Existing Cluster Use Horonworks * Cluster's Hadoop version : 2.4 * Hadoop Client on Impala Node is 2.4 * Impala 1.4 is installed*from tarball* ...
    Donald fossouoDonald fossouo
    Oct 14, 2014 at 8:39 am
    Oct 15, 2014 at 1:51 am
  • Look at the pow() builtin. To unsubscribe from this group and stop receiving emails from it, send an email to impala-user+unsubscribe@cloudera.org.
    Nong LiNong Li
    Oct 14, 2014 at 6:11 pm
    Oct 14, 2014 at 6:44 pm
  • This should go to the impala-user list. What versions of Impala and CDH are you running? To unsubscribe from this group and stop receiving emails from it, send an email to ...
    Patrick AngelesPatrick Angeles
    Oct 3, 2014 at 8:09 pm
    Oct 3, 2014 at 8:11 pm
  • Will Impala 2.0 depend on Cloudera 5.0? To unsubscribe from this group and stop receiving emails from it, send an email to impala-user+unsubscribe@cloudera.org.
    Keith SimmonsKeith Simmons
    Oct 2, 2014 at 8:35 pm
    Oct 2, 2014 at 9:30 pm
  • Currently it seems that when an external table is created like CREATE EXTERNAL TABLE raw_impression( a STRING, b INT, d STRING, ) STORED AS PARQUET and a parquet file is loaded with the schema (these ...
    Marius van NiekerkMarius van Niekerk
    Oct 1, 2014 at 12:06 pm
    Oct 1, 2014 at 8:17 pm
  • Hello, Since upgrading to Impala 2.0/CDH 5.2.0, we've been losing nodes mid-query pretty regularly. In each case that I've checked, the failure (per impala-server.log) has been due to SIGSEGV. I've ...
    Charlie FlowersCharlie Flowers
    Oct 30, 2014 at 3:26 pm
    Oct 30, 2014 at 3:26 pm
  • Dear All: Our customer may want us to deploy impala on HPC environment,so it may need the Impala to support INFINIBAND or MPI Protocol, Does anyone know how to do that or there is a completed ...
    吴朱华吴朱华
    Oct 30, 2014 at 6:33 am
    Oct 30, 2014 at 6:33 am
  • Hi, I am writing this to ask for help for the errors when I start up the catalogd daemon. The impala version I use is 1.3 with CDH 5.0 and I compile it from source code. The commands I used to start ...
    Baoquan ZhangBaoquan Zhang
    Oct 28, 2014 at 11:17 am
    Oct 28, 2014 at 11:17 am
  • I've done a Docker image for running Impala on Docker (single node cluster) If you want to try it, it's available on the Docker repo docker pull codingtony/impala ...
    Tony BussieresTony Bussieres
    Oct 23, 2014 at 9:04 pm
    Oct 23, 2014 at 9:04 pm
  • Hi everyone, I have a lot of data :), and I want to partition my dataset by day. I have a timestamp field called "created" and I want to use the dynamic partitioning. So, I've just added 3 calculated ...
    GuillaumeGuillaume
    Oct 23, 2014 at 3:14 pm
    Oct 23, 2014 at 3:14 pm
  • Hi, I'm trying to build and test Impala from the source on Ubuntu 12.04. I've done the following: 1. Installed all prerequisites for Impala. 2. Cloned Impala repository ...
    Ammar BakeerAmmar Bakeer
    Oct 21, 2014 at 11:23 pm
    Oct 21, 2014 at 11:23 pm
Group Navigation
period‹ prev | Oct 2014 | next ›
Group Overview
groupimpala-user @
categorieshadoop
discussions56
posts212
users67
websitecloudera.com
irc#hadoop

67 users for October 2014

Matthew Jacobs: 27 posts 吴朱华: 15 posts Tony Bussieres: 12 posts Alex Behm: 10 posts Keith Simmons: 7 posts Sanjay Subramanian: 7 posts Serega Sheypak: 6 posts Skye Wanderman-Milne: 6 posts Venkat Ankam: 6 posts Ippokratis Pandis: 5 posts Bulvik, Noam: 4 posts Charlie Flowers: 4 posts Colin Marc: 4 posts Impala explorer: 4 posts Lenni Kuff: 4 posts Mehant Baid: 4 posts Nong Li: 4 posts Buntu Dev: 3 posts Chen: 3 posts Dilip Biswal: 3 posts
show more