On behalf of the Impala team at Cloudera, I am happy to announce the beta
release of Cloudera Impala, the industry’s first real time query framework
Impala is an open source real-time query engine that allows users to query
data stored in HDFS and HBase via a SQL interface. It leverages the same
metadata, SQL syntax (HiveQL), ODBC driver and Hue user interface (Beeswax)
as Apache Hive to provide a unified platform. But rather than using batch
processing MapReduce framework, Impala directly accesses the data locally
through a distributed query engine architecture. The result is an order of
magnitude performance improvement over Apache Hive depending on the type of
query. For more details on Impala, please see:
As part of the open beta, we encourage the CDH community to try it out.
Here is how you can get started:
- Access the download of Impala from
https://ccp.cloudera.com/display/SUPPORT/Downloads, install Impala and
try it out. Please note that you need to have CDH4.1 installed on
RHEL/CentOS 6.2. We highly encourage the use of Cloudera Manager 4.1 to
deploy and manage Impala.
- Access the demo VM of Impala from
The VM includes instructions that show you the power of Impala.
- Access the Impala source code at: https://github.com/cloudera/impala
- Download and review the Impala documentation at
- Download and review the Cloudera Manager 4.1 documentation for how to
deploy Impala at:
- Free edition:
- Enterprise edition:
Once you get started, we encourage you to provide feedback. We have the
following mechanisms set up to do this:
- An Impala specific user group has been set up. Click here to join:
Please use this to ask questions and provide feedback.
- An Impala specific Jira project has been set up. You can reach it at:
We look forward to hearing about your experiences with Impala.