Cloudera Impala is a modern, open source, distributed SQL query engine
for Apache Hadoop. Follow us at @RideImpala!
Impala provides low latency and high concurrency for BI/analytic queries on Hadoop (not delivered by batch frameworks such as Apache Hive). Impala also scales linearly, even in multitenant environments.
Utilize the same file and data formats and metadata, security, and resource management frameworks as your Hadoop deployment -- no redundant infrastructure or data conversion/duplication.
For Apache Hive users, Impala utilizes the same metadata, ODBC driver, SQL syntax, and user interface as Hive -- so you don't have to worry about re-inventing the implementation wheel.
Impala is integrated with native Hadoop security and Kerberos for authentication, and via the Sentry module, you can ensure that the right users and applications are authorized for the right data.
Impala is open source (Apache License), so you can self-support in perpetuity if you wish. However, technical support for those who want it is available via a Cloudera Enterprise subscription add-on.
With Impala, more users -- whether using SQL queries or BI applications -- can interact with more data through a single repository and metadata store from source through analysis.
How-to: Get Started Writing Impala UDFs (Jan. 24, 2014)
Impala Performance Update: Now Reaching DBMS-Class Speed (Jan. 13, 2014)
How-to: Use Impala on Amazon EMR (Dec. 16, 2013)
How-to: Do Statistical Analysis with Impala and R (Dec. 16, 2013)
How-to: Use MADlib Pre-built Analytic Functions with Impala (Oct. 29, 2013)
Explore the Impala App in Hue (Oct. 11, 2013)