Follow Datanami:
November 17, 2015

Cloudera Proposes to Donate Impala and Kudu to the ASF

PALO ALTO, Calif., Nov. 17 — Cloudera, the global provider of the fastest, easiest, and most secure data management and analytics platform built on Apache Hadoop and the latest open source technologies, announced today that it will submit proposals for Impala and Kudu to join the Apache Software Foundation (ASF). By donating its leading analytic database and columnar storage projects to the ASF, Cloudera aims to accelerate the growth and diversity of their respective developer communities.

Since its launch in 2012, Impala has been broadly adopted and widely embraced as the leading analytic database for Hadoop, with support across multiple industry vendors and integrations with all the leading business intelligence (BI) tools. Initially unlocking interactive SQL performance on Hadoop with core SQL-92 functionality, Impala 2.0 further matured its analytic SQL capabilities — enabling extremely low-latency operational BI and data discovery solutions while supporting multi-user concurrency across analysts and casual users alike. Impala 2.0 became a game changer for many businesses, delivering these popular business use cases and familiar access natively to Hadoop.

A majority of Cloudera customers across all industries now rely on Impala for their enterprise analytic and BI workloads, including Epsilon, Magnify Analytic Solutions, Premier, Quaero, RelayHealth, and SFR. With a continued focus on reliability at massive scale, Impala has also increasingly supported customers running millions of queries, running on clusters ranging from tens to hundreds of nodes, and over a thousand concurrent users.

Impala has been Apache-licensed since its public launch and has since become an open source standard in the Hadoop ecosystem. Since opening up Impala for community contributions earlier this year, there has been increasing development activity, with Google developing integrations between Impala and BigTable, as well as contributions from Arcadia Data, Intel, and others. By donating the project to the ASF, this diverse community can further drive the vision of Impala from its well-established foundation.

The recent beta release of Kudu, the new updateable storage engine for Hadoop, brought another critical milestone for analytics on Hadoop. For the first time, the community has both an interactive query engine with Impala and an updateable storage engine with Kudu — enabling fast analytic use cases on data as it changes. In a short period since its release, Kudu has experienced widespread interest within the open source community. Through its application to join the ASF, Kudu will continue to benefit from the broader development community and the collaboration of these projects will dramatically expand the use cases they can serve as more companies look to develop real-time analytic applications.

“We believe that the ASF is the right venue to foster an open source community around both Impala’s and Kudu’s development. We expect that these projects will benefit from more productive collaboration with related Apache projects, and under the auspices of the ASF, will attract talented contributors who will push development forward at pace,” said Doug Cutting, chief architect at Cloudera and co-creator of Hadoop. “With a strong architectural foundation set for both these projects, we believe that the timing is right for development to move wholesale to the ASF.”

Cloudera will continue to actively develop and support the Impala and Kudu projects, as it has with a number of successful ASF projects. Cloudera employees have founded and launched several open source projects with the ASF, including Apache Hadoop, Apache Flume, Apache HBase, Apache Parquet, and ZooKeeper. Within the Apache Software Foundation, Cloudera also has 13 company employees who are Apache members, a title that holds the highest merit within the foundation. These members are: Sean Busbey, Jarek Jarcec Cecho, Doug Cutting, Patrick Hunt, Todd Lipcon, Mark Miller, Aaron T. Myers, Sean Owen, Yonik Seeley, Michael Stack, Tom White, and Xuefu Zhang.

“Impala has made a significant impact on the widespread adoption of Hadoop, and its move into the ASF is a signal of the high demand for Hadoop to be accessible to all businesses,” said Marcel Kornacker, founder of Impala and lead architect at Cloudera. “We’ve built a strong foundation for Impala and are excited to continue shaping its vision with the open source community.”

“In the short time since Kudu’s beta release in September, we’ve been thrilled at the community reception from open source developers and users alike across many different organizations. The need for an updateable storage engine in the Hadoop ecosystem has resonated, especially with the increased interest in building real-time analytic applications,” said Todd Lipcon, founder and lead engineer of the Kudu project at Cloudera. “With the submission of Kudu to the Apache Incubator, we are excited to work with the broader community of Hadoop ecosystem developers to continue to build out Kudu integrations with the rest of the Hadoop ecosystem and develop a new open standard storage solution.”

About Cloudera

Cloudera delivers the modern data management and analytics platform built on Apache Hadoop and the latest open source technologies. The world’s leading organizations trust Cloudera to help solve their most challenging business problems with Cloudera Enterprise, the fastest, easiest and most secure data platform available for the modern world. Our customers efficiently capture, store, process and analyze vast amounts of data, empowering them to use advanced analytics to drive business decisions quickly, flexibly and at lower cost than has been possible before. To ensure our customers are successful, we offer comprehensive support, training and professional services. Learn more at http://cloudera.com.

Source: Cloudera

Datanami