Follow Datanami:
October 1, 2014

Cloudera Acquires DataPad Technology Assets

PALO ALTO, Calif., Oct. 1 — Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop, today announced that it has acquired the technology assets of DataPad, an innovator in the exploration and analysis of big data sets. The acquisition will further strengthen Cloudera’s enterprise data hub offering by simplifying data processing and analysis on Big Data. DataPad’s Python-based framework will accelerate adoption of Cloudera’s leading Big Data management and analytics platform and the team’s expertise further expands the breadth of open source committers and contributions from Cloudera. Terms of the deal were not disclosed.

DataPad co-founders, Wes McKinney and Chang She, well-known open source contributors and system architects, and the DataPad team, will join Cloudera. McKinney is the creator of the open source project Pandas, the Python open source library and is also the author of the best-selling Python for Data Analysis. She, a long-time colleague of McKinney, is also a core developer of Pandas. They will lead the company’s efforts to build high-performance data backends for business intelligence and analytic use cases, simplifying use of Cloudera’s products.

As a part of Cloudera, the DataPad technology and team will drive the Cloudera Enterprise big data platform to be more accessible to developers, data scientists, and the company’s ecosystem of partners.

“We are thrilled to have the DataPad team join Cloudera and look forward to their contributions to the Cloudera roadmap,” said Peter Cooper-Ellis, vice president, Engineering, Cloudera. “We’ve long been supporters of the DataPad team and have been impressed with their engineering work. Together, we possess some of the best talent in the data engineering sphere. The deep Python expertise that DataPad brings to Cloudera will further accelerate our data engineering capability.”

Python is arguably the most important programing language for today’s software developers to gain fluency and has tens of thousands of users. This open source language can be used on any operating system and easily processes text, numbers, images, scientific data and more. Many of today’s largest industries depend on Python in their daily operations. Big Data application developers find the language powerful and easy to use.

“Since the early days of DataPad, we’ve collaborated closely with the Cloudera team on engineering challenges for analytics systems,” said McKinney. “Working side by side with some of the best and brightest open source and data engineering talent in the industry can only benefit and improve additional innovation. Bringing our Python expertise to the Cloudera team will allow Cloudera to deliver better developer and data analysis capabilities.”

Datanami