Follow Datanami:
October 16, 2014

Waterline Data Science Joins MapR Advantage Partner Program

NEW YORK, N.Y., Oct. 16 — Waterline Data Science today announced at Strata + Hadoop World New York that it has joined the MapR Advantage Partner Program. Waterline Data Science will integrate the MapR Distribution including Apache Hadoop with Waterline Data Inventory to enable data self-service on Hadoop, allowing users to find, understand, and help govern Hadoop data.

Oliver Claude, Waterline Data Science CMO, states, “We’re pleased to have Anoop Dawar, senior director, product management, MapR, join our Advisory Board to help steer the partnership and make Hadoop more business and mission critical by leveraging the enterprise-grade MapR Distribution and Waterline Data Inventory’s enterprise-ready data self-service with built-in data governance.”

Jon Posnik, vice president of business development, MapR Technologies, said, “Waterline Data Science is a great addition to our growing partner ecosystem. The combination of our technologies will help continue the adoption of Hadoop across business units with easier data self-service and a foundation for data governance.”

Companies are deploying Hadoop “data lakes” to provide unprecedented access to data for data science and analytics to uncover new business insight. But Hadoop’s advantages of frictionless ingest, flexible schema on read, and lack of data governance, present problems for users trying to find and understand the data. Waterline Data Inventory addresses these problems by building a complete inventory of data assets in Hadoop and by opening access to Hadoop data through data self-service. As a result, data scientists can be more productive, business analysts can easily augment reporting and BI with Hadoop data without coding, and data governance teams can start controlling Hadoop data.

“There is no point building a predictive model of the wrong column, and without a data inventory, you don’t know if you have the wrong column,” said John Mount, co-author of the book, Practical Data Science with R. A data inventory is also valuable for Hadoop data governance, according to Sunil Soares, author of Big Data Governance.

Alex Gorelik, Founder and CEO, states, “A major complaint with Hadoop is once you’ve loaded the data, extracting value is like finding a needle in a stack of needles. Waterline Data Inventory lets business users find the best needles in the stack of needles, without having to write code, and without having to wrangle the entire stack. That’s our secret sauce, and key to deliver faster time to value and broad Hadoop adoption.”

Datanami