Follow Datanami:
August 28, 2014

Trifacta Teams with Tableau to Simplify the Analysis of Big Data on Hadoop

SAN FRANCISCO, Calif., Aug. 28 — Trifacta, a leading Data Transformation platform provider, today announced deep integration with Tableau, a global leader in rapid-fire, easy-to-use business analytics software. The integration builds on Trifacta’s strategic partnership with Tableau. With the release of Trifacta Data Transformation Platform 1.5, Tableau customers now have the option of writing the output of Trifacta data transformations directly to a Tableau Data Extract format or registering the output with Hadoop’s HCatalog for more scalable, interactive data discovery on Hadoop.

“We believe that accessing and analyzing data should be simple, intuitive and fast for everyone. With the proliferation of big data, we’re seeing the growing need for this same type of self-service experience on Hadoop. Trifacta provides Tableau users with an intuitive Data Transformation Platform for Hadoop so they can more efficiently transform and analyze common data formats in Hadoop such as text and JSON,” said Dan Jewett, Vice President of Product Management at Tableau.

The integration between Trifacta and Tableau removes a key barrier between the raw, semi-structured data commonly stored in Hadoop and the self-service process for analyzing, visualizing and sharing of insights that users of Tableau value. Trifacta customers currently use Tableau as a delivery mechanism for analysis and visualization and are looking for ways to extend self-service data access beyond databases and spreadsheets to Hadoop. Trifacta’s 1.5 release makes the process of preparing data in Trifacta and visualizing it in Tableau seamless and efficient with the addition of native support for the Tableau Data Extract format and HCatalog.

“Tableau creates huge opportunities for effectively analyzing data, but working with big data poses specific challenges. The most significant barriers come from structuring, distilling and automating the transfer of data from Hadoop. Our integration removes these barriers in a way that complements self-service data analysis,” said Joe Hellerstein, Chief Strategy Officer of Trifacta. “Now, Trifacta and Tableau users can move directly from big data in Hadoop to powerful, interactive visualizations.”

Trifacta and Tableau are key elements of a Hadoop-centric data processing pipeline. Hadoop allows for semi-structured data to be stored easily in raw formats, including log and machine generated data. Much of that data appears in JSON and text formats. Trifacta removes the need for additional conversions and enables end-users to visually transform both raw and structured data with Trifacta’s Predictive Interaction technology to assess, structure, clean, enrich and distill data into a final data set ready for business exploration.

Datanami