Data Quality Unites with Integration at Syncsort
Syncsort has established itself as a major player in the market for big data integration software with its DMX-h product. Following its acquisition of Trillium last year, the company is taking the next step of giving customers the convenience of accessing data quality and governance capabilities from within its integration software.
Last week the New York software company announced two new solutions resulting from the acquisition of Trillium. The first is a data governance solution that brings together Trillium’s Discover product and Syncsort’s DMX-h integration software to perform data profiling on data brought into data lakes., while the second is a Customer 360 solution that also brings together technology from the previously separate vendors.
The new data governance solution addresses a need in the market to ensure that data lakes don’t turn into data swamps. As customers fill up their data lake –possibly by using DMX-h to transform and load mainframe or other operational data sources into Hadoop – they can use Discover to gain a better understanding of the quality of the data flowing in.
It’s all about ensuring good data governance practices, explains Syncsort CTO Tendü Yoğurtçu. “As part of their data governance initiatives, many of our customers are looking for ways to assess the quality of the data as it is integrated from diverse data sources, including legacy and mainframes, and accessed in data lakes to support next-generation business analytics,” Yoğurtçu says. “This process ensures they can trust that data, and that they are targeting the right data to fit the purpose.”
In addition to supporting mainframe data sources and Hadoop data targets, the new data governance solution also supports relational and NoSQL databases, MPP analytic warehouses, and other file systems.
Syncsort’s second new solution, dubbed Trusted Customer 360, provides another vehicle for combining DMX-h with Trillium technology. In this case, Syncsort is tapping into Trillium’s data validation and verification capabilities.
“We have gotten strong feedback that our customers not only need a single view of their customers, but it is critical that they can trust the accuracy of that view,” says Syncsort’s Vice President of Product Management, Keith Kohl, who came over with the Trillium acquisition.
Earlier this year, Trillium announced that it can not only provide verification and validation of postal information such as addresses, but non-postal data, such as people’s emails and phone numbers. Trillium’s technology also gives customers the capability to determine, for example, if a phone number is connected to a landline or a cell phone, or if an email address is connected to a domain from a known spammer.
While there are many cases where Syncsort will seek to integrate the data quality and data integration capabilities together, the product families will continue to live apart. The company doesn’t want to create a giant, unwieldy stack that forces customers to buy more than they need, executives with the company told Datanami at the Strata + Hadoop World conference earlier this year.