
Tag: CDC
ClickHouse Acquires PeerDB to Advance Postgres CDC Integration Solutions
ClickHouse, a high-performance, real-time analytics database, has announced the acquisition of PeerDB, a leading data replication and synchronization platform specializing in change data capture (CDC) solutions for Postg Read more…
The Spark-to-Ray Migration That Will Save Amazon $100M+ Per Year
When Ray first emerged from the UC Berkeley RISELab back in 2017, it was positioned as a possible replacement for Apache Spark. But as Anyscale, the commercial outfit behind Ray, scaled up its own operations, the “Ray Read more…
Databricks Make a CDC Play with Arcion Acquisition
The quicker you can access transactional data directly from the change log of a database, the quicker you can do analytics or AI on it. That’s the basic math behind Databricks’ announcement today that it intends to a Read more…
Striim Bolsters Cloud Data Integration to Google
Two months after enhancing real-time data connections to Snowflake and Databricks, streaming data integration provider Striim today unveiled an improved, more secure connection to Google Cloud with the launch of Striim C Read more…
Redis Delivers First Big Release Under New CEO Trollope
Redis today unleashed a torrent of new functionality in the database, including vector search, native triggers, and a built-in change data capture capability, among others. The features reflect priorities that new CEO Ro Read more…
Microsoft Bolsters Azure Integration with SQL Server 2022
SQL Server 2022 customers will see benefits in the areas of data integration, analytics, governance, and high availability thanks to the database’s enhanced connectivity to the Azure cloud, Microsoft announced today. Read more…
Matillion Debuts Data Integration Service on K8S
Matillion yesterday debuted the Data Productivity Cloud, a new service that brings all of the vendor’s ETL and data integration tools together in a single software as a service (SaaS) offering running atop Kubernetes i Read more…
Exploring the Top Options for Real-Time ELT
Competitive advantage in today’s world rests on a company’s ability to innovate and adapt to a rapidly changing environment. To do that, organizations must adopt real-time thinking in the way they approach the design Read more…
Rockset Delivers Fast Analytics on SQL Server Data
Rockset today announced launched a new integration into SQL Server that will allow customers to continuously index production data as it hits the relational database and deliver sub-second analytics on that data. Rock Read more…
Google Cloud Spanner Gets Change Streams
Prospective Google Cloud Spanner users who want to suck data out of the globally consistent cloud database in real time without resorting to hacking the system will get their wishes following Google Cloud’s announcemen Read more…
Databricks Ships New ETL Data Pipeline Solution
Databricks today announced the general availability (GA) of Delta Live Tables (DLT), a new offering designed to simplify the building and maintenance of data pipelines for extract, transform, and load (ETL) processes usi Read more…
Matillion Unveils Streaming CDC in the Cloud
Matillion made its initial entry into the world of cloud-based ETL at the AWS re:Invent conference in 2015. So it was fitting that the company chose last week’s re:Invent as the venue to announce Matillion Data Loader Read more…
Fivetran Raises $565 Million, Buys CDC Vendor HVR
Fivetran took a big step into the world of enterprise data integration today when it announced an Andreessen Horowitz-led $565 million round of financing and plans to acquire change data capture (CDC) vendor HVR for $700 Read more…
Hands-Off: Manual Data Integration Tasks Plummeting, Gartner Says
While the need to integrate data has never been greater, the addition of machine learning and other forms of automation is driving a large reduction in the amount of manual data management tasks that human workers are re Read more…
Still Wanted: (Much) Better COVID Data
It wasn’t supposed to happen this way. With three effective COVID vaccines, we were supposed to be on the tail-end of the pandemic by now. But that hasn’t happened. Infection rates are increasing quickly as the Delta Read more…
Future Proofing Data Pipelines
Data pipelines are critical structures for moving data from its source to a destination. For decades, companies have used data pipelines to move information, such as from transactional with analytic systems. However, as Read more…
Informatica Accelerates DataOps with Spark, GPUs
Informatica today announced that customers can see up to a 5x performance boost for ETL and data management workloads when they run them under its new cloud-based data integration engine that’s powered by Apache Spark Read more…
To Centralize or Not to Centralize Your Data–That Is the Question
Should you strive to centralize your data, or leave it scattered about? It seems like it should be a simple question, but it’s actually a tough one to answer, particularly because it has so many ramifications for how d Read more…
Poor Data Hurts COVID-19 Contact Tracing, CDC Director Says
The lack of accurate and timely data about COVID-19 infections has hampered the government’s ability to perform contact tracing, the director of the Centers for Disease Control and Prevention, Robert Redfield, testifie Read more…
COVID-19 Has a Data Governance Problem
The COVID-19 pandemic has put many of the world’s activities at a stand-still. Scientists and government officials have been working furiously behind the scenes to forecast the spread and growth of the virus since the Read more…