
Tag: streaming data
Confluent Works to Hide Streaming Complexity
Technical complexity is inherent when building distributed stream processing systems, particularly when integrating new real-time applications with traditional IT systems. But Confluent, the company behind Apache Kafka, Read more…
Analytics Predictions for 2023
The world is awash in data, and the pace of data generation is increasing. You don’t need a crystal ball to tell you that. But what will the new year bring in the field of big data analytics? We leave that to our panel Read more…
New Flink Startup Immerok Gets Off the Ground
There’s a new startup dedicated to making a business out of Apache Flink, the distrubuted stream processing framework and dataflow architecture that emerged from Germany over 10 years ago. With offices in Berlin and Ne Read more…
Developing Kafka Data Pipelines Just Got Easier
Developing data pipelines in Apache Kafka just got easier thanks to the launch of Stream Designer, a graphical design tool that is now available on Confluent Cloud. Confluent also used this week’s Kafka conference, dub Read more…
Deephaven Streamlines Access to Real-Time Analytics Platform
Getting Deephaven’s real-time analytics system up and running will be easier thanks to a new installation technique using a standard Python library. The open source software also sports a new integration with Jupyter a Read more…
Can Streaming Graphs Clean Up the Data Pipeline Mess?
The previously separate worlds of graph databases and streaming data are coming together in an open source project called Quine. According to its creator, the Akka-based distributed framework is capable of returning Cyph Read more…
StarTree Keeps Real-Time Analytics Fresh with New Options for Pinot
Since it was created at LinkedIn in 2015, interest in Apache Pinot--the distributed storage and analytics engine for real-time analytics--has steadily grown. But over the past year, the number of downloads and active par Read more…
Apache Druid Gets Multi-Stage Reporting Engine, Cloud Service from Imply
Imply, the Northern California company behind the open source analytics database called Apache Druid, made a pair of announcements today during a virtual event, including the beta of a new multi-stage query engine as wel Read more…
It’s About Time for InfluxData
These are heady times for InfluxDB, which is the world’s most popular time-series database, which has been the fastest growing category of databases the past two years, per DB-Engines.com. But when Paul Dix and his par Read more…
Matillion Unveils Streaming CDC in the Cloud
Matillion made its initial entry into the world of cloud-based ETL at the AWS re:Invent conference in 2015. So it was fitting that the company chose last week’s re:Invent as the venue to announce Matillion Data Loader Read more…
Confluent Ships ‘Cluster Linking’ in Kafka Platform Update
Companies that are pushing the envelope in terms of the scale of their real-time streaming data deployments will be happy to know they can now run a Kafka cluster that spans on-prem and cloud resources via cluster linkin Read more…
Did Rockset Just Solve Real-Time Analytics?
Companies have been pushing the envelope of real-time analytics and what technology is capable of for many years. Along that vein, Rockset today claimed it has smashed through the barriers preventing customers from runni Read more…
Hazelcast Platform to Bring Historical, Real-Time Data Together
Hazelcast is best known as a developer of in-memory data grid (IMDB) technology, a RAM-loving layer for speeding up operational applications. But with the Hazelcast Platform launch currently slated for September, the San Read more…
Intimidated by Kafka? Check Out Confluent’s New Developer Site
If you’re new to Kafka and developing event streaming platforms, it’s easy to get overwhelmed. This is an entirely different architecture than what most developers are used to, after all. But that fear should be only Read more…
Confluent Raises More Than $800M in IPO
Confluent, the company behind Apache Kafka, can now count a successful IPO among its accomplishments, with more than $820 million in shares sold since debuting on the stock market last week. Confluent, which trades on Read more…
DataStax Taps Pulsar for Streaming Data Platform
DataStax today unveiled Astra Streaming, a new event streaming platform based on Apache Pulsar, a publish and subscribe (pub-sub) data platform that competes with Apache Kafka. Astra Streaming is pre-integrated in the cl Read more…
Three Takeaways from Jay Kreps’ Kafka Summit Keynote
We’re always interested in hearing what Apache Kafka co-creator Jay Kreps has to say, since he’s one of the smartest folks in distributed systems today. The Confluent CEO didn’t disappoint in delivering insight int Read more…
With Integration Complete, Cloudera Re-Launches SQL Stream Builder
In October, Cloudera acquired Eventador, which developed a SQL-based streaming data analysis solution based on Apache Flink. With the work to integrate that product with the Cloudera Data Platform (CDP) complete, the com Read more…
How Jumping into Data Lake Automation Can Help Enterprises Ride the Wave
According to Gartner, over 90% of deployed data lakes will become useless as they are overwhelmed with information assets captured for uncertain use cases. This is indeed, an alarming situation. While companies are spend Read more…