
Tag: Flink
Ververica, the company behind open source Apache Flink, this week unveiled Stateful Functions, a new framework designed to extend Flink into the world of distributed, stateful applications.
Stateful Functions is a collection of tools designed to give developers the ability to create stateful applications that run in the modern serverless manner. Read more…
Lyft was a late entrant to the ride-sharing business model, at least compared to its competitor Uber, which pioneered the concept and remains the largest provider. That delay in starting out actually gave Lyft a bit of an advantage in terms of architecting its big data infrastructure in the cloud, as it was able to sidestep some of the challenges that Uber faced in building out its on-prem system. Read more…
When it comes to streaming data, it’s tough to find a company operating on a more massive scale than Netflix, which streams more than 125 million hours of TV shows and movies — Read more…
A startup named ParallelM today unveiled new software aimed at alleviating data scientists from the burden of manually deploying, monitoring, and managing machine learning pipelines in production.
Dubbed MLOps, ParallelM‘s software helps to automate many of the operational tasks required to turn a machine learning model from a promising piece of code running nn Spark, Flink, TensorFlow, or PyTorch processing engines into a secure, governed, and production-ready machine learning system. Read more…
Trade-offs are a part of life, in personal matters as well as in computers. You typically cannot have something built quickly, built inexpensively, and built well. Pick two, as your grandfather would tell you. Read more…
The media often likes to proclaim “The Year of This” or “The Year of That.” With the greater attention given to advancing capabilities in artificial intelligence and machine learning, it seemed like a no-brainer to declare 2017 “The Year of AI.” Read more…
The company behind Apache Flink, data Artisans, today launched the first commercial distribution of the upstart stream processing engine. The new software product, called the dA Platform, is identical to the open source Flink project at this time, and comes with 24/7 technical support from the team that develops Flink. Read more…
Bottlenecks are a fact of life in IT. No matter how fast you build something, somebody will find a way to max it out. While the performance headroom has been elevated dramatically since Hadoop introduced distributed computing to the commodity masses, the bottleneck has shifted, but it hasn’t disappeared. Read more…
It wasn’t long ago that developers looked to the Lamba architecture for hints on how to design big data applications that needed elements of both batch and streaming data. But already, the Lamba architecture is falling out of favor, especially in light of a new crop of frameworks like Apache Spark and Apache Flink that can do it all. Read more…
If you’re tired of using multiple technologies to accomplish various big data tasks, you may want to consider Apache Beam, a new distributed processing tool from Google that’s now incubating at the ASF. Read more…