Tag: Flink
New Hadoop and Flink Hacks Leveraging Known Configuration Vulnerability
Security researchers at Aqua Nautilus say they are tracking a new set of attacks against Apache Hadoop and Apache Flink applications. The attackers are employing stealthy techniques to exploit a known security vulnerabil Read more…
EMR Serverless Now Available from AWS
Amazon EMR, which ostensibly is the world’s most popular hosted Hadoop environment, is now generally available as a serverless offering, AWS announced today. Amazon EMR Serverless will save customers time and money Read more…
Kubernetes Adoption Widespread for Big Data, But Monitoring and Tuning Are Issues, Survey Finds
Kubernetes may be a complex piece of software that can be difficult to monitor and manage. But the benefits of running applications in the popular container orchestration system appear to outweigh the disadvantages, beca Read more…
Flink Gets Extension into Stateful Functions
Ververica, the company behind open source Apache Flink, this week unveiled Stateful Functions, a new framework designed to extend Flink into the world of distributed, stateful applications. Stateful Functions is a col Read more…
What’s Behind Lyft’s Choices in Big Data Tech
Lyft was a late entrant to the ride-sharing business model, at least compared to its competitor Uber, which pioneered the concept and remains the largest provider. That delay in starting out actually gave Lyft a bit of a Read more…
How Netflix Optimized Flink for Massive Scale on AWS
When it comes to streaming data, it's tough to find a company operating on a more massive scale than Netflix, which streams more than 125 million hours of TV shows and movies -- per day. Netflix captures billions of pi Read more…
ParallelM Aims to Close the Gap in ML Operationalization
A startup named ParallelM today unveiled new software aimed at alleviating data scientists from the burden of manually deploying, monitoring, and managing machine learning pipelines in production. Dubbed MLOps, Parall Read more…
Google/ASF Tackle Big Computing Trade-Offs with Apache Beam 2.0
Trade-offs are a part of life, in personal matters as well as in computers. You typically cannot have something built quickly, built inexpensively, and built well. Pick two, as your grandfather would tell you. But appare Read more…
2017 Is the Year of AI. Or Is It?
The media often likes to proclaim "The Year of This" or "The Year of That." With the greater attention given to advancing capabilities in artificial intelligence and machine learning, it seemed like a no-brainer to decla Read more…
Flink Distro Now Available from data Artisans
The company behind Apache Flink, data Artisans, today launched the first commercial distribution of the upstart stream processing engine. The new software product, called the dA Platform, is identical to the open source Read more…
Tracking the Ever-Shifting Big Data Bottleneck
Bottlenecks are a fact of life in IT. No matter how fast you build something, somebody will find a way to max it out. While the performance headroom has been elevated dramatically since Hadoop introduced distributed comp Read more…
Merging Batch and Stream Processing in a Post Lambda World
It wasn't long ago that developers looked to the Lamba architecture for hints on how to design big data applications that needed elements of both batch and streaming data. But already, the Lamba architecture is falling o Read more…
Apache Beam’s Ambitious Goal: Unify Big Data Development
If you're tired of using multiple technologies to accomplish various big data tasks, you may want to consider Apache Beam, a new distributed processing tool from Google that's now incubating at the ASF. One of the cha Read more…