Follow Datanami:

Tag: apache hadoop

New Hadoop and Flink Hacks Leveraging Known Configuration Vulnerability

Security researchers at Aqua Nautilus say they are tracking a new set of attacks against Apache Hadoop and Apache Flink applications. The attackers are employing stealthy techniques to exploit a known security vulnerabil Read more…

Machine Learning, from Single Core to Whole Cluster

The demand for production-quality software for mining insights from datasets across scales has exploded in the last several years. The growing size of datasets throughout industry, government, and other fields has increa Read more…

Microsoft Now Developing Its Own Hadoop

Hadoop might be dead, but that’s not stopping public cloud providers from using it. The latest to make a move is Microsoft Azure, which in July announced that it would begin developing its own distribution under its HD Read more…

Google Cloud’s Dataproc Gets a GPU-Powered Spark Boost

Google Cloud’s Dataproc – its big data platform that allows users to run Apache Hadoop and Spark jobs – is getting a boost. Apache Spark 3 and Hadoop 3 have launched general availability, enhancing users’ data an Read more…

Apache Spark Is Great, But It’s Not Perfect

Apache Spark is one of the most widely used tools in the big data space, and will continue to be a critical piece of the technology puzzle for data scientists and data engineers for the foreseeable future. With that said Read more…

Here’s What Doug Cutting Says Is Hadoop’s Biggest Contribution

Apache Hadoop isn't the center of attention in the IT world anymore, and much of the hype has dissipated (or at least regrouped behind AI). But the open source software project still has a place for on-premise workloads, Read more…

New Cloudera Plots a Course Toward a Unified Future

The merger of Hortonworks and Cloudera will eliminate competition in the market for big data platforms and create a clear leader in the space. Once the transaction is complete, the new Cloudera will embark upon the chall Read more…

Is Hadoop Officially Dead?

The merger of Cloudera and Hortonworks was applauded by many people in the big data community, and even Wall Street liked the news initially. But as the confetti from the party clears, some are asking tough questions, li Read more…

Reaction to Hortonworks-Cloudera Mega Merger

"I didn't see this coming." That was a common reaction to yesterday's news that Hortonworks and Cloudera are combining forces in a blockbuster $5.2-billion merger. Sentiment was mostly positive, especially among people w Read more…

Hadoop 3.0 Ships, But What Does the Roadmap Reveal?

As promised, the Apache Software Foundation delivered Hadoop version 3.0 before the end of the year. Now the Hadoop community turns its attention to versions 3.1 and 3.2, which are slated to bring even more good stuff du Read more…

Hadoop 3.0 Likely to Arrive Before Christmas

It's looking like big data developers will get an early holiday present as work on Hadoop version 3.0 nears completion. And while Hadoop 3.0 brings compelling new features, including a 50% increase in capacity and upward Read more…

Application Management Gets Unraveled

It's all about enterprise applications, we are told, with big data apps among the most critical. Hence, a growing focus on managing application performance has fueled new monitoring approaches such as operational data sc Read more…

How Pandora Uses Kafka

As a big Hadoop user, Pandora Media is no stranger to distributed processing technologies. But when the music streaming service decided to transition its ad tracking system from a batch-oriented system into a real-time o Read more…

Committers Talk Hadoop 3 at Apache Big Data

The upcoming delivery of Apache Hadoop 3 later this year will bring big changes to how customers store and process data on clusters. Here at the annual Apache Big Data show in Miami, Florida, a pair of Hadoop project com Read more…

Anatomy of a Hadoop Project Failure

Several years ago, the educational technology company Blackboard selected Apache Hadoop to run a new data analytics application designed to turn data exhaust into actionable insight. Months later, the failed project was Read more…

MapR Unveils Spark-Only Distro

Big data practitioners who want to get started quickly with Apache Spark but don't want to mess around with Hadoop may be interested in new software that MapR Technologies announced today. MapR's new Apache Spark Dist Read more…

Apache Foundation Keeps Eyes Wide Open with ODPi

If you're looking for controversy in the Apache Hadoop community, you need look no further than the 2015 launch of the Open Data Platform Initiative (ODPi), which some perceived as an attempt to wrest control of Apache H Read more…

Hadoop 3 Poised to Boost Storage Capacity, Resilience with Erasure Coding

The next major version of Apache Hadoop could effectively double storage capacity while increasing data resiliency by 50 percent through the addition of erasure coding, according to a presentation at the Apache Big Data Read more…

Apache’s Wacky But Winning Recipe for Big Data Development

When Doug Cutting set out to develop an open source Web search engine in the late 1990s, he initially chose the GPL license to distribute his wares. When that failed, he decided to give the Apache Software Foundation a s Read more…

ODPi Offers Olive Branch to Apache Software Foundation

The rift between the Open Data Platform Initiative (ODPi) and the Apache Software Foundation (ASF) is on the mend, thanks in part to a peace offering by ODPi, an admission of being indelicate, and a $40,000 check. It may Read more…

Datanami