Follow Datanami:

Tag: apache hadoop

Apache Spark Is Great, But It’s Not Perfect

Apr 3, 2019 |

Apache Spark is one of the most widely used tools in the big data space, and will continue to be a critical piece of the technology puzzle for data scientists and data engineers for the foreseeable future. Read more…

Here’s What Doug Cutting Says Is Hadoop’s Biggest Contribution

Apr 1, 2019 |

Apache Hadoop isn’t the center of attention in the IT world anymore, and much of the hype has dissipated (or at least regrouped behind AI). But the open source software project still has a place for on-premise workloads, according to Hadoop co-creator Doug Cutting, who says Hadoop will be remembered most of all for a single contribution it made to IT. Read more…

New Cloudera Plots a Course Toward a Unified Future

Oct 24, 2018 |

The merger of Hortonworks and Cloudera will eliminate competition in the market for big data platforms and create a clear leader in the space. Once the transaction is complete, the new Cloudera will embark upon the challenging task of merging the two companies’ Read more…

Is Hadoop Officially Dead?

Oct 18, 2018 |

The merger of Cloudera and Hortonworks was applauded by many people in the big data community, and even Wall Street liked the news initially. But as the confetti from the party clears, some are asking tough questions, like whether the merger signals the death of Hadoop as a viable computer platform moving forward. Read more…

Reaction to Hortonworks-Cloudera Mega Merger

Oct 4, 2018 |

“I didn’t see this coming.” That was a common reaction to yesterday’s news that Hortonworks and Cloudera are combining forces in a blockbuster $5.2-billion merger. Sentiment was mostly positive, especially among people who worked with the two vendors, but questions remain about the merger’s impact. Read more…

Hadoop 3.0 Ships, But What Does the Roadmap Reveal?

Dec 15, 2017 |

As promised, the Apache Software Foundation delivered Hadoop version 3.0 before the end of the year. Now the Hadoop community turns its attention to versions 3.1 and 3.2, which are slated to bring even more good stuff during the first half of 2018. Read more…

Hadoop 3.0 Likely to Arrive Before Christmas

Dec 5, 2017 |

It’s looking like big data developers will get an early holiday present as work on Hadoop version 3.0 nears completion. And while Hadoop 3.0 brings compelling new features, including a 50% increase in capacity and upwards of a 4x improvement in scalability, more exciting stuff – like support for Docker, support for GPUs, and an S3-compatible storage API —   Read more…

Application Management Gets Unraveled

Jun 6, 2017 |

It’s all about enterprise applications, we are told, with big data apps among the most critical. Hence, a growing focus on managing application performance has fueled new monitoring approaches such as operational data science. Read more…

How Pandora Uses Kafka

May 31, 2017 |

As a big Hadoop user, Pandora Media is no stranger to distributed processing technologies. But when the music streaming service decided to transition its ad tracking system from a batch-oriented system into a real-time one, it brought in a new technological underpinning to serve as the core foundational element. Read more…

Committers Talk Hadoop 3 at Apache Big Data

May 18, 2017 |

The upcoming delivery of Apache Hadoop 3 later this year will bring big changes to how customers store and process data on clusters. Here at the annual Apache Big Data show in Miami, Florida, a pair of Hadoop project committers from Cloudera shared details on how the changes will impact YARN and HDFS. Read more…

Do NOT follow this link or you will be banned from the site!