Tag: yarn

Committers Talk Hadoop 3 at Apache Big Data

May 18, 2017 |

The upcoming delivery of Apache Hadoop 3 later this year will bring big changes to how customers store and process data on clusters. Here at the annual Apache Big Data show in Miami, Florida, a pair of Hadoop project committers from Cloudera shared details on how the changes will impact YARN and HDFS. Read more…

Can Hadoop Be Simple Again?

Sep 19, 2016 |

In the beginning, Hadoop had two pieces: HDFS and MapReduce. Developers knew how to use them to build applications, and IT teams knew what it took to operate them. Fast forward to 2016, and developers have a cornucopia of technologies and frameworks at their disposal. Read more…

From Spiders to Elephants: The History of Hadoop

Apr 15, 2015 |

Have you ever wonder where this thing called Hadoop came from, or even why it’s here? Marko Bonaci has wondered such things, too. In fact, he wondered about them so much that he decided to write a History of Hadoop chapter for his upcoming book, “Spark in Action.” Read more…

Hortonworks Hatches a Roadmap to Improve Apache Spark

Sep 24, 2014 |

Hortonworks today issued a broad and detailed roadmap outlining the investment it would like to see made to Apache Spark, the in-memory processing framework that has become one of Hadoop’s most popular subprojects. Read more…

Enforcing Hadoop SLAs in a Big YARN World

Jul 23, 2014 |

The Apache Hadoop community has done a truly amazing job developing a scalable and versatile platform for big data analytic workloads. And with the recent introduction of YARN in Hadoop 2, we’re now able to run multiple analytic engines on our clusters simultaneously. Read more…

Moving Beyond ‘Traditional’ Hadoop: What Comes Next?

Jun 5, 2014 |

The phrase “traditional Hadoop” was heard early and often at this week’s 2014 Hadoop Summit. While first-generation Hadoop technologies unlocked previously unseen potential in big data sets, it pales in comparison to what we’ll see with second-gen Hadoop, we’re told. Read more…

Yahoo: We Run the Whole Company on Hadoop

Jun 4, 2014 |

Hadoop is absolutely critical to the operations of Yahoo, executives with the company said this week at the Hadoop Summit. While the company, which spun out Hortonworks in 2011, is moving away from “traditional” Hadoop components like MapReduce in favor of YARN, Tez, and Hive, the Hadoop platform remains absolutely core to its operations. Read more…

Hortonworks Spins Up a YARN Readiness Program

Jun 3, 2014 |

Hortonworks today launched its YARN Ready Program to help Hadoop application vendors adopt the technology that’s at the heart of the modern Hadoop v2 infrastructure.

YARN is the key piece of technology that enables multiple data engines to run simultaneously on the same Hadoop cluster, and is the centerpiece of Hadoop v2, which launched last October. Read more…

Big Data Confusion Drives MongoDB and Cloudera Together

Apr 29, 2014 |At first glance, the partnership that Cloudera and MongoDB unveiled today is a bit of a head scratcher. While the two companies are arguably the biggest software vendors in the nascent space, they swim in opposite ends of the big data pool. It turns out, that's exactly why the companies felt they needed to work together. Read more…

Hortonworks Keen on Cascading-Tez Combo

Apr 21, 2014 |In the future, it will be easier to build big data applications, and they'll run faster and utilize more real-time data than today's apps, too. Two vendors working to make that future a reality, Hortonworks and Concurrent, today announced they'll work together to build and assemble the next generation of Hadoop apps running on YARN, Tez, and Apache Spark. Read more…