Follow Datanami:

Tag: yarn

Big Data Confusion Drives MongoDB and Cloudera Together

At first glance, the partnership that Cloudera and MongoDB unveiled today is a bit of a head scratcher. While the two companies are arguably the biggest software vendors in the nascent space, they swim in opposite ends of the big data pool. It turns out, that's exactly why the companies felt they needed to work together. Read more…

Hortonworks Keen on Cascading-Tez Combo

In the future, it will be easier to build big data applications, and they'll run faster and utilize more real-time data than today's apps, too. Two vendors working to make that future a reality, Hortonworks and Concurrent, today announced they'll work together to build and assemble the next generation of Hadoop apps running on YARN, Tez, and Apache Spark. Read more…

Hortonworks Drives Stinger Home with HDP 2.1

Hortonworks today unveiled a major new release of its Hadoop distribution that puts significant new capabilities into the hands of its customers. The speed and scale of SQL processing in Apache Hive were improved with the final phase of the Stinger initiative, while the additions of Apache Storm and Apache Solr in HDP 2.1 open up new ways for customers to manipulate their data. Security and data governance were bolstered with Apache Knox and Apache Falcon, respectively, while Apache Spark is now available as a tech preview. Read more…

Pivotal Refreshes Hadoop Offering, Adds In-Memory Processing

As the commercial Hadoop field grows increasingly competitive, providers of this popular big data framework are working to differentiate their offerings. Pivotal, for one, has been honing the technologies developed and acquired by its parent companies as part of its vision to help enterprises realize the full potential of big data. Read more…

MapR Embraces Co-Existence with Hadoop Update

MapR Technologies today unveiled new products based on the Hadoop version 2 codebase that it says will allow customers to continue to run MapReduce version 1 applications while also reaping the rewards of a post-YARN Hadoop world. The company also announced the capability to run the HP Vertica columnar analytic database directly on its Hadoop stack. Read more…

HDP 2.0: Rise of the Hadoop Data Lake

Hortonworks became the first Hadoop distributor to ship the new Hadoop version 2 software today when it announced the general availability of Hortonworks Data Platform (HDP) 2.0. The update will enable customers with small Hadoop clusters to upgrade their big data platform into a shared Hadoop service, or a data lake, a Hortonworks executive explains. Read more…

HortonWorks Reaches Out to SAS and Storm

Hortonworks this week revealed a new partnership with SAS that will enable the analytics giant to use its tools to analyze data stored in Hortonworks' Hadoop distribution. It also announced plans to integrate the Apache Storm stream processing engine into its distribution, and to ship a preview by the end of the year. Read more…

Hadoop Version 2: One Step Closer to the Big Data Goal

The wait for Hadoop 2.0 ended yesterday when the Apache Software Foundation (ASF) announced the availability of the new big data platform. Among the most anticipated features is the new YARN scheduler, which will make it easier for users to run different workloads--such as MapReduce, HBase, SQL, graph analysis, and stream processing--on the same hardware, and all at the same time. Better availability and scalability, and a smoother upgrade process, round out the new platform, as Hadoop creator Doug Cutting explains, but still not everybody is happy with Hadoop. Read more…

LinkedIn Open Sources Samza Stream Processor

LinkedIn has donated Samza, its lightweight, real-time stream processing framework, to the Apache Software Foundation, thereby putting a promising new Hadoop engine into the open source realm for anybody to use. Read more…

Yahoo! Spinning Continuous Computing with YARN

YARN was the big news this week, with the announcement that the Hadoop resource manager is finally hitting the streets as part of the Hortonworks Data Platform (HDP) “Community Preview.” According to Bruno Fernandez-Ruiz, who spoke at Hadoop Summit this week, Yahoo! has been able to leverage YARN to transform the processing in their Hadoop cluster from simple, stodgy MapReduce, to a nimble micro-batch engine processing machine – a change which they refer to as “continuous computing.” Read more…

The Art of Scheduling in Big Data Infrastructures Today

Arun Murthy, architect with Hortonworks, said recently that the Hadoop community wanted to “fundamentally re-architect Hadoop...in a way where multiple types of applications can operate efficiently and predictable within the same cluster”. The starting point to do this, he says, is YARN, which has the potential to “turn Hadoop from a single application system to a multi-application operating system”. Fritz Ferstl, CTO with Univa argues that such efforts may run the risk of reinventing the wheel. Read more…

YARN to Spin Hadoop into Big Data Operating System

Hadoop is about to see a fundamental reset in its base functionality, says Arun Murthy, architect with Hortonworks and the Apache Software Foundation, who says that SQL in Hadoop via YARN is a part of the core of this metamorphosis. Read more…

Baldeschwieler: Looking at the Future of Hadoop

Hadoop has come a long way, and with projects currently underway it’s got plenty of fuel to drive enterprise innovation for years to come said Hortonworks co-founder and CTO, Eric Baldeschwieler in his recent Hadoop Summit Keynote in Amsterdam, Netherlands. Read more…

Intel Hitches Xeon to Hadoop Wagon

No longer content to sit on the sidelines of major Hadoop events, Intel has unveiled its strategy to tap into the momentum around the open source platform with its own purpose-built distribution. Read more…

Apache Hadoop 2.0.3-Alpha Released With Future Outlook

The next generation of the Apache Hadoop open-source software framework has been given an alpha release and set free in the wild, delivering the next major milestone for the Apache Hadoop community. Read more…

Datanami