Language Flags

Translation Disclaimer

HPCwire HPC in the Cloud Digital Manufacturing Report Green Computing Report


February 20, 2013

Hortonworks Proposes New Hadoop Incubation Projects


Hortonworks today announced the submission of two new incubation projects to the Apache Software Foundation and the launch of the Stinger Initiative, three efforts aimed at enhancing the security and performance of Hadoop applications.

“Our approach to innovation has been consistent from when our team began their work on the Hadoop project at Yahoo! more than seven years ago,” said Greg Pavlik, vice president of engineering for Hortonworks. “Simply put, we believe that the fastest way to innovate is to do our work within the open source community, introduce enterprise feature requirements into the public domain, and collaborate with others to progress existing open source projects or incubate new projects to meet those needs. By staying true to our 100-percent open source philosophy and applying enterprise software rigor to the test and release process, we can continue to accelerate the adoption of Hadoop within mainstream enterprises.”

The proposed incubation projects and the Stinger Initiative follow on the success of similar innovative projects initiated within the community by Hortonworks engineers, including Apache Ambari, Apache HCatalog and Apache Hadoop YARN, which are now foundational elements of Apache Hadoop for the enterprise.

The new efforts focus on enterprise requirements that are essential for broad adoption across the Hadoop ecosystem:

  • The Stinger Initiative to Optimize Apache Hive for Interactive Queries: Stinger represents a concerted effort by Hortonworks and the broader Apache community to improve Hive performance and better serve business intelligence use cases such as interactive data exploration, visualization and parameterized reporting. It is complementary to best-of-breed data warehouse and analytic platforms. As approximately 50 percent of Hadoop users depend on Hive for SQL-based operational data processing, enhancing Hive’s SQL capabilities and optimizing its query performance in support of user-focused SQL interactions is critical to ensuring Hive remains the de-facto standard for SQL queries with Hadoop. 
  • Tez Next-generation Runtime Proposed as Apache Incubator Project: The Tez proposal aims to enhance the performance of Hadoop components that currently run on MapReduce, such as Apache Hive, by providing an alternative, next-generation runtime built on Hadoop YARN that significantly improves latency and throughput of Hadoop applications.
  • Hadoop Gateway Proposed as Apache Incubator Project: The Hadoop Gateway proposal addresses the need for a single point of authentication and secure access for Apache Hadoop services in a cluster, which will simplify Hadoop security for users who access data and execute jobs and operators who control and manage the cluster. 

Hortonworks has proposed Tez and Hadoop Gateway as incubator projects to the Apache Software Foundation and looks forward to engaging with the community on these proposals.

Share Options


Subscribe

» Subscribe to our weekly e-newsletter


Discussion

There are 0 discussion items posted.

 
Cray CS300-LC

Sponsored Links

Sponsored Whitepapers

Parallel Performance of the IMSL C Numerical Library with OpenMP

05/21/2013 | Rogue Wave Software

Download whitepaper containing benchmark results depicting the speedup achieved as a result of incorporating OpenMP directives in the IMSL C Numerical Library, for portable, cross platform analytics.

Download this Whitepaper...

Best Practices in Big Data Storage - Sponsored by Cleversafe, Cray, DDN, NetApp, & Panasas

05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas

From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.

Download this Whitepaper...

View the White Paper Library

Sponsored Multimedia

SGI President and CEO, Jorge Titinger, on Big Data

SGI President and CEO, Jorge Titinger, talks about SGI's history and leadership in HPC and how that has converged into Big Data Solutions.

View Multimedia

Cray CS300-AC Cluster Supercomputer Air Cooling Technology Video

The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.

View Multimedia

More Multimedia

SGI DataRaptor with MarkLogic Database

Job Bank

Datanami Conferences Ad

Featured Events

June 4-4, 2013
The Economist's Information Forum
San Francisco, CA
United States

June 10-13, 2013
Cloud & Big Data Expo
New York City, NY
United States

June 17-18, 2013
Forecast 2013
San Francisco, CA
United States

June 19-20, 2013
GigaOM Structure
San Francisco, CA
United States

June 26-27, 2013
2013 Hadoop Summit
San Jose, CA
United States

June 26-27, 2013
Big Data World Congress
London
United Kingdom

» View/Search Events

» Post an Event