Hortonworks Hocks Hadoop Upgrade
Apache Hadoop contributor Hortonworks announced Hortonworks Data Platform version 2. HDPv2 will be using the most recent version of Hadoop (0.23). According to the Apache Software Foundation, curators and cultivators of Hadoop, the newest release is enterprise ready.
The Hortonworks Data Platform, which is powered by Hadoop, is the company’s scalable open source platform for handling big enterprise and research data. As with the other Hadoop distros floating around out there, the key to the success of the platform is the ability to integrate data from just about any source imaginable and provide a more simplified way to make use of it.
The company describes how they differentiate themselves from others offering Hadoop simplification for the enterprise, noting:
“Unlike other Hadoop solutions that lock away management features within proprietary extensions, Hortonworks Data Platform includes Ambari, an open source installation and management system out of the box. Hortonworks Data Platform also includes HCatalog, a metadata management service for simplifying data sharing between Hadoop and other enterprise information systems, along with a complete set of open APIs, including WebHDFS and those for Ambari and HCatalog, to make it easier for ISVs to integrate and extend Apache Hadoop.”
On Jan.6th, when the Apache Software Foundation made news announcing Hadoop v1.0 after 6 years of development, a number of notable new features and enhancements were made. With the release of Hadoop version 0.23, improvements have been made to both HDFS and MapReduce including:
- NextGen MapReduce (also known as YARN)
- HDFS Federation, which allows Namenodes to act independently and without coordination with eachother
- Splitting MapReduce JobTracker into 2 components (resource management and life-cycle management)
- The Resource manager will now manage global assignment of compute resources for each application while ApplicationMaster will manage scheduling and coordination.
According to Eric Baldeschwieler, CEO of Hortonworks, “With more than three years of development and much anticipation, Apache Hadoop 0.23 delivers important advancements in scalability, performance, high availability and data integrit.
He continued, “Apache Hadoop 0.23 is currently being tested across hundreds of applications in the world’s largest Hadoop deployment. We are excited to make the technology advancements in Apache Hadoop 0.23 available through an easily consumable version via the Hortonworks Data Platform v2.”
HDP was created to extremely scalable and fully open-source platform for storage, processing, analysis of large scale data. Along with HDFS and MapReduce, Hortonworks Data Platform includes Pig, Hive, HBase and Zookeeper.
Hortonworks was created by Yahoo! and Benchmark Capital to facilitate Apache Hadoop development. They provide tech support, training and certifications for vendors, enterprises, service providers and systems integrators.
Related Stories
Hadoop Hits Primetime with Production Release
RainStor Brings Database to Hadoop
Karmasphere Ushers in New Hadoop Partner
September 6, 2024
- Hakkoda Demonstrates Strong Business Momentum in FY 2023-2024
- Elastic Accelerates Logs Onboarding with Automatic Import Powered by Search AI
- Domo Partners with Brooklyn Data to Simplify the AI and Data Journey for Businesses
- SingleStore Now Returns to the Chase Center, Diving Deeper Into Building Intelligent Applications at Enterprise Scale
- Informatica Recognized as an Industry Leader in Enterprise Data Catalogs
- Fosfor Launches Decision Cloud with Snowflake Integration, Amplifying Business Outcomes
- DiffusionData Launches 6.11 with New Monitoring and Filtering Capabilities
September 5, 2024
- BCG Experiment Reveals GenAI’s Impact on Expanding Workers’ Skillsets
- Domino Data Lab Named a Leader 2 Years in a Row Across Multiple Dresner Advisory Reports on AI
- Dresner Advisory Publishes 2024 ModelOps, AI, Data Science, and ML Research
- Red Hat Enterprise Linux AI Now Available Across Hybrid Cloud for Gen AI Model Development
- HPE Introduces One-Click-Deploy AI Applications in HPE Private Cloud AI
- Aerospike Year-over-Year Recurring Revenue Soars by 51% as Demand for AI Rises
- Dataset Providers Alliance Releases Comprehensive AI Data Licensing Position Paper
- InfluxData Enhances InfluxDB 3.0 for High-Performance Time Series Workloads at Scale
September 4, 2024
- Revefi Announces $20M Series A Funding for AI Data Engineering Innovation
- Acceldata Executives to Present on the Power of Data Observability and AI at 2024 Industry Events
- GridGain Sponsoring Strategic AI and Kafka Conferences This Month
- Carruthers and Jackson’s CDO Summer School Inaugurates 500 New Data Leaders
- O’Reilly to Host Free Online Event Discussing Deepfakes and Regulatory Challenges in AI
Most Read Features
Sorry. No data so far.
Most Read News In Brief
Sorry. No data so far.
Most Read This Just In
Sorry. No data so far.
Sponsored Partner Content
-
Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!
-
Supercharge Your Data Lake with Spark 3.3
-
Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]
-
Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]
-
Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023
-
The Art of Mastering Data Quality for AI and Analytics
Sponsored Whitepapers
Contributors
Featured Events
-
Efficient Generative AI Summit
September 9 -
AI Hardware & Edge AI Summit 2024
September 10 - September 12San Jose CA United States -
HPC + AI Wall Street 2024
September 17 - September 18 -
CDAO Government 2024
September 18 - September 19Washington DC United States -
AI & Big Data Expo Europe 2024
October 1 - October 2Netherlands