
Hortonworks Hocks Hadoop Upgrade
Apache Hadoop contributor Hortonworks announced Hortonworks Data Platform version 2. HDPv2 will be using the most recent version of Hadoop (0.23). According to the Apache Software Foundation, curators and cultivators of Hadoop, the newest release is enterprise ready.
The Hortonworks Data Platform, which is powered by Hadoop, is the company’s scalable open source platform for handling big enterprise and research data. As with the other Hadoop distros floating around out there, the key to the success of the platform is the ability to integrate data from just about any source imaginable and provide a more simplified way to make use of it.
The company describes how they differentiate themselves from others offering Hadoop simplification for the enterprise, noting:
“Unlike other Hadoop solutions that lock away management features within proprietary extensions, Hortonworks Data Platform includes Ambari, an open source installation and management system out of the box. Hortonworks Data Platform also includes HCatalog, a metadata management service for simplifying data sharing between Hadoop and other enterprise information systems, along with a complete set of open APIs, including WebHDFS and those for Ambari and HCatalog, to make it easier for ISVs to integrate and extend Apache Hadoop.”
On Jan.6th, when the Apache Software Foundation made news announcing Hadoop v1.0 after 6 years of development, a number of notable new features and enhancements were made. With the release of Hadoop version 0.23, improvements have been made to both HDFS and MapReduce including:
- NextGen MapReduce (also known as YARN)
- HDFS Federation, which allows Namenodes to act independently and without coordination with eachother
- Splitting MapReduce JobTracker into 2 components (resource management and life-cycle management)
- The Resource manager will now manage global assignment of compute resources for each application while ApplicationMaster will manage scheduling and coordination.
According to Eric Baldeschwieler, CEO of Hortonworks, “With more than three years of development and much anticipation, Apache Hadoop 0.23 delivers important advancements in scalability, performance, high availability and data integrit.
He continued, “Apache Hadoop 0.23 is currently being tested across hundreds of applications in the world’s largest Hadoop deployment. We are excited to make the technology advancements in Apache Hadoop 0.23 available through an easily consumable version via the Hortonworks Data Platform v2.”
HDP was created to extremely scalable and fully open-source platform for storage, processing, analysis of large scale data. Along with HDFS and MapReduce, Hortonworks Data Platform includes Pig, Hive, HBase and Zookeeper.
Hortonworks was created by Yahoo! and Benchmark Capital to facilitate Apache Hadoop development. They provide tech support, training and certifications for vendors, enterprises, service providers and systems integrators.
Related Stories
Hadoop Hits Primetime with Production Release
RainStor Brings Database to Hadoop
Karmasphere Ushers in New Hadoop Partner
July 6, 2022
- Equinix Global Tech Trends Survey Finds IT Leaders Planning for Growth Despite Recession Fears
- SingleStore and Intel Collaborate to Deliver Real-Time Data Technology
July 5, 2022
- Ontotext Announces Latest Major Release, GraphDB 10
- Registration Open for Flash Memory Summit, Aug. 2-4
June 30, 2022
June 29, 2022
- Lightbits Raises $42M in Growth Capital
- TigerGraph Launches New Version of TigerGraph Cloud
- Immuta Adds Policy Enforcement to Unity Catalog in the Databricks Lakehouse Platform
- DataStax’s Astra Streaming Goes GA With New Built-in Support for Kafka and RabbitMQ
- Ocient Partners With Carahsoft
- Timecho, Founded by the Creators of Apache IoTDB, Raises Over $10M
- Acceldata to Enhance Data Reliability with Databricks Integration
June 28, 2022
- Micron Delivers 176-Layer NAND SATA SSD for Datacenters
- Databricks Announces Major Contributions to Flagship Open Source Projects
- Sigma Computing Partners with Databricks to Bring No-Code Analytics to the Data Lakehouse
- Opaque Systems Raises $22M Series A To Bring Scalable, Multi-Party Analytics and AI to Confidential Computing
- MinIO Partners With Snowflake to Deliver Multi-Cloud Data Accessibility
- Cloudian Partners with Vertica to Deliver On-prem Data Warehouse Platform on S3 Data Lake
- Kyligence Introduces an Intelligent Metrics Store to Democratize Data Analytics
- Databricks Unveils New Innovations for Its Data Lakehouse Platform
Most Read Features
- A/B Test Like You’re Airbnb
- Databricks Opens Up Its Delta Lakehouse at Data + AI Summit
- Artificial Intelligence and Machine Learning Are Headed for A Major Bottleneck — Here’s How We Solve It
- Snowflake Unveils Native Apps, UniStore, and More Python Support at Summit
- Europe’s New AI Act Puts Ethics In the Spotlight
- A Culture Shift on Data Privacy
- Why the Open Sourcing of Databricks Delta Lake Table Format Is a Big Deal
- Data Mesh Vs. Data Fabric: Understanding the Differences
- NetApp Spots a Data Platform Opportunity in the Cloud
- Exploring the Top Options for Real-Time ELT
- More Features…
Most Read News In Brief
- Google Debuts LaMDA 2 Conversational AI System and AI Test Kitchen
- Samsung to Ship Next-Generation Smart SSD This Year
- OpenAI’s DALL·E 2 Is Surreal
- EMR Serverless Now Available from AWS
- John Snow Labs Releases Spark NLP 4.0
- Airflow Available as a New Managed Service Called Astro
- DataRobot Introduces Expanded AI Cloud Capabilities and Tools
- DataStax Nabs $115 Million to Help Build Real-Time Applications
- Data Quality Study Reveals Business Impacts of Bad Data
- Google Suspends Senior Engineer After He Claims LaMDA is Sentient
- More News In Brief…
Most Read This Just In
- GigaOm Benchmark Study Names SingleStore Best Database
- Databricks Unveils New Innovations for Its Data Lakehouse Platform
- Databricks Introduces Data Lineage For Unity Catalog
- Snowplow Closes $40M in Series B Funding
- Precisely Launches New Data Integrity Suite
- StreamSets Launches Enterprise-Grade Transformation Engine Built on Snowpark
- Exabeam Partners with Google Cloud
- Anaconda Acquires PythonAnywhere to Expand Python Team Collaboration in the Cloud
- Prophecy Launches Low-Code Platform for Databricks
- Continual Raises $14.5M Series A to Unite Analytics & AI Teams
- More This Just In…
Sponsored Partner Content
-
Everyday AI, Extraordinary People
-
Dataiku Makes the Use of Data and AI an Everyday Behavior
-
Data Fabrics as the best path for Enterprise Data Integration
-
Dataiku connects data and doers through Everyday AI
-
Leaving Legacy ETL Behind
-
Streamline Lakehouse Analytics with Matillion and Databricks SQL
-
Close the Information Gap: How to Succeed at Analytics in the Cloud
-
Who wins the hybrid cloud?
Sponsored Whitepapers
Contributors
Featured Events
-
CDAO Government
September 13 @ 1:00 pm - September 14 @ 5:00 pm -
CDAO Fall
October 10 - October 12Boston MA United States