Follow Datanami:

Tag: Hadoop

Putting Your Data On the Table

One of the big breakthroughs in data engineering over the past seven to eight years is the emergence of table formats. Typically layered atop column-oriented Parquet files, table formats like Apache Iceberg, Delta, and A Read more…

New Hadoop and Flink Hacks Leveraging Known Configuration Vulnerability

Security researchers at Aqua Nautilus say they are tracking a new set of attacks against Apache Hadoop and Apache Flink applications. The attackers are employing stealthy techniques to exploit a known security vulnerabil Read more…

How Acceldata Helped T-Mobile’s Data Modernization Strategy

When T-Mobile started migrating some of its data estate from an on-prem Hadoop system to cloud-based data platforms, it found the move liberating. But as it settled into a hybrid-cloud world, T-Mobile realized costs were Read more…

Meet 2023 Person to Watch Justin Borgman

Following the dissolution of the Hadoop elephant, Presto, the successor to Apache Hive, emerged as one of the most promising open source projects. As the CEO and co-founder of Starburst, the largest commercial entity beh Read more…

Duke Energy, GE Tap AWS to Help Balance the Green Grid of the Future

The electricity grid of the future will not only be powered from greener sources, but it will also be several times bigger than it is today, thanks mandates to eliminate the use of gasoline and diesel for transportation Read more…

Snowflake Reflects on 10 Years Passed, Ponders 10 Years Ahead

When Snowflake Computing was founded 10 years ago, the big data market looked much different than it does today. Momentum was building behind something called Hadoop, while cloud computing was viewed with suspicion. Desp Read more…

EMR Serverless Now Available from AWS

Amazon EMR, which ostensibly is the world’s most popular hosted Hadoop environment, is now generally available as a serverless offering, AWS announced today. Amazon EMR Serverless will save customers time and money Read more…

In Search of the Data Dream Team

When it comes to succeeding at big data, the people you put in place are just as important--if not more important--than the products and technologies you use. One of the folks exploring the intersection of people and dat Read more…

Onehouse Emerges from Stealth to Deliver Data Lakes in ‘Months, Not Years’

Onehouse, a data lakehouse management company, has emerged from stealth today with $8 million in seed funding from investment firms Greylock Ventures and Addition. Onehouse’s cloud-native managed lakehouse service i Read more…

2022 Big Data Predictions from the Cloud

The pandemic marked an inflection point for the growth of cloud platforms in 2020, as organizations scrambled to keep their applications running. That general pattern has continued through 2021, and the cloud looms even Read more…

LinkedIn Open Sources Tech Behind 10,000-Node Hadoop Cluster

LinkedIn last week open sourced DynoYARN, a key piece of technology that allows it to predict how appliacation performance will be impacted as it scales Hadoop to gargantuan proportions, including one 10,000-node cluster Read more…

Who’s Winning in Open Source Data Tech

Organizations understandably want the innovation that comes with using open source data management tech, but how do you know when it’s too green for adoption? A new survey from OpenLogic seeks the opinions of the masse Read more…

COVID-Driven Cloud Surge Takes a Toll on the Data

The stepped-up pace of migration to the cloud over the past 18 months wasn’t exactly forced, but neither was it completely voluntary. For many companies, accelerating digital transformation during COVID was a matter of Read more…

Starburst Backs Data Mesh Architecture

The emerging data mesh architecture has the potential to keep AI and analytics projects moving forward even as data storage and processing continues to disperse far and wide. One independent backer of the data mesh conce Read more…

Cloudera To Go Private in $5.3 Billion Buyout by Wall Street Firms

The long tale of Hadoop got another twist today when Cloudera announced that it has agreed to be acquired by a pair of investment firms for $5.3 billion. The acquiring companies, Clayton, Dubilier & Rice (CD&R) a Read more…

Do Customers Want Open Data Platforms?

Snowflake turned some heads in the big data market with recent blog posts and articles that cast doubt on the benefits of open data architectures. Customers should “choose open wisely,” the data warehouse giant says, Read more…

Drowning In a Data Lake? Gartner Analyst Offers a Life Preserver

Gartner this week convened its annual Data and Analytics Summit Americas conference, which was held online again due to the coronavirus pandemic. From the role of AI in data management to avoiding data lake failures, Gar Read more…

Data Privacy Startup Privacera Raises More Cash

Growing demand for data governance, privacy and security tools needed to comply with an expanding list of national data-privacy rules continues to attract investors as more sensitive data shifts to the cloud. The late Read more…

How Jumping into Data Lake Automation Can Help Enterprises Ride the Wave

According to Gartner, over 90% of deployed data lakes will become useless as they are overwhelmed with information assets captured for uncertain use cases. This is indeed, an alarming situation. While companies are spend Read more…

Apache Iceberg: The Hub of an Emerging Data Service Ecosystem?

Engineers at Netflix and Apple created Apache Iceberg several years ago to address the performance and usability challenges of using Apache Hive tables in large and demanding data lake environments. Now the data table fo Read more…

Datanami