Follow Datanami:

Tag: Hadoop

Snowflake Reflects on 10 Years Passed, Ponders 10 Years Ahead

When Snowflake Computing was founded 10 years ago, the big data market looked much different than it does today. Momentum was building behind something called Hadoop, while cloud computing was viewed with suspicion. Desp Read more…

EMR Serverless Now Available from AWS

Amazon EMR, which ostensibly is the world’s most popular hosted Hadoop environment, is now generally available as a serverless offering, AWS announced today. Amazon EMR Serverless will save customers time and money Read more…

In Search of the Data Dream Team

When it comes to succeeding at big data, the people you put in place are just as important--if not more important--than the products and technologies you use. One of the folks exploring the intersection of people and dat Read more…

Onehouse Emerges from Stealth to Deliver Data Lakes in ‘Months, Not Years’

Onehouse, a data lakehouse management company, has emerged from stealth today with $8 million in seed funding from investment firms Greylock Ventures and Addition. Onehouse’s cloud-native managed lakehouse service i Read more…

2022 Big Data Predictions from the Cloud

The pandemic marked an inflection point for the growth of cloud platforms in 2020, as organizations scrambled to keep their applications running. That general pattern has continued through 2021, and the cloud looms even Read more…

LinkedIn Open Sources Tech Behind 10,000-Node Hadoop Cluster

LinkedIn last week open sourced DynoYARN, a key piece of technology that allows it to predict how appliacation performance will be impacted as it scales Hadoop to gargantuan proportions, including one 10,000-node cluster Read more…

Who’s Winning in Open Source Data Tech

Organizations understandably want the innovation that comes with using open source data management tech, but how do you know when it’s too green for adoption? A new survey from OpenLogic seeks the opinions of the masse Read more…

COVID-Driven Cloud Surge Takes a Toll on the Data

The stepped-up pace of migration to the cloud over the past 18 months wasn’t exactly forced, but neither was it completely voluntary. For many companies, accelerating digital transformation during COVID was a matter of Read more…

Starburst Backs Data Mesh Architecture

The emerging data mesh architecture has the potential to keep AI and analytics projects moving forward even as data storage and processing continues to disperse far and wide. One independent backer of the data mesh conce Read more…

Cloudera To Go Private in $5.3 Billion Buyout by Wall Street Firms

The long tale of Hadoop got another twist today when Cloudera announced that it has agreed to be acquired by a pair of investment firms for $5.3 billion. The acquiring companies, Clayton, Dubilier & Rice (CD&R) a Read more…

Do Customers Want Open Data Platforms?

Snowflake turned some heads in the big data market with recent blog posts and articles that cast doubt on the benefits of open data architectures. Customers should “choose open wisely,” the data warehouse giant says, Read more…

Drowning In a Data Lake? Gartner Analyst Offers a Life Preserver

Gartner this week convened its annual Data and Analytics Summit Americas conference, which was held online again due to the coronavirus pandemic. From the role of AI in data management to avoiding data lake failures, Gar Read more…

Data Privacy Startup Privacera Raises More Cash

Growing demand for data governance, privacy and security tools needed to comply with an expanding list of national data-privacy rules continues to attract investors as more sensitive data shifts to the cloud. The late Read more…

How Jumping into Data Lake Automation Can Help Enterprises Ride the Wave

According to Gartner, over 90% of deployed data lakes will become useless as they are overwhelmed with information assets captured for uncertain use cases. This is indeed, an alarming situation. While companies are spend Read more…

Apache Iceberg: The Hub of an Emerging Data Service Ecosystem?

Engineers at Netflix and Apple created Apache Iceberg several years ago to address the performance and usability challenges of using Apache Hive tables in large and demanding data lake environments. Now the data table fo Read more…

Google, Twitter Expand Data Cloud Partnership

Google and Twitter are expanding a partnership that will see the social media platform complete the move of its analytics, data processing and machine learning workloads to Google’s Data Cloud. The partners said the sh Read more…

Cloud: The Big Data Budget-Buster

Think you’re going to save money by moving your big data apps to the cloud? Think again, according to a new survey from Pepperdata, which found that about a third of companies are exceeding their cloud budgets by 20% t Read more…

Cloudera CEO: Enterprise Data Cloud Vision Nearly Complete

The hard work may be paying off for Cloudera, the embattled former Hadoop distributor that’s been pivoting to an enterprise data cloud strategy for the past year-and-a-half. With the delivery of an on-prem version of i Read more…

Data Exchange Maker Harbr Closes Series A

Harbr, a London startup that helps organizations like Moody’s Analytics to create their own custom data exchanges, yesterday announced that it has completed a Series A round of financing, netting $38.5 million for the Read more…

The Past and Future of In-Memory Computing

When Nikita Ivanov co-founded GridGain Systems back in 2005, he envisioned in-memory computing going mainstream and becoming a massive category unto itself within a few years. That obviously didn’t pan out, but on the Read more…

Datanami