Follow Datanami:

Tag: data lake

LinkedIn Implements New Data Trigger Solution to Reduce Resource Usage For Data Lakes

With its vast user base and the numerous interactions that occur daily, LinkedIn generates an enormous amount of data every day. The billions of data points fuel various applications, from rankings to search. The additio Read more…

Data Observability in 2024: A Guide

In today's data-driven world, data observability is a critical concept for organizations aiming to effectively manage their data. Simply put, it means having the ability to constantly monitor and understand the status of Read more…

The Data Lakehouse Is On the Horizon, But It’s Not Smooth Sailing Yet

Data warehouses and data lakes serve clear and distinct purposes. Typically, data warehouses store structured data according to a predefined schema to generate fast query speeds for reporting purposes.  Data lakes, on t Read more…

Data Engineering in 2024: Predictions For Data Lakes and The Serving Layer

The data landscape experienced significant changes in 2023, presenting new opportunities (and potential challenges) for data engineering teams. I believe we will see the following this year in the areas of analytics, Read more…

Inside AWS’s Plans to Make S3 Faster and Better

As far as big data storage goes, Amazon S3 has won the war. Even among storage vendors whose initials are not A.W.S., S3 is the defacto standard for storing lots of data. But AWS isn’t resting on its laurels with S3, a Read more…

Unveiling Chaos LakeDB: First Lake Database for Live Search, SQL, and Gen AI Analytics

With the rapidly evolving digital landscape, the ascent of generative AI is not just a passing phase. Organizations that are able to harness the potential of gen AI are set to gain a substantial competitive advantage. Ye Read more…

There Are Many Paths to the Data Lakehouse. Choose Wisely

You don’t need a crystal ball to see that the data lakehouse is the future. At some point soon, it will be the default way of interacting with data, combining scale with cost-effectiveness. Also easy to predict is t Read more…

Oracle Announces GA of MySQL HeatWave Lakehouse

Oracle recently announced the general availability of MySQL HeatWave Lakehouse, a fully managed database service. The company previously debuted the service at its CloudWorld event last October. This lakehouse is the Read more…

A Truce in the Cloud Data Lake Vs. Data Warehouse War?

At the 2nd Annual Semantic Layer Summit, which took place April 26, AtScale founder and CTO Dave Mariani sat down with Bill Inmon, recognized by many as the father of the data warehouse, to discuss the evolution of moder Read more…

Cyberspooks Need Big Data Portability, Too

The problem of how to effectively move and manage large amounts of data is one that impacts all organizations of a certain size, including U.S. Government agencies working in cybersecurity. Now a new partnership between Read more…

Starburst Bolsters Trino Platform as Datanova Begins

Starburst today rolled out a host of enhancements to its Trino-based analytics platform for the cloud, called Galaxy, including support for Python, new caching and indexing features, and a new data catalog. The company u Read more…

Onehouse Announces $25M Series A, New Feature for Its Managed Lakehouse Platform

Managed data lakehouse firm Onehouse has announced a $25 million Series A funding round, bringing its total funding to $33 million. Additionally, the company announced a new feature of its platform called Onetable. On Read more…

Datanova- the coolest data conference of the year

This event is for those who seek to not just ‘do data’ incrementally better, but differently. For the leaders who want to help their companies become truly data-driven. For the analyst in all of us that seeks sim Read more…

Two Cancer-Fighting Startups Gain a Foothold in AWS

The cloud is a natural place for startups that have large computing and data storage needs but also have uncertain futures. For two startups aiming to stop cancer, including Lyell Immunopharma and Hurone AI, the public c Read more…

Mastering the Mesh: Finding Clarity in the Data Lake

Data lakes are great in theory, but their application in the real world often leaves the user wanting more. A data mesh is one approach to cleaning up chaos left by data lakes and the resulting swing back to data decentr Read more…

AWS Bolsters Glue ETL Tool with Data Observability, Ray Support

AWS has made a big push into data management during re:Invent this week, with the unveiling of DataZone and launch of zero-ETL capabilities in Redshift. But AWS also bolstered its ETL tool with the launch of Amazon Glue Read more…

The Key Tech Enabling Cloudera’s New Lakehouse

Cloudera today debuted CDP One, its new software-as-a-service (SaaS) lakehouse offering. For the first time, Cloudera is taking over management of its data platform on behalf of its customers. It’s also Cloudera’s fi Read more…

Will the Data Lakehouse Lead to Warehouse-Style Lock-In?

Lakehouse architectures are gaining steam as a preferred method for doing big data analytics in the cloud, thanks to the way they blend traditional data warehousing concepts with today’s cloud tech. But could lakehouse Read more…

Five Ways Big Data Projects Can Go Wrong (And What You Can Do About Them)

So your big data project isn’t panning out the way you wanted? You’re not alone. The poor success rate of big data projects has been a persistent theme over the past 10 years, and the same types of struggles are show Read more…

Google Cloud Opens Door to the Lakehouse with BigLake

Google Cloud made its way into the lakehouse arena today with the launch of Big Lake, a new storage engine that melds the governance of its data warehousing offering, BigQuery, with the flexibility of open data formats a Read more…

Datanami