Follow Datanami:

Tag: big data

Three Critical Factors to Consider When Preparing Data for Generative AI

Thanks in part to the excitement around breakthrough generative artificial intelligence (AI) tools like ChatGPT, industry analysts are projecting rapid growth of business investment in AI and machine learning (ML) techno Read more…

Why a Universal Semantic Layer is the Key to Unlock Value from Your Data

A semantic layer is a way to represent data so that it can be easily understood by business users, making it easier for them to interpret and use it directly, without a dependence on data engineering teams. Wikipedia def Read more…

The Business Case for Privacy Enhancing Technologies

In a world that’s constantly hyping the next big thing, it’s natural to be wary when a family of technologies is described as transformational. This label has been recently associated with Privacy Enhancing Technolog Read more…

Privacera Streamlines Data Access Control Decisions

To be data-driven, one needs access to data. That’s pretty obvious. But data also carries risk, which complicates the math. One vendor hoping to streamline the data access workflow and safely get more data into the han Read more…

Rockset Looks to Compute-Compute Isolation for Real-Time Advantage

The separation of compute and storage is a bedrock of big data architecture and has enabled nearly infinite scalability in cloud storage. Now a related concept called compute-compute isolation is being introduced to data Read more…

The Age of Data Productivity is Here

Black Swan events like extreme weather, financial crises, pandemics, or war used to be an anomaly. Today, these events occur at a surprisingly regular cadence, and such life-altering disruptions can induce extreme stress Read more…

Yes, Real-Time Streaming Data Is Still Growing

A funny thing happened while the tech world was focused almost exclusively on ChatGPT over the past eight months: Adoption of other cutting-edge technologies kept growing. One of those is real-time stream data processing Read more…

OpenAI Releases ChatGPT Code Interpreter, ‘Your Personal Data Analyst’

You can now run Python functions to analyze your own data in a ChatGPT session thanks to the new Code Interpreter that OpenAI is releasing as a beta to subscribers this week. ChatGPT Code Interpreter “lets ChatGPT r Read more…

DataRobot CEO Sees Success at Junction of Gen AI and ‘Classical AI’

What’s the next generation of enterprise AI going to look like? If you ask DataRobot CEO Debanjan Saha, enterprises will see the most business benefits by combining new generative AI tools and techniques with the class Read more…

What Is MosaicML, and Why Is Databricks Buying It For $1.3B?

Databricks shocked the big data world last week when it announced plans to acquire MosaicML for a cool $1.3 billion. With just $1 million in revenue at the end of 2022 and $20 million so far this year, some speculated th Read more…

Will Gen AI Help the Texas Rangers Win the World Series in ’23?

The Texas Rangers are one of six MLB teams that hasn’t won a World Series. If they win it all this year--and they are currently leading a surprisingly competitive AL West at the season’s midway point with one of the Read more…

Security Risks of Gen AI Raise Eyebrows

Unless you’ve been hiding under a rock the past eight months, you’ve undoubtedly heard how large language models (LLMs) and generative AI will change everything. Businesses are eagerly adopting things like ChatGPT to Read more…

Databricks Unleashes New Tools for Gen AI in the Lakehouse

Fresh off its announcement of the acquisition of MosaicML on Monday, Databricks today unleashed a torrent of new AI capabilities at its Data + AI Summit designed to enable its customers to create generative AI applicatio Read more…

Databricks Puts Unified Data Format on the Table with Delta Lake 3.0

Databricks today rolled out a new open table format in Delta Lake 3.0 that it says will eliminate the possibility of picking the wrong one. Dubbed Universal Format, or UniForm, the new table format can read and write dat Read more…

Cloudera Sees Iceberg Everywhere

Cloudera gave its hybrid cloud customers a big boost today when it announced on-prem support for the Apache Iceberg table format. The move gives customers the capability to access and process on-prem data with any Iceber Read more…

Snowflake Gives Everybody a Little Something at Summit

Whether you’re a data engineer building data pipelines, a data scientist creating AI models, or a CFO trying to minimize cloud spending, Snowflake gave you something today at its annual user conference in Las Vegas, Ne Read more…

Acryl Data Raises $21M in Bid to Unify Data Stack

Acryl Data, the company driving the open source data catalog called the DataHub Project, got a boost in its quest to unify the fragmented data stack today when it announced the completion of a $21 million Series A round. Read more…

AI to Goose Demand for All Flash Arrays, Pure Storage Says

These are still early days for AI, but the trajectory from things like ChatGPT makes it pretty clear to Pure Storage: The need to store and serve huge amounts of data to train AI models on GPUs will almost certainly requ Read more…

How to Build Great Data Products

By now, it’s common knowledge that data is everywhere and being produced and consumed at an astonishing rate. But the more important thing to focus on—especially for enterprises that have invested in creating data en Read more…

Numbers Station Sees Big Potential In Using Foundation Models for Data Wrangling

A startup called Numbers Station is applying the generative power of pre-trained foundation models such as GPT-4 to help with data wrangling. The company, which is based on research conducted at the Stanford AI Lab, has Read more…

Datanami