Follow Datanami:

Tag: Parquet

Tabular Seeks to Remake Cloud Data Lakes in Iceberg’s Image

The creators of the table format Apache Iceberg launched a new company this summer called Tabular that’s aiming to remake how companies store data in the cloud. If the company has its way, much of the minutia of how da Read more…

A Peek at the Future of the Open Data Architecture

Hadoop may have fizzled out as a data platform, but it laid the groundwork for an open data architecture that continues to grow and evolve today, largely in the cloud. We got a peek at the future of this open data archit Read more…

Presto the Future of Open Data Analytics, Foundation Says

The openness of Presto, its adherence to standard SQL, and the ubiquity and performance of modern cloud storage have combined to put Presto in the driver’s seat of the big data analytics stack for the foreseeable futur Read more…

Data Headaches Targeted with a Dose of .BIG

Working with large numbers of files--and large files--remains a roadblock to productivity for data professionals around the world. Now a software startup named Exponam says it has come up with a potential solution to the Read more…

Return of the Living Data

When Google published a paper on its proprietary BigQuery engine about nine years ago, the open source community reproduced the technology as best they could, just as they did with MapReduce and the Google File System, w Read more…

Data Startup Aims to Make S3 ‘Work Like Dropbox’

Quilt Data emerged from stealth today with a new service that aims to make S3 work more like Dropbox, the handy file sharing service. For about $500 per month, Quilt Data allows teams to securely large share files that a Read more…

Celebrating Data Independence

Every company wants the independence to do what they wish with their data. That's one of the first assumptions underlying this whole big data movement. But depending on where and how a business stores its data -- such as Read more…

Big Data File Formats Demystified

So you're filling your Hadoop cluster with reams of raw data, and your data analysts and scientists are champing at the bit to get started. Then the question hits you: How are you going to store all this data so they can Read more…

Datanami