July 20, 2020

Building an Open Cloud Data Lake Future

Sponsored Content by Dremio

The explosion of data and the need for business agility to leverage that data for competitive advantage are driving a massive surge of data lake innovation. We’ve moved past first-generation on-premises Hadoop-based data lakes to focus on building next-generation data platforms in the cloud. Organizations of all sizes recognize that cloud data lakes, with separation of compute and data, give them the flexibility and freedom they need both today and tomorrow.

A key advantage of cloud data lakes is their open architecture, which minimizes the risk of vendor lock-in as well as the risk of being locked out of future industry innovation. As the cloud data lake evolves to support a wide range of production analytical and data processing use cases, it’s important to ensure that it maintains this open architecture in the future. A rich ecosystem of open source projects, technology vendors and cloud providers has emerged to make that a reality.

What’s been missing is a rallying point for this rapidly expanding community — an industry event dedicated to showcasing the newest innovations and the best ways to put them to work. And that’s why we’re excited to introduce and host Subsurface, the industry’s first cloud data lake conference.

Subsurface is an industry conference spanning the entire open cloud data lake architecture. It’s the only event designed to bring the broader community together around the fast-growing world of cloud data lakes. A crucial part of that community is the open source creators and committers whose innovations will fuel the next generation of cloud data lakes. At Subsurface, you’ll be able to dive into innovative open source projects such as Apache Arrow, Iceberg, Parquet, Marquez and Superset and how companies like Expedia and Netflix are using them to build open cloud data lakes.

Open source sessions include:

Technical Keynote: The Future of Intelligent Storage in Big Data – Daniel Weeks, Big Data Compute Team Lead at Netflix and Apache Iceberg and Parquet committer
Apache Arrow: A New Gold Standard for Dataset Transport – Wes McKinney, director at Ursa Labs, Pandas creator and Apache Arrow co-creator
Functional Data Engineering: A Set of Best Practices – Maxime Beauchemin, CEO and co-founder at Preset, Apache Superset creator and Airflow creator
Data Lineage and Observability with Marquez – Julien Le Dem, CTO and co-founder at Datakin and Apache Parquet co-creator
Lessons Learned From Running Apache Iceberg at Petabyte Scale – Anton Okolnychyi, Apache Iceberg PMC member and Apache Spark contributor
Hiveberg: Integrating Apache Iceberg with the Hive Metastore – Adrian Woodhead, principal engineer and Christine Mathiesen, software development at Expedia Group

And it’s not just about the technical sessions — Subsurface is the catalyst for a long-term cloud data lake community. We’re creating a dedicated Slack instance for Subsurface which you’ll be able to use both during and after the conference. You’ll be able to jump into topic-based Slack channels with attendees, speakers, event sponsors and open source project leads to get questions answered, watch demos and collaborate on making your cloud data lake initiatives a success.

So, whether you are looking to expand your technical knowledge or hear from your peers about their cloud data lake use cases and architectures, Subsurface provides plenty of opportunities to learn, network and be inspired. We’ll even have a little fun along the way.

Register for Subsurface today!

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

Building an Open Cloud Data Lake Future

Register for Subsurface today!

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 23, 2024

April 22, 2024

April 19, 2024

April 18, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Building an Operational Data Warehouse for Real-time Analytics

Can You Use Kafka as a Database?

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

Call & Contact Center Expo

AI & Big Data Expo North America 2024

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Building an Open Cloud Data Lake Future

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 23, 2024

April 22, 2024

April 19, 2024

April 18, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link