Follow Datanami:

Tag: ETL

Cloud Is the New Center of Gravity for Data Warehousing

The great migration of data into the cloud didn’t start in 2020, but it certainly accelerated throughout the year. And according to a new survey from IDG, the overwhelming majority of companies are planning to expand t Read more…

AWS Bolsters Its Lakehouse

Amazon Web Services wants you to create data silos to ensure you get the best performance when processing data. AWS also wants to help unify your data to ensure that insights don’t fall between the cracks. If you think Read more…

Optimizing AI and Deep Learning Performance

As AI and deep learning uses skyrocket, organizations are finding they are running these systems on similar resource as they do with high-performance computing (HPC) systems – and wondering if this is the path to peak Read more…

Snowflake Extends Its Data Warehouse with Pipelines, Services

Customers running atop Snowflake’s cloud data warehouse soon will find new functionality, including the ability to build ETL data pipelines , as well as the ability to expose pre-built analytic routines as data service Read more…

AWS Launches Visual Data Prep Tool

AWS this week unveiled Glue DataBrew, a new visual data preparation tool for AWS Glue that’s designed to help users clean and normalize data without writing code. Data preparation is the Achille’s Heel of advanced Read more…

Can Isima Be the Nutanix of Data Management?

Despite the technological advances in big data, companies continue to struggle to put it all together and manage data in an effective way. Now a company called Isima is stepping forward with a plan to build a platform th Read more…

Informatica Likes Its Chances in the Cloud

Quick: Name a company that made its name in the 1990s and 2000s by providing data integration tools for enterprise analytics running in on-prem data centers, but has since pivoted the cloud and was even named Snowflake� Read more…

Running Sideline to Sideline with Big Data

What can you do with big data? A better question might be what can’t you do. From a big data point of view, we are living in extremely resource-rich times, with a huge assortment of tools, framework, and platforms to c Read more…

Zaloni Pivots to DataOps

Zaloni once was focused on helping customers to manage data in Hadoop. But under new CEO Susan Cook, the company has broadened its scope and is now aiming to help customers manage the entire supply chain of data, or what Read more…

Fivetran Launches Pay-As-You-Go Option for ETL

Fivetran wants to make it “stupidly simple” for customers to load data into cloud data warehouses, and judging from the company’s rapid growth, it seems to be working. Last week, the extract, transformation, and lo Read more…

Do You Have Customer Data You Can Trust?

Customer data is the new business success factor, yet over 76% of the 4,500+ companies we’ve worked with did not have accurate customer data. Surprised? Here’s more. Forty-seven percent of these companies did not Read more…

Pachyderm Gains Microsoft Funding, Launches Hub

A startup launched as a Hadoop alternative in the form of a container-based big data platform continues to attract investors to its open source data science framework. Pachyderm Inc. said this week its $16 million Ser Read more…

Planning an ETL Proof of Concept? Here Is What You Need to Consider

Picture this: You are trying to track the sentiment of your product using first-party customer data, social data, and social listening data to determine the success of a new feature. Getting a daily report on this can he Read more…

Taking the Pain Out of Buying and Selling Data

We’re well into big data’s second decade, and we’ve made a ton of progress on many fronts. We have cloud-based systems with infinite storage capacity, sophisticated machine learning software that improves by the mo Read more…

To Centralize or Not to Centralize Your Data–That Is the Question

Should you strive to centralize your data, or leave it scattered about? It seems like it should be a simple question, but it’s actually a tough one to answer, particularly because it has so many ramifications for how d Read more…

Reproducibility in Data Analytics Under Fire in Stanford Report

Armed with the same data and told to test the same hypotheses, dozens of independent researchers instead came to widely different conclusions using a variety of analytics techniques, according to a new report from Stanfo Read more…

Spark 3.0 to Get Native GPU Acceleration

NVIDIA today announced that it’s working with Apache Spark’s open source community to bring native GPU acceleration to the next version of the big data processing framework. With Spark version 3.0, which is due out n Read more…

Top 12 Datanami Stories of 2019

2019 was an eventful year in the big data space, with enough intersecting story lines to keep a big data watcher enmeshed for hours – if not days -- on end. We did our best to trace the story lines out for you, dear re Read more…

Beyond BI: Looker Seeks Bigger Role for Data

Looker is best known as a business intelligence platform, which it definitely is. But with today's release of Looker 7, the company is making a strong case that it's much more than that. In fact, here at its user confere Read more…

Datanami