Follow BigDATAwire:

Tag: ETL

AWS Launches Visual Data Prep Tool

AWS this week unveiled Glue DataBrew, a new visual data preparation tool for AWS Glue that’s designed to help users clean and normalize data without writing code. Data preparation is the Achille’s Heel of advanced Read more…

Can Isima Be the Nutanix of Data Management?

Despite the technological advances in big data, companies continue to struggle to put it all together and manage data in an effective way. Now a company called Isima is stepping forward with a plan to build a platform th Read more…

Informatica Likes Its Chances in the Cloud

Quick: Name a company that made its name in the 1990s and 2000s by providing data integration tools for enterprise analytics running in on-prem data centers, but has since pivoted the cloud and was even named Snowflake� Read more…

Running Sideline to Sideline with Big Data

What can you do with big data? A better question might be what can’t you do. From a big data point of view, we are living in extremely resource-rich times, with a huge assortment of tools, framework, and platforms to c Read more…

Zaloni Pivots to DataOps

Zaloni once was focused on helping customers to manage data in Hadoop. But under new CEO Susan Cook, the company has broadened its scope and is now aiming to help customers manage the entire supply chain of data, or what Read more…

Fivetran Launches Pay-As-You-Go Option for ETL

Fivetran wants to make it “stupidly simple” for customers to load data into cloud data warehouses, and judging from the company’s rapid growth, it seems to be working. Last week, the extract, transformation, and lo Read more…

Do You Have Customer Data You Can Trust?

Customer data is the new business success factor, yet over 76% of the 4,500+ companies we’ve worked with did not have accurate customer data. Surprised? Here’s more. Forty-seven percent of these companies did not Read more…

Pachyderm Gains Microsoft Funding, Launches Hub

A startup launched as a Hadoop alternative in the form of a container-based big data platform continues to attract investors to its open source data science framework. Pachyderm Inc. said this week its $16 million Ser Read more…

Planning an ETL Proof of Concept? Here Is What You Need to Consider

Picture this: You are trying to track the sentiment of your product using first-party customer data, social data, and social listening data to determine the success of a new feature. Getting a daily report on this can he Read more…

Taking the Pain Out of Buying and Selling Data

We’re well into big data’s second decade, and we’ve made a ton of progress on many fronts. We have cloud-based systems with infinite storage capacity, sophisticated machine learning software that improves by the mo Read more…

To Centralize or Not to Centralize Your Data–That Is the Question

Should you strive to centralize your data, or leave it scattered about? It seems like it should be a simple question, but it’s actually a tough one to answer, particularly because it has so many ramifications for how d Read more…

Reproducibility in Data Analytics Under Fire in Stanford Report

Armed with the same data and told to test the same hypotheses, dozens of independent researchers instead came to widely different conclusions using a variety of analytics techniques, according to a new report from Stanfo Read more…

Spark 3.0 to Get Native GPU Acceleration

NVIDIA today announced that it’s working with Apache Spark’s open source community to bring native GPU acceleration to the next version of the big data processing framework. With Spark version 3.0, which is due out n Read more…

Top 12 Datanami Stories of 2019

2019 was an eventful year in the big data space, with enough intersecting story lines to keep a big data watcher enmeshed for hours – if not days -- on end. We did our best to trace the story lines out for you, dear re Read more…

Beyond BI: Looker Seeks Bigger Role for Data

Looker is best known as a business intelligence platform, which it definitely is. But with today's release of Looker 7, the company is making a strong case that it's much more than that. In fact, here at its user confere Read more…

How ML Helps Solve the Big Data Transform/Mastering Problem

Despite the astounding technological progress in big data analytics, we largely have yet to move past manual techniques for important tasks, such as data transformation and master data management. As data volumes grow, t Read more…

Dremio Noses Into Cloud Lakes with Analytics Speedup

Most of today's big data action is occurring in the cloud, where companies are building massive data lakes atop object storage systems like AWS S3 and Microsoft ADLS. While object stores offer tremendous scalability, the Read more…

StreamSets Eases Spark-ETL Pipeline Development

Apache Spark gives developers a powerful tool for creating data pipelines for ETL workflows, but the framework is complex and can be difficult to troubleshoot. StreamSets is aiming to simplify Spark pipeline development Read more…

Can We Stop Doing ETL Yet?

Despite the advances we've made in data science and advanced analytics in recent years, many projects still are beholden to a technological holdover from the 1980s: extract, transform, and load, or ETL. It's uncanny how Read more…

BigDATAwire