Follow Datanami:

Tag: data pipeline

How Airflow 2.8 Makes Building and Running Data Pipelines Easier

Apache Airflow is one of the world’s most popular open source tools for building and managing data pipelines, with around 16 million downloads per month. Those users will see several compelling new features that help t Read more…

AWS Plots Zero-ETL Connections to Azure and Google

At the recent re:Invent show, AWS unveiled new zero-ETL connections that will eliminate the need for customers to build and maintain data pipelines between various AWS data services, including Redshift, Aurora, DynamoDB, Read more…

Pantomath on a Mission To Enhance End-To-End Data Pipeline Observability Across Complex Data Ecosystems

In our data-intensive business world, organizations are striving to use new and innovative methods to derive valuable and actionable insights from the data. Unfortunately, data quality issues are a major challenge and re Read more…

Six Common Signs It’s Time to Invest in Data Reliability

Every enterprise relies heavily on data to make decisions.  This makes data reliability crucial. Without it, you may not find the path to streamlined customer experience and revenue generation. However, data reliability Read more…

Meet Maxime Beauchemin, a 2023 Person to Watch

When it comes to prolific contributors to open source projects in the big data space, Maxime Beauchemin is definitely somebody you should know. As a data engineer at Airbnb, Beauchemin created multiple tools that he subs Read more…

Data Mesh Creator Takes Next Data Step

Zhamak Dehghani, who is credited with popularizing the data mesh concept, announced earlier this month the founding of Nextdata. The new outfit will develop software designed to help customers implement “data product c Read more…

Data Integration and Observability Provider Crux Nabs $50m in Funding

Crux, a cloud-based provider of data integration and observability tools that claims to have more than 250 data connectors, today announced that it raised $50 million in a Series B round of venture capital. The San Franc Read more…

AWS Seeks an End to ETL

Extract, transform, and load. It’s a simple and ubiquitous thing in IT. And yet everybody seems to hate it. The latest company to pile on to ETL is AWS, which declared an effort to end ETL yesterday at re:Invent. Ad Read more…

How Snowplow Breaks Down Data Barriers

If you have suspicions about your data, you’re not alone. The AI and data analytics dreams of many a company have been broken by poor data management and defective ETL pipelines. But by enforcing data schema at the poi Read more…

A Data Platform for Chatbot Development

One of the most compelling use cases for AI at the moment is developing chatbots and conversational agents. While the AI part of the equation works reasonably well, getting the training data organized to build and train Read more…

CI/CD Pipeline: 7 Advantages To A Continuous Integration Approach to Data Pipelines

When it comes to modern software development, it’s not surprising that companies have a need for speed. But if you develop software too quickly, it can mean sacrificing quality, security and compliance. DevOps and con Read more…

Exploring the Top Options for Real-Time ELT

Competitive advantage in today’s world rests on a company’s ability to innovate and adapt to a rapidly changing environment. To do that, organizations must adopt real-time thinking in the way they approach the design Read more…

Airflow Available as a New Managed Service Called Astro

Companies can now get an Apache Airflow data orchestration environment up and running in less than an hour via Astro, a new managed service launched today by Astronomer, the commercial entity behind the popular open-sour Read more…

Monte Carlo Raises $135 Million to Grow Data Observability Biz

Data observability has emerged as one of the hottest sectors in the big data market, thanks to its focus on fixing broken data pipelines. One of the hottest players in the field is Monte Carlo, which this week announced Read more…

Building Continuous Data Observability at the Infrastructure Layer

Data is the lifeblood of business today, but getting it where it needs to go is hard, especially as data volumes grow. Data pipelines become the repeatable method for moving this digital crude, but monitoring the flows f Read more…

Monte Carlo Hits the Circuit Breaker on Bad Data

Data pipelines are critical conduits of information for data-driven companies. But what happens when the data in the pipeline becomes corrupted? In some situations, you want to immediately stop the flow of data, which is Read more…

Databricks Ships New ETL Data Pipeline Solution

Databricks today announced the general availability (GA) of Delta Live Tables (DLT), a new offering designed to simplify the building and maintenance of data pipelines for extract, transform, and load (ETL) processes usi Read more…

ETL Tool Apache Hop Graduates Incubator

Apache Hop, a metadata-driven data orchestration tool used to design and build pipelines, today emerged from incubator status and was named a Top-Level Project at the Apache Software Foundation, clearing the way for more Read more…

Inside AutoTrader UK’s Data Observability Pipeline

In the course of shifting its analytics estate to the cloud, AutoTrader UK has adopted many new tools and technologies, including BigQuery, Looker, and dbt, which have helped to democratize data access among users. Along Read more…

Bad Data Pipelines Costing Companies Big, Fivetran Finds

Stop us if you’ve heard this one before: Overworked data engineer builds faulty data pipeline, which leads to bad data, which leads to a bad outcome. It may be the same old song, but it’s also the current state of af Read more…

Datanami