Follow Datanami:

Tag: data pipeline

Databricks Donates Delta Code to Open Source

Apr 24, 2019 |

Databricks today announced that it’s open sourcing the code behind Databricks Delta, the Apache Spark-based product it designed to help keep data neat and clean as it flows from sources into its cloud-based analytics environment. Read more…

Data Pipeline Automation: The Next Step Forward in DataOps

Apr 24, 2019 |

The industry has largely settled on the notion of a data pipeline as a means of encapsulating the engineering work that goes into collecting, transforming, and preparing data for downstream advanced analytics and machine learning workloads. Read more…

From Big Beer to Big Data: Inside AB InBev’s Digital Transformation

Apr 11, 2019 |

With more than 500 beer brands and $55 billion in sales, Anheuser-Busch InBev is already the world’s biggest beer company. And if all goes as planned with its digital transformation project, it will be the best beer company in the world, too. Read more…

Google Doubles Down on Cloud Data Migration

Feb 21, 2019 |

Data integration startups have become prime acquisition targets as cloud analytics vendors look to beef up their migration capabilities.

What that in mind, Google Cloud announced this week it intends to acquire data migration specialist Alooma, a five-year-old Bay Area startup. Read more…

Streamsets Gets $35M for DataOps

Sep 11, 2018 |

StreamSets, which bills itself as the “air traffic control” tasked with preventing collisions from occurring with big data, today announced that it raised $35 million, which it will use to continue building its data operations, or DataOps, platform. Read more…

Machine Teaching Will Drive Crowdsourced Cognition into the AI Pipeline

Jun 25, 2018 |

Building high-quality artificial intelligence (AI) is hard work. It’s a specialized discipline that historically has required highly skilled specialists, aka data scientists.

Any time you require some highly skilled, highly paid practitioner to accomplish something of value, you’ve introduced a bottleneck into that process. Read more…

How Disney Built a Pipeline for Streaming Analytics

May 14, 2018 |

The explosion of on-demand video content is having a huge impact on how we watch television. You can now binge watch an entire season’s worth of Grey’s Anatomy at one sitting, if that suits your fancy. Read more…

Apache Airflow to Power Google’s New Workflow Service

May 1, 2018 |

Apache Airflow, the workload management system developed by Airbnb, will power the new workflow service that Google rolled out today. Called Cloud Composer, the new Airflow-based service allows data analysts and application developers to create repeatable data workflows that automate and execute data tasks across heterogeneous systems. Read more…

How Netflix Optimized Flink for Massive Scale on AWS

Apr 30, 2018 |

When it comes to streaming data, it’s tough to find a company operating on a more massive scale than Netflix, which streams more than 125 million hours of TV shows and movies —  Read more…

Do NOT follow this link or you will be banned from the site!