Follow Datanami:

Tag: ETL

Spark 3.0 to Get Native GPU Acceleration

May 14, 2020 |

NVIDIA today announced that it’s working with Apache Spark’s open source community to bring native GPU acceleration to the next version of the big data processing framework. With Spark version 3.0, which is due out next month, organizations will be able to speed up all of their Spark workloads, from ETL jobs to machine learning training, without making wholesale changes to their code. Read more…

Top 12 Datanami Stories of 2019

Jan 3, 2020 |

2019 was an eventful year in the big data space, with enough intersecting story lines to keep a big data watcher enmeshed for hours – if not days — on end. Read more…

Beyond BI: Looker Seeks Bigger Role for Data

Nov 7, 2019 |

Looker is best known as a business intelligence platform, which it definitely is. But with today’s release of Looker 7, the company is making a strong case that it’s much more than that. Read more…

How ML Helps Solve the Big Data Transform/Mastering Problem

Oct 10, 2019 |

Despite the astounding technological progress in big data analytics, we largely have yet to move past manual techniques for important tasks, such as data transformation and master data management. As data volumes grow, the productivity gap posed by manual methods grows wider, putting the dreams of AI- and machine learning-powered automation further out of reach. Read more…

Dremio Noses Into Cloud Lakes with Analytics Speedup

Sep 17, 2019 |

Most of today’s big data action is occurring in the cloud, where companies are building massive data lakes atop object storage systems like AWS S3 and Microsoft ADLS. While object stores offer tremendous scalability, they’re notoriously slow. Read more…

StreamSets Eases Spark-ETL Pipeline Development

Sep 5, 2019 |

Apache Spark gives developers a powerful tool for creating data pipelines for ETL workflows, but the framework is complex and can be difficult to troubleshoot. StreamSets is aiming to simplify Spark pipeline development with Transformer, the latest addition to its DataOps platform. Read more…

Can We Stop Doing ETL Yet?

Sep 3, 2019 |

Despite the advances we’ve made in data science and advanced analytics in recent years, many projects still are beholden to a technological holdover from the 1980s: extract, transform, and load, or ETL. Read more…

Skills Are Critical in Data Science Job Hunt

Aug 22, 2019 |

Those planning a career in data science have a healthy job outlook, as demand for data scientists continues to grow. While an advanced data science degree can definitely help, it’s becoming increasingly apparent that having the right skills is a more critical factor in landing your dream job. Read more…

The Critical Element for a Successful Digital Transformation? HTAP Powered by In-Memory Computing

Aug 8, 2019 |

Many of today’s digital transformation and omnichannel customer experience initiatives demand real-time analysis of data. For example, banks need to analyze transactions across their systems in real time to detect and prevent fraud. Read more…

Do NOT follow this link or you will be banned from the site!