Follow Datanami:

Tag: data lake

Databricks Donates Delta Code to Open Source

Apr 24, 2019 |

Databricks today announced that it’s open sourcing the code behind Databricks Delta, the Apache Spark-based product it designed to help keep data neat and clean as it flows from sources into its cloud-based analytics environment. Read more…

How Databricks Keeps Data Quality High with Delta

Apr 8, 2019 |

Data lakes have sprung up everywhere as organizations look for ways to store all their data. But the quality of data in those lakes has posed a major barrier to getting a return on data lake investments. Read more…

Nvidia Sees Green in Data Science Workloads

Mar 19, 2019 |

We already knew that GPUs are useful for lots of things besides making Fortnite uncomfortably realistic. All the biggest supercomputers in the world use GPUs to accelerate math, and more recently they’ve been used to power deep neural networks in public clouds. Read more…

Building a Successful Data Governance Strategy

Dec 7, 2018 |

One of the core elements of data analytics that organizations struggle with today is data governance. An organization could do everything right and still wonder why their analytics projects are failing if they haven’t taken the time to build and implement a governance strategy. Read more…

Hitachi Ups Game for Managing Unstructured Data

Dec 4, 2018 |

Most enterprise data go unused, and according to some studies very little unstructured data in the form of text, audio and video makes its way into the hands of data analysts. Read more…

AWS To Build You a Data Lake in ‘A Few Clicks’

Nov 29, 2018 |

AWS yesterday announced Lake Formation, a new service that it says will let users build their own data lake on S3 — complete with the requisite provisions for security, access control, data transformation, and cataloging — Read more…

Is it Time to Drain the Data Lake?

Oct 15, 2018 |

The term “Big Data” has been a major point of enterprise technology conversation for decades, and with the rise of the data lake, it’s back in the spotlight.

Early on, the idea of big data was how to store the mass amounts of it being generated by the newly created Web-scale infrastructures. Read more…

ML Powers Discovery In GE’s 500 PB Lake

Sep 25, 2018 |

Like most Fortune 50 firms, General Electric relies on an abundance of computer systems to power its enterprise. And like most firms that size, synching up and aligning the data emitted by different systems is a major challenge. Read more…

Survey: Excel Remains Go-To Data Prep Tool

Apr 2, 2018 |

Skyrocketing data volumes and a complex mix of data types are bogging down data preparation and processing efforts, according to a recent overview of enterprise data quality.

The snapshot released by data prep vendor Paxata surprisingly found that about two-thirds of the organizations it surveyed late last year are still relying profiling tools like Excel spreadsheets to help ingest and profile data. Read more…

Tech’s Hottest New Trend: Data Governance

Jan 24, 2018 |

What’s the hottest new trend in information technology? If you said “artificial intelligence,” give yourself partial credit, because AI definitely is hot. But for technology decision-makers in the business world, there’s something even bigger simmering just below the surface. Read more…

Do NOT follow this link or you will be banned from the site!