Follow Datanami:

Tag: databricks

A Cloud Twist to the Farm-to-Table Movement

Aug 27, 2019 |

Mail-order services, or what a cloud-enabled industry calls “subscription box” services, have become all the rage for everything from fine wine to pet food. Consumers can also order Kansas City steaks online, but a partnership between a startup and Microsoft has yielded a data-driven approach to subscription box sales that allows one entrepreneur to better track beef, poultry and pork shipments, determine which of the 21 different varieties are selling and how quickly they go from freezer to dinner table. Read more…

Databricks Offers Something for Everybody with AutoML Solution

Aug 20, 2019 |

Databricks today took the covers off a new automated machine learning solution that promises to reduce the amount of manual coding required to develop predictive applications. But while other AutoML solutions tend to focus on core data science aspects of predictive apps, like model selection and hyperparameter tuning, Databricks new offering is designed to be used by a broad swath of personas to automate a range of data science and engineering activities, from data prep to production deployment. Read more…

Databricks Donates Delta Code to Open Source

Apr 24, 2019 |

Databricks today announced that it’s open sourcing the code behind Databricks Delta, the Apache Spark-based product it designed to help keep data neat and clean as it flows from sources into its cloud-based analytics environment. Read more…

Apache Spark Is Great, But It’s Not Perfect

Apr 3, 2019 |

Apache Spark is one of the most widely used tools in the big data space, and will continue to be a critical piece of the technology puzzle for data scientists and data engineers for the foreseeable future. Read more…

A Decade Later, Apache Spark Still Going Strong

Mar 8, 2019 |

Don’t look now but Apache Spark is about to turn 10 years old. The open source project began quietly at UC Berkeley in 2009 before emerging as an open source project in 2010. Read more…

Databricks Open Sources MLflow to Simplify Machine Learning Lifecycle

Jun 5, 2018 |

Databricks today unveiled MLflow, a new open source project that aims to provide some standardization to the complex processes that data scientists oversee during the course of building, testing, and deploying machine learning models. Read more…

Top 3 New Features in Apache Spark 2.3

Mar 14, 2018 |

It’s tough to find a big data project that’s had as much impact as Apache Spark over the past five years. The folks at Databricks, who contribute heavily to Spark (along with the wider Spark community) are keeping the project on the cutting edge with version 2.3. Read more…

Databricks Puts ‘Delta’ at the Confluence of Lakes, Streams, and Warehouses

Oct 25, 2017 |

Databricks today launched a new managed cloud offering called Delta that seeks to combine the advantages of MPP data warehouses, Hadoop data lakes, and streaming data analytics in a unifying platform designed to let users analyze their freshest data without incurring enormous complexity and costs. Read more…

The Data Science Behind Dollar Shave Club

Sep 14, 2017 |

Dollar Shave Club burst onto the men’s hygiene scene in 2011 with a hilarious video and preposterous business plan: selling subscriptions for razor blades at a ridiculously low price. Six years later, the company keeps getting laughs with viral YouTube spots, while a sophisticated Apache Spark-based data mining operation running on Databricks’ Read more…

Now Trending: AI Washing

Jul 26, 2017 |

First there was “green washing,” where companies exaggerated the environmental benefits of their products in order to boost sales. Now technology experts are warning us about “AI washing,” an equally questionable tactic pursued by software and technology vendors to boost their artificial intelligence bona fides. Read more…

Do NOT follow this link or you will be banned from the site!