Follow Datanami:

Tag: apache spark

Anaconda: Data Science Exiting Hadoop for the Cloud

Jun 14, 2018 |

Data scientists are embracing cloud-native frameworks as they move on from on-premises data infrastructure previously dominated by Hadoop, concludes a survey on the state of data science.

The shift is driven in part by the enterprise transition from merely managing big data to using machine learning and other connected data tools to glean insights in real time, according to the data science survey released this week by Python platform specialist Anaconda Inc. Read more…

Databricks Open Sources MLflow to Simplify Machine Learning Lifecycle

Jun 5, 2018 |

Databricks today unveiled MLflow, a new open source project that aims to provide some standardization to the complex processes that data scientists oversee during the course of building, testing, and deploying machine learning models. Read more…

Project Hydrogen Unites Apache Spark with DL Frameworks

Jun 5, 2018 |

The folks behind Apache Spark today unveiled Project Hydrogen, a new endeavor that aims to eliminate barriers preventing organizations from using Spark with deep learning frameworks like TensorFlow and MXnet. Read more…

Google Cloud Adds Cask Data

May 22, 2018 |

Leading cloud providers continue to snap up analytics startups with an eye toward expanding access to big data technologies. Cask Data, developers of an application platform that among other things integrates Hadoop and Apache Spark, is the latest acquisition by Google Cloud, which earlier this month bought cloud migration specialist Velostrata. Read more…

Apache Zeppelin Launches Latest Data Science Notebook

Mar 23, 2018 |

ZEPL, the startup founded by the creators of interactive data analytics tool Apache Zeppelin, has moved its multi-tenant analytics platform out of beta, announcing its general availability this week.

The platform is among a growing list of data science notebooks aimed at enterprise collaboration in conducting analytics via a single notebook interface. Read more…

Top 3 New Features in Apache Spark 2.3

Mar 14, 2018 |

It’s tough to find a big data project that’s had as much impact as Apache Spark over the past five years. The folks at Databricks, who contribute heavily to Spark (along with the wider Spark community) are keeping the project on the cutting edge with version 2.3. Read more…

Data Lakes Crest In Drive to Boost Quality

Dec 19, 2017 |

As more data moves to the cloud, the composition of data lakes is shifting to new sources such as NoSQL databases while cloud data repositories emerge amid hybrid deployments, according to a big data survey. Read more…

The Data Science Behind Dollar Shave Club

Sep 14, 2017 |

Dollar Shave Club burst onto the men’s hygiene scene in 2011 with a hilarious video and preposterous business plan: selling subscriptions for razor blades at a ridiculously low price. Six years later, the company keeps getting laughs with viral YouTube spots, while a sophisticated Apache Spark-based data mining operation running on Databricks’ Read more…

Databricks, Flush With Cash, Steers Spark at AI

Aug 22, 2017 |

Momentum around the Apache Spark cluster computing framework continues to build with the announcement of hefty late-stage funding round that will help push the analytics platform and related artificial intelligence applications deeper into enterprises. Read more…

Open Source Tool Emerges For Cyber Defense

Jul 26, 2017 |

As banks, hospitals and retailers continue to lose ground to hackers, the open source community has stepped into the fray with a cyber security project designed to bring advanced analytics to IT monitoring data. Read more…

Do NOT follow this link or you will be banned from the site!