Tag: apache spark

The Data Science Behind Dollar Shave Club

Sep 14, 2017 |

Dollar Shave Club burst onto the men’s hygiene scene in 2011 with a hilarious video and preposterous business plan: selling subscriptions for razor blades at a ridiculously low price. Six years later, the company keeps getting laughs with viral YouTube spots, while a sophisticated Apache Spark-based data mining operation running on Databricks’ Read more…

Databricks, Flush With Cash, Steers Spark at AI

Aug 22, 2017 |

Momentum around the Apache Spark cluster computing framework continues to build with the announcement of hefty late-stage funding round that will help push the analytics platform and related artificial intelligence applications deeper into enterprises. Read more…

Open Source Tool Emerges For Cyber Defense

Jul 26, 2017 |

As banks, hospitals and retailers continue to lose ground to hackers, the open source community has stepped into the fray with a cyber security project designed to bring advanced analytics to IT monitoring data. Read more…

GigaSpaces Closes Analytics-App Gap With Spark

Jul 19, 2017 |

Data analytics and cloud vendors are rushing to support enhancements to the latest version of Apache Spark that boost streaming performance while adding new features such as data set APIs and support for continuous, real-time applications. Read more…

IBM Bolsters Spark Ties with Latest SQL Engine

Jul 18, 2017 |

IBM is extending its commitment to Apache Spark as a key component of in-memory analytics with the latest release of its SQL engine for Hadoop.

The new version of IBM Big SQL released last week also solidifies the company’s joint distribution deal with Hortonworks announced last month that includes Hortonwork’s Hadoop and stream processing distributions. Read more…

NEC Claims Vector CPU Outperforms Spark

Jul 6, 2017 |

An arms race is shaping up in the machine-learning sector with the claim by NEC Corp. that its approach based on its vector processor accelerates data processing by more than a factor of 50 compared to the Apache Spark cluster-computing framework. Read more…

Spark’s New Deep Learning Tricks

Jun 7, 2017 |

Imagine being able to use your Apache Spark skills to build and execute deep learning workflows to analyze images or otherwise crunch vast reams of unstructured data. That’s the gist behind Deep Learning Pipelines, a new open source package unveiled yesterday by Databricks. Read more…

What’s In the Pipeline for Apache Spark?

Mar 6, 2017 |

According to Apache Spark creator Matei Zaharia, Spark will see a number of new features and enhancements to existing features in 2017, including the introduction of a standard binary data format, better integration with Kafka, and even the capability to run Spark on a laptop. Read more…

Platform Incorporates Spark to Boost Collaboration

Feb 23, 2017 |

In-memory tools such as Apache Spark continue to mark inroads on predictive analytics platforms designed to allow data scientists and analysts to apply machine learning to large and diverse data sets. Read more…

Anaconda Gets Big Iron Support From IBM

Feb 10, 2017 |IBM is expanding its embrace of big data analytics to include support on its open source mainframe for the Anaconda stack used by Python programmers. The company (NYSE: IBM) announced Thursday (Feb. 9) that it is collaborating with Anaconda developer Continuum Analytics and Rocket Software to host the open source analytics platform on IBM z/OS mainframes. Read more…