Tag: apache spark

Data Lakes Crest In Drive to Boost Quality

Dec 19, 2017 |

As more data moves to the cloud, the composition of data lakes is shifting to new sources such as NoSQL databases while cloud data repositories emerge amid hybrid deployments, according to a big data survey. Read more…

The Data Science Behind Dollar Shave Club

Sep 14, 2017 |

Dollar Shave Club burst onto the men’s hygiene scene in 2011 with a hilarious video and preposterous business plan: selling subscriptions for razor blades at a ridiculously low price. Six years later, the company keeps getting laughs with viral YouTube spots, while a sophisticated Apache Spark-based data mining operation running on Databricks’ Read more…

Databricks, Flush With Cash, Steers Spark at AI

Aug 22, 2017 |

Momentum around the Apache Spark cluster computing framework continues to build with the announcement of hefty late-stage funding round that will help push the analytics platform and related artificial intelligence applications deeper into enterprises. Read more…

Open Source Tool Emerges For Cyber Defense

Jul 26, 2017 |

As banks, hospitals and retailers continue to lose ground to hackers, the open source community has stepped into the fray with a cyber security project designed to bring advanced analytics to IT monitoring data. Read more…

GigaSpaces Closes Analytics-App Gap With Spark

Jul 19, 2017 |

Data analytics and cloud vendors are rushing to support enhancements to the latest version of Apache Spark that boost streaming performance while adding new features such as data set APIs and support for continuous, real-time applications. Read more…

IBM Bolsters Spark Ties with Latest SQL Engine

Jul 18, 2017 |

IBM is extending its commitment to Apache Spark as a key component of in-memory analytics with the latest release of its SQL engine for Hadoop.

The new version of IBM Big SQL released last week also solidifies the company’s joint distribution deal with Hortonworks announced last month that includes Hortonwork’s Hadoop and stream processing distributions. Read more…

NEC Claims Vector CPU Outperforms Spark

Jul 6, 2017 |

An arms race is shaping up in the machine-learning sector with the claim by NEC Corp. that its approach based on its vector processor accelerates data processing by more than a factor of 50 compared to the Apache Spark cluster-computing framework. Read more…

Spark’s New Deep Learning Tricks

Jun 7, 2017 |

Imagine being able to use your Apache Spark skills to build and execute deep learning workflows to analyze images or otherwise crunch vast reams of unstructured data. That’s the gist behind Deep Learning Pipelines, a new open source package unveiled yesterday by Databricks. Read more…

What’s In the Pipeline for Apache Spark?

Mar 6, 2017 |

According to Apache Spark creator Matei Zaharia, Spark will see a number of new features and enhancements to existing features in 2017, including the introduction of a standard binary data format, better integration with Kafka, and even the capability to run Spark on a laptop. Read more…

Platform Incorporates Spark to Boost Collaboration

Feb 23, 2017 |

In-memory tools such as Apache Spark continue to mark inroads on predictive analytics platforms designed to allow data scientists and analysts to apply machine learning to large and diverse data sets. Read more…