Follow Datanami:

Spring Strata 2017 Coverage


Streaming Analytics Picks Up Where Hadoop Lakes Leave Off

If phase one of the big data boom was store as much data as your lake will hold, phase two will be about extracting information from the data as quickly as you can. For many organizations, that means using streaming analytics to both shrink the decision window and reduce the flow of data into the lake.

As we’ve talked about in recent articles, Hadoop-based data lakes aren’t going to go disappear into the digital night. Read more…

Feature Articles from Spring Strata 2017

Anatomy of a Hadoop Project Failure


Several years ago, the educational technology company Blackboard selected Apache Hadoop to run a new data analytics application designed to turn data exhaust into actionable insight. Months later, the failed project was cancelled, and Blackboard implemented a hosted relational data warehousing product instead.

The reasons behind Blackboard‘s initial selection of Hadoop for this project will sound familiar: Read more…

Sports Follies Exemplify Need for Instant Analysis of Streaming Data


Michael Lewis’ book, “Moneyball” tells the the well-known story of how the Oakland A’s used data analytics to gain a competitive advantage in major league baseball (MLB) in the late 80s and early 90s. In the 25 years since, all major sports including basketball, football, hockey and soccer have embraced analytics as a key to success, it has become part of the ethos of sports. Read more…

Scrutinizing the Inscrutability of Deep Learning


Most people can’t make heads or tails out of most algorithms. By “most people,” I’m referring to anyone who didn’t literally develop the algorithm in question or doesn’t manage the rules, data, and other artifacts that govern how its results drive automated decision processes.

An algorithm only starts to make sense if an expert can explain it in the proverbial “plain English.” Algorithms’ daunting size, complexity, and obscurity create a serious challenge: Read more…

Cloudera Gives Data Scientists a New Workbench


Data scientists will get the flexibility of a data science notebook with the security and governance of a locked-down Hadoop cluster through the Data Science Workbench, a new Web-based product unveiled by Cloudera today at the Strata + Hadoop World conference.

Companies are suffering from an impedance mismatch that exists between the open source Python- and R-based tools that data scientists want to use and what Hadoop provides, says Cloudera director of product management Matt Brandwein. Read more…

Hadoop Has Failed Us, Tech Experts Say


The Hadoop dream of unifying data and compute in a distributed manner has all but failed in a smoking heap of cost and complexity, according to technology experts and executives who spoke to Datanami.

“I can’t find a happy Hadoop customer. It’s sort of as simple as that,” Read more…

News in Brief from Spring Strata 2017

DataOps Dominates Strata Startup Winners


The top big data startups at this year’s Strata + Hadoop World conference include data platform specialists along with an analytics startup focused on the gaming industry.

The winner of this year’s Startup Showcase was Nexla Inc., developers of a platform that leverages machine learning to automate data operations. Read more…

Consumer Analytics Platform Gets ‘Granular’


The latest version of a retail analytics platform attempts to squeeze more insights from store loyalty programs as a way of gauging consumer preferences and whether, for example, a new product is selling and why.

New York-based 1010data Inc. said this week version 3.03 of its self-service consumer insights platform adds reporting capabilities designed to allow retailers and their suppliers to leverage loyalty programs to expand the range of analytics. Read more…

SAP Vora Gets Analytics, Cloud Upgrades


Building on its acquisition of Hadoop specialist Altiscale Inc., SAP is combining the latest release of its Vora in-memory distributed computing platform with its big data cloud as it extends the Apache Spark framework to deliver interactive analytics on Hadoop.

Vora software targets distributed data to provide contextual awareness while improving “business process awareness” across enterprise applications and analytics. Read more…

MapR Extends Its Platform to the Edge


MapR Technologies today unveiled MapR Edge, an extension of its converged data platform that lets customers install MapR nodes practically anywhere they want.

The new offering runs on small portable PCs like the Intel NUC (pictured above), and delivers the full breadth of MapR’s capabilities–including Hadoop, NoSQL, and data streaming functionality—anywhere customers want, from autonomous cars driving rural highways to wellheads in the oil field. Read more…

This Just In from Spring Strata 2017