MLB Unravels Home Run Mystery with Analytics, Modeling and Simulation

MLB Unravels Home Run Mystery with Analytics, Modeling and Simulation

Since the 2015 season, the ball has been flying out of Major League ballparks at a remarkable clip. Read more…

Data Regulation D-Day Arrives: Final Thoughts on GDPR

Data Regulation D-Day Arrives: Final Thoughts on GDPR

Finally, after years of waiting, the General Data Protection Regulation (GDPR) goes into effect tomorrow, dramatically changing the rules for what you can and can’t do with the personal data of European citizens. Read more…

Analytics Power Discoveries in ICIJ’s West Africa Leaks

Analytics Power Discoveries in ICIJ’s West Africa Leaks

Graph analytic technologies and techniques played a major role in West Africa Leaks, the latest International Consortium of Investigative Journalists (ICIJ) project to unearth illegal or unethical activities or investments that powerful individuals have hidden using offshore law firms and tax havens. Read more…

Okera Emerges from Stealth with Big Data Fabric

Okera Emerges from Stealth with Big Data Fabric

While data platforms like Hadoop and object stores like S3 have become popular places for organizations to stash massive amounts of data, the platforms haven’t solved companies’ Read more…

Datanami Headlines

ID Verify Vendor Mitek Acquires AI-Based Authenticator

To get a sense of which AI capabilities beyond chatbots are most likely to have an immediate commercial impact, it’s useful to monitor which AI developers are being snapped up by established technology vendors.

Another clue surfaced this week when Mitek Systems Inc., a specialist in digital identity verification for banks, announced a deal to acquire Paris-based A2iA, a developer of software tools used to capture data, process official documents and automate workflows. Read more…

Intel Adopts ‘Holistic’ Approach to AI

A developers’ conference is generally regarded as an indication the host company is all-in on a particular technology. Intel Corp.’s inaugural AI DevCon highlights its strategy of moving beyond its dominant position in the server and other processor markets to focus on “the AI-driven future of computing.”

The two-day event in San Francisco focused on forging a “holistic” Read more…

Google Cloud Adds Cask Data

Leading cloud providers continue to snap up analytics startups with an eye toward expanding access to big data technologies. Cask Data, developers of an application platform that among other things integrates Hadoop and Apache Spark, is the latest acquisition by Google Cloud, which earlier this month bought cloud migration specialist Velostrata. Read more…

Cloudera Gives Data Scientists More Options for ML

Cloudera unleashed a collection of new software today that’s geared at accelerating the development and deployment of machine learning programs. In addition to a new release of its Data Science Workbench that lets data scientists deploy ML models as APIs with the push of a button, it released a new iteration of its enterprise suite of software based around Apache Hadoop, Cloudera Enterprise 6.0, that offers first-class support for GPUs, among other new features. Read more…

Google Unleashes TPUs on Cloud ML Engine

As the amount of machine learning training data soars, so too does demand for new tools that will accelerate the process. With that in mind, Google Cloud announced the beta release of a new feature that allows users to speed training by running Tensor processing units (TPU) on its machine learning engine. Read more…

The Seven Sins of Data Prep

Data preparation is often considered a necessary precursor to the “real” work found in visualizing or analyzing data, but this framing sells data prep short. The ways in which we cleanse and shape data for downstream use have significant bearing on our final analytic output, and cutting corners on data prep can run up a huge cost for companies. Read more…

Why Developers Need to Think like Data Scientists

Data is growing faster than is even fathomable. By 2020, roughly 1.7 megabytes of new information will be created every second for every human being on the planet. Given the immense amount of data, it’s no wonder there is a call in all industries for talent that can collect and analyze the data. Read more…

Oracle Deal Adds Data Tools to its Cloud Services

Oracle moved this week to expand its portfolio of data tools and libraries with the acquisition of DataScience.com, developers of a collaboration platform that connects with popular workflows and cloud infrastructure.

Oracle said Wednesday (May 16) the addition of the Los Angeles-area data science startup would help boost utilization of machine learning on Oracle Cloud. Read more…

Danger and Difficulty Temper Data’s Huge Potential

It has been called the new oil, the new currency, the new religion. It is data, of course, and it’s having a monumental impact on how we build business systems in the 21st century. However, for all the potential benefits that data bring can bring to organizations, many are discovering that data is also full of hidden pitfalls and risks. Read more…

Big Data File Formats Demystified

So you’re filling your Hadoop cluster with reams of raw data, and your data analysts and scientists are champing at the bit to get started. Then the question hits you: How are you going to store all this data so they can actually use it? Read more…

MemSQL Gains Traction, New Investors

MemSQL, the database vendor that touts its architecture as capable of handling both data analytics and transactions, announced a $30 million funding round led by, GV, the former Google Ventures.

The hybrid SQL database developer said this week its latest funding round brings its total to $110 million. Read more…

A Wave of Purpose-Built AI Hardware Is Building

Google last week unveiled the third version of its Tensor Processing Unit (TPU), which is designed to accelerate deep learning workloads developed in its TensorFlow environment. But that’s just the start of a groundswell of new processors and processing architectures, including Wave Computing, which claims its soon-to-be-launched processor will dramatically lower the barrier of entry for running artificial intelligence workloads. Read more…

Hive Rolls Productivity Platform

Among the latest pitches for adopting predictive analytics are worker productivity tools that help enterprises understand what’s around the corner. The ability to forecast tomorrow’s workload is touted as a way to improve efficiency and even reduce workplace stress by eliminating “reactive work.”

That’s the value proposition being offered by Hive, the productivity startup that unveiled an analytics feature this week combining task management and request messaging. Read more…