Follow Datanami:

Tag: Spark

How Auto Insurers Detect and Use Your Driving ‘Fingerprint’

You may not know it, but the way you drive is unique--sort of like a fingerprint. How fast you drive, how tight you turn, and how long you idle in the driveway before hitting the road all help to identify you from others Read more…

Supercharging Apache Spark with Flash and NoSQL

Apache Spark has become the defacto standard computational engine in the big data world. But as an in-memory technology, Spark has limitations. One of the ways people are getting around those limitations is by pairing Sp Read more…

Concord Claims 10x Performance Edge on Spark Streaming

Organizations that are looking for a stream processing engine upon which to build fast data applications featuring high-throughput and low-latency may want to check out Concord, a new framework that emerged from the ad-t Read more…

How TransUnion Maximizes Data Science Tools and Talent

You may know TransUnion as one of the credit bureaus that controls the interest rate on your new loan. But in fact the company does much more, and has solutions around fraud detection, collections, and marketing, among o Read more…

Investments in Fast Data Analytics Surge

Companies are quickly ramping up their investments fast data analytics and real-time stream processing frameworks and lowering spending on batch technologies in an attempt to get on top of growing data volumes and veloci Read more…

Actian Reasserts Performance Claims With VectorH

The latest version of SQL-on-Hadoop specialist Actian Corp.'s Vector database tightens integration with Apache Spark to widen access to new data sources while adding enterprise features required to move Hadoop-based anal Read more…

MongoDB Struts Its NoSQL Stuff in NYC

When you think about giants of the technology world, MongoDB may not come to mind. But judging by the big strides this up-and-coming NoSQL database vendor is making, and the aggressive roadmap it put forth today at the t Read more…

What’s Hot This Summer: Data Science Bootcamps

Summer is here and temperatures are rising. While some of us take vacations or cool off at the beach, prospective data scientists are heating up their job prospects by participating in one of a growing number of data sci Read more…

Apache Spark Adoption by the Numbers

It's been about three years since Apache Spark burst onto the big data scene and became one of the hottest technologies on the planet. Judging by the numbers surrounding Spark's adoption—including things like salaries, Read more…

IBM Seeks Data Science Unity with New Spark-Based ‘Experience’

IBM today launched what it's calling the first enterprise application for data science collaboration. Called the Data Science Experience, the free, cloud-based offering is aimed at enabling data scientists to perform tas Read more…

Big Data Benchmark Gauges Hadoop Platforms

In another indication of a maturing technology and growing demand, an industry group has released a big data analytics benchmark designed to gauge the performance of Hadoop-based systems. The Transaction Processing Pe Read more…

Merging Batch and Stream Processing in a Post Lambda World

It wasn't long ago that developers looked to the Lamba architecture for hints on how to design big data applications that needed elements of both batch and streaming data. But already, the Lamba architecture is falling o Read more…

How Spark and Hadoop Are Advancing Cancer Research

The combination of Spark and Hadoop has supercharged big data analysis across many industries and use cases by lowering the barrier of entry to advanced analytics and thereby enabling data scientists to create data-drive Read more…

Hadoop Past, Present, and Future

Every few years the technology industry seems to be consumed with a shiny new object that gets hyped far beyond reality. At worst, the inevitable bursting of the hype bubble leads to the disappearance of the technology f Read more…

DataRobot Looks to Cut Data Science Backlog

The data science automation specialist DataRobot Inc. is gaining traction in the big data market for its machine-learning application as new investors like Intel Capital fund its expanding operations. Boston-based Dat Read more…

SnappyData Gets Funded for Spark-GemFire Combo

SnappyData today announced it has received $3.65 million in Series A funding to build a business around its real-time analytics platform that combines Apache Spark, Pivotal's GemFire data grid, and an innovative data app Read more…

Apache Beam’s Ambitious Goal: Unify Big Data Development

If you're tired of using multiple technologies to accomplish various big data tasks, you may want to consider Apache Beam, a new distributed processing tool from Google that's now incubating at the ASF. One of the cha Read more…

LinkedIn Diagnostics Help Tune Hadoop Jobs

An open source tool released last by LinkedIn developers is intended to help Hadoop and Spark users analyze, tune and improve the performance of their workflows. The self-service performance-tuning tool for Hadoop dub Read more…

Reporter’s Notebook: 6 Key Takeaways from Strata + Hadoop World

The big data ecosystem was on full display at last week's Strata + Hadoop World conference in San Jose. At the ripe old age of 10, Hadoop is still the driving force, but newer frameworks like Spark and Kafka are gaining Read more…

Cutting On Random Digital Mutations and Peak Hadoop

In a wide-ranging Strata + Hadoop World talk on Wednesday that reminds us why we like Doug Cutting so much, the father of Hadoop riffed on the evolution of big data tech, the power of open source, the promise of Flink, a Read more…

Datanami