Follow Datanami:

Tag: Spark

MongoDB Struts Its NoSQL Stuff in NYC

When you think about giants of the technology world, MongoDB may not come to mind. But judging by the big strides this up-and-coming NoSQL database vendor is making, and the aggressive roadmap it put forth today at the t Read more…

What’s Hot This Summer: Data Science Bootcamps

Summer is here and temperatures are rising. While some of us take vacations or cool off at the beach, prospective data scientists are heating up their job prospects by participating in one of a growing number of data sci Read more…

Apache Spark Adoption by the Numbers

It's been about three years since Apache Spark burst onto the big data scene and became one of the hottest technologies on the planet. Judging by the numbers surrounding Spark's adoption—including things like salaries, Read more…

IBM Seeks Data Science Unity with New Spark-Based ‘Experience’

IBM today launched what it's calling the first enterprise application for data science collaboration. Called the Data Science Experience, the free, cloud-based offering is aimed at enabling data scientists to perform tas Read more…

Big Data Benchmark Gauges Hadoop Platforms

In another indication of a maturing technology and growing demand, an industry group has released a big data analytics benchmark designed to gauge the performance of Hadoop-based systems. The Transaction Processing Pe Read more…

Merging Batch and Stream Processing in a Post Lambda World

It wasn't long ago that developers looked to the Lamba architecture for hints on how to design big data applications that needed elements of both batch and streaming data. But already, the Lamba architecture is falling o Read more…

How Spark and Hadoop Are Advancing Cancer Research

The combination of Spark and Hadoop has supercharged big data analysis across many industries and use cases by lowering the barrier of entry to advanced analytics and thereby enabling data scientists to create data-drive Read more…

Hadoop Past, Present, and Future

Every few years the technology industry seems to be consumed with a shiny new object that gets hyped far beyond reality. At worst, the inevitable bursting of the hype bubble leads to the disappearance of the technology f Read more…

DataRobot Looks to Cut Data Science Backlog

The data science automation specialist DataRobot Inc. is gaining traction in the big data market for its machine-learning application as new investors like Intel Capital fund its expanding operations. Boston-based Dat Read more…

SnappyData Gets Funded for Spark-GemFire Combo

SnappyData today announced it has received $3.65 million in Series A funding to build a business around its real-time analytics platform that combines Apache Spark, Pivotal's GemFire data grid, and an innovative data app Read more…

Apache Beam’s Ambitious Goal: Unify Big Data Development

If you're tired of using multiple technologies to accomplish various big data tasks, you may want to consider Apache Beam, a new distributed processing tool from Google that's now incubating at the ASF. One of the cha Read more…

LinkedIn Diagnostics Help Tune Hadoop Jobs

An open source tool released last by LinkedIn developers is intended to help Hadoop and Spark users analyze, tune and improve the performance of their workflows. The self-service performance-tuning tool for Hadoop dub Read more…

Reporter’s Notebook: 6 Key Takeaways from Strata + Hadoop World

The big data ecosystem was on full display at last week's Strata + Hadoop World conference in San Jose. At the ripe old age of 10, Hadoop is still the driving force, but newer frameworks like Spark and Kafka are gaining Read more…

Cutting On Random Digital Mutations and Peak Hadoop

In a wide-ranging Strata + Hadoop World talk on Wednesday that reminds us why we like Doug Cutting so much, the father of Hadoop riffed on the evolution of big data tech, the power of open source, the promise of Flink, a Read more…

Apache Flink Creators Get $6M to Simplify Stream Processing

Real-time stream processing is one of the hottest topics this week at Strata + Hadoop World, and one of the new frameworks turning heads is Apache Flink. Developed by the German company data Artisans, Flink is unique in Read more…

Finding Long-Term Solutions to the Data Scientist Shortage

As we learned in the first part of this series, the gap between demand for skilled data scientists and supply is driving salaries north of $200,000 in some areas of the country. If big data analytics is to be democratize Read more…

Machine-Learning Platform Certified For Cloudera

In the run up to next week's Hadoop confab in Silicon Valley, vendors are releasing a flock of automation and other tools aimed at beefing up the mainstream data processing framework. Among them is an attempt to incorpor Read more…

Why Hadoop Must Evolve Toward Greater Simplicity

Developers have been filing the rough edges off Apache Hadoop ever since the open source project started to gain traction in the enterprise. But if Hadoop is going to take the next step and become the backbone of analyti Read more…

From Hadoop to Zeta: Inside MapR’s Convergence Conversion

If you're a regular Datanami reader, you likely know MapR Technologies as a Hadoop distributor, one of the three "pure play" providers alongside Hortonworks and Cloudera. But with its integrated NoSQL database, a modifie Read more…

How Big Data Can Empower B2B Sales

As consumers, we've grown accustomed to having Big Data look over us. We're no longer surprised when Amazon recommends a perfect of headphones for a 14 year-old girl, or when Target reminds us it's time to buy laundry de Read more…

Datanami