January 19, 2016

Survey Sees Spark Emerging in 2016

George Leopold

This is the “Year of Spark,” asserts a new big data survey on analytics priorities.

The survey of more than 250 data scientists and architects, IT managers and business intelligence analysts released on Tuesday (Jan. 19) found that nearly 70 percent of users expressed interest in deploying Apache Spark in the coming year. While current leader MapReduce is expected to remain the dominant compute framework in production, survey sponsor Syncsort noted that the “high level of interest should translate into more Spark deployments, mostly running on Hadoop.”

“The ability to design data transformations once and then run them anywhere—across Hadoop MapReduce, Spark, Linux, Windows, or Unix, on premise or in the cloud—is critical,” the survey concluded.

Another conclusion drawn from the survey was that the “ability to transform and prepare data in flight will be more important, eliminating the need for staging increasing volumes of data.”

Added Tendü Yoğurtçu, general manager of Syncsort’s Big Data unit: “Though challenging, this will also create an opportunity to deliver next generation data integration products, future-proofing user’s applications while taking advantage of highly scalable and distributed platforms like Apache Hadoop and Spark,” either on-premise or in the cloud.

Syncsort, Woodcliff Hills, N.J., also said its “Hadoop Perspectives” survey found that the number of users switching to Hadoop will continue to increase based on a range of cost and operational benefits. Among them is the desire to make more data available to more business users across organizations.

At the same time, more respondents said they want to leverage Hadoop for “advanced use cases” like crunching unstructured data from social media sources as well as rolling out Internet of Things (IoT) strategies. Data executives also cited the desire to make greater use of predictive analytics and visualization to gain deeper insights into the customers’ preferences.

Meanwhile, 40 percent of respondents said they currently use Hadoop as a cheaper alternative for storage and processing in their data warehouses. The survey also noted that Hadoop has “yet to be leveraged for mobile apps and software,” with a mere 4.9 percent of respondents reporting utility for those use cases.

Based on the results of the survey conducted late last year, Syncsort also predicted greater use of streaming, real-time data sources along with greater emphasis on data governance and security as the pace of production deployments quickens in 2016.

In the first instance, “the best business decisions often require the most recent data available,” the survey found. The most popular use cases included fraud detection, analytics on telemetry and security data, insurance claim validation and IoT deployments.

Meanwhile, the survey predicted that more organizations would adopt a “Hadoop first” approach to data management, “skipping traditional and more expensive platforms and applying metadata, lineage, security, and other data management measures on Hadoop from the start.”

Recent items:

3 Major Things You Should Know About Apache Spark 1.6

Spark Streaming: What Is It and Who’s Using It?

Applications: Predictive Analytics, Visualization

Technologies: Frameworks

Tags: apache spark, Hadoop, mapreduce, not, syncsort

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

Survey Sees Spark Emerging in 2016

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

May 10, 2024

May 9, 2024

May 8, 2024

May 7, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Top 6 Strategies for Reducing Data Warehouse Costs

Building an Operational Data Warehouse for Real-time Analytics

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

AI & Big Data Expo North America 2024

CDAO Canada Public Sector 2024

AI Hardware & Edge AI Summit Europe

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Survey Sees Spark Emerging in 2016

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

May 10, 2024

May 9, 2024

May 8, 2024

May 7, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link