August 22, 2018

Three Myths of Big Data: Busted

Alex Woodie

(Sinart Creative/Shutterstock)

Big data is a scary thing. If you’re tasked with moving, storing, or analyzing it, big data can cause all sorts of headaches. But as troublesome as it is, we shouldn’t create monsters where none exist.

Anil Gadre, the chief product officer with MapR Technologies, kicked off last week’s MapR Convergence event in San Diego with some myth busting. His first myth — that succeeding in AI is all about picking the right algorithm — didn’t stand a chance.

“In reality, it’s a continuum,” says Gadre, a Silicon Valley veteran who previously worked at Sun Microsystems. Many MapR customers employ a range of computational models in their big data operations, including batch, micro-batch, streaming analytic, and event processing, in addition to deep learning.

The key to succeeding with big data, Gadre says, comes down to combining informational context with speed to generate action. Many MapR customers want to stay open to the next popular big data tool, whether its Spark or TensorFlow or Caffe. And while many MapR customers are getting value from algorithms, maintaining the models as part of a DataOps strategy is a bigger chore than many realize.

The second myth busted by Gadre was that containers are just for stateless applications. In fact, the MapR platform uses containers to support stateful applications too, he says. What’s more, data science workloads are a perfect fit for containerization technologies like Docker and Kubernetes because of their constantly changing nature.

IT professionals should embrace containers as a way to appease data scientists’ insatiable appetite for new technologies and processing capacity. “They’re too demanding. They want too much freedom,” Gadre says, referring to how IT pros view data scientists. Containers are the answer, he says.

Think going all cloud with your big data setup is the way to go? Then you probably didn’t hear Gadre’s third busted myth at the La Jolla Hyatt Regency, which is that customers are better off putting their data and infrastructure in the hands of a cloud provider.

While you may think that moving into a cloud providers’ collection of big data services is a way to keep your options open, Gadre argues that you’re actually narrowing your options. “You’re really choosing a software stack,” he says of cloud adoption.

One way to keep your options open is to run a data platform, like MapR’s offering, on cloud infrastructure. And because of the different processing and scheduling options supported by MapR, customers can actually save money by running it in the cloud, Gadre says.

Inside MapR’s Support for Kubernetes

Applications: Data Mining

Technologies: Frameworks

Sectors: Healthcare

Tags: big data, cloud, mapr

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

April 26, 2024

April 25, 2024

April 24, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Top 6 Strategies for Reducing Data Warehouse Costs

Building an Operational Data Warehouse for Real-time Analytics

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

AI & Big Data Expo North America 2024

CDAO Canada Public Sector 2024

AI Hardware & Edge AI Summit Europe

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Three Myths of Big Data: Busted

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 26, 2024

April 25, 2024

April 24, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In