March 20, 2013

GPUs Push Big Data’s Need for Speed

Nicole Hemsoth

As we noted on Monday during our first report from the GPU Technology Conference in San Jose, graphics processor giant, NVIDIA is diving into big data in a formal way with some core announcements around new functionality, programming options and now use cases for data-intensive enterprise apps.

During his keynote, NVIDIA CEO, Jen-Hsun Huang delved into more detail about how GPU computing is invading the realm of commercial and web-scale apps.

Huang noted that when it comes to large-scale enterprise and mobile applications, using “best effort” approaches to performance (namely vanilla datacenters) just won’t cut it. Users are demanding services based on highly complex algorithms fed by constant streams of ever-changing data. Further, in many cases, especially for things like social media analysis, it doesn’t make much sense to bother storing the data—it needs to be quickly ingested and barfed out in real-time for instant analysis.

To highlight how big data is being accelerated by GPUs, one of the company’s superstar data-intensive application use cases, audio recognition service, Shazam, revealed how they are able to snip their time to results, extend their services and increase efficiency.

Shazam boasts 300 million users and is adding two million more each week. This ever-growing group of users hits the service with roughly 10 million requests to identify songs based on an audio “fingerprint” that scans a database of over 25 million songs to find the singular match. As if that process isn’t enough, the company wants to make sure they can turn over results before people get tired of waiting.

While it’s much more entertaining to simplify what the folks at Shazam are doing on the big data front, their operations grow in complexity with added scale. However, adding GPUs into the mix cut their costs down by one-third, says the company’s CTO, Jason Titus. Further, they’re able to cut down the processing time, even with the addition of new content and users.

The idea of using GPUs to boost performance of massive systems is catching on for some of the biggest of data problems (not to mention computational challenges). The top supercomputer on the planet, Oak Ridge National Lab’s Titan, is a 18,688-node GPU powerhouse, with each node boasting one of the Opteron 16-core CPUs, 32 GB of memory, and of course, an NVIDIA Tesla K20x GPU tucked in to push the system to almost 300,000 cores.

While the large majority of companies running big web-scale operations that power apps or analytics aren’t likely to plunk down the $80 million for a cluster of that magnitude (and imagine power and cooling), they are looking to GPUs to add performance while increasing overall efficiency with the hope that their investment in GPUs will pay off in terms of new capabilities and operational savings.

But then again, even with a major investment in hardware, it’s not as simple as snapping in a few graphics cards—the porting process can be a bit of challenge, although the company is working to push a new generation of big data developers into its CUDA framework, partnering to bring CUDA support to more mainstream languages, including Python.

NVIDIA has big plans for some of its new architectures. While some are more relevant to the mobile, gaming and general consumer space, Kepler and its successors could find a soft spot in the big data world–especially once the developer community is engaged.

On that note, take a look at the company’s roadmap for GPGPU computing–and the ecosystem as a whole.

More on the big data and GPU angle coming this week…

Applications: Enterprise Analytics

Technologies: Processors, Systems

Vendors: NVIDIA

Tags: big data, gt13, huang, Nvidia, shazam

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

GPUs Push Big Data’s Need for Speed

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 26, 2024

April 25, 2024

April 24, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Top 6 Strategies for Reducing Data Warehouse Costs

Building an Operational Data Warehouse for Real-time Analytics

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

AI & Big Data Expo North America 2024

CDAO Canada Public Sector 2024

AI Hardware & Edge AI Summit Europe

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

GPUs Push Big Data’s Need for Speed

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 26, 2024

April 25, 2024

April 24, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link