July 14, 2015

Univa Gives ‘Pause’ to Big Data Apps

Alex Woodie

Scheduling workloads on today’s big analytic clusters can be a big challenge. Your team may have carefully everything lined up, only to have a last-minute change leave your schedule in shambles. One company that’s close to a solution is Univa, which today announced the addition of its “preemption” feature that allows admins to momentarily “pause” workloads so they can run a higher priority application.

Historically, admins were loathe to stop HPC or big data jobs before they ended, because it could be so expensive to get the job started again. But the preemption feature that Univa is shipping in Grid Engine 8.3 gives admins the power to pause a workload, and then resume it moments later, without the need to start from the beginning.

“You could liken the function to a home DVR system,” says explains Bill Bryce, Vice President of Products at Univa. “Users can be confident that if they pause one program to watch another, they will always be able to come back to finish the first.”

According to Univa, which made the announcement at the International Supercomputing Conference being held this week in Frankfurt, Germany, says this is the first time a preemption feature has been able to work with big data apps.

While the Univa Grid Engine is commonly found in HPC environments, it also supports big data workloads, such as Hadoop and MapReduce. In fact, the Univa Grid Engine provides essentially the same basic functionality that resource schedulers like YARN and Mesos—as well as more advanced functions, such as the new preemption feature.

Univa CTO Fritz Ferstl provided a good description of the Univa Grid Engine’s place in a big data world in a 2013 Datanami story. He writes:

“Univa Grid Engine and similar workload management tools have matured over the past two decades in technical and scientific computing, as well as in HPC, and have evolved to become an essential cog in Big Data infrastructures. Today Univa Grid Engine supports an impressive and essentially open-ended spectrum of use case scenarios. They are being deployed across all market sectors and typically in business and performance- critical situations.

“Univa Grid Engine, in particular,” he continues, “also supports a scale exceeding 100,000 cores with massive throughput of jobs of any size. Scale and throughput is essential in Big Data environments because Big Data workloads (also known as ‘jobs’) often have comparatively short runtimes, while the amount of data to be analyzed is massive. This results in a large count of jobs to be processed on growing cluster sizes as data volumes and time-to-result requirements increase.”

Managing MapReduce Applications in a Shared Infrastructure

Applications: Enterprise Analytics, Research Analytics

Technologies: Frameworks, Processors, Systems

Sectors: Academia, Biosciences, Financial Services, Other

Vendors: Univa

Tags: Hadoop, hpc, mapreduce, Univa Grid Engine

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

Univa Gives ‘Pause’ to Big Data Apps

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 16, 2024

April 15, 2024

April 12, 2024

April 11, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Building an Operational Data Warehouse for Real-time Analytics

Can You Use Kafka as a Database?

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

Call & Contact Center Expo

AI & Big Data Expo North America 2024

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Univa Gives ‘Pause’ to Big Data Apps

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 16, 2024

April 15, 2024

April 12, 2024

April 11, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link