Language Flags

Translation Disclaimer

HPCwire HPC in the Cloud Digital Manufacturing Report Green Computing Report
Rogue Wave

March 01, 2013

SDSC Creating BigData Top100 List


The San Diego Supercomputer Center (SDSC) at the University of California, San Diego, today announced plans for a community-based effort to create the BigData Top100 List, the first global ranking of its kind for systems designed for big data applications.

The BigData Top100 List will rank systems according to their performance on an application-level workload specification, while also reporting on system efficiencies in terms of price/performance. As an application-level benchmark, the list will complement other rankings of high-performance computing (HPC) systems, such as the Top500 and Graph500.

The National Science Foundation (NSF) has defined big data as “large, diverse, complex, longitudinal, and/or distributed data sets generated from instruments, sensors, Internet transactions, email, video, click streams, and/or all other digital sources available today and in the future.”

The BigData Top100 List initiative was announced at the O’Reilly Strata Conference in Santa Clara, California this week in a joint presentation by Baru and Milind Bhandarkar, Chief Scientist, Greenplum, a division of EMC and an industry sponsor of the CLDS. Baru and Bhandarkar are members of the initial BigData Top100 List steering group, which also includes Dhruba Borthakur (Facebook), Eyal Gutkind (Mellanox), Jian Li (IBM), Raghunath Nambiar (Cisco), Ken Osterberg (Seagate), Scott Pearson (Brocade), Meikel Poess (Oracle), Tilmann Rabl (University of Toronto), Richard Treadway (NetApp), and Jerry Zhao (Google).

A comprehensive article outlining the introduction of the new list and describing the benchmarking initiative was published in the March 2013 inaugural issue of the quarterly journal Big Data.

“The creation of a new journal focused solely on big data underscores the importance of this trend across all applications domains, in science as well as business,” said Bhandarkar. “The benchmarking effort is a pioneering activity in big data, and creating such a benchmark is a vital step toward fostering competition and innovation in the field.”

In addition, an online competition for refining the benchmark dataset and benchmark workload will be announced shortly on kaggle.com, a leading platform for predictive modeling competitions.

“We were excited when SDSC approached us with the idea of benchmarking data systems,” said Will Cukierski, who co-leads development of public competitions for Kaggle.  “Competitions reward objective merit, and merit is what ought to matter as this industry matures.”

“The existence of such benchmarks enables healthy competition among technology and solution providers, resulting eventually in product improvements and evolution of new technologies,” added Baru.

Related Articles:

SDSC’s Chaitan Baru Named Associate Director, Data Initiatives

Share Options


Subscribe

» Subscribe to our weekly e-newsletter


Discussion

There are 0 discussion items posted.

 
Xyratex

Sponsored Links

Sponsored Whitepapers

Parallel Performance of the IMSL C Numerical Library with OpenMP

05/21/2013 | Rogue Wave Software

Download whitepaper containing benchmark results depicting the speedup achieved as a result of incorporating OpenMP directives in the IMSL C Numerical Library, for portable, cross platform analytics.

Download this Whitepaper...

Best Practices in Big Data Storage - Sponsored by Cleversafe, Cray, DDN, NetApp, & Panasas

05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas

From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.

Download this Whitepaper...

View the White Paper Library

Sponsored Multimedia

SGI President and CEO, Jorge Titinger, on Big Data

SGI President and CEO, Jorge Titinger, talks about SGI's history and leadership in HPC and how that has converged into Big Data Solutions.

View Multimedia

Cray CS300-AC Cluster Supercomputer Air Cooling Technology Video

The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.

View Multimedia

More Multimedia



Job Bank

Datanami Conferences Ad

Featured Events

May 22-23, 2013
Business Intelligence Innovation Summit
Chicago, IL
United States

June 4-4, 2013
The Economist's Information Forum
San Francisco, CA
United States

June 10-13, 2013
Cloud & Big Data Expo
New York City, NY
United States

June 19-20, 2013
GigaOM Structure
San Francisco, CA
United States

June 26-27, 2013
2013 Hadoop Summit
San Jose, CA
United States

June 26-27, 2013
Big Data World Congress
London
United Kingdom

» View/Search Events

» Post an Event