Language Flags

Translation Disclaimer

HPCwire HPC in the Cloud Digital Manufacturing Report Green Computing Report


January 14, 2013

Thumbtack Releases Benchmark Analysis of NoSQL Databases


Thumbtack Technology has published a report analyzing how NoSQL databases perform under conditions of extremely high load—addressing an increasingly common use case with big data.

The Thumbtack benchmark report, entitled Ultra-High Performance NoSQL Benchmarking: Analyzing Durability and Performance Tradeoffs, is intended to help organizations obtain an independent understanding of the strengths and weaknesses of various NoSQL database approaches to supporting applications that process huge volumes of data and need durable and consistent storage semantics. In it, Thumbtack takes a focused, tuned and optimized approach to answering specific real-world questions by evaluating four databases (Aerospike, Cassandra, Couchbase, and MongoDB) as they would be optimized for this scenario.

In particular, the report focused on how durability and consistency needs affect raw performance. The data strongly suggests that for synchronously replicated, durable data, Aerospike was able to outperform the other databases by a wide margin, whereas when those constraints are relaxed, both Aerospike and Couchbase are able to process transaction volumes approaching 1 million operations per second.

“Previous published NoSQL benchmarks typically have run out-of-the-box queries against default settings on a broad set of databases, but not all use cases are the same. That is why we have focused on testing a few NoSQL databases for their ability to address a specific class of real-world problem with big data,” said Ben Engber, Thumbtack Technology CEO and co-author of the benchmark study. “We believe IT decision-makers will benefit from the focused results of our report, as they seek to gain a clearer understanding of how databases can support their application needs.”

Benchmark Methodology

Thumbtack performed its tests using a modified version of the Yahoo Cloud Serving Benchmark (YCSB) from Yahoo! Research, which is rapidly becoming the standard for benchmarking NoSQL databases. However, because the YCSB hasn’t been optimized for testing at the highest volumes, Thumbtack altered and extended the YCSB tool and supporting scripts to overcome these limitations as part of this benchmarking project. In conjunction with publishing the report, Thumbtack has also made available its changes to the core YCSB framework on which the tests were run. These include various performance and functional changes that allow YCSB to create a higher load and support a variety of different kinds of replication and durability models.

Today’s report is the first in a series of reports Thumbtack plans to produce to help organizations make the correct decisions in the very broadly defined NoSQL space. Upcoming reports will provide detailed analysis of aspects other than raw key-value performance, including fault-tolerance and secondary index support. Additional databases and hosting environments will also be incorporated into the analysis.

Share Options


Subscribe

» Subscribe to our weekly e-newsletter


Discussion

There are 0 discussion items posted.

 
Xyratex

Sponsored Links

Sponsored Whitepapers

Parallel Performance of the IMSL C Numerical Library with OpenMP

05/21/2013 | Rogue Wave Software

Download whitepaper containing benchmark results depicting the speedup achieved as a result of incorporating OpenMP directives in the IMSL C Numerical Library, for portable, cross platform analytics.

Download this Whitepaper...

Best Practices in Big Data Storage - Sponsored by Cleversafe, Cray, DDN, NetApp, & Panasas

05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas

From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.

Download this Whitepaper...

View the White Paper Library

Sponsored Multimedia

SGI President and CEO, Jorge Titinger, on Big Data

SGI President and CEO, Jorge Titinger, talks about SGI's history and leadership in HPC and how that has converged into Big Data Solutions.

View Multimedia

Cray CS300-AC Cluster Supercomputer Air Cooling Technology Video

The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.

View Multimedia

More Multimedia

SGI DataRaptor with MarkLogic Database

Job Bank

Datanami Conferences Ad

Featured Events

June 4-4, 2013
The Economist's Information Forum
San Francisco, CA
United States

June 10-13, 2013
Cloud & Big Data Expo
New York City, NY
United States

June 17-18, 2013
Forecast 2013
San Francisco, CA
United States

June 19-20, 2013
GigaOM Structure
San Francisco, CA
United States

June 26-27, 2013
2013 Hadoop Summit
San Jose, CA
United States

June 26-27, 2013
Big Data World Congress
London
United Kingdom

» View/Search Events

» Post an Event