Language Flags

Translation Disclaimer

HPCwire HPC in the Cloud Digital Manufacturing Report Green Computing Report


January 30, 2013

Collecting Big Data in Big Oceans


We have explored nearly every nook and cranny of the earth’s terrain. We have looked into space and examined galaxies billions of light years away. Yet what lies below the Earth’s waters had largely remained a mystery.

As such, the Census of Marine Life (CoML) was formed in 2000 to investigate the life in the oceans. The project was a $650 million effort that lasted ten year and involved 2700 scientists from over 80 nations. It resulted in 2600 scientific publications, over 6000 potential new species discoveries, and over 30 million species records, and counting.

While the CoML officially ended in 2010, the technological infrastructure that housed the information, known as the Ocean Biogeographic Information System, or OBIS, remains to prompt collaboration among marine biologists.

Niki Vermeulen of the Centre for the History of Science, Technology, and Medicine at the University of Manchester discussed this significant technological undertaking in a study looking to historically contextualize the CoML. From that historical standpoint, the CoML brought the discipline of marine biology into a similar open-access, collaborative, multi-team effort that is currently being experienced in genetics, proteomics, particle physics, and more.

The database itself, according to Vermeulen, consisted of information on living organisms drawn from taxonomic databases (OBIS drew from 12 of these at the time of the study) and geographical information from GIS systems that noted where species were found. The system was freely available on the internet, openly accessible to all participants in the CoML.

One of the goals of the CoML project, particularly the technological side of it, was to model ocean life’s future. That proved markedly difficult, according to Vermeulen, for various reasons. For one, it was difficult to obtain a proper picture of the past when faced with insufficient data. Further, the current models are unable to accommodate the vast amount of variables present among the diverse species that exist.

With that said, they were able to conclude that commercial fishing has cut into certain species’ population by almost 50 percent, a finding produced from studying trends from the 1950s to 1999 and then aggregating that with CoML data before and running it through OBIS. That realization was one of the first major big data-related findings from CoML and OBIS.

The experiments and discoveries studied in grade school are mostly dominated by lone researchers who stumble upon revelations. Rutherford prompted the discovery of the nucleus by shooting particles at gold foil to find one in ten thousand shooting right back; Watson and Crick found the shape of DNA using crystallography; Mendel developed genetics by examining pea pods. However, with most of the fundamentals known, the days where a single scientist changes the worldview with a single experiment are waning, and CoML’s effort represented one more example of this trend.

Further, while the oceans remain relatively unexplored and several species remain to be discovered, the focus has shifted to forming a large-scale understanding of marine life. The comprehensive OBIS database is a step in that direction.

Related Articles

Can Extragalactic Data Be Standardized?

Digging Mutants Out of Massive Data

A Different Einstein on Another Old Problem

Share Options


Subscribe

» Subscribe to our weekly e-newsletter


Discussion

There are 0 discussion items posted.

 
SGI Hadoop

Sponsored Links

Sponsored Whitepapers

Best Practices in Big Data Storage - Sponsored by Cleversafe, Cray, DDN, NetApp, & Panasas

05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas

From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.

Download this Whitepaper...

Big Data, Big Brains – Sponsored By NetApp

04/22/2013 | NetApp

Big data has proven to be one of the most promising yet challenging technologies for both government and industry. But, before IT leaders can harness the full potential of big data, there are key issues to address surrounding infrastructure, storage, personnel, and training.
MeriTalk surveyed 17 visionary big data leaders to find out what they see as the big data challenges and opportunities as well as how government can best leverage big data. Download the “Big Data, Big Brains Report”.

Download this Whitepaper...

View the White Paper Library

Sponsored Multimedia

SGI President and CEO, Jorge Titinger, on Big Data

SGI President and CEO, Jorge Titinger, talks about SGI's history and leadership in HPC and how that has converged into Big Data Solutions.

View Multimedia

Cray CS300-AC Cluster Supercomputer Air Cooling Technology Video

The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.

View Multimedia

More Multimedia



Job Bank

Datanami Conferences Ad

Featured Events

May 22-23, 2013
Business Intelligence Innovation Summit
Chicago, IL
United States

June 4-4, 2013
The Economist's Information Forum
San Francisco, CA
United States

June 10-13, 2013
Cloud & Big Data Expo
New York City, NY
United States

June 19-20, 2013
GigaOM Structure
San Francisco, CA
United States

June 26-27, 2013
2013 Hadoop Summit
San Jose, CA
United States

June 26-27, 2013
Big Data World Congress
London
United Kingdom

» View/Search Events

» Post an Event