Language Flags

Translation Disclaimer

HPCwire HPC in the Cloud Digital Manufacturing Report Green Computing Report
Rogue Wave

October 30, 2012

Digging Mutants Out of Massive Data


Genetics has been on the forefront of big data research out of necessity. A lot of data is required to hold genomes alone, whether they by human, mice, or any organism, not to mention the amount of memory and processing needed to actually compare and analyze those genomes.

Professor Bruce Beutler of the Beutler Laboratory, now located at the University of Texas Southwestern Medical Center in Dallas, has been helping to advance that research with the help of BioMed Central’s database Mutagenetix to make the datasets involved more accessible to Buetler and the rest of the scientific world. His research involves creating random mutations in mouse DNA in an attempt to connect those mutations to certain phenotypes.

“Over a period of eleven years,” Beutler said “we accumulated hundreds of mutations, induced in the mouse genome using the germ line mutagen ENU. Some of these mutations caused striking phenotypes, and broke new ground in immunology, metabolism, neuroscience, and basic cell biology.”

The research is intended to garner a better understanding of how genetic mutations, which can cause anything from disease to evolutionary advancement, are created. That mutation formation process is aptly named mutagenesis. Studying mutagenesis is understandably quite data intensive with the amount of genes within the genomes that have to be mapped and followed.

As such, Mutagenetix, an open database of ENU-generated data (ENU is short for N-ethel-N-nitrosourea) where ENU is the primary driver of phenotype creation in mice.

 "The dataset we have acquired allows us to make strong inferences about how mutagenesis works,” Buetler said. “Our data also provide the basic information needed to model mutagenesis in vitro, to predict just how many genes will be damaged or destroyed in a given population of mice.”

Buetler won the 2011 Nobel Prize in Physiology or Medicine for his work, which has implications for determining how human phenotypes and diseases are created. The Mutagenetix database extends his work and allows scientists to conduct their own experiments based on the information freely available.

Per BioMed Central’s Iain Hrynaszkiewicz, “As a result of our partnership with LabArchives, data which extend the Beutler lab's Nobel Prize-winning discoveries are available for public use, and the data are put into context with an associated peer-reviewed journal publication in BMC Research Notes.”

Hrynaszkiewicz hopes the availability will prompt other scientists to not only use the Nobel Prize-winning data, but also inspire them to publish their results in a similar open environment.

Related Articles

Brown University Advances Genomics, Big Data Research

Researchers Germinate Novel Approach to Big Bio Data

DNA Big Data Research Stuns Stephen Colbert

Share Options


Subscribe

» Subscribe to our weekly e-newsletter


Discussion

There are 0 discussion items posted.

 
Cray CS300-LC

Sponsored Links

Sponsored Whitepapers

Parallel Performance of the IMSL C Numerical Library with OpenMP

05/21/2013 | Rogue Wave Software

Download whitepaper containing benchmark results depicting the speedup achieved as a result of incorporating OpenMP directives in the IMSL C Numerical Library, for portable, cross platform analytics.

Download this Whitepaper...

Best Practices in Big Data Storage - Sponsored by Cleversafe, Cray, DDN, NetApp, & Panasas

05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas

From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.

Download this Whitepaper...

View the White Paper Library

Sponsored Multimedia

SGI President and CEO, Jorge Titinger, on Big Data

SGI President and CEO, Jorge Titinger, talks about SGI's history and leadership in HPC and how that has converged into Big Data Solutions.

View Multimedia

Cray CS300-AC Cluster Supercomputer Air Cooling Technology Video

The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.

View Multimedia

More Multimedia



Job Bank

Datanami Conferences Ad

Featured Events

May 22-23, 2013
Business Intelligence Innovation Summit
Chicago, IL
United States

June 4-4, 2013
The Economist's Information Forum
San Francisco, CA
United States

June 10-13, 2013
Cloud & Big Data Expo
New York City, NY
United States

June 19-20, 2013
GigaOM Structure
San Francisco, CA
United States

June 26-27, 2013
2013 Hadoop Summit
San Jose, CA
United States

June 26-27, 2013
Big Data World Congress
London
United Kingdom

» View/Search Events

» Post an Event