Sectors » Biosciences

Features

Big Data Outlier Detection, for Fun and Profit

Sep 30, 2014 |

As we discussed in the first part of this series, how you handle data outliers can determine whether your big data project ends with a bang or flames out in failure. But before you even decide what to do with outliers, you need to be able to detect them. That is easier said than done. Because they can mean different things at different times, outliers can be extremely challenging to deal with in a big data context. On the one Read more…

How a Facebook-Like Graph Powers Drug Discovery

Sep 29, 2014 |

Researchers have long sought to identify the key proteins involved in the development of diseases like cancer. However, the time and effort required to check each combination of proteins can be daunting. But thanks to the advent of graph analytics, researchers can now build models of protein networks, thereby enabling mass parallelization of the protein problem and powering a more efficient drug discovery process. One of the companies employing advanced graph analytics in drug discovery is e-Therapeutics, a British biotech Read more…

Under New CEO, Lucidworks Aims to Redefine Search and Itself

Sep 18, 2014 |

Lucidworks today unveiled an ambitious new enterprise search application called Fusion that uses advanced signal processing and analytics to drive a new level of personalization and push-delivery of information to users. The product is the culmination of a year-long development initiative under new CEO Will Hayes, who aims to carve a new big data niche for the Solr backer. Search is a ubiquitous component of the Internet, the original “killer app.” Type in your query, hit enter, and voila: a Read more…

Tape Gets Second Wind as Big Data Mounts

Sep 16, 2014 |

Think tape is dead in our big data world? Think again. This week, the National Center for Supercomputing Applications (NCSA) announced that it bought 20PB of tape capacity to expand the world’s biggest data archive. Meanwhile, the LTO Program has plotted out a roadmap for the next decade that will eventually see a single LTO tape cartridge storing 120 TB of data. While tape is looked down upon in our speed-obsessed culture, the old standby has proven its relevance in Read more…

Big Data Outliers: Friend or Foe?

Sep 2, 2014 |

The bigger your dataset, the greater your chance of stumbling into an outlier. It’s practically a certainty you’ll find isolated, unexpected, and possibly bizarre data you never expected to see in your data. But how you respond to these outliers could mean the difference between big data success and failure. How should you deal with data outliers? The answer is simple: It depends. On the one hand, the presence of outliers may be a sign of serious data quality issues, Read more…

News In Brief

Food For Thought: Startup Targets Nutritional Data

Sep 16, 2014 |

A U.K. data startup that has developed a semantic platform targeting nutrition and health applications said it has added more than 500,000 U.S. products to its dataset in hopes of promoting new applications for everything from nutritional information to exercise devices to restaurant menus. London-based Klappo said earlier this month that the expansion of its dataset includes 525,000 U.S. product barcodes containing ingredient information about the nutritional value of U.S. food products. It said about 100,000 cover new food ingredients Read more…

Data Tools Help Researchers Tackle Pediatric Cancer

Sep 10, 2014 |

A genomic data analysis platform being installed at the National Cancer Institute will deliver rapid analysis of billions of data points required when sequencing human DNA and other genomic data. Translational Genomics Research (TGen) said Sept. 10 it is providing the National Cancer Institute with high performance computing and bioinformatics support along with specialized tools designed to support pediatric cancer research programs. Those efforts include personalized medicine trial for pediatric cancer patients being conducted by the Neuroblastoma and Medulloblastoma Translational Read more…

Data Startup Targets Machine Learning for Healthcare

Aug 27, 2014 |

A medical diagnostic startup is attempting to use recent advances in machine learning as a way to make it easier for doctors to sort through medical information in the form of images, unstructured data like notes on a patient’s history and structured laboratory test results. “Medical diagnostics is, at its heart, a data problem,” notes Jeremy Howard, founder and CEO of Enlitic, a San Francisco-based startup that wants to use machine-learning technology to transform diagnostic healthcare. “Recent applied machine learning Read more…

Databases Used to Trace Western Migrations

Aug 15, 2014 |

Researchers at the University of Texas at Dallas used three databases of “notable individuals” over the last 2,000 years to trace western migration patterns. While Rome remained the cultural center of the European continent through the early Middle Ages, the researchers also found that Paris, locations in what is now the Netherlands and northern Italy were among the emerging cities that attracted European intellectuals and those elites who’s birth and death records made it into the databases. According to the Read more…

Climate Researchers Crunch Data on Weather Extremes

Aug 5, 2014 |

Few data science problems can top weather forecasting and climate modeling for sheer complexity. Hence, big data is playing a key role in tracking changes in global climate and, with it, weather and climate extremes. The application of data analysis to decades of weather and climate data can provide insights that inform billion-dollar investments in agriculture, commodities, insurance and the financial sector, experts stress. More insights could be drawn from recent research at Northeastern University in Boston that used climate Read more…

This Just In