Big Data • Big Analytics • Big Insight

Sectors » Biosciences

Features

Taming the Wild Side of Hadoop Data

Jan 28, 2015 |

Organizations are attracted to Hadoop because it lets them store huge amounts and different types of data, and worry about structuring it later. But that “anything goes” philosophy has a downside, and can threaten to turn a big data lake into bottomless pit. Today Hortonworks unveiled a plan to give data management on Hadoop just a bit more structure. Dubbed the Data Governance Initiative (DGI), Hortonworks aims to lead the development of new open source software that help organizations track Read more…

Three Ways Big Data and HPC Are Converging

Jan 27, 2015 |

Big data is becoming much more than just widespread distribution of cheap storage and cheap computation on commodity hardware.  Big data analytics may soon become the new “killer app” for high performance computing (HPC). There is more to big data than large amounts of information.  It also pertains to massive distributed activities such as complex queries and computations (a.k.a analytics).  In other words, deriving value through computation is just as “big” as the size of the data sets themselves.  In Read more…

Can Big Data Give Us Bionic Brains?

Jan 20, 2015 |

The rise of big data and analytics promises to transform many aspects of our lives, chief among them how we synthesize data and make decisions. Some theorize that big data will augment our own cognitive capabilities to the point where it’s like we have bionic brains. According to Deloitte Analytics, the field of cognitive analytics is an extension of cognitive computing, and is made up of three main components: machine learning, natural language processing, and advanced analytics. These technologies, combined Read more…

Dato Aims to Unleash Machine Learning

Jan 8, 2015 |

The new year started with a bang for GraphLab, the big data analytics startup headquartered near Seattle. Today the company announced the completion of an $18.5 million Series B round of venture funding. It also just changed its name to Dato. CEO Carlos Guestrin gave Datanami the low down on the new name and the state of machine learning in a recent telephone briefing. “Our goal has been to make machine learning accessible to more and more people, to allow Read more…

Graph Analytics Poised to Solve Tough Big Data Problems

Nov 3, 2014 |

Hadoop has emerged as the go-to platform for sifting through massive amounts of data on commodity machines. But when it comes to certain types of analytic workloads with open-ended problems, nothing beats a graph database, which may or may not run on Hadoop. The product category is maturing quickly and well positioned to make major inroads in the field of analytics over the next few years. There’s no denying that graphs are hot. “Graph analysis,” analyst firm Gartner says, “is Read more…

News In Brief

U.S. Names First Chief Data Scientist

Feb 24, 2015 |

An industry veteran and college math professor who is partially credited with coining the title “data scientist” has been named the nation’s first chief data scientist. The White House announced the appointment of DJ Patil to the new post last week. Patil also will serve as the Obama administration’s deputy chief technology officer for data policy, the White House said. Patil most recently served as a vice president at RelateIQ, a customer relationship management specialist acquired by Salesforce in July Read more…

Connecting the Dots on Dark Data

Feb 17, 2015 |

It’s estimated that 90 percent of the data in the average enterprise is “dark data”–that is, data that isn’t readily available for analytics—leaving only about 10 percent data exposed for doing analytics. While the inclusion of pertinent metadata and master data management (MDM) structure is the long term answer to the dark data problem, enterprises need a way to get value out of that data now. The dark data problem runs deep in many organizations, especially those that have implemented Read more…

Data Companies Work With Citizen Scientists on Climate

Dec 11, 2014 |

Storage vendor EMC Corp. is joining forces with big data and cloud specialist Pivotal and the EarthWatch Institute in an effort to apply analytical tools to the study the impact of climate change. The partners along with Schoodic Institute at Acadia National Park will study the interactions between nature and climate as part of a broader effort to promote citizen science using big data lakes, analytical tools and visualizations, the partners said. The “Big Data vs. Climate Change” initiative was Read more…

Google Targets Big Genome Data

Nov 12, 2014 |

Google wants to leverage the infrastructure that runs its dominant search engine, Google Maps and Gmail to help analyze and share big genomic data. Google Genomics is being promoted as a way to store growing DNA data in the cloud. Users could load and export genomic data for free, then pay about $25 a year for storage and queries. Storage is calculated in terabytes per month, queries in millions of API calls. The search giant joins other cloud giants in Read more…

Machine Learning Gets a Boost from Google

Oct 23, 2014 |

Search giant Google announced a partnership with Oxford University researchers that will target artificial intelligence applications such as image recognition and natural language understanding. Google said U.K. researchers with its recently acquired DeepMind initiative would work with Oxford AI specialists who earlier this year cofounded Dark Blue Labs. The cofounders, Nando de Freitas, Phil Blunson, Edward Grefenstette and Karl Moritz Hermann, are considering leading experts in the use of “deep learning” for machine understanding of natural language. The researchers “will Read more…

This Just In

PSC to Provide HPC Resources for Pitt Big Data Project

Oct 9, 2014 |

Oct. 9 — The National Institutes of Health has awarded the University of Pittsburgh an $11 million, four-year grant to lead a Big Data to Knowledge Center of Excellence, an initiative that will help scientists capitalize more fully on large amounts of available data and to make data science a more prominent component of biomedical research. The Pittsburgh Supercomputing Center will provide HPC resources and expertise to support the effort. Much of science focuses on understanding the “why” or “how” Read more…