Technologies » Middleware

Features

Enforcing Hadoop SLAs in a Big YARN World

Jul 23, 2014 |

The Apache Hadoop community has done a truly amazing job developing a scalable and versatile platform for big data analytic workloads. And with the recent introduction of YARN in Hadoop 2, we’re now able to run multiple analytic engines on our clusters simultaneously. Unfortunately, the prospect for resource contention has also gone up, and that will likely increase demand for service level agreement (SLA) enforcement. YARN made its big introduction just as companies started to move their Hadoop deployments out Read more…

FDA Mines Billing Data for Drug Interaction Insight

Jul 22, 2014 |

The Food and Drug Administration is five years into a pilot program aimed at identifying hazardous drug interactions by mining the medical billing records of millions of Americans. The program, dubbed Mini-Sentinel, is a creative application of big data technologies that has the potential to improve people’s lives. The FDA started funding the Mini-Sentinel project in 2009 with the goal of coming up a better way to monitor for unintended side effects of prescription drugs. It’s impossible to eliminate all Read more…

Streaming Analytics Ready for Prime Time, Forrester Says

Jul 22, 2014 |

Analytic platforms that generate insights from data in real time are mature enough for enterprises to begin adopting them, Forrester says in its latest report. While open source streaming analytic products like Apache Storm are proving popular, Forrester says they lack key functionality found in the offerings of proprietary vendors, such as top-rated Software AG. You don’t need a Forrester analyst to know that streaming analytics is red hot at the moment. If Hadoop has opened our eyes to what Read more…

Inside Sibyl, Google’s Massively Parallel Machine Learning Platform

Jul 17, 2014 |

If you’ve ever wondered how your spam gets identified in Gmail or where personal video recommendations come from on YouTube, the answer is likely Sibyl, a massively parallel machine learning system that Google developed to make predictions and recommendations with user-specific data culled from its Internet applications. Dr. Tushar Chandra, a distinguished Google Research engineer, recently shared some information on Sibyl in a keynote presentation at the annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN). Sibyl is not Read more…

Can You Trust Your Algorithms?

Jul 15, 2014 |

Algorithms are critical to how we interact with data. And as the volume and variety of data increases, so does our reliance on algorithms to give us the answers we seek. But how much faith should you put into those algorithms, and how can you be sure they’re not misleading you? They’re not simple questions, but through the use of algorithmic differentiation techniques, data scientists can get more precise answers. Algorithmic differentiation, sometimes called automatic differentiation, is a technique used Read more…

News In Brief

HP Taps Hortonworks to Supply Hadoop for HAVEn

Jul 24, 2014 |

Hewlett-Packard and Hortonworks today announced a strategic partnership that will see the vendors work together around Hadoop. In exchange for HP’s $50 million investment in Hortonworks, HP gets the right to distribute the Hortonworks Data Platform (HDP) as the Hadoop component of its HAVEn big data suite. While HP offers a range of IT solutions for big data problems, the IT giant has not had a very aggressive Hadoop strategy. It strengthened its Hadoop story in June 2013, when it Read more…

See Spark Run on NoSQL, DataStax Says

Jun 30, 2014 |

DataStax today announced that Apache Spark is included in the latest release of its NoSQL databases, including open source Cassandra and DataStax Enterprise version 4.5. Those databases now include the in-memory Spark tools, thereby giving customers a new option for running analytic workloads on transactional data. Apache Spark has garnered loads of attention as a potent analytic toolset for Hadoop. Developers are eager for Spark, which offers a single API for accessing a treasure trove of machine learning, graph, SQL, Read more…

MapR Announces $110M Investment Led by Google

Jun 30, 2014 |

Hadoop distributor MapR Technologies said today it has completed a $110 million financing round led by heavy hitters Google Capital and the venture capital arm of mobile chip specialist Qualcomm. San Jose-based MapR said Google Capital led the equity-financing round with $80 million. Also participating was Qualcomm Ventures, along with existing investors Lightspeed Venture Partners, Mayfield Fund, New Enterprise Associates and Redpoint Ventures. MapR also announced completion of a $30 million line of credit led by Silicon Valley Bank. The financing Read more…

Couchbase Doesn’t Take $60M Round Lying Down

Jun 26, 2014 |

NoSQL database developer Couchbase today announced a $60 million round of venture funding from venture firms, bringing its total funding to $115 million. Couchbase’s CEO Bob Wiederhold tells Datanami the $60 million will be used primarily to accelerate product development. “We have a number of projects that we’re very excited about and that have gotten a lot of interest in the market,” Wiederhold says. “The biggest one is our mobile strategy. We also will invest more aggressively in expanding our Read more…

MIT Spinoff Mines Text With Analytics Engine

Jun 13, 2014 |

A data analytics spinoff from the Massachusetts Institute of Technology’s Media Lab has developed a new text analytics engine it claims could give computers the ability to understand humans the way people understand each other. The startup, Luminoso, says it is attempting to give computing devices “a structural foundation of common-sense reasoning that sits at the forward edge of text analytics.” Luminoso’s founders spent years at MIT’s Media Lab building a cloud-based, multi-lingual tool for machine learning and, ultimately, “understanding Read more…

This Just In