Features

Rethinking Enterprise Search for the Big Data Age

Aug 31, 2015 |

The humble search engine has retained a prominent place in the toolboxes of would-be data explorers. You see Solr and Lucene sitting at the Hadoop table, right alongside SQL and machine learning. But some search experts–including one who helped build Microsoft Bing–says traditional search engines are too long in the tooth for today’s big data challenges. There’s no doubt that Internet search engines, such as Google and Bing, have changed how we retrieve information, says Donald Thompson, the co-founder and CTO Read more…

Cast Your Models Into Data Streams and Take Advantage of the IoT

Aug 28, 2015 |

Using data in real-time used to be a challenge. Lengthy extraction, transformation and loading processes make analyzing data an activity more akin to glancing in a rearview window than reading road signs as you pass. And with excitement building around the Internet of Things many just assume that harnessing sensor technology is an IT budget breaker. However, that doesn’t have to be the case. Event stream processing combines high throughput (millions of events per second) and low latency (delays undetectable Read more…

Struggling Under Multiple Databases? Get Used To It

Aug 27, 2015 |

Programmers are a fickle bunch. One day they’re all using Java, and the next day Python is the hot new thing. While developers indelibly will dabble in different languages, we’re now seeing them do similar things with databases, particularly those of a NoSQL bent. The database explosion benefits developers, but it’s a headache for the admins tasked with managing the stuff. Time was, you installed the Oracle database and were happy with it, damn it. Oracle defined what you could Read more…

Medical Insight Set to Flow from Semantic Data Lakes

Aug 26, 2015 |

The potential for data analytics to disrupt healthcare delivery is large, and getting larger by the day. But in many cases, the need to hammer data into a structured format creates a barrier to productivity. Now a hospital chain in New York City is hoping to change that by adopting a Hadoop-based semantic data lake. Located in the Bronx, Montefiore Health System is the first hospital to implement a semantic data lake as part of the New York City Clinical Read more…

Five Reasons for Leaving Your Data Where It Is

Aug 25, 2015 |

Not a week goes by without a major story about vulnerable data in the hands of the wrong people. Whether it’s a data breech at a government agency or global retailer exposing financial information, cloud service providers like Microsoft, Google and Amazon battling government agencies over access to data, or potential HIPAA violations resulting from compromised health data, we are constantly reminded that, unfortunately, people are after our data – and they’ve gotten pretty good at getting it. Despite this, Read more…

Hortonworks Boosts Streaming Analytics, IoT Plays with NiFi Deal

Aug 25, 2015 |

Hortonworks today announced a definitive agreement to acquire Onyara, the company behind the data routing and streaming technology called Apache NiFi. The Hadoop heavy also announced NiFi will be the basis for its second major product line, Hortonworks DataFlow, which will underlie real-time streaming analytics and Internet of Things (IoT) applications. The National Security Agency created Apache NiFi eight years ago to address its real-time data collection needs. Short for Niagrafiles (a play on Niagara Falls), the technology was designed Read more…

How Spark Democratizes Analytic Value from Hadoop Lakes

Aug 24, 2015 |

So you’ve installed Hadoop and built a data lake to house all the bits and bytes that your organization previously discarded. So now what? If you follow the advice from industry experts, the next step on your analytics journey is to add Apache Spark to the mix. It’s common for people to confuse Hadoop with analytics, says Rob Thomas, vice president of product development at IBM Analytics. “Hadoop itself doesn’t do analytics,” Thomas tells Datanami. “Hadoop is the data storage Read more…

Big Data Workloads in the Cloud: One Size Does Not Fit All

Aug 21, 2015 |

There is a bewildering array of big data platforms in the cloud today including multiple flavors of Hadoop and varied technologies from major vendors. The difficulty of selecting the “right cloud platform” is exacerbated by the fact that there are no established guidelines about how to configure the right cloud infrastructure to support a specific use case around a set of technologies and related workloads. There is also a gap in the service levels needed by enterprise clients compared to Read more…

How NVIDIA Is Unlocking the Potential of GPU-Powered Deep Learning

Aug 20, 2015 |

Companies across nearly all industries are exploring how to use GPU-powered deep learning to extract insights from big data. From self-driving cars and voice-directed phones to disease-detecting mirrors and high-speed securities trading, the potential use cases for the technology are large and expanding by the day. Ever since computer scientist Geoff Hinton decided to try training a neural network on a GPU and did much to help popularize the field of deep neural networks several years back, researchers have been Read more…

Beware the Dangers of Dark Data

Aug 18, 2015 |

The amount of data we’re generating is doubling roughly every 18 months, which is eerily similar to Moore’s Law scale for the growth of processing power. But much of that new data will remain invisible to those who would use it. The situation around this dark data threatens to derail big data initiatives before they can get off the ground. Gartner defines dark data as “the information assets organizations collect, process and store during regular business activities, but generally fail Read more…