Features

How Super Bowl 50 Will Feed Our Super Appetite…for Data

Feb 5, 2016 |

The Super Bowl is a lot of things to a lot of people. It’s the championship game for the National Football League, a showcase for clever ads from Madison Avenue, and for the rest of us, an excuse to splurge on food and drink. But the game is also a big test for the IT professionals tasked with keeping up with ever-increasing demands for data. On Sunday, more than 72,000 people are expected to cram into Levi’s Stadium in Santa Read more…

Apache Spark Surrounded By Cloud Data Services at IBM

Feb 4, 2016 |

IBM has made no secret about its admiration for Apache Spark, which it sees as the future for in-memory analytics. Today the IT giant unveiled a host of new cloud-based data services that bolsters its hosted Apache Spark business with NoSQL, graph, and machine learning capabilities. IBM added a number of new offerings to Cloud Data Services, which is a portfolio of more than two dozen database engines and related data-oriented services that it runs for customers as part of Read more…

Keeping on Top of Data Drift

Feb 3, 2016 |

Data is often thought to be constant and immutable. A given piece of data is defined by 1s and 0s, and it never changes. But there’s an emerging school of thought in the big data world that sees data as constantly drifting and mutating in response to forces around it. Without processes put in place to detect it, this data drift can wreak havoc on one’s understanding, expert warn. Organizations that collect huge amounts of data from many different sources, and Read more…

Hadoop’s Second Decade: Where Do We Go From Here?

Feb 2, 2016 |

As we reported last week, the first Hadoop cluster went online at Yahoo 10 years ago. The platform has enjoyed phenomenal, if improbable, growth since then. But where does it go from here? Once again, we tapped the knowledge of Hadoop creator Doug Cutting and other experts in the big data industry to get the low down on the high technology. Cutting, who is the chief architect at commercial Hadoop pioneer Cloudera, sees the loose confederation of open source projects Read more…

Knowing What’s Possible a Big Obstacle for Big Data

Feb 1, 2016 |

There are many reasons why a given organization may be lagging on big data analytics adoption. The shortage of data scientists is one, and the tightening of budgets is another. But according to analytic experts, one of the biggest obstacles to adopting big data is just knowing what is possible. “The biggest obstacle we’re running into is not knowing what’s possible. This is the single biggest problem we’re running into,” says Praveen Kankariya, the founder and CEO of Impetus Technologies, Read more…

Lifting the Fog of Spark Adoption

Feb 1, 2016 |

Clients are often confused about Apache Spark, and this confusion sometimes hinders its adoption. The confusion is not about the features of Spark per se, but about installing and running the big data framework. One client was convinced that they needed MapR M5 to even make use of Spark and they were really confused on how it runs on the cluster, thinking multiple Spark jobs interacted directly. To help illustrate the flexibility with deploying Spark, I explained the following to Read more…

Happy Birthday, Hadoop: Celebrating 10 Years of Improbable Growth

Jan 28, 2016 |

It’s hard to believe, but the first Hadoop cluster went into production at Yahoo 10 years ago today. What began as an experiment in distributed computing for an Internet search engine has turned into a global phenomenon and a focal point for a big data ecosystem driving billions in spending. Here are some thoughts on the big yellow elephant’s milestone from the people involved in Hadoop’s early days. Hadoop’s story started before January 2006, of course. In the early 2000s, Doug Read more…

Distributed Computing Tops List of Hottest Job Skills

Jan 27, 2016 |

If you have cloud and distributed computing skills, your job prospects for 2016 are golden. That’s because those particular job skills—which parallel the rise of Hadoop and other distributed computing frameworks–topped a LinkedIn analysis of the top 25 skills to help you find a new job this year. The sudden arrival of cloud and distributed computing as the hottest skills in the land was somewhat unexpected, according to Sohan Murthy, the head of research for data analytics and strategy at LinkedIn Read more…

Data Quality Trending Down? C’est La Vie

Jan 26, 2016 |

One of the biggest impediments to becoming a data-driven organization is tackling the problem of data quality. Data is often too dirty and discombobulated for use in high-end decision-making, and the increasing volume and diversity of data compounds this problem. But according to data quality experts, executives brush off the problem with a choice French phrase or two. Blazent today unveiled the results of a study commissioned by analysts at 451 Group that looked into data quality. Executives with the Read more…

Finding Your Way in the New Data Economy

Jan 25, 2016 |

In big data analytics today, the goal is to capture, cleanse, and analyze as much data as possible. The leading practitioners are blending dozens of external sources with their own data sets, maybe more. But in the emerging data economy, you’ll not only be tapping into a pool of available data that’s exponentially larger, but you’ll be contributing your data to it as well. There is so much activity underway in the data and analytic space at the moment that Read more…