Tag: Hadoop

The Bright Future of Semantic Graphs and Big Connected Data

Feb 8, 2016 |

The big data revolution is generating a mess of unruly data that’s difficult to parse and understand. This is to be expected–explosions don’t generally occur in a nice, orderly fashion, after all. But if the folks at Cloudera and Franz have their way, the world of connected data will become more accessible and useful when viewed through the lens of semantic graph technologies. Semantic graph technology is shaping up to play a key role in how organizations access the growing Read more…

Hadoop’s Second Decade: Where Do We Go From Here?

Feb 2, 2016 |

As we reported last week, the first Hadoop cluster went online at Yahoo 10 years ago. The platform has enjoyed phenomenal, if improbable, growth since then. But where does it go from here? Once again, we tapped the knowledge of Hadoop creator Doug Cutting and other experts in the big data industry to get the low down on the high technology. Cutting, who is the chief architect at commercial Hadoop pioneer Cloudera, sees the loose confederation of open source projects Read more…

Distributed Computing Tops List of Hottest Job Skills

Jan 27, 2016 |

If you have cloud and distributed computing skills, your job prospects for 2016 are golden. That’s because those particular job skills—which parallel the rise of Hadoop and other distributed computing frameworks–topped a LinkedIn analysis of the top 25 skills to help you find a new job this year. The sudden arrival of cloud and distributed computing as the hottest skills in the land was somewhat unexpected, according to Sohan Murthy, the head of research for data analytics and strategy at LinkedIn Read more…

Finding Your Way in the New Data Economy

Jan 25, 2016 |

In big data analytics today, the goal is to capture, cleanse, and analyze as much data as possible. The leading practitioners are blending dozens of external sources with their own data sets, maybe more. But in the emerging data economy, you’ll not only be tapping into a pool of available data that’s exponentially larger, but you’ll be contributing your data to it as well. There is so much activity underway in the data and analytic space at the moment that Read more…

Hadoop Market is Neck and Neck, Forrester Says

Jan 20, 2016 |

If you’re shopping for a Hadoop distribution on which to hang your big data hat, you have your work cut out for you, according to Forrester Research, which found four strong performers in a market it says is neck and neck. “Choosing a Hadoop distribution will be difficult for most AD&D [application development and delivery] pros who carefully consider each of these Leaders,” write Forrester analysts Mike Gualtieri and Noel Yuhanna. “Forrester doesn’t think there is a wrong choice among Read more…

Survey Sees Spark Emerging in 2016

Jan 19, 2016 |

This is the “Year of Spark,” asserts a new big data survey on analytics priorities. The survey of more than 250 data scientists and architects, IT managers and business intelligence analysts released on Tuesday (Jan. 19) found that nearly 70 percent of users expressed interest in deploying Apache Spark in the coming year. While current leader MapReduce is expected to remain the dominant compute framework in production, survey sponsor Syncsort noted that the “high level of interest should translate into Read more…

Picking the Right SQL-on-Hadoop Tool for the Job

Jan 13, 2016 |

SQL is, arguably, the biggest workload many organizations run on their Hadoop clusters. And there’s good reason why: The combination of a familiar interface (SQL) along with a modern computing architecture (Hadoop) enables people to manipulate and query data in new and powerful ways. But not all SQL-on-Hadoop tools are equal, and that makes picking the right tool a challenge. There’s no shortage of SQL on Hadoop offerings, and each Hadoop distributor seems to have its preferred flavor.  The list Read more…

What Data Science Skills Employers Want Now

Jan 7, 2016 |

There’s good news if you’re for a job in data science in 2016 — the number of job openings in the field appears to be rising as companies look to leverage big data for competitive advantage. But actually landing a coveted data science job means having the right mix of skills, and you may be surprised to learn what skills are most in demand by employers. The folks at CrowdFlower recently did an analysis of the 3,490 postings for data Read more…

New TPC Benchmark Puts an End to Tall SQL-on-Hadoop Tales

Dec 17, 2015 |

You take certain things for granted in the big data world. Data will continue to grow at a geometric rate. Amazing new technologies will regularly appear out of nowhere. And software vendors will squabble endlessly over whose SQL-on-Hadoop engine is fastest and best. Thanks to a new TPC-DS 2.0 benchmark unveiled today, we may take the last one off that list. The Transaction Processing Performance Council (TPC) organization today unveiled a new benchmark for gauging the performance of SQL engines running on Read more…

Meet Your Friendly Neighborhood Spark Sherpa

Dec 9, 2015 |

Apache Spark is the most popular big data project at the moment, with thousands of contributors cranking out code on a weekly basis. Keeping up with Spark releases is hard, and it’s why Hadoop distributor Hortonworks views itself as a Sherpa that guides customers on how best to use the explosive big data tech. The amount of activity on Apache Spark is extraordinarily high, with hundreds of JIRA issues being addressed every week, and thousands more in the queue. The Read more…