Big Data • Big Analytics • Big Insight

Tag: Hadoop

Why Pay for Analytics When Open Source Is ‘Free?’

Oct 23, 2014 |

The free analytics question comes up over and over again, especially as it pertains to open source analytic offerings. I don’t think a day goes by when someone at a company doesn’t ask, “Why should I pay for analytics when I can use (fill in the blank) open source statistics/analytics?”  You will find many lengthy discussions all over the Web on just this topic.  Often the discussion is much more impassioned than it needs to be. I will address the Read more…

What You May Have Missed at Strata + Hadoop World 2014

Oct 21, 2014 |

Talk about information overload. If you were one of the lucky 5,000 to attend the Strata + Hadoop World conference last week, then you were subject to a marathon session of big data keynotes delivered continually for the better part of two days. It’s understandable that you missed out on some of the big data news announced at the show, including Cray’s new Hadoop appliance, or the latest tools from Revolution Analytics and Tableau. Don’t worry: We’ll get you back Read more…

Congratulations Hadoop, You Made It–Now Disappear

Oct 20, 2014 |

Hadoop has matured at a rapid rate since it broke into the mainstream several years ago. With billions in venture funding, an eager community of developers, and a thriving ISV community, the open source big data platform seems poised to take a big step into the wider enterprise. But according to Cloudera’s chief strategy officer Mike Olson, Hadoop’s next big move may be to just disappear. Olson predicted Hadoop’s disappearance during a keynote address at last week’s Strata + Hadoop Read more…

Israeli Startup Looks to Ease Hadoop Rollouts

Oct 20, 2014 |

An Israeli big data processing startup said this week it has secured $3 million in early-round funding and will launch its Hadoop-based platform in the U.S. Tel Aviv-based Xplenty announced Oct. 20 the Series A funding from Waarde Capital and existing seed investor Magma Venture Partners. The funding will be used to expand platform features on its cloud-based processing engine while the company expands globally. Xplenty also announced plans to offer its big data processing technology directly to U.S. customers. Read more…

A Storyboard Approach to Big Data Insights

Oct 17, 2014 |

Big data by itself is just a worthless collection of numbers and characters. To make big data work, you need to show how the information is meaningful. Taking a storytelling approach to analytics is one way to put big data in context. One of the analytics vendors that’s well-versed in telling data-driven stories is ClearStory Data, which develops a Hadoop-based application that lets users explore big-data feeds, find relevant insights, and share them with others. ClearStory is one of the Read more…

Hadoop ISVs Break Away from MapReduce, Embrace Spark, In-Memory Processing

Oct 16, 2014 |

Big data analytic software vendors who run on Hadoop are increasingly replacing their MapReduce engines with Apache Spark and other in-memory analytic engines as the runtime of choice. Many of these next-gen Hadoop vendors are showcasing their upgraded goods at this week’s Strata + Hadoop World conference. One of the Hadoop vendors making the move to Spark is Platfora, which unveiled a new version of its Big Data Analytics offering during the first day of the Strata + Hadoop World Read more…

DataTorrent Raises the Bar for Real-Time Streaming

Oct 15, 2014 |

There’s a lot of talk these days about real-time streaming applications. If analyzing and acting on new data is good, then doing it immediately must be better. The truth is, building real-time streaming applications is not easy work. One company that’s pushing the bar higher in this area is DataTorrent. DataTorrent develops a distributed Hadoop 2 application called Real Time Streaming (RTS) that enables users to act upon and analyze time-sensitive data, including call data records, log and machine data, Read more…

EMC, Pivotal Add Compute to Hadoop Data Lake

Oct 15, 2014 |

Storage vendor EMC Corp. and cloud specialist Pivotal have partnered to roll out a new version of a Data Lake Hadoop Bundle that adds a compute option to the big data product along with accelerated analytics that come with scaled out storage and computing along with analytics software. As data lakes gain momentum as scalable repositories for data generated from current and advanced workloads, EMC and Pivotal are positioning Data Lake Hadoop Bundle 2.0 as a tool for plumbing the Read more…

Hortonworks Goes Broad and Deep with HDP 2.2

Oct 15, 2014 |

From full support for Apache Spark, Apache Kafka, and the Cascading framework to updated management consoles and SQL enhancements in Hive, there’s something for everybody in Hortonworks’ latest Hadoop distribution, which was revealed today at the Strata + Hadoop World conference in New York City. Hadoop started out with two main pieces: MapReduce and HDFS. But today’s distributions are massive vehicles that wrap up and deliver a host of sub-components, engines, and consoles that are desired and needed for running Read more…

Spark Smashes MapReduce in Big Data Benchmark

Oct 10, 2014 |

Databricks today released benchmark results for Apache Spark running the Sort Benchmark, a competition for measuring the sorting performance of large clusters. Spark running on Hadoop sorted 100 TB of data in 23 minutes, three times faster than the previous record held by Yahoo using MapReduce on Hadoop. The result, Databricks says, are due to targeted improvements the Spark community made to improve performance, and should lay to rest any concerns about Spark’s scalability. Databricks, which is the commercial outfit Read more…