Big Data • Big Analytics • Big Insight

Tag: Hadoop

Wanted: Intelligent Middleware That Simplifies Big Data Analytics

Mar 23, 2015 |

We’ve seen tremendous technological innovation in the data analytics space over the past 10 years. Platforms like Hadoop have emerged and machine learning techniques are going mainstream. But according respected leaders in the big data community, there’s still a sizable gap in the marketplace when it comes to intelligent middleware that can orchestrate and organizes how data flows, not just in Hadoop but the enterprise. Lars George was one of the first technologists to begin working with Hadoop in Europe Read more…

Pinterest Shoots ‘Pinball’ Into Open Source

Mar 12, 2015 |

Pinterest announced yesterday that it’s making the workflow management software it developed to manage big data pipelines, called Pinball, available as open source. Now anybody can use the same technology that Pinterest uses to manage the flow of work on Hadoop and other cluster resources. Pinterest came onto the social media scene in 2010 with a relatively simple Web and mobile app that lets users share pictures and videos by “pinning” them to their pinboards. Today, Pinterest has become the Read more…

Beyond the 3 Vs: Where Is Big Data Now?

Mar 12, 2015 |

Once defined by the “three Vs” of volume, velocity, and variety, the term “big data” has overflowed the small buckets we gave it and taken on a life of its own. Today, big data refers not only to the ongoing information explosion, but also an entire ecosystem of new technologies, as well as a unique way of thinking about data itself. Here’s a quick recap of the short history of big data. The World Wide Web was just starting to Read more…

FPGA System Smokes Spark on Streaming Analytics

Mar 10, 2015 |

Technologists with decades of experience building field-programmable gate array (FPGA) systems for the federal government today unveiled a commercial FPGA offering it claims holds 100x performance advantage over Apache Spark for specific streaming analytic jobs. By all appearances, the Ryft One looks like just another general-purpose X86 Linux server. Indeed, the 1U device sports an Ubuntu OS and Intel processor, largely for purposes of familiarity. But as you dig beyond that generic façade and into its more exotic innards, you’ll Read more…

Taming Apache Storm for Real-Time Analytics

Mar 5, 2015 |

Apache Storm is gaining a foothold among organizations looking to do real-time analytics on streaming data. However, the difficulty in working with the distributed processing framework is proving to be a major hurdle to Storm adoption. Now, a company called Impetus says it’s simplifying development on Storm with a new product. Released as open source by Twitter several years ago, Apache Storm plays a key role as the real-time processing layer of the emerging big data technology stack. While the Read more…

Where Does InfiniDB Go From Here?

Mar 5, 2015 |

Last September, the company behind InfiniDB, Calpont, went out of business. Up stepped MariaDB, the company behind the open source relational database, to serve as a steward for the product and provide support to customers. The big question on everybody’s mind is, where does the product go from here? InfiniDB is a columnar database management system designed to power analytic applications. Originally debuting in the year 2000, the software was built upon the MySQL database and includes its own SQL Read more…

How to Get a ‘Network Effect’ from Your Big Data Lake

Mar 3, 2015 |

One of the hidden benefits of being a data-driven organization is a so-called “network effect” that occurs around data and analytics. When an organization has several successful big data analytics projects under its belt, it often becomes easier to see how data can be used to benefit the organization in profound new ways. Creating a Hadoop-based data lake is often the first step in going down the big data analytics road. Without data and a place to put it—often a Read more…

Novetta Throws Entity Analytics Hat Into Hadoop Ring

Mar 2, 2015 |

One of the new big data analytic vendors exhibiting at the recent Strata + Hadoop World conference was Novetta, a firm that’s well-known in the Washington D.C. area for its cyber analytic offerings. But now the company is widening its reach into the commercial market with a Hadoop-based solution called Novetta Entity Analytics. One of Novetta’s first customers in the big data space was an unnamed government security agency that was having trouble pulling useful information out of an 8-billion Read more…

Accelerating Hadoop® Workflows to Yield Greater Application Efficiency

Mar 2, 2015 |

As enterprise-critical decision support fully embraces big data, confusion has grown on how to best satisfy increasing demand for ever larger data analytics. Some have questioned whether Hadoop will continue to reliably scale and serve as the primary workhorse for enterprise production level data analytics. Rising to satisfy the need for more scale, truly break-through technologies have recently removed any question mark on how to extend the useful life and scale of enterprise-critical Hadoop applications. How is this possible? By Read more…

Big Data So Easy a Caveman Could Do It?

Feb 26, 2015 |

Let’s face it: big data isn’t easy. If you’re building a big data application today, you’re up to your eyeballs in things like R and Java, MapReduce and Pig, and Storm and Kafka. There’s a reason data scientists are so hard to find that they’re compared to unicorns. But in the future, the big data application assembly process may be dumbed down to the point where, as the insurance commercial says, even a caveman could do it. That’s the approach Read more…