Tag: Hadoop

Google Cloud Dataflow Now Open for Business

Apr 16, 2015 |

Google today formally took the wraps off Cloud Dataflow, the hosted offering designed to allow developers with average Java and Python skills to build sophisticated analytic “pipelines” that process huge amounts of data. Google introduced Cloud Dataflow about a year ago as a next-gen platform for building systems that can ingest, transform, normalize, and analyze huge amounts of data—well into the exabyte range, Google executives said. The software is built on the infrastructure and technology that powers Google’s own applications, Read more…

From Spiders to Elephants: The History of Hadoop

Apr 15, 2015 |

Have you ever wonder where this thing called Hadoop came from, or even why it’s here? Marko Bonaci has wondered such things, too. In fact, he wondered about them so much that he decided to write a History of Hadoop chapter for his upcoming book, “Spark in Action.” Bonaci’s History of Hadoop starts humbly enough in 1997, when Doug Cutting sat down to write the first edition of the Lucene search engine. In 2000, Cutting placed Lucene into the open Read more…

Hadoop: The Tale of Data Storage to Data Processing

Apr 14, 2015 |

The expectations of CIOs around Hadoop are changing. They’re demanding more ROI from their data and saying goodbye to ‘experimentation.’ Of course, Hadoop is everywhere right now, but we’re seeing people (and companies) coining phrases such as “businesses becoming more data-driven.” Since its inception, Hadoop has served as the quintessential landing zone for all lines of business and the data that resides in those lines. The open-source framework was a data storage trailblazer that got us to where we are Read more…

Why Cybersecurity Needs Big Data Tech, Especially Hadoop

Apr 9, 2015 |

When cyber criminals really started ramping up their activities several years ago, security experts realized they needed bigger and better tools if they had any hope of slowing them down. In the wake of 2014, or “The Year of the Data Breach,” it’s clear those efforts largely failed, which is why so many security experts today are banking their hopes on Hadoop. Cybersecurity has become a big data problem because the size and complexity of the data has simply grown Read more…

AtScale Claims to Mask Hadoop Complexity for OLAP-Style BI

Apr 7, 2015 |

AtScale came out of stealth mode today with new software designed to trick business intelligence tools into thinking that Hadoop is a standard database upon which they can perform OLAP-style analysis, as opposed to the huge distributed file system that it really is. Business intelligence tools like Tableau, Qlik, TIBCO Spotfire, and Microstrategy weren’t designed with Hadoop in mind, but they’re commonly used to analyze data stored in HDFS just the same. In response to this demand, the Hadoop community Read more…

Tableau Aims to Speed Analytics with V9

Apr 6, 2015 |

Tableau Software has long positioned itself as a provider of tools that help you visualize big data, but hooking up to big data sources has not always been easy, and the resulting analyses have not always run as fast as people would like them. With the upcoming release of Tableau 9.0, the speed of analysis will be a big point of emphasis. If you had to pick one tool that all big data users seem to have, it’s Tableau’s. With Read more…

Deep Dive Into Oracle’s Emerging Big Data Stack

Apr 6, 2015 |

Oracle has a lot of turf to protect in the multi-billion-dollar relational database market, where it owns a dominant share of the market. That creates a natural tension when it comes to big data technologies like Hadoop and NoSQL, and while the IT giant isn’t embarking upon a wholesale re-architecting its business plan around these emerging open source technologies, it is on its way toward building a comprehensive big data stack that solves many emerging use cases. Oracle has made Read more…

Does Hadoop Need a Reality Check?

Apr 3, 2015 |

Hadoop garners a lot of the attention when it comes to big data, to the point where “Hadoop” and “big data” are practically synonymous in many people’s minds. But by all accounts, few companies outside of the Fortune 1000 are using Hadoop directly, and despite the attention it receives, Hadoop is driving little in actual revenues. Hadoop was first conceived at Yahoo as a distributed file system (HDFS) and a processing framework (MapReduce) for indexing the Internet. It worked so Read more…

Wanted: Intelligent Middleware That Simplifies Big Data Analytics

Mar 23, 2015 |

We’ve seen tremendous technological innovation in the data analytics space over the past 10 years. Platforms like Hadoop have emerged and machine learning techniques are going mainstream. But according respected leaders in the big data community, there’s still a sizable gap in the marketplace when it comes to intelligent middleware that can orchestrate and organizes how data flows, not just in Hadoop but the enterprise. Lars George was one of the first technologists to begin working with Hadoop in Europe Read more…

Pinterest Shoots ‘Pinball’ Into Open Source

Mar 12, 2015 |

Pinterest announced yesterday that it’s making the workflow management software it developed to manage big data pipelines, called Pinball, available as open source. Now anybody can use the same technology that Pinterest uses to manage the flow of work on Hadoop and other cluster resources. Pinterest came onto the social media scene in 2010 with a relatively simple Web and mobile app that lets users share pictures and videos by “pinning” them to their pinboards. Today, Pinterest has become the Read more…