Tag: Hadoop

Solving Hadoop Problems, For Fun and Profit

Jul 6, 2015 |

Things move quickly in the Hadoop world, and keeping up can be hard to do. Just ask Chris Wensel, the creator of the popular open source development tool Cascading and CTO at Concurrent. While Wensel spends many hours keeping Cascading current with every Hadoop release as a service to the community, he’s got bigger fish to fry solving production Hadoop problems in enterprise accounts. “I spend a lot of CPU cycles and dollars on Amazon testing Cascading on every vendor Read more…

Big Data’s Dirty Little Secret

Jul 2, 2015 |

The twin phenomena of big data and machine learning are combining to give organizations previously unheard of predictive power to drive their businesses in new ways. But behind the big data headlines that tease us with tales of amazing insight and business optimization lurks an inconvenient truth: raw data is very dirty and requires an enormous amount of effort to clean. Data scientists are undoubtedly the rock stars of the big data movement, as they use their keen understanding of Read more…

Inside WebTrends’ Big Data Analytics Pipeline

Jul 1, 2015 |

WebTrends has been collecting and analyzing Web data on behalf of its customers since it was founded way back in 1993. Considering the exponenetial growth of the Net since then, it’s not a stretch to say WebTrends was doing big data before big data was a “thing.” But following the recent creation of a data analytics pipeline built with technologies like Hadoop, Spark, and Kafka, the company is taking its big data analytic services to a whole new level. WebTrends Read more…

Kyvos Debuts OLAP for Hadoop

Jun 30, 2015 |

Many technology pros view OLAP as a legacy technology, a holdover from the days of data warehousing that doesn’t have a place in today’s big data world. But several startups are fighting to change that perception, including Kyvos Insights, which today unveiled its OLAP-on-Hadoop solution. Twenty years ago, online analytical processing (OLAP) was the center of many enterprise data warehouse (EDW) initiatives. The technology, which is largely synonymous with the term “multi-dimensional database,” gave organizations a way to pre-index and Read more…

DDN Tackles Enterprise Storage Needs as ‘Wolfcreek’ Looms

Jun 30, 2015 |

When it comes to keeping supercomputers fed with data, there are few storage makers that can keep up with DataDirect Networks. But increasingly, DDN is feeling pressure from enterprises that are struggling to keep up with the ongoing data explosion and mixed I/O workloads. That’s where DDN’s forthcoming high-end storage array for the broader enterprise market, codenamed “Wolfcreek,” comes into play. Wolfcreek is DDN‘s next generation converged architecture for enterprise customers. The system borrows technology from DDN’s SFA12k line of Read more…

Zettaset Patents Data Access Approach

Jun 26, 2015 |

Big data security specialist Zettaset said it has been awarded a U.S. patent for a technique designed to boost data access and performance in distributed computing frameworks like Hadoop and NoSQL. Zettaset said this week the U.S. Patent and Trademark Office issued a patent for its DiamondLane technology on June 23. The U.S. patent covers “distributed storage medium management for heterogeneous storage media in high availability clusters” (U.S. Patent No. 9,063,939-B2). Zettaset, Mountain View, Calif., said DiamondLane could be used Read more…

How Apache Spark Is Helping IT Asset Management

Jun 24, 2015 |

There’s been a lot of energy focused on how big data technology can improve the sales, marketing, service, and support departments of corporations. Tools like Hadoop, Spark, and NoSQL databases are changing the rules for how work gets done, and it’s very exciting. But big data tech is slowly creeping into the IT department itself, which might need it most of all. While the typical IT department may be familiar with big data concepts and technologies, they’re typically working with Read more…

Ex-Googler Now Helping Cloudera Build Hadoop

Jun 17, 2015 |

Cloudera scored a coup recently when it nabbed former Google executive Daniel Sturman to head up its engineering department. In an interview with Datanami, Sturman explains how he intends to use his experience designing distributed systems at the Internet giant to help evolve Hadoop. Sturman was intimately involved in designing and running the software infrastructure that Google uses to run its massive online business. As vice president of engineering at Google, he led the teams responsible for the Google Compute Read more…

How TrueCar Uses Hadoop to Deliver Price Transparency

Jun 15, 2015 |

If you’re in the market for a new car, you might be using the car pricing service from TrueCar to figure out how much you should pay. With just a few clicks on a mobile app, you can pull find what a car is worth on the company’s mobile app. It all seems very simple, but behind the scenes at TrueCar is a sophisticated big data system powered by Hadoop. TrueCar was founded about 10 years ago with a premise Read more…

8 New Big Data Projects To Watch

Jun 12, 2015 |

The big data community has a secret weapon when it comes to innovation: open source. The granddaddy of big data, Apache Hadoop, was born in open source, and its growth will come from continued innovation in done by the community in the open. Here are eight open source projects generating buzz now in the community. 1. Apache Zeppelin No other big data projects at the moment is as popular as Apache Spark, the in-memory analytics framework developed at Amplab. But Read more…