Features

Inside WebTrends’ Big Data Analytics Pipeline

Jul 1, 2015 |

WebTrends has been collecting and analyzing Web data on behalf of its customers since it was founded way back in 1993. Considering the exponenetial growth of the Net since then, it’s not a stretch to say WebTrends was doing big data before big data was a “thing.” But following the recent creation of a data analytics pipeline built with technologies like Hadoop, Spark, and Kafka, the company is taking its big data analytic services to a whole new level. WebTrends Read more…

DDN Tackles Enterprise Storage Needs as ‘Wolfcreek’ Looms

Jun 30, 2015 |

When it comes to keeping supercomputers fed with data, there are few storage makers that can keep up with Data Direct Networks. But increasingly, DDN is feeling pressure from enterprises that are struggling to keep up with the ongoing data explosion and mixed I/O workloads. That’s where DDN’s forthcoming high-end storage array for the broader enterprise market, codenamed “Wolfcreek,” comes into play. Wolfcreek is DDN‘s next generation converged architecture for enterprise customers. The system borrows technology from DDN’s SFA12k line Read more…

One on One with LinkedIn’s VP of Engineering

Jun 29, 2015 |

Why are data scientists tripping over themselves to get their hands on LinkedIn’s data? What’s it like to run one of the world’s biggest social media sites, and how can machine learning algorithms contribute to the creation of economic opportunity for a global workforce? We recently posed those questions (and more!) to Igor Perisic, Vice President of Engineering at LinkedIn. Alex Woodie: Igor, thank you for agreeing to this interview. First, please tell us about yourself and your role at Read more…

Future of Connected Cities on Display in L.A.

Jun 26, 2015 |

Cities are sitting on mounds of data generated by everything from parking meters to traffic lights to water meters, but they’re having difficulty harnessing that data effectively. As the big data and Internet of Things (IoT) trends evolve, we’ll see cities competing with one another to use data to serve citizens, and that’s where vendors like Los Angeles-based Civic Resource Group International are hoping to step in. Yesterday in downtown LA, CRGI showcased what the future of a connected city Read more…

Going In-Memory? Consider These Three Things First

Jun 25, 2015 |

Businesses, large and small, know that if they aren’t using the data they collect in some kind of intelligent way, their companies will be out of business in a few short years. Quickly turning data into business insights is downright essential to increasing and retaining customers as well as delivering new products and services. You know how online retailers have those flash sales and replace items that have sold out with similar ones as shoppers are viewing the screen? They Read more…

How Apache Spark Is Helping IT Asset Management

Jun 24, 2015 |

There’s been a lot of energy focused on how big data technology can improve the sales, marketing, service, and support departments of corporations. Tools like Hadoop, Spark, and NoSQL databases are changing the rules for how work gets done, and it’s very exciting. But big data tech is slowly creeping into the IT department itself, which might need it most of all. While the typical IT department may be familiar with big data concepts and technologies, they’re typically working with Read more…

Data Lake Showdown: Object Store or HDFS?

Jun 23, 2015 |

The explosion of data is causing people to rethink their long-term storage strategies. Most agree that distributed systems, one way or another, will be involved. But when it comes down to picking the distributed system–be it a file-based system like HDFS or an object-based file store such as Amazon S3–the agreement ends and the debate begins. The Hadoop Distributed File System (HDFS) has emerged as a top contender for building a data lake. The scalability, reliability, and cost-effectiveness of Hadoop Read more…

Why Big Data Prep Is Booming

Jun 22, 2015 |

The big data analytics space is growing as organizations attempt to turn massive sets of data into a competitive advantage. But this trend is also driving a secondary boom in the market for data preparation tools that clean big data prior to analysis. The hype surrounding big data masks a dirty little secret: Most data sets are relatively dirty and must be thoroughly cleaned, lest the resulting analytic results be tainted and unusable. Necessity is the mother of all invention, Read more…

Getting MDM Off the Ground

Jun 18, 2015 |

Anyone who has been exposed to Master Data Management (MDM) realizes that the practice often comes with a price. While most individuals applaud the concept of implementing MDM, too often what initially appeared to be an adequately funded and well-planned project starts to miss key deadlines and milestones while accumulating unexpected costs. Before trying to tackle MDM, it’s important to understand exactly what MDM entails and how it will work within a larger big data management framework. This may sound Read more…

Ex-Googler Now Helping Cloudera Build Hadoop

Jun 17, 2015 |

Cloudera scored a coup recently when it nabbed former Google executive Daniel Sturman to head up its engineering department. In an interview with Datanami, Sturman explains how he intends to use his experience designing distributed systems at the Internet giant to help evolve Hadoop. Sturman was intimately involved in designing and running the software infrastructure that Google uses to run its massive online business. As vice president of engineering at Google, he led the teams responsible for the Google Compute Read more…