Tag: big data

Big Data’s Dirty Little Secret

Jul 2, 2015 |

The twin phenomena of big data and machine learning are combining to give organizations previously unheard of predictive power to drive their businesses in new ways. But behind the big data headlines that tease us with tales of amazing insight and business optimization lurks an inconvenient truth: raw data is very dirty and requires an enormous amount of effort to clean. Data scientists are undoubtedly the rock stars of the big data movement, as they use their keen understanding of Read more…

Inside WebTrends’ Big Data Analytics Pipeline

Jul 1, 2015 |

WebTrends has been collecting and analyzing Web data on behalf of its customers since it was founded way back in 1993. Considering the exponenetial growth of the Net since then, it’s not a stretch to say WebTrends was doing big data before big data was a “thing.” But following the recent creation of a data analytics pipeline built with technologies like Hadoop, Spark, and Kafka, the company is taking its big data analytic services to a whole new level. WebTrends Read more…

DDN Tackles Enterprise Storage Needs as ‘Wolfcreek’ Looms

Jun 30, 2015 |

When it comes to keeping supercomputers fed with data, there are few storage makers that can keep up with DataDirect Networks. But increasingly, DDN is feeling pressure from enterprises that are struggling to keep up with the ongoing data explosion and mixed I/O workloads. That’s where DDN’s forthcoming high-end storage array for the broader enterprise market, codenamed “Wolfcreek,” comes into play. Wolfcreek is DDN‘s next generation converged architecture for enterprise customers. The system borrows technology from DDN’s SFA12k line of Read more…

8 New Big Data Projects To Watch

Jun 12, 2015 |

The big data community has a secret weapon when it comes to innovation: open source. The granddaddy of big data, Apache Hadoop, was born in open source, and its growth will come from continued innovation in done by the community in the open. Here are eight open source projects generating buzz now in the community. 1. Apache Zeppelin No other big data projects at the moment is as popular as Apache Spark, the in-memory analytics framework developed at Amplab. But Read more…

Four Ways Your Data is Lying to You

Jun 8, 2015 |

Every second, new online data emerges in the form of posts, tweets, emails, and comments from your customers, clients and constituents to provide insights into economic trends, customer behavior and competitive threats. Measuring in the billions, data points provide an endless and ongoing stream of valuable opportunities for organizations to optimize their relationships, products and operations, making the Internet essentially the world’s largest focus group. The reams of data available today are probably our most mission-critical and valuable connection to Read more…

Deep Dive Into HP’s New HPC & Big Data Business Unit

Jun 5, 2015 |

When HP finally divides into two pieces – HP Inc. (PCs and printers) and Hewlett Packard Enterprise (servers and services) – how will the HPC portfolio fare? Views vary of course. The split is meant to let the ‘new’ companies shed distraction and sharpen focus. HPC will live within HP Enterprise, but perhaps surprisingly not by itself. Instead HPC is being combined with big data into a single global business unit, HPC & Big Data, created in March and led Read more…

So You Want To Be a Data Scientist: A Guide for College Grads

Jun 4, 2015 |

Congratulations, recent college graduate, and welcome to the workforce! Of all the jobs that you’ll apply for, the one with the sexy title “data scientist” may be the toughest to get–and potentially the most rewarding too. But never fear: Datanami is here with advice from actual data scientists on how to become one of them. The first piece of advice for budding data scientists is not to get frustrated by the job requirements. No recent college grad can fill is Read more…

Survey Casts Doubt on Big Data, Hadoop Efforts

Jun 4, 2015 |

A rule of thumb in markets like consumer electronics is don’t be an early adopter; wait until the vendor works out the kinks in its product and go with a later, more mature version. The same could be said for the growing number of big data initiatives as well as Hadoop distributions used by a growing number of enterprises: A survey of more than 100 senior executives found that about three-quarters are dissatisfied with their big data and analytics deployments. Read more…

Basho Goes Vertical with Big Data Stack

May 27, 2015 |

Basho Technologies made a name for itself in the NoSQL database world by developing a scalable key-value store called Riak that’s used by the likes of Time Warner, The Weather Company, and Comcast. Today the company disclosed plans to move up the stack by integrating other big data products–including Apache Spark, Apache Solr, and Redis–into its new Basho Data Platform. Nobody solves a big data problem with a single set of tools or skills. Big data is bigger than just Read more…

Forget Big Data–Small Data Is Where the Money Lies

May 20, 2015 |

Big Data continues to be a big topic. Experts promise a world where the Internet of Things (IoT) proliferates and tiny, inexpensive sensors collect massive amounts of data. New, powerful databases coupled with information storage technology and analytical tools offer the promise of gleaning amazing insights and finding patterns we never before imagined. However, for most businesses, it’s still too early to become actual practitioners of Big Data. Some may remember the 2013 findings from research organization SINTEF that 90% Read more…