Big Data • Big Analytics • Big Insight

Tag: big data

Snowflake Differentiates Itself in Strata Startup Showcase

Feb 23, 2015 |

Snowflake Computing, a big data warehousing as a service provider, took home top honors at the Startup Showcase event held during last week’s Strata + Hadoop World conference. The award is a boost to the Silicon Valley company, which aims to be a one-stop shop for analyzing data generated on the cloud. Snowflake emerged from stealth mode in October with $26 million in cash and a vision to create an “elastic data warehouse” that lives in the cloud. The company, Read more…

The Wild West and Last Frontier of Big Data

Feb 23, 2015 |

We are in the Wild West of big data. The speed of processing keeps getting faster, while the volume of data that can be processed is beyond what could have been imagined just a few years ago. The Last Frontier of big data, meanwhile, is the discovery of value hidden in disparate data sources that have yet to be blended and harmonized. Just like the gold-seeking pioneers from centuries past, big data pioneers who embrace this challenge and blaze their Read more…

Why ‘Data Lakes’ May Create Drowning Risks

Feb 16, 2015 |

Many organizations tackling Big Data projects find themselves swimming in uncharted waters, but the concept of a “data lake” may be at least one way to keep them from wading in too deep. A data lake can be defined as an environment where a data warehouse resides within Hadoop. The idea is to bring greater efficiency to managing unstructured information. The trade-off is that those using the data lake approach are putting all of their eggs in one basket, which Read more…

A Black Mark(et) for Big Data

Feb 6, 2015 |

Yesterday’s revelation of a massive data breach at Anthem Blue Cross is a potent reminder of the tenuousness of our personal information in today’s digital environment. While corporations maintain large databases by necessity, the ease at which hackers extract that data and sell it on the black market should be a concern to everybody. Anthem disclosed that the personal data of about 80 million people was stolen in a “very sophisticated external cyber attack” that it says it discovered last Read more…

Will Big Metadata Rat You Out?

Feb 5, 2015 |

Data scientists are usually taught to be cautious with personally identifiable (PII) data and take pains to ensure that it’s properly anonymized and aggregated for authorized uses. But as MIT researchers recently showed, it’s quite possible to “reidentify” a person by analyzing credit card metadata even after it has been anonymized. Writing in the magazine Science, four MIT researchers demonstrated a method that can be used to identify individuals by finding links among relatively course-grained credit card metadata. While the Read more…

Tachyon Support Coming to Big Data Hypervisor

Feb 3, 2015 |

Organizations that are deploying Apache Spark to do data science on big data may be inclined to invest in Tachyon, the in-memory file system that was developed next to Spark at the AMPlab. Getting Spark and Tachyon spun up and deployed on bare metal can be a hassle, but it’s a business opportunity to BlueData, which is aiming to be the VMware of big data. Tachyon is a distributed, in-memory file system designed to enable reliable file sharing at memory-speed Read more…

Microsoft Readies Major Push Into Big Data

Feb 2, 2015 |

Microsoft has a lot of irons in the fire. Always has and always will. But judging from its recent acquisition of Revolution Analytics, the early success of its hosted machine learning service, and the forthcoming public launch of a MapReduce analog called “Cosmos,” the Redmond, Washington software giant is set to make big data an even bigger part of its go-to-market strategy. Microsoft is reportedly gearing up to publicly launch a new big data storage and crunching service called Cosmos, Read more…

9 Places to Get Big Data Now

Jan 29, 2015 |

Discussions of big data often revolve around what new technologies and processes people are using to analyze data. Hadoop, in-memory databases, and machine learning algorithms are getting lots of attention in this regard. But sometimes we tend overlook the most important ingredient in big data analytics: the data itself. Much of big data that organizations want to analyze exists within their own four walls. Relational databases make great repositories for structured data like account records, orders, and customer lists. Depending Read more…

Three Ways Big Data and HPC Are Converging

Jan 27, 2015 |

Big data is becoming much more than just widespread distribution of cheap storage and cheap computation on commodity hardware.  Big data analytics may soon become the new “killer app” for high performance computing (HPC). There is more to big data than large amounts of information.  It also pertains to massive distributed activities such as complex queries and computations (a.k.a analytics).  In other words, deriving value through computation is just as “big” as the size of the data sets themselves.  In Read more…

Why Machine Learning Is A Priority for Andreessen Horowitz in 2015

Jan 26, 2015 |

Marc Andreessen and Ben Horowitz are legendary in the Silicon Valley, having invested early in firms like Twitter, Facebook, Box, Skype, and Groupon. So when the Sand Hill Road experts with an eye for picking winners put machine learning near the top of its list of trends to watch out for in 2015, it was a cue that the technology is on the cusp of having a breakout year. The way that Andreessen Horowitz sees things, machine learning and big data Read more…