Big Data • Big Analytics • Big Insight

Applications » Data Mining

Features

Rating the Advanced Analytics Vendors

Feb 27, 2015 |

There are several ways you can go about obtaining the advanced analytic capabilities needed to extract insights from large amounts of data. You can outsource the whole thing to a services firm, you can buy pre-built applications for a specific industry, or you can buy tools that will let you build what you need. Last week, Gartner rated the top 16 such build-it-yourself tools in the advanced analytics category. The “Magic Quadrant for Advanced Analytics Platforms” that Gartner delivered last Read more…

Big Data So Easy a Caveman Could Do It?

Feb 26, 2015 |

Let’s face it: big data isn’t easy. If you’re building a big data application today, you’re up to your eyeballs in things like R and Java, MapReduce and Pig, and Storm and Kafka. There’s a reason data scientists are so hard to find that they’re compared to unicorns. But in the future, the big data application assembly process may be dumbed down to the point where, as the insurance commercial says, even a caveman could do it. That’s the approach Read more…

Spark Steals the Show at Strata

Feb 25, 2015 |

There was a lot of good stuff on display at last week’s Strata + Hadoop World conference. But if there was one product or technology that stood out from the pack, that would have to be Apache Spark, the versatile in-memory framework that is taking the big data world by storm. At Strata, Spark creator Matei Zaharia showed how the technology will get even more powerful in the months to come. Spark has garnered an incredible amount of momentum, largely running Read more…

The Wild West and Last Frontier of Big Data

Feb 23, 2015 |

We are in the Wild West of big data. The speed of processing keeps getting faster, while the volume of data that can be processed is beyond what could have been imagined just a few years ago. The Last Frontier of big data, meanwhile, is the discovery of value hidden in disparate data sources that have yet to be blended and harmonized. Just like the gold-seeking pioneers from centuries past, big data pioneers who embrace this challenge and blaze their Read more…

How Advances in SQL on Hadoop Are Democratizing Big Data–Part 2

Feb 19, 2015 |

In a previous article, we discussed several key advances in SQL on Hadoop that are making Big Data capabilities increasingly accessible to analytics organizations. While SQL democratizes Big Data by leveraging conventional skillsets, SQL access to data in isolation is not a guarantee of sustainable analytic agility. Architects successfully integrating Hadoop to accelerate and expand analytic capabilities understand the value of data governance, transparency and common vocabulary to ensure actionable value-add analytic capabilities. Similarly, effective managers engaging new SQL on Read more…

News In Brief

Apache Spark Ecosystem Continues To Build

Feb 25, 2015 |

Apache Spark was everywhere at the recent Strata + Hadoop World conference. From Tableau’s new Spark interface to the new Spark as a service (SaaS) offerings and Intel’s new Spark initiative, the big data framework was very hard to miss. Intel jumped on Spark’s bandwagon last week when it announced it was forming a new initiative around the in-memory framework. “We have engaged with Databricks, one of the pioneers of Apache Spark, to advance analytics capability for the Spark on Read more…

Snowflake Differentiates Itself in Strata Startup Showcase

Feb 23, 2015 |

Snowflake Computing, a big data warehousing as a service provider, took home top honors at the Startup Showcase event held during last week’s Strata + Hadoop World conference. The award is a boost to the Silicon Valley company, which aims to be a one-stop shop for analyzing data generated on the cloud. Snowflake emerged from stealth mode in October with $26 million in cash and a vision to create an “elastic data warehouse” that lives in the cloud. The company, Read more…

Cloudera Brings Kafka Under Its ‘Data Hub’ Wing

Feb 18, 2015 |

Cloudera is making Apache Kafka a supported part of its Hadoop distribution, the company announced today. While Kafka still doesn’t run on Hadoop, Cloudera says the changes it is instituting will help CDH customers build real-time analytics applications that span Hadoop and Kafka. Kafka is an open source message broker that’s designed to handle massive flows of streaming, real-time data, such as log data. The software was originally developed at LinkedIn, which uses it to process hundreds of millions of Read more…

MapR Delivers Bi-Directional Replication with Distro Refresh

Feb 18, 2015 |

A new release of the MapR Distribution including Hadoop unveiled today will enable companies to perform real-time, bi-directional data replication between Hadoop clusters that are thousands of miles apart. The new table replication feature was added to MapR-DB, the NoSQL database included with the high-end edition of MapR’s commercial Hadoop offering. As Hadoop adoption grows, companies are finding it increasingly difficult to ensure that they’re acting on the latest, freshest data. This fast-data problem is particularly evident in organizations that Read more…

Plugging Leaks in Big Data Lakes

Feb 17, 2015 |

The big data lake phenomenon is in full swing at the moment, with Hadoop playing a central role in the storage and processing of massive amounts of data. But without certain processes in place, a data lake will not stand the test of time. Unfortunately, most of those processes must be implemented manually today. People today are expecting too much out of Hadoop, and therefore setting themselves up for failure. While Hadoop provides the basic structure for storing and analyzing Read more…

This Just In

Intel and Mitsubishi Collaborate

Sep 30, 2014 |

TOKYO, Japan, Sept. 30 — Intel Corporation and Mitsubishi Electric Corporation today announced a new collaboration to develop next-generation factory automation (FA) systems with Internet of Things (IoT) technologies and a pilot program at Intel’s backend manufacturing facility in Malaysia. The pilot demonstrates the benefits of IoT in a factory setting with a focus on delivering productivity enhancement through innovative functions, such as predictive failure, by combining Intel’s expertise developing solutions for IoT and Mitsubishi Electric’s “e-F@ctory” automation capabilities. Intel realized Read more…

SAS to Host Two Big Data Events in October

Sep 24, 2014 |

CARY, N.C., Sept. 24 — From data visualization to cybersecurity, marketing analytics to the Internet of Things, the sources and uses of data are rapidly evolving. This October in Las Vegas, decision makers and data scientists from around the world will gather at two events to explore and share how critical technologies — data mining, data visualization, Hadoop, forecasting and more — create value from big data. And they’ll examine big data threats too, including cyber-attacks and fraud. The Premier Read more…

RapidMiner World Conference Concludes

Aug 25, 2014 |

BOSTON, Mass., Aug. 25 — Pioneering predictive analytics leader RapidMiner last week concluded its RapidMiner World conference, which brought together over 100 RapidMiner users and other data analytics industry experts from around the globe. The four-day conference, which took place from August 18 – 21 in Boston, explored the latest in predictive analytics, data mining, and the future of RapidMiner. With practical use cases and industry discussions, RapidMiner World attracted a wide-range of participants, ranging in skill-level and company role. In this inaugural U.S. Read more…

ProfitBricks Accelerating Big Data Projects with SDN and Infiniband

Apr 2, 2014 |

ProfitBricks, the price/performance leader in Cloud Computing IaaS, is now helping big data systems integrators like Altoros bring the power of the cloud to the high demand of its customers. Thanks to second generation technologies like software-defined networking (SDN) and InfiniBand that are at the heart of the ProfitBricks cloud, Altoros customers can easily deploy Hadoop clusters for both temporary projects and continuous big data analysis programs.

Dell Acquires StatSoft

Mar 24, 2014 |

Dell today announced the acquisition of StatSoft, a leading provider of advanced analytics solutions that deliver a wide range of data mining, predictive analytics and data visualization capabilities. StatSoft combines comprehensive statistical analysis with advanced analytics to help organizations better understand their businesses, predict change, increase agility and control critical systems.