Big Data • Big Analytics • Big Insight

Applications » Predictive Analytics


Training Day: CrowdFlower Sets Human-Generated Data Free

Mar 4, 2015 |

Data scientists who are looking for high quality sets of curated data on which to train their machine learning models may want to check out CrowdFlower, which today unleashed a veritable treasure trove of free human-generated data. CrowdFlower today released about 40 data sets as part of its Data for Everyone campaign (see But over the coming weeks, the San Francisco company expects to make thousands of data sets available for download from its website, covering millions of records. Read more…

The 3 Key Steps to Building a Predictive App with Machine Learning

Mar 3, 2015 |

Machine learning is the technology that allows businesses to make sense of vast quantities of data, make better decisions, and ultimately bring better services to consumers. From personalized recommendations to fraud detection, from sentiment analysis to personalized medicine, machine learning provides the technology to adapt services to individual needs. For all the value that it brings, machine learning technology has a high cost. Building a predictive application is a multi-stage and iterative process that requires a plethora of people, systems Read more…

Rating the Advanced Analytics Vendors

Feb 27, 2015 |

There are several ways you can go about obtaining the advanced analytic capabilities needed to extract insights from large amounts of data. You can outsource the whole thing to a services firm, you can buy pre-built applications for a specific industry, or you can buy tools that will let you build what you need. Last week, Gartner rated the top 16 such build-it-yourself tools in the advanced analytics category. The “Magic Quadrant for Advanced Analytics Platforms” that Gartner delivered last Read more…

Big Data So Easy a Caveman Could Do It?

Feb 26, 2015 |

Let’s face it: big data isn’t easy. If you’re building a big data application today, you’re up to your eyeballs in things like R and Java, MapReduce and Pig, and Storm and Kafka. There’s a reason data scientists are so hard to find that they’re compared to unicorns. But in the future, the big data application assembly process may be dumbed down to the point where, as the insurance commercial says, even a caveman could do it. That’s the approach Read more…

Spark Steals the Show at Strata

Feb 25, 2015 |

There was a lot of good stuff on display at last week’s Strata + Hadoop World conference. But if there was one product or technology that stood out from the pack, that would have to be Apache Spark, the versatile in-memory framework that is taking the big data world by storm. At Strata, Spark creator Matei Zaharia showed how the technology will get even more powerful in the months to come. Spark has garnered an incredible amount of momentum, largely running Read more…

News In Brief

Apache Spark Ecosystem Continues To Build

Feb 25, 2015 |

Apache Spark was everywhere at the recent Strata + Hadoop World conference. From Tableau’s new Spark interface to the new Spark as a service (SaaS) offerings and Intel’s new Spark initiative, the big data framework was very hard to miss. Intel jumped on Spark’s bandwagon last week when it announced it was forming a new initiative around the in-memory framework. “We have engaged with Databricks, one of the pioneers of Apache Spark, to advance analytics capability for the Spark on Read more…

Snowflake Differentiates Itself in Strata Startup Showcase

Feb 23, 2015 |

Snowflake Computing, a big data warehousing as a service provider, took home top honors at the Startup Showcase event held during last week’s Strata + Hadoop World conference. The award is a boost to the Silicon Valley company, which aims to be a one-stop shop for analyzing data generated on the cloud. Snowflake emerged from stealth mode in October with $26 million in cash and a vision to create an “elastic data warehouse” that lives in the cloud. The company, Read more…

IBM Embraces Hadoop in ‘BigInsight’ Push

Feb 19, 2015 |

IBM jumped onto the Hadoop bandwagon this week with the introduction of its BigInsights for Apache Hadoop offering along with machine learning with R statistical computing and other features designed to handle data analysis at massive scale. The introduction coincides with the launch of an industry initiative by IBM and others to promote Apache Hadoop and big data technologies in enterprises. IBM BigInsights for Apache Hadoop comes with a broad data science toolset to query data, visualize and carry out Read more…

Survey Finds Uneven Success With Big Data Rollouts

Feb 4, 2015 |

Despite heavy investments in big data deployments, a survey finds that few respondents have either shifted their operations to production or are satisfied with their big data initiatives. The key to success, the survey found, is establishing a centralized organizational structure when rolling out their big data and analytics units. The troubling findings are contained in a recent survey released by Capgemini Consulting, which found that only 27 percent of respondents considered their big data initiatives to be “successful” and Read more…

Microsoft Readies Major Push Into Big Data

Feb 2, 2015 |

Microsoft has a lot of irons in the fire. Always has and always will. But judging from its recent acquisition of Revolution Analytics, the early success of its hosted machine learning service, and the forthcoming public launch of a MapReduce analog called “Cosmos,” the Redmond, Washington software giant is set to make big data an even bigger part of its go-to-market strategy. Microsoft is reportedly gearing up to publicly launch a new big data storage and crunching service called Cosmos, Read more…

This Just In

IBM and Juniper Networks Partner

Feb 25, 2015 |

ARMONK, N.Y. and SUNNYVALE, Calif., Feb. 25 — IBM and Juniper Networks today announced plans to provide real-time network behavior insights to help customers dramatically improve mobile experiences, address increasing Internet of Things (IoT) application demands and uncover new opportunities gleaned from Big Data. IBM and Juniper Networks will work together to enable the design and delivery of next generation high-performance network analytics to help communications service providers (CSP) and enterprises become more agile and efficient, reducing time to deployment and cost, while enhancing end user application experiences. With close Read more…

HP Unveils Haven Predictive Analytics

Feb 17, 2015 |

SAN JOSE, Calif., Feb. 17 — HP today unveiled HP Haven Predictive Analytics, a new offering that accelerates and operationalizes large-scale machine learning and statistical analysis, and ultimately provides organizations with much deeper insights and understanding into today’s rapidly evolving data volumes. Powered by HP’s innovative Distributed R offering, the new release dramatically improves performance and enables users to analyze much larger data sets than was previously possible with the popular R statistical programing language. Available now at, the new offering includes the following key components and Read more…

RapidMiner Makes Self-Service Advanced Analytics Available for Hadoop

Feb 17, 2015 |

SAN JOSE, Calif. and BOSTON, Mass., Feb. 17 — RapidMiner, the industry’s easiest-to-use Modern Analytics platform, today announced significant updates to the most comprehensive advanced analytics offering on the market today. In a world where data lakes are often used solely as a repository for information, underutilized due to the state of the market and limits of technology, RapidMiner’s aggressive advances turn the tide for data scientists and business users alike to extract business value from Big Data. Most analytics vendors Read more…

New Functionality Announced Within Lavastorm Analytics Engine Platform

Feb 3, 2015 |

Feb. 3 — Lavastorm Analytics, a leading agile data management and analytics software company, today announced new functionality within its Lavastorm Analytics Engine platform that enables business analysts who have a limited knowledge of complex data science to deliver business-impacting insights using predictive analytics. Business analysts can now leverage key aspects of data science without requiring a specialized educational background in advanced analytics. Business analysts typically encounter a host of core problems when trying to utilize predictive analytics. They lack the Read more…

Skytree Partners with Tableau to Combine Machine Learning and Data Visualization

Jan 29, 2015 |

SAN JOSE, Calif., Jan. 29 — Skytree, the Machine Learning Company, today announced its partnership with Tableau, a global leader in rapid-fire, easy-to-use business analytics software, to bring Skytree Infinity’s advanced machine learning technology to Tableau’s data analytics software. The partnership furthers Skytree’s footprint within the big data ecosystem. Now, Tableau’s global user base can benefit from Skytree’s best-in-class machine learning without having to leave their dashboard. “Machine learning is an important part of gaining the greatest amount of value from Read more…