Sectors » Retail


How Spark Helps ClearStory Achieve Data Harmony

Jul 28, 2014 |

When the folks at ClearStory Data set out to build a big data harmonization tool that could automatically converge diverse data streams into a single coherent view for analysis, they figured they’d have to develop much of the underlying technology themselves. Then they heard of the Apache Spark project at Cal Berkeley’s AMPlab, and the rest, as they say, is history. ClearStory Data was founded several years ago by two high-tech veterans, Sharmila Mulligan and Vaibhav Nivargi, with the idea Read more…

AutoTrader Zooms Ahead with Automated ETL

Jul 25, 2014 |

If you’re in the market for a car, you’re probably using the Internet to check inventory and compare prices. From the consumer point of view, it all appears to work seamlessly, but keeping all that automotive data accurate and up-to-date is no easy task. For the folks at AutoTrader and its data integration subsidiary, VinSolutions, the move to an automated ETL process proved beneficial. Founded in 1996, today is one of the most popular sources for market data on Read more…

Enforcing Hadoop SLAs in a Big YARN World

Jul 23, 2014 |

The Apache Hadoop community has done a truly amazing job developing a scalable and versatile platform for big data analytic workloads. And with the recent introduction of YARN in Hadoop 2, we’re now able to run multiple analytic engines on our clusters simultaneously. Unfortunately, the prospect for resource contention has also gone up, and that will likely increase demand for service level agreement (SLA) enforcement. YARN made its big introduction just as companies started to move their Hadoop deployments out Read more…

Streaming Analytics Ready for Prime Time, Forrester Says

Jul 22, 2014 |

Analytic platforms that generate insights from data in real time are mature enough for enterprises to begin adopting them, Forrester says in its latest report. While open source streaming analytic products like Apache Storm are proving popular, Forrester says they lack key functionality found in the offerings of proprietary vendors, such as top-rated Software AG. You don’t need a Forrester analyst to know that streaming analytics is red hot at the moment. If Hadoop has opened our eyes to what Read more…

Slicing and Dicing Music Data for Fun and Profit

Jul 21, 2014 |

The advent of big data analytics promises to have a profound impact on many aspects of human life, including how we work and play. Big data is even influencing the arts, where the field of music data science is rearranging our relationship with music. We’re in the midst of a boom in music data science that can be traced back to 1999, when two important events occurred. First, Shawn Fanning unleashed Napster to the world, thereby giving people the power Read more…

News In Brief

NFL’s 49ers Launch Data Drive to Boost Fan Base

Jul 28, 2014 |

The San Francisco 49ers are now brought to you by the $1.2 billion Levi’s Stadium in Santa Clara. The NFL team’s “fan engagement program,” dubbed “Faithful 49″ is presented by team sponsor Esurance and a couple of data collectors you may have heard of: Google and Facebook. As the Niners prepare to open their spanking new 68,500-seat stadium in Silicon Valley, the team is jumping into the data collection business by offering fans incentives like flying in the team charter Read more…

Survey Finds Business Insights in Operational Data

Jul 18, 2014 |

Operational intelligence derived from automated data collection and traditionally used to manage IT infrastructure could also be leveraged to gauge how transaction-based businesses are performing. Furthermore, machine-generated data could even be exploited to adjust business systems and processes in real-time to account for customer preferences, concludes a recent study that looked at ways of gleaning business insights from seemingly mundane operational intelligence. The study, “Masters of Machines: Business insight from IT operational intelligence,” argues that turning operational intelligence into business Read more…

GraphLabs Wises Up Machine Learning Platform

Jul 16, 2014 |

GraphLabs Inc., a machine learning startup, rolled out a new software platform this week it says will bring expanded machine-learning capabilities to enterprises struggling to find data scientists. The GraphLab Create 1.0 platform is said to automate key aspects of advanced big data analytics so a diverse range of companies can use machine learning to squeeze more business intelligence out of operational and other data. Seattle-based GraphLabs also claims its Create platform is the first to apply advanced machine learning Read more…

Survey: Variety, Not Volume, Stymies Data Scientists

Jul 1, 2014 |

Diverse data types, not just the volume of data, is the biggest challenge faced by data scientists, according to a new survey of big data practitioners. One consequence, warns the survey by computational database specialist Paradigm4, is that data variety is causing frustrated scientists to “leave data on the table.” Of the 111 data scientists responding to the survey, fully 71 percent said big data made analytics more difficult. Data variety rather than volume was most often cited as the Read more…

See Spark Run on NoSQL, DataStax Says

Jun 30, 2014 |

DataStax today announced that Apache Spark is included in the latest release of its NoSQL databases, including open source Cassandra and DataStax Enterprise version 4.5. Those databases now include the in-memory Spark tools, thereby giving customers a new option for running analytic workloads on transactional data. Apache Spark has garnered loads of attention as a potent analytic toolset for Hadoop. Developers are eager for Spark, which offers a single API for accessing a treasure trove of machine learning, graph, SQL, Read more…

This Just In

Tresata Releases TREE 3.3

Jun 5, 2014 |

CHARLOTTE, N.C., June 5 – Tresata announced the release of TREE 3.3, its third generation of its most successful software application that has been used by both financial services and retail industry giants to integrate disparate datasets in Hadoop at a segment of one. TREE is the only 100% Hadoop-powered software application that collects, curates and collapses multiple datasets ranging from transactional data, market data, customer service data and social network data and is able to resolve all critical variables to a Read more…

Hadoop Market Expected to Reach 20.9 Billion by 2018

Jan 3, 2014 |

According to a new market report published by Transparency Market Research “Hadoop Market – Global Industry Analysis, Size, Share, Growth, Trends, and Forecast, 2012- 2018,” the global Hadoop market was worth USD 1.5 billion in 2012 and is expected to reach USD 20.9 billion in 2018, growing at a CAGR of 54.7% from 2012 to 2018.