Sectors » Retail


Enforcing Hadoop SLAs in a Big YARN World

Jul 23, 2014 |

The Apache Hadoop community has done a truly amazing job developing a scalable and versatile platform for big data analytic workloads. And with the recent introduction of YARN in Hadoop 2, we’re now able to run multiple analytic engines on our clusters simultaneously. Unfortunately, the prospect for resource contention has also gone up, and that will likely increase demand for service level agreement (SLA) enforcement. YARN made its big introduction just as companies started to move their Hadoop deployments out Read more…

Streaming Analytics Ready for Prime Time, Forrester Says

Jul 22, 2014 |

Analytic platforms that generate insights from data in real time are mature enough for enterprises to begin adopting them, Forrester says in its latest report. While open source streaming analytic products like Apache Storm are proving popular, Forrester says they lack key functionality found in the offerings of proprietary vendors, such as top-rated Software AG. You don’t need a Forrester analyst to know that streaming analytics is red hot at the moment. If Hadoop has opened our eyes to what Read more…

Slicing and Dicing Music Data for Fun and Profit

Jul 21, 2014 |

The advent of big data analytics promises to have a profound impact on many aspects of human life, including how we work and play. Big data is even influencing the arts, where the field of music data science is rearranging our relationship with music. We’re in the midst of a boom in music data science that can be traced back to 1999, when two important events occurred. First, Shawn Fanning unleashed Napster to the world, thereby giving people the power Read more…

For Esri, Analytics All About Location, Location, Location

Jul 18, 2014 |

Certain analytic tools excel at manipulating with certain types of data. When it comes to data with a geographic bent, there may be no more influential vendor than Esri, a Southern California company that has quietly gobbled up a majority share of the geographic information systems (GIS) market. But now the company is positioning GIS as powerful way to visualize all types of data. Esri owns anywhere from 40 to 70 percent of the market for GIS software, according to Read more…

Inside Sibyl, Google’s Massively Parallel Machine Learning Platform

Jul 17, 2014 |

If you’ve ever wondered how your spam gets identified in Gmail or where personal video recommendations come from on YouTube, the answer is likely Sibyl, a massively parallel machine learning system that Google developed to make predictions and recommendations with user-specific data culled from its Internet applications. Dr. Tushar Chandra, a distinguished Google Research engineer, recently shared some information on Sibyl in a keynote presentation at the annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN). Sibyl is not Read more…

News In Brief

Survey Finds Business Insights in Operational Data

Jul 18, 2014 |

Operational intelligence derived from automated data collection and traditionally used to manage IT infrastructure could also be leveraged to gauge how transaction-based businesses are performing. Furthermore, machine-generated data could even be exploited to adjust business systems and processes in real-time to account for customer preferences, concludes a recent study that looked at ways of gleaning business insights from seemingly mundane operational intelligence. The study, “Masters of Machines: Business insight from IT operational intelligence,” argues that turning operational intelligence into business Read more…

GraphLabs Wises Up Machine Learning Platform

Jul 16, 2014 |

GraphLabs Inc., a machine learning startup, rolled out a new software platform this week it says will bring expanded machine-learning capabilities to enterprises struggling to find data scientists. The GraphLab Create 1.0 platform is said to automate key aspects of advanced big data analytics so a diverse range of companies can use machine learning to squeeze more business intelligence out of operational and other data. Seattle-based GraphLabs also claims its Create platform is the first to apply advanced machine learning Read more…

Survey: Variety, Not Volume, Stymies Data Scientists

Jul 1, 2014 |

Diverse data types, not just the volume of data, is the biggest challenge faced by data scientists, according to a new survey of big data practitioners. One consequence, warns the survey by computational database specialist Paradigm4, is that data variety is causing frustrated scientists to “leave data on the table.” Of the 111 data scientists responding to the survey, fully 71 percent said big data made analytics more difficult. Data variety rather than volume was most often cited as the Read more…

See Spark Run on NoSQL, DataStax Says

Jun 30, 2014 |

DataStax today announced that Apache Spark is included in the latest release of its NoSQL databases, including open source Cassandra and DataStax Enterprise version 4.5. Those databases now include the in-memory Spark tools, thereby giving customers a new option for running analytic workloads on transactional data. Apache Spark has garnered loads of attention as a potent analytic toolset for Hadoop. Developers are eager for Spark, which offers a single API for accessing a treasure trove of machine learning, graph, SQL, Read more…

Report: Dropbox Acquires Stealthy Data Startup

Jun 19, 2014 |

Dropbox, the secure file-sharing specialist, has reportedly acquired a stealthy big data startup, TechCrunch reported earlier in the week. According to the June 16 report, Dropbox has acquired Parastructure, which develops data analysis software based on open source infrastructure. Little else is known about the startup beyond what was posted on its homepage. While neither company confirmed the deal, Parastructure’s web site now redirects to Dropbox. What is known is that the San Francisco-based startup was formed in 2012 by Read more…

This Just In

Tresata Releases TREE 3.3

Jun 5, 2014 |

CHARLOTTE, N.C., June 5 – Tresata announced the release of TREE 3.3, its third generation of its most successful software application that has been used by both financial services and retail industry giants to integrate disparate datasets in Hadoop at a segment of one. TREE is the only 100% Hadoop-powered software application that collects, curates and collapses multiple datasets ranging from transactional data, market data, customer service data and social network data and is able to resolve all critical variables to a Read more…

Hadoop Market Expected to Reach 20.9 Billion by 2018

Jan 3, 2014 |

According to a new market report published by Transparency Market Research “Hadoop Market – Global Industry Analysis, Size, Share, Growth, Trends, and Forecast, 2012- 2018,” the global Hadoop market was worth USD 1.5 billion in 2012 and is expected to reach USD 20.9 billion in 2018, growing at a CAGR of 54.7% from 2012 to 2018.