Sears Rides Hadoop Up Retail Mountain
Falling behind Walmart and Target in retail store sales, Sears hopes to rebound by investing fairly heavily in Hadoop. Sears revenue had decreased from $50 billion in 2008 to $42 billion last year. However, smarter marketing as a result of being able to keep all their data and target customers individually has resulted in sizable growth over the last year, with sales over this year’s quarter ending on July 28 up 163% from the same quarter in 2011.
“With Hadoop we can keep everything, which is crucial because we don’t want to archive or delete meaningful data,” said Sears Chief Technology Officer Phil Shelley. Sears has seen their big data processing, especially with regard to evaluating marketing campaigns, quicken as a result of moving their data from Teradata and SAS onto Hadoop. According to Shelley, what took six weeks’ time now happens within a week on Hadoop. Their current 300-node cluster which contains 2PB allows the company to keep 100% of their data instead of a meager 10% according to Shelley.
Sears’s view and strategy regarding big data is an interesting one. Along with being Sears’s CTO and Executive VP, Shelley runs MetaScale, a Sears subsidiary whose goal it is to move into providing Hadoop services for other companies similar to Amazon and their Amazon Web Services.
It would seem that, in an effort to compete with Amazon, they would have a little catching up to do. On the other hand, Sears’s big data efforts currently surpass that of Walmart’s, who just started running ten Hadoop test nodes for experimental e-commerce analysis. Sears did that in 2010.
It is unfair to compare Sears, which makes it money historically from their physical stores, to online stores like Amazon. It may not even be fair to compare them to Target and Walmart, as Sears has more of an appliance focus while Target and Walmart are more general.
With that being said, Sears, and specifically MetaScale, wants to exist in the big data market. They have some interesting viewpoints regarding that. For example, Shelley sees little value in the modern era for ETL.
“ETL is an antiquated technique, and for large companies it’s inefficient and wasteful because you create multiple copies of data,” Shelley says. .“Everybody used ETL because they couldn’t put everything in one place, but that has changed with Hadoop, and now we copy data, as a matter of principle, only when we absolutely have to copy.”
Shelley’s principles are sound and may have led to the drastic reported reduction, $500,000, in their mainframe costs per year. Some, like Cloudera CEO Mike Olson, warn against the complete departure from ETL. But to Shelley the move is intuitive. “If in three years you come up with a new query or analysis, it doesn’t matter because there’s no schema,” Shelley says. “You just go get the raw data and transform it into any format you need.”
July 21, 2016
- Datawatch Reports Third Quarter Fiscal 2016 Financial Results
- Cloudera Enterprise 5.8 and Navigator Optimizer Now Available
- The Scripps Research Institute Leverages DDN Storage
- Hazelcast and Heimdall Data Launch Intelligent SQL Optimization Solution
July 20, 2016
- TIBCO Mashery Enterprise Introduced
- Oregon State University’s New Data Analytics Programs Aim to Address National Shortage
- Deloitte Advisory Cyber Risk Services and Cray Introduce Cyber Reconnaissance and Analytics
- Snowflake Reports New Product Innovations and Strategic Partnerships
July 19, 2016
- Zettaset Big Data Encryption Solution Achieves Certification With MapR Converged Data Platform
- MapR Risk Management Quick Start Solution for Financial Services Now Available
- Global Corporation Selects Attunity Replicate for Hadoop
- NICE Actimize Signs Agreement With Tableau and Launches Visual Analytics Solution
- Nuix to Integrate Data Processing and Indexing Engine With Teradata Aster Analytics
- Metis Becomes First and Only Data Science Training Bootcamp in the U.S. to Accept International Students
- Ryanair Utilizing QlikView and Qlik Sense
- Inspur Announces Availability of Memory1 Servers for Big Data Applications
July 18, 2016
- Talena Secures HPE Vertica and Hortonworks Certifications
- Splice Machine’s New Open-Source RDBMS Sandbox Goes Live on AWS
- Hortonworks to Present at Pacific Crest Global Technology Leadership Forum
- Waterline Data, Trifacta, and Cloudera Present Webinar on How to Govern Data Without Hampering its Value
Most Read Features
- 9 Must-Have Skills to Land Top Big Data Jobs in 2015
- Concord Claims 10x Performance Edge on Spark Streaming
- Solr or Elasticsearch–That Is the Question
- What’s Hot This Summer: Data Science Bootcamps
- Avoid These Five Big Data Governance Mistakes
- Spark Streaming: What Is It and Who’s Using It?
- How Uber Uses Spark and Hadoop to Optimize Customer Experience
- Which Type of SSD is Best: SATA, SAS, or PCIe?
- Three NoSQL Databases You’ve Never Heard Of
- Skip the Ph.D and Learn Spark, Data Science Salary Survey Says
- More Features…
Most Read News In Brief
- Six Big Name Schools with Big Data Programs
- Investments in Fast Data Analytics Surge
- Why Gartner Dropped Big Data Off the Hype Curve
- Companies Struggle to Find an ROI on Analytics
- Report: Machine Learning Driving AI
- Crunchy Data Container Suite Packages PostgresSQL
- The Rise and Fall of Qlik
- Doubts Mount About Data Payoffs
- Talend Rolls New Cloud Data Manager Platform
- Big Data Career Notes
- More News In Brief…
Most Read This Just In
- H2O.ai Unveils Sparkling Water 2.0
- Continuum Analytics Releases Anaconda Mosaic
- MapR Unveils New Converge Partners Program
- Kyvos Insights Delivers New Version of Big Data Analytics Solution for Hadoop
- TransUnion Powers New Prama Self-Service Analytics Platform With MapR
- BI Leaders Join With Teradata to Enhance Presto for the Enterprise
- Qualys Introduces App for Splunk Enterprise
- Qubole and WANdisco Launch Cloudera Migration Program
- Bigstep Launches Bare-Metal Cloud for Big Data in the U.S.
- IMS Health Chooses Cloudera Enterprise to Support Big Data Factory for Life Sciences and Healthcare
- More This Just In…
September 8 - September 9Boston MA United States
September 19 - September 20Malaysia
September 19New York NY United States
September 26 - September 29New York United States
September 26 - September 27New York United States