Sears Rides Hadoop Up Retail Mountain
Falling behind Walmart and Target in retail store sales, Sears hopes to rebound by investing fairly heavily in Hadoop. Sears revenue had decreased from $50 billion in 2008 to $42 billion last year. However, smarter marketing as a result of being able to keep all their data and target customers individually has resulted in sizable growth over the last year, with sales over this year’s quarter ending on July 28 up 163% from the same quarter in 2011.
“With Hadoop we can keep everything, which is crucial because we don’t want to archive or delete meaningful data,” said Sears Chief Technology Officer Phil Shelley. Sears has seen their big data processing, especially with regard to evaluating marketing campaigns, quicken as a result of moving their data from Teradata and SAS onto Hadoop. According to Shelley, what took six weeks’ time now happens within a week on Hadoop. Their current 300-node cluster which contains 2PB allows the company to keep 100% of their data instead of a meager 10% according to Shelley.
Sears’s view and strategy regarding big data is an interesting one. Along with being Sears’s CTO and Executive VP, Shelley runs MetaScale, a Sears subsidiary whose goal it is to move into providing Hadoop services for other companies similar to Amazon and their Amazon Web Services.
It would seem that, in an effort to compete with Amazon, they would have a little catching up to do. On the other hand, Sears’s big data efforts currently surpass that of Walmart’s, who just started running ten Hadoop test nodes for experimental e-commerce analysis. Sears did that in 2010.
It is unfair to compare Sears, which makes it money historically from their physical stores, to online stores like Amazon. It may not even be fair to compare them to Target and Walmart, as Sears has more of an appliance focus while Target and Walmart are more general.
With that being said, Sears, and specifically MetaScale, wants to exist in the big data market. They have some interesting viewpoints regarding that. For example, Shelley sees little value in the modern era for ETL.
“ETL is an antiquated technique, and for large companies it’s inefficient and wasteful because you create multiple copies of data,” Shelley says. .“Everybody used ETL because they couldn’t put everything in one place, but that has changed with Hadoop, and now we copy data, as a matter of principle, only when we absolutely have to copy.”
Shelley’s principles are sound and may have led to the drastic reported reduction, $500,000, in their mainframe costs per year. Some, like Cloudera CEO Mike Olson, warn against the complete departure from ETL. But to Shelley the move is intuitive. “If in three years you come up with a new query or analysis, it doesn’t matter because there’s no schema,” Shelley says. “You just go get the raw data and transform it into any format you need.”
May 25, 2017
- Talend and Snowflake Team to Drive Cloud Data Migration
- Interana Survey: 70% of Organizations Struggle to Find Critical Data Insights
May 24, 2017
- IBM, Google and Lyft Announce New Open Source Cloud Project
- Infosys Launches Boundaryless Data Lake on AWS
- Qlik Appoints New Regional Director for Eastern Europe
- Talend Announces Support for Cloudera Altus
- Cloudera Launches Altus to Simplify Big Data Workloads in the Cloud
- Dataguise Answers Enterprise GDPR Compliance Challenges with Complete Sensitive Data Governance
- Trifacta Reveals Spring ’17 Release
- IBM Accelerates Open Database-as-a-Service on IBM Power Systems
- Breakthrough for Large-Scale Computing: ‘Memory Disaggregation’ Made Practical
May 23, 2017
- Pepperdata Code Analyzer for Apache Spark Highlights Bottlenecks for Developers
- MapR Announces New Quick Start Solution
- MariaDB Announces TX 2.0 for Enterprise
- Attunity Wins Bronze in the 12th Annual 2017 IT World Awards
- IBM and the tranSMART Foundation Bring Translational Medicine Data to Scientists
- Life Sciences and Medical Research Turn to Qumulo for Scale-Out Storage
May 22, 2017
- Bright Computing Announces Integration with BeeGFS from ThinkParQ
- Kim Hammonds and Mike Stankey Join Cloudera Board of Directors
- Kodiak Data Launches MemCloud to Address Big Data Infrastructure Chasm
Most Read Features
- Machine Learning, Deep Learning, and AI: What’s the Difference?
- 9 Must-Have Skills to Land Top Big Data Jobs in 2015
- Kafka ‘Massively Simplifies’ Data Infrastructure, Report Says
- Hadoop Has Failed Us, Tech Experts Say
- How Uber Uses Spark and Hadoop to Optimize Customer Experience
- Which Type of SSD is Best: SATA, SAS, or PCIe?
- Spark Streaming: What Is It and Who’s Using It?
- Speed: The No. 1 Reason to Adopt Graph Tech
- How ‘Purple Rain’ Bolsters Security Intelligence for Capital One
- New AI Chips to Give GPUs a Run for Deep Learning Money
- More Features…
Most Read News In Brief
- Why Gartner Dropped Big Data Off the Hype Curve
- Infor Buys Cloud Analytics Vendor Birst
- Microsoft Surges in Gartner Quadrant with Power BI
- Data Deals Heat Up
- Six Big Name Schools with Big Data Programs
- ‘Cloud TPU’ Bolsters Google’s ‘AI-First’ Strategy
- GOAI Publishes Python Data Frame for GPU Analytics
- Startup Patents ‘Whole Brain’ AI Approach
- Demand, Salaries Grow for Data Scientists
- Data Warehouse Market Ripe for Disruption, Gartner Says
- More News In Brief…
Most Read This Just In
- Fuzzy Logix, Kinetica to Provide Enhanced GPU-Accelerated In-Database Analytics
- NorCom Selects MapR to Accelerate Deep Learning in Autonomous Driving Applications
- NVIDIA & H2O.ai Announce Major Partnership News
- MarkLogic Launches Version 9 of Database Platform
- MariaDB Names Jon Bakke as Chief Revenue Officer
- Gartner Names Arcadia Data Cool Vendor in IoT Analytics
- Talend Reports First Quarter 2017 Financial Results
- TIBCO Software to Acquire Data Science Platform Leader Statistica
- Apache Software Foundation Announces Beam v2.0.0
- MapR Receives Back-to-Back Honors from CRN
- More This Just In…
June 5 - June 7San Francisco CA United States
June 5 - June 6San Francisco CA United States
June 6 - June 8New York United States
July 18 - July 20Las Vegas NV United States
September 18 - September 19Germany