Sears Rides Hadoop Up Retail Mountain
Falling behind Walmart and Target in retail store sales, Sears hopes to rebound by investing fairly heavily in Hadoop. Sears revenue had decreased from $50 billion in 2008 to $42 billion last year. However, smarter marketing as a result of being able to keep all their data and target customers individually has resulted in sizable growth over the last year, with sales over this year’s quarter ending on July 28 up 163% from the same quarter in 2011.
“With Hadoop we can keep everything, which is crucial because we don’t want to archive or delete meaningful data,” said Sears Chief Technology Officer Phil Shelley. Sears has seen their big data processing, especially with regard to evaluating marketing campaigns, quicken as a result of moving their data from Teradata and SAS onto Hadoop. According to Shelley, what took six weeks’ time now happens within a week on Hadoop. Their current 300-node cluster which contains 2PB allows the company to keep 100% of their data instead of a meager 10% according to Shelley.
Sears’s view and strategy regarding big data is an interesting one. Along with being Sears’s CTO and Executive VP, Shelley runs MetaScale, a Sears subsidiary whose goal it is to move into providing Hadoop services for other companies similar to Amazon and their Amazon Web Services.
It would seem that, in an effort to compete with Amazon, they would have a little catching up to do. On the other hand, Sears’s big data efforts currently surpass that of Walmart’s, who just started running ten Hadoop test nodes for experimental e-commerce analysis. Sears did that in 2010.
It is unfair to compare Sears, which makes it money historically from their physical stores, to online stores like Amazon. It may not even be fair to compare them to Target and Walmart, as Sears has more of an appliance focus while Target and Walmart are more general.
With that being said, Sears, and specifically MetaScale, wants to exist in the big data market. They have some interesting viewpoints regarding that. For example, Shelley sees little value in the modern era for ETL.
“ETL is an antiquated technique, and for large companies it’s inefficient and wasteful because you create multiple copies of data,” Shelley says. .“Everybody used ETL because they couldn’t put everything in one place, but that has changed with Hadoop, and now we copy data, as a matter of principle, only when we absolutely have to copy.”
Shelley’s principles are sound and may have led to the drastic reported reduction, $500,000, in their mainframe costs per year. Some, like Cloudera CEO Mike Olson, warn against the complete departure from ETL. But to Shelley the move is intuitive. “If in three years you come up with a new query or analysis, it doesn’t matter because there’s no schema,” Shelley says. “You just go get the raw data and transform it into any format you need.”
September 26, 2016
- Quantum Provides Petascale Data Storage and Management for Major European Research Institutions
- MemSQL 5.5 Released
- Pentaho Announces Five New Data Integration Enhancements
- NetApp Announces New Software, Flash Systems, and Expanded Cloud Support
- Zoomdata and Cloudera Partner to Deliver Customer Insights Solution
- Red Hat Introduces Ansible Tower App for Splunk
- University of Tokyo and JCAHPC to Deploy DDN’s IME14K for New Systems
- MemSQL Unveils Roadmap for Ongoing Integration With Apache Spark
- Zoomdata Announces Partnership to Provide Unified Visual Analytics Front End for Teradata Data Platforms
- Maana Unveils Winter ’17 Knowledge Platform
- Lightbend Announces Early Access Program for New Fast Data Platform
- Zoomdata Offering Free 30-Day Trial in AWS Marketplace
September 23, 2016
- Finalists Announced for 2016 Data Impact Awards
- NetApp to Showcase Flash-Optimized Solutions for Data Analytics at .conf2016
September 22, 2016
- TIBCO BusinessWorks Container Edition Now Available on Pivotal Network
- Informatica to Expand Data Lake Management Solution
- RapidMiner Launches New Marketplace of Data Science Experts
- Cloudera and CenturyLink Expand Alliance to Deliver Big Data as a Service
- Moody’s Analytics Chooses Qlik to Deliver Insights for Risk Management Assessment Solution
- Pepperdata Introduces New Offering That Speeds Amazon EMR
Most Read Features
- 9 Must-Have Skills to Land Top Big Data Jobs in 2015
- Which Type of SSD is Best: SATA, SAS, or PCIe?
- Spark Streaming: What Is It and Who’s Using It?
- Solr or Elasticsearch–That Is the Question
- Python Eats Into R as SAS Dominance Fades
- Yahoo’s New Pulsar: A Kafka Competitor?
- 5 Factors Driving the Graph Database Explosion
- 9 Paths to a Data Science Interview
- Apache Spark: 3 Real-World Use Cases
- Workforce Analytics: How Big Data Is Shaping the Labor Pool
- More Features…
Most Read News In Brief
- AWS Redshift Feels the Heat
- Six Big Name Schools with Big Data Programs
- Why Gartner Dropped Big Data Off the Hype Curve
- Veritas Disses Dell/EMC As It Preps Big Info Management Push
- MIT Programmers Attack Big Data Memory Gap
- ‘Smart Machines’ Top the Hype Cycle, Gartner Says
- Altiscale Deal Would Boost SAP Hadoop Offerings
- Huawei, Startup Collaborate on Big Data Object Storage
- U.S. Visa Program Would Scan Social Media Data
- Pentagon Eyes AI on the Battlefield
- More News In Brief…
Most Read This Just In
- Datanami Reveals Winners of Inaugural Readers’ and Editors’ Choice Awards
- New Report Says Data Lakes Market to be Worth $8.81 Billion by 2021
- Continuum Analytics Teams Up With Intel for Python Distribution Powered by Anaconda
- Teradata Introduces Borderless Analytics
- SAP Launches BW/4HANA
- Huawei and Alluxio Jointly Release Big Data Storage Acceleration Solution
- Cask Releases Preview of First Unified Integration Platform for Big Data
- Informatica to Expand Data Lake Management Solution
- Munich Re Relying on SAS Analytics and HDP for Big Data Initiative
- Elastic Acquires Prelert
- More This Just In…
September 26 - September 29New York United States
September 26 - September 27New York United States
October 19San Francisco CA United States
October 23 - October 27New York United States