AWS Looks to ‘Demystify’ Machine Learning
Amazon Web Services used a big data conference in the backyard of some of its largest government customers to showcase its AI and machine learning tools that are helping to funnel ever-larger volumes of data into its storage and computing infrastructure.
Making a pitch for better data management tools like metadata systems, AWS executives addressing a big data conference in Tysons Corner, Va., said the the public cloud giant aims to go beyond democratizing big data to “demystify” AI and machine learning.
The combination of organized data and analytics will accelerate the building and deployment of machine learning models, many that currently never make it to production. Those that are deployed often require up to 18 months to roll out, noted Ben Snively, a solution architect at AWS (NASDAQ: AMZN).
Open source tools for model development often advance a generation or two in the time it takes many enterprises to develop, train and launch a machine learning model, he added.
Snively asserted that the combination of big data and analytics along with AI and machine learning creates a “flywheel effect” in which organized, accessible data leads to faster insights, better products and—completing the cycle—more data.
(Hence, the cloud vendor forecasts as much as 180 zettabytes of widely varied and fast-moving data by 2025.)
As it seeks to demystify machine automation technologies and move beyond the current technology “hype phase,” AWS executives note that deployment of machine learning models and, eventually, full-blown platforms, remains hard. Among the reasons are “dirty” data that must be cleansed to foster access. The company estimates that 80 percent of data lakes currently lack metadata management systems that help determine data sources, formats and other attributes needed to wrangle big data.
That makes the heavy investments in data lakes “inefficient,” stressed Alan Halamachi, a senior manager for AWS solution architectures. “If data is not in a format where it can be widely consumed and accessible,” Halamachi stressed, machine learning developers will find themselves in “data jail.”
Once big data is wrangled and secured—“Hackers would like nothing more than to engineer a single breach with access to all of it,” Hamachi said—it can be combined with analytics on the inference side to accelerate training of machine learning models, Snively said.
Noting that most machine learning models built by enterprises never make it to production, the AWS engineers pitched several new tools including its SageMaker machine and deep learning stack introduced in November. Described as a tool for taking the “muck” out of developing machine learning models, Snively said Sagemaker is also designed to free data scientists from IT chores like standing up a server for model development.
The cloud giant is seeing more experimentation among its customers as they seek to connect big data with machine learning development. “Voice [recognition] systems are here to stay,” Snively asserted, and developers are investigating “new ways of interacting with those systems.”
“It’s really about demystifying AI and machine learning” and getting beyond the “magic box” phase, he added.
July 16, 2019
- Cloudera and ISID Partner to Build Integrated Platform for Mizuho Americas
- Nano Puzzle for More Stable Data Storage
- iBASIS Turns to Infinidat to Upgrade Overall Storage Performance
- AdhereHealth Selects Paxata to Accelerate Medication Adherence Solution
- SnapLogic Launches AWS Quick Start Solution to Accelerate Big Data Initiatives
- VoltDB Accelerates Significant Growth Driven by 5G Capabilities
- WANdisco, Databricks Partner to Migrate On-Premises Hadoop Analytic Workloads with LiveMigrator to Azure Databricks
- Apollo Data Graph Platform Adds Managed Federation to Power GraphQL Across the Entire Organization
- Talend Delivers Pay-as-You-Go On-Ramp to Accelerate Integration Projects
July 15, 2019
- INRIX Expands AV Platform for Cities to Manage Roadways, Curbs and Sidewalks
- RYAH Dose Measuring Vaporizer Hits U.S. Dispensary Shelves
- Amperity Secures Funding to Help Consumer Brands Use Data to Serve Their Customers
- Attunity Wins Microsoft 2019 MSUS Partner Award for Intelligent Cloud-Data Estate Modernization
July 12, 2019
- Most Enterprises Don’t Trust Their Data, According to Talend Survey
- Break the Rules of Uniformity to Increase Performance and Persist Larger Datasets In-Memory on Existing Hardware
- Information Builders Achieves AWS Microsoft Workloads Competency Status
- Quantum to Speak at Automated Vehicles Symposium on Advanced Driver Assistance Systems Challenges with Big Data
July 11, 2019
- Alluxio Delivers First Data Orchestration Platform Powering Multi-Cloud Analytics and AI
- Talend Announces Availability of Stitch Data Loader on AWS Marketplace
- Iron Mountain Expands Data Services to Support Amazon Web Services
Most Read Features
- Hitting the Reset Button on Hadoop
- Big Data File Formats Demystified
- Is Hadoop Officially Dead?
- Hadoop Struggles and BI Deals: What’s Going On?
- Teradata Turns 40, Takes Off Gloves, Readies for a Fight
- 10 Big Data Trends to Watch in 2019
- How to Build a Better Machine Learning Pipeline
- The 4 Paradigms of Data Prep for Analytics and Machine Learning
- Why Knowledge Graphs Are Foundational to Artificial Intelligence
- ‘Data Scientist’ Title Evolving Into New Thing
- More Features…
Most Read News In Brief
- MapR Says It’s Close to Deal to Sell Company
- Cloud Now Default Platform for Databases, Gartner Says
- After Funding Falls Through, MapR Seeks a Buyer to Avoid Shut Down
- Argonne Team Makes Record Globus File Transfer
- Cloudera Unveils CDP, Talks Up ‘Enterprise Data Cloud’
- Tibco Eyes ‘Data Science for Ops’ with Spotfire Upgrades
- California’s New Data Privacy Law Takes Effect in 2020
- War Unfolding for Control of Elasticsearch
- Global DataSphere to Hit 175 Zettabytes by 2025, IDC Says
- Google Cloud Unveils Slew of New Data Management and Analytics Services
- More News In Brief…
Most Read This Just In
- IBM Unveils New Data Prep Tool Designed to Help Speed DataOps
- MicroStrategy 2019 Update Brings HyperIntelligence to Mobile Users, Injects Analytics into Business Applications
- Sinequa Raises $23 million to Accelerate the Transition Beyond Data-Driven to an Information-Driven Economy
- IBM, Cloudera Announce Strategic Partnership
- What’s My Line? GPUs Help Researcher Decipher Ancient Sanskrit
- Cloudian Announces New Object Storage Solution for VMware Cloud Provider Platform
- EnterpriseDB Acquired by Great Hill Partners
- SDSC Receives New Funding for West Big Data Innovation Hub
- Syncsort Delivers Mainframe Data to Microsoft Azure to Unlock New Business Insights
- Microsoft, Providence St. Joseph Health Announce Strategic Alliance to Accelerate the Future of Care Delivery
- More This Just In…