AWS Looks to ‘Demystify’ Machine Learning
Amazon Web Services used a big data conference in the backyard of some of its largest government customers to showcase its AI and machine learning tools that are helping to funnel ever-larger volumes of data into its storage and computing infrastructure.
Making a pitch for better data management tools like metadata systems, AWS executives addressing a big data conference in Tysons Corner, Va., said the the public cloud giant aims to go beyond democratizing big data to “demystify” AI and machine learning.
The combination of organized data and analytics will accelerate the building and deployment of machine learning models, many that currently never make it to production. Those that are deployed often require up to 18 months to roll out, noted Ben Snively, a solution architect at AWS (NASDAQ: AMZN).
Open source tools for model development often advance a generation or two in the time it takes many enterprises to develop, train and launch a machine learning model, he added.
Snively asserted that the combination of big data and analytics along with AI and machine learning creates a “flywheel effect” in which organized, accessible data leads to faster insights, better products and—completing the cycle—more data.
(Hence, the cloud vendor forecasts as much as 180 zettabytes of widely varied and fast-moving data by 2025.)
As it seeks to demystify machine automation technologies and move beyond the current technology “hype phase,” AWS executives note that deployment of machine learning models and, eventually, full-blown platforms, remains hard. Among the reasons are “dirty” data that must be cleansed to foster access. The company estimates that 80 percent of data lakes currently lack metadata management systems that help determine data sources, formats and other attributes needed to wrangle big data.
That makes the heavy investments in data lakes “inefficient,” stressed Alan Halamachi, a senior manager for AWS solution architectures. “If data is not in a format where it can be widely consumed and accessible,” Halamachi stressed, machine learning developers will find themselves in “data jail.”
Once big data is wrangled and secured—“Hackers would like nothing more than to engineer a single breach with access to all of it,” Hamachi said—it can be combined with analytics on the inference side to accelerate training of machine learning models, Snively said.
Noting that most machine learning models built by enterprises never make it to production, the AWS engineers pitched several new tools including its SageMaker machine and deep learning stack introduced in November. Described as a tool for taking the “muck” out of developing machine learning models, Snively said Sagemaker is also designed to free data scientists from IT chores like standing up a server for model development.
The cloud giant is seeing more experimentation among its customers as they seek to connect big data with machine learning development. “Voice [recognition] systems are here to stay,” Snively asserted, and developers are investigating “new ways of interacting with those systems.”
“It’s really about demystifying AI and machine learning” and getting beyond the “magic box” phase, he added.
September 13, 2019
- Duality Technologies Named a “Cool Vendor” by Gartner for Privacy Preservation in Analytics
- Valen Analytics Announces InsureRight Manage 2.0
- HPCwire and EnterpriseAI to Cover Silicon & Systems for Deep Learning at AI Hardware Summit as Headline Media Partners
September 12, 2019
- Odaseva Announces Growth with One Trillion Documents Supported and Over 10 Million Users
- IBM and ŠKODA AUTO University Collaborate on new Digital Skills for Students
- Trifacta Raises $100M to Support Explosive Growth of Data Wrangling for AI and the Cloud
- Snowflake and Fedresults Bring Cloud Smart Technology to Federal Government
- IBM Unveils z15 With Industry-First Data Privacy Capabilities
- StorageCraft Research Reveals Rampant Data Growth, and Inadequate IT Infrastructures are a Source of Global Concern and Risk
- Sumo Logic Accelerates Continuous Intelligence for Modern Enterprises with New Product Innovations
September 11, 2019
- StackRox Launches New Sumo Logic App for Kubernetes Security
- Sumo Logic Showcases the Intelligence Economy at Illuminate 2019
- Multi-Cloud on the Rise and Open Source Tech Like Kubernetes is Disrupting the Modern Application Stack, According to Sumo Logic Research
- Accenture Acquires Pragsis Bidoop
- Lucidworks Fusion 5.0 Features Data Science Toolkit Integration & Microservices Architecture Orchestrated by Kubernetes
- TIBCO and Asia Pacific University of Technology and Innovation Announce Enriched Collaboration
- Looker Brings the Data Community Together at JOIN 2019
- InfluxData Launches InfluxDB Cloud 2.0
- Nationwide Drives Data-enabled Culture with ‘Fit to Fly’ Analytics Strategy
- Hazelcast Enhances Real-Time Capabilities for Financial Services Industry
Most Read Features
- Is Python Strangling R to Death?
- Can We Stop Doing ETL Yet?
- Big Data File Formats Demystified
- Re-Imagining Big Data in a Post-Hadoop World
- Seeing the Big Picture on Big Data Market Shift
- How to Build a Better Machine Learning Pipeline
- Is Hadoop Officially Dead?
- 10 Big Data Trends to Watch in 2019
- Why Knowledge Graphs Are Foundational to Artificial Intelligence
- AutoML Tools Emerge as Data Science Difference Makers
- More Features…
Most Read News In Brief
- HPE Acquires MapR
- R Backers Tout Funding Milestone, Seek Comeback
- H2O.ai Tops Off Funding to Accelerate AI Adoption
- Startup Rockset Adds SQL to DynamoDB
- MapR Says It’s Close to Deal to Sell Company
- AI, Analytics Help to Propel Wind Power
- War Unfolding for Control of Elasticsearch
- StreamSets Eases Spark-ETL Pipeline Development
- California’s New Data Privacy Law Takes Effect in 2020
- How IBM Is Turning Db2 into an ‘AI Database’
- More News In Brief…
Most Read This Just In
- Cray ARM-based System ‘Ookami’ to Serve as Testbed for Computational Studies at Stony Brook
- Cloudera Agrees to Acquire Arcadia Data
- Illumina to Share their Data Virtualization Journey at Gartner Catalyst Conference
- Ascend Introduces Queryable Dataflows for Faster Pipeline Development and Overall Time to Big Data Success
- Report: SAS Sees 105% Growth in AI Revenue
- New Graph Database Performance Benchmark Confirms Graph Databases are Ready for Solving Real-World Business Intelligence, Data Challenges
- SnapLogic Delivers AI-powered Pipeline Recommendations and Azure Databricks Support with Latest Platform Release
- Accenture to Acquire Analytics8, Australian Analytics and Data Specialists
- Databricks Offers Automation Throughout the End-to-End Data and Machine Learning Lifecycle
- ThoughtSpot Raises $248M to Transform Enterprises with Search & AI-Driven Analytics
- More This Just In…
September 23 - September 26New York United States
October 20 - October 22Charlotte NC United States
October 23 - October 24Berlin Germany