AWS Looks to ‘Demystify’ Machine Learning
Amazon Web Services used a big data conference in the backyard of some of its largest government customers to showcase its AI and machine learning tools that are helping to funnel ever-larger volumes of data into its storage and computing infrastructure.
Making a pitch for better data management tools like metadata systems, AWS executives addressing a big data conference in Tysons Corner, Va., said the the public cloud giant aims to go beyond democratizing big data to “demystify” AI and machine learning.
The combination of organized data and analytics will accelerate the building and deployment of machine learning models, many that currently never make it to production. Those that are deployed often require up to 18 months to roll out, noted Ben Snively, a solution architect at AWS (NASDAQ: AMZN).
Open source tools for model development often advance a generation or two in the time it takes many enterprises to develop, train and launch a machine learning model, he added.
Snively asserted that the combination of big data and analytics along with AI and machine learning creates a “flywheel effect” in which organized, accessible data leads to faster insights, better products and—completing the cycle—more data.
(Hence, the cloud vendor forecasts as much as 180 zettabytes of widely varied and fast-moving data by 2025.)
As it seeks to demystify machine automation technologies and move beyond the current technology “hype phase,” AWS executives note that deployment of machine learning models and, eventually, full-blown platforms, remains hard. Among the reasons are “dirty” data that must be cleansed to foster access. The company estimates that 80 percent of data lakes currently lack metadata management systems that help determine data sources, formats and other attributes needed to wrangle big data.
That makes the heavy investments in data lakes “inefficient,” stressed Alan Halamachi, a senior manager for AWS solution architectures. “If data is not in a format where it can be widely consumed and accessible,” Halamachi stressed, machine learning developers will find themselves in “data jail.”
Once big data is wrangled and secured—“Hackers would like nothing more than to engineer a single breach with access to all of it,” Hamachi said—it can be combined with analytics on the inference side to accelerate training of machine learning models, Snively said.
Noting that most machine learning models built by enterprises never make it to production, the AWS engineers pitched several new tools including its SageMaker machine and deep learning stack introduced in November. Described as a tool for taking the “muck” out of developing machine learning models, Snively said Sagemaker is also designed to free data scientists from IT chores like standing up a server for model development.
The cloud giant is seeing more experimentation among its customers as they seek to connect big data with machine learning development. “Voice [recognition] systems are here to stay,” Snively asserted, and developers are investigating “new ways of interacting with those systems.”
“It’s really about demystifying AI and machine learning” and getting beyond the “magic box” phase, he added.
February 19, 2020
February 18, 2020
- ScyllaDB Extends Support for Streaming with High-Performance Apache Kafka Connector
- Snowflake Announces General Availability on Google Cloud
- Qlik Tapped by Cheshire Healthcare Leaders to Help Solve Health Challenges
- DataRobot Unveils Snowflake Integration to Streamline and Simplify AI
- ASG Technologies Partners with 4th-IR to Enrich ASG Data Intelligence with AI Models and Data Expertise
- Prisma Analytics, Decisive Group Partner on UAE’s Big Data, AI Situation Room
- Dataiku Selected as Platform for NATO’s Allied Command Transformation Focusing on AI Projects
February 14, 2020
- DataRobot Unveils Beneficiaries of AI for Good: Powered by DataRobot Program
- Reltio Introduces Reltio Connected Customer 360
February 13, 2020
- data.world Joins Snowflake Partner Connect
- Oracle Adds Advanced Analytics Capabilities to its FCCM Products
- Study: Just 32% of Data Teams Able to Extract the Insights Their Organizations Need for Better Decision-Making
- Indiana University Creates Climate Change Data Portal to Model State’s Future Water Supply
February 12, 2020
- Domo Launches Appstore Including Third-Party Data Catalog from AWS Data Exchange
- GigaSpaces Launches GigaSpaces Cloud Managed Service on Google Cloud Platform
- Sisense, UiPath Partner to Deliver Advanced Business Analytics Solution
- Parallel Wireless, Robin.io Expand Partnership for Next Generation Network Automation
- DataStax Opens Registration for Apache Cassandra NoSQL Event
- Talend Achieves Security and Compliance Requirements for Enterprise Cloud Environments
Most Read Features
- An Open Source Alternative to AWS SageMaker
- Big Data File Formats Demystified
- ML and BI Are Coming Together, Gartner Says
- How the Coronavirus Response Is Aided by Analytics
- Predictive Maintenance Drives Big Gains in Real World
- Room for Improvement in Data Quality, Report Says
- How to Build a Better Machine Learning Pipeline
- Rob Bearden Returns to Lead Cloudera’s Second Act
- Optimizing Utilization Forecasting with Artificial Intelligence and Machine Learning
- Big Data Predictions: What 2020 Will Bring
- More Features…
Most Read News In Brief
- Hitachi Vantara Buys Cataloger Waterline Data
- DoD Looks to Scale Predictive Maintenance
- War Unfolding for Control of Elasticsearch
- MongoDB Embraces GraphQL with Document Database
- Global DataSphere to Hit 175 Zettabytes by 2025, IDC Says
- Google Advances Data Set Search Tool
- Inside Fortnite’s Massive Data Analytics Pipeline
- HPE Acquires MapR
- Defenses Emerge to Combat Adversarial AI
- Tick Data Comes to BigQuery
- More News In Brief…
Most Read This Just In
- Qlik Acquires RoxAI to Extend Qlik Sense’s AI Capabilities with Advanced Alerting and Intelligent Automation
- Research: Data Skills Gap is Costing Organizations Billions in Lost Productivity
- Okera Delivers Real-Time Actionable Insights into Data Lakes
- Iguazio Deployed by Payoneer to Prevent Fraud with Real-time Machine Learning
- Luminoso Announces AI Application for Better Search Engine Results
- Collibra Launches Data Lineage, an Automated Data Lifecycle Mapping Capability
- Spark + AI Summit Reveals 2020 Keynote Speakers and Expanded Training
- Pepperdata Introduces Query Spotlight
- Elastic Announces the General Availability of Elastic App Search on Elasticsearch Service
- Sisense Expands Presence in Australia to Support Growing Demand for Business Intelligence, Data Analytics
- More This Just In…