AWS Looks to ‘Demystify’ Machine Learning
Amazon Web Services used a big data conference in the backyard of some of its largest government customers to showcase its AI and machine learning tools that are helping to funnel ever-larger volumes of data into its storage and computing infrastructure.
Making a pitch for better data management tools like metadata systems, AWS executives addressing a big data conference in Tysons Corner, Va., said the the public cloud giant aims to go beyond democratizing big data to “demystify” AI and machine learning.
The combination of organized data and analytics will accelerate the building and deployment of machine learning models, many that currently never make it to production. Those that are deployed often require up to 18 months to roll out, noted Ben Snively, a solution architect at AWS (NASDAQ: AMZN).
Open source tools for model development often advance a generation or two in the time it takes many enterprises to develop, train and launch a machine learning model, he added.
Snively asserted that the combination of big data and analytics along with AI and machine learning creates a “flywheel effect” in which organized, accessible data leads to faster insights, better products and—completing the cycle—more data.
(Hence, the cloud vendor forecasts as much as 180 zettabytes of widely varied and fast-moving data by 2025.)
As it seeks to demystify machine automation technologies and move beyond the current technology “hype phase,” AWS executives note that deployment of machine learning models and, eventually, full-blown platforms, remains hard. Among the reasons are “dirty” data that must be cleansed to foster access. The company estimates that 80 percent of data lakes currently lack metadata management systems that help determine data sources, formats and other attributes needed to wrangle big data.
That makes the heavy investments in data lakes “inefficient,” stressed Alan Halamachi, a senior manager for AWS solution architectures. “If data is not in a format where it can be widely consumed and accessible,” Halamachi stressed, machine learning developers will find themselves in “data jail.”
Once big data is wrangled and secured—“Hackers would like nothing more than to engineer a single breach with access to all of it,” Hamachi said—it can be combined with analytics on the inference side to accelerate training of machine learning models, Snively said.
Noting that most machine learning models built by enterprises never make it to production, the AWS engineers pitched several new tools including its SageMaker machine and deep learning stack introduced in November. Described as a tool for taking the “muck” out of developing machine learning models, Snively said Sagemaker is also designed to free data scientists from IT chores like standing up a server for model development.
The cloud giant is seeing more experimentation among its customers as they seek to connect big data with machine learning development. “Voice [recognition] systems are here to stay,” Snively asserted, and developers are investigating “new ways of interacting with those systems.”
“It’s really about demystifying AI and machine learning” and getting beyond the “magic box” phase, he added.
March 21, 2018
- PossibleNOW Announces Strategic Solution with RedPoint Global for GDPR Consent Management
- ScyllaDB Introduces Scylla Manager for Greater Centralized Control of Scylla Database Clusters
- Arcadia Data Launches Arcadia Enterprise
- Survey Reveals Organizations Continue to Struggle with Data Quality and Data Prep Issues
- Attivio Releases Open Source UI Toolkit for Rapid Development of AI-Powered Search Applications
March 20, 2018
March 19, 2018
- Yotpo Announces the Launch of A.I.-Powered ‘Insights’
- Kerstin Kleese van Dam Receives 32nd Town of Brookhaven Annual Women’s Recognition Award for Science
- Nimbus Data Launches the World’s Largest Solid State Drive
- NetApp Selects T6 Unified Wire Adapters For AFF A700 All-Flash And FAS9000 Hybrid Storage Systems
- IARPA to Host Proposers’ Day Conference for BETTER Program on March 29
- A3Cube Partners with Edgeware Computing to Increase Sales of Supercomputing Solutions for the Enterprise
March 15, 2018
- Lexalytics Launches Support for Unicode Emoji 11.0
- Proscia Inks Agreement to Bring Deep-Learning Technology to Dermatopathology
- AI Expo Global Introduces Speaker Line-up
- MathWorks Announces Release 2018a of the MATLAB and Simulink Product Families
- Fractal Analytics Acquires Behavioral Architecture Firm Final Mile
- Record Breaking Amount in Total Tape Capacity Shipments Announced by the LTO Program
- SURF Cooperative Makes iRODS the Data Management Solution for Dutch National Data Infrastructure
March 14, 2018
Most Read Features
- Mr. Robot Thrills Elastic Crowd with Real-World Hacks
- Winners and Losers from Gartner’s Data Science and ML Platform Report
- Which Programming Language Is Best for Big Data?
- Why 2018 Will Be The Year Of The Data Engineer
- 9 Must-Have Skills to Land Top Big Data Jobs in 2015
- The GDPR: An Artificial Intelligence Killer?
- Which Type of SSD is Best: SATA, SAS, or PCIe?
- Spark Streaming: What Is It and Who’s Using It?
- Weighing Open Source’s Worth for the Future of Big Data
- The Hybrid Database Capturing Perishable Insights at Yiguo
- More Features…
Most Read News In Brief
- Confluent Adds KSQL Support to Kafka Platform
- Esri Adds an Unstructured Location Data Capability
- IBM Launches New Cloud for Data Science and Engineering
- It’s March, Time For Bracket Analytics
- AWS Looks to ‘Demystify’ Machine Learning
- Elastic to Release Source Code for X-Pack
- AI, Big Data Are Meshing–and Disrupting
- Why Gartner Dropped Big Data Off the Hype Curve
- Inside MapR’s Support for Kubernetes
- DHS Funds Cyber Data Research
- More News In Brief…
Most Read This Just In
- Spectra Logic Announces New Tape Library Offerings
- NVIDIA to Host World’s Top AI Experts at GPU Technology Conference
- Waterline Data Launches New Platform to Address Big Data’s Growing Volume and Complexity
- MemSQL Establishes a New Baseline for Database Speed
- REDMINT NETWORK Launches Loud ML Deep Learning API for Linux in the US in Partnership with InfluxData
- Logz.io Launches Two Open Source Projects
- FICO Opens Xpress Mosel Programming Language to All
- MapR Achieves Microsoft Gold Cloud Platform Competency
- PSSC Labs to Build Custom Turn-Key Cluster Server for CyberSecurity Malaysia
- Splunk Agrees to Acquire Phantom
- More This Just In…