March 3, 2020

Kubernetes Gets an Automated ML Workflow

George Leopold

A stable version of an automation tool released this week aims to make life easier machine learning developers training and scaling models, then deploying ML workloads atop Kubernetes clusters.

Roughly two years after its open source release, Kubeflow 1.0 leverages the de facto standard cluster orchestrator to aid data scientists and ML developers in tapping cloud resources to run those workloads in production. Among the stable workflow applications released on Monday (March 2) are a central dashboard, Jupyter notebook controller and web application along with TensorFlow and PyTorch operators for distributed training.

Contributors from Google, IBM, Cisco Systems, Microsoft and data management specialist Arrikto said Jupyter notebooks can be used to streamline model development. Other tools can then be used to build application containers and leverage Kubernetes resources to train models. A server can then be deployed for ML inference.

Kubeflow is ultimately designed to make it easier for developers to utilize popular cloud-native Kubernetes resources. “Deploying Kubernetes optimized for ML and integrated with your cloud is no easy task,” community leaders noted in a blog post. Kubeflow 1.0 is designed to reduce those steps to a single command.

Kubeflow 1.0 applications.

“One of our key motivations was to leverage Kubernetes to simplify distributed training,” lead developers said. “Kubeflow provides Kubernetes custom resources that make distributed training with TensorFlow and PyTorch simple.”

IBM (NYSE: IBM), the second largest code contributor to Kubeflow after Google (NASDAQ: GOOGL), said it is evaluating the machine learning tool kit for production deployments on its cloud and Power platforms along with Red Hat OpenShift. Kubeflow also can be deployed on Amazon Web Services (NASDAQ: AMZN) and Google Cloud Platform.

Kubeflow’s design is based on the concept of a machine learning pipeline that includes all the steps in a given data science workflow. Developers start by obtaining data from a local or remote source, transforming data, loading it into an ML model running on a laptop, then initiating the training of that model on a larger cluster, using one or more data sets.

IBM was an early contributor to Kubeflow pipelines used for machine learning workflow orchestration, the company said in a separate post.

Google initially created Kubeflow to manage its internal machine learning pipelines written in Tensorflow and executed atop Kubernetes.

Kubeflow leverages the cluster orchestrator to provide a higher abstraction level for ML pipelines, thereby freeing data scientist to focus on adding value to machine learning models.

The emergence of Kubeflow closely tracks production deployment of Kubernetes, addressing many of the teething problems faced by early adopters as they struggle to get the machine learning models trained on cloud-native platforms.

Organizers said the Kubeflow project has so far attracted contributions from more than 30 companies.

Recent items:

Kubeflow Emerges for ML Workflow Automation

In Search of a Common Deep Learning Stack

Applications: Artificial Intelligence, Enterprise Analytics

Technologies: Cloud, Frameworks

Sectors: Financial Services, Manufacturing, Other, Retail

Vendors: Google Cloud Platform

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

Kubernetes Gets an Automated ML Workflow

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 18, 2024

April 17, 2024

April 16, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Building an Operational Data Warehouse for Real-time Analytics

Can You Use Kafka as a Database?

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

Call & Contact Center Expo

AI & Big Data Expo North America 2024

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Kubernetes Gets an Automated ML Workflow

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 18, 2024

April 17, 2024

April 16, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link