

Machine learning developers gained new abilities to develop and run their ML programs on the framework and hardware of their choice thanks to the OpenXLA Project, which today announced the availability of key open source components.
Data scientists and ML engineers often spend a lot of time optimizing their models to work on each hardware target. Whether they’re working in a framework like TensorFlow or PyTorch and targeting GPUs or TPUs, there was no way to avoid this manual effort, which consumed precious time and made it difficult to move applications at a later date.
This is the general problem targeted by the folks behind the OpenXLA Project, which was founded last fall and today includes Alibaba, Amazon Web Services, AMD, Apple, Arm, Cerebra Systems, Google, Graphcore, Hugging Face, Intel, Meta, and NVIDIA as its members.
By creating a unified machine learning compiler that works with a range of ML development frameworks and hardware platforms and runtimes, OpenXLA can accelerate the delivery of ML applications and provide greater code portability.
Today, the group announced the availability of three open source tools as part of the project. XLA is an ML compiler for CPUs, GPUs, and accelerators; StableHLO is an operation set for high-level operations (HLO) in ML that provides portability between frameworks and compilers; while IREE (Intermediate Representation Execution Environment) is an end-to-end MLIR (Multi-Level Intermediate Representation) compiler and runtime for mobile and edge deployments. All three are available for download from the OpenXLA GitHub site
Initial frameworks supported by OpenXLA including TensorFlow, PyTorch, and JAX, a new Google framework JAX is designed for transforming numerical functions, and is described as bringing together a modified version of autograd and TensorFlow while following the structure and workflow of NumPy. Initial hardware targets and optimizations include Intel CPU, Nvidia GPUs, Google TPUs, AMD GPU, Arm CPUs, AWS Trainium and Inferentia, Graphcore’s IPU, and Cerebras Wafer-Scale Engine (WSE). OpenXLA’s “target-independent optimizer” targets albebraic functions, op/kernel fusion, weight update sharding, full-graph layout propagation, scheduling, and SPMD for parallelism.
The OpenXLA compiler products can be used with a variety of ML use cases, including full-scale training of massive deep learning models, including large language models (LLMs) and even generative computer vision models like Stable Diffusion. It can also be used for inference; Waymo already uses OpenXLA for real-time inferencing on its self-driving cars, according to a post today on the Google open source blog.

The OpenXLA compiler ecosystem provides portability between ML development tools and hardware targets (Image source OpenXLA Project)
OpenXLA members touted some of their early successes with the new compiler. Alibaba, for instance, says it was able to train a GPT2 model on Nvidia GPUs 72% faster using OpenXLA, and saw an 88% speedup for a Swin Transformer training task on GPUs.
Hugging Face, meanwhile, said it saw about a 100% speedup when it paired XLA with its text generation model written in TensorFlow. “OpenXLA promises standardized building blocks upon which we can build much needed interoperability, and we can’t wait to follow and contribute!” said Morgan Funtowicz, head of machine learning optimization for the Brooklyn, New York, company.
Facebook was able to “achieve significant performance improvements on important projects,” including using XLA on PyTorch models running on Cloud TPUs, said Soumith Chintala, the lead maintainer for PyTorch.
The chip startups are pleased for XLA, which reduces the risks of adopting relatively new, unproven hardware for customers. “Our IPU compiler pipeline has used XLA since it was made public,” said David Norman, Graphcore’s director of software design. “Thanks to XLA’s platform independence and stability, it provides an ideal frontend for bringing up novel silicon.”
“OpenXLA helps extend our user reach and accelerated time to solution by providing the Cerebras Wafer-Scale Engine with a common interface to higher level ML frameworks,” says Andy Hock, a vice president and head of product at Cerebras. “We are tremendously excited to see the OpenXLA ecosystem available for even broader community engagement, contribution, and use on GitHub.”
AMD and Arm, which are battling bigger chipmakers for pieces of the ML training and serving pies, are also happy members of the OpenXLA Project.
“We value projects with open governance, flexible and broad applicability, cutting edge features and top-notch performance and are looking forward to the continued collaboration to expand open source ecosystem for ML developers,” Alan Lee, AMD’s corporate vice president of software development, said in the blog.
“The OpenXLA Project marks an important milestone on the path to simplifying ML software development,” said Peter Greenhalgh, vice president of technology and fellow at Arm. “We are fully supportive of the OpenXLA mission and look forward to leveraging the OpenXLA stability and standardization across the Arm Neoverse hardware and software roadmaps.”
Curiously absent are IBM, which continues to innovate on chips with its Power10 processor, and Microsoft, the world’s second largest provider behind AWS.
Related Items:
Google Announces Open Source ML Compiler Project, OpenXLA
AMD Joins New PyTorch Foundation as Founding Member
Inside Intel’s nGraph, a Universal Deep Learning Compiler
March 30, 2023
- Zenoss Launches Free Trial for Kubernetes Monitoring with No Download Required
- Hightouch and Fivetran Accelerate Modern Data Stack Adoption, Exceed 230 Shared Customers
- UNESCO Calls on All Governments to Implement AI Global Ethical Framework Without Delay
- Salesforce Announces New Automotive Cloud Features
- mParticle Launches Warehouse Sync
- Dremio and Domo Announce New Integration to Expand Data Lakehouse Access
- Algolia Introduces New Developer-Friendly ‘Build’ Pricing Plan
- MariaDB’s New SkySQL Release Reimagines How Companies Control Cloud Database Spend
March 29, 2023
- BigID Prepares Organizations for CPRA Compliance with Automated Data Privacy Suite
- Lenovo: Only 15% of Businesses Considered Data Leaders, as Organizations Strive to Enhance Data Strategies to Keep Up with Competitors
- Quantic Selects Couchbase Capella to Scale Point of Sale Platform
- NetApp’s 2023 Cloud Complexity Report Highlights Shifting Demands of a Multicloud Environment
- Alation Secures 2 New Procurement Contracts to Meet Public Sector’s Demand for Data Intelligence
- IBM Cloud and Wasabi Partner to Power Data Insights Across Hybrid Cloud Environments
March 28, 2023
- Sinequa Integrates ChatGPT with Its Neural Search Engine
- D2iQ Brings Cloud-native Deployment Capabilities to Public Sector with New DKP Gov Kubernetes Management Platform
- New Academic Program Lowers Cost for University Researchers to Access Leading-edge ML Method
- Virtana Releases Latest State of Multi-Cloud Management Report
- Hitachi Vantara and Golden Grove Nursery Use Data-driven Analytics for More Sustainable Water Management
- Datametica Brings Pelican SaaS-based Tech to the Google Cloud Marketplace
Most Read Features
- Databricks Bucks the Herd with Dolly, a Slim New LLM You Can Train Yourself
- Prompt Engineer: The Next Hot Job in AI
- Data Mesh Vs. Data Fabric: Understanding the Differences
- Iceberg Data Services Emerge from Tabular, Dremio
- Hallucinations, Plagiarism, and ChatGPT
- GPT-4 Has Arrived: Here’s What to Know
- Open Table Formats Square Off in Lakehouse Data Smackdown
- What Does It Mean for a Data Catalog to Be Powered by a Knowledge Graph?
- Apache Pinot Uncorks Real-Time Data for Ad-Tech Firm
- ChatGPT Brings Ethical AI Questions to the Forefront
- More Features…
Most Read News In Brief
- Observability Primed for a Breakout 2023: Prediction
- Mathematica Helps Crack Zodiac Killer’s Code
- Multi-modal GPT-4 Rumored To Be Released This Week
- Bill Gates Says the Age of AI Has Begun, Bringing Opportunity and Responsibility
- Open Letter Urges Pause on AI Research
- OpenXLA Delivers Flexibility for ML Apps
- Observability Overload: Grafana Labs Survey Builds a Case for Centralized Solutions
- OpenAI’s New GPT-3.5 Chatbot Can Rhyme like Snoop Dogg
- Tool Providers Up Their Data Integration Games
- Microsoft Puts AI into ERP and CRM
- More News In Brief…
Most Read This Just In
- Colossal-AI Releases Open Source Framework for ChatGPT Replication
- Former Cloudera CPO & Hortonworks Cofounder, Arun Murthy, Joins Scale AI
- Salesforce Launches Hyperforce EU Operating Zone
- AWS Announces General Availability of Amazon OpenSearch Serverless
- Akkio Launches Chat Explore Powered by GPT-4
- Salesforce Announces Einstein GPT
- Esri Releases New App to Easily View and Analyze Global Land-Cover Changes
- Vultr Announces Availability of NVIDIA H100 Tensor Core GPU and Partnerships with Domino Data Lab and Anaconda
- Google Cloud and Accenture Expand Strategic Partnership, Announce Platform Tech Integration
- Comet Releases MLOps Industry Report | 2023 Machine Learning Practitioner Survey
- More This Just In…
Sponsored Partner Content
Sponsored Whitepapers
Sponsored Multimedia
Contributors
Featured Events
-
AI in Finance Summit NY
April 20 - April 21New York NY United States -
CDAO Spring 2023
April 25 @ 8:00 am - April 26 @ 5:00 pmSan Francisco CA United States -
AI & Big Data Expo North America 2023
May 17 @ 8:00 am - May 18 @ 5:00 pm -
IEEE Conference on Artificial Intelligence 2023
June 5 @ 8:00 am - June 6 @ 5:00 pmSanta Clara CA United States -
CDAO Insurance 2023
June 13 - June 14