February 1, 2018

Ray’s New Library Targets High Speed Reinforcement Learning

Alex Woodie

via Shutterstock

Data scientists looking to push the ball forward in the field of reinforcement learning may want to check out RLlib, a new library released as open source last month by researchers affiliated with RISELab. According to researchers, the goal of RLlib is to enable users to break down the various components that go into a reinforcement learning, thereby making them more scalable, easier to integrate, and easier to resuse.

Reinforcement learning is a type of supervised learning that’s gaining popularity as a way to quickly train programs to perform tasks optimally in a world awash in less-than-optimal training data. Instead of training a model with pristine data, which is ideal in supervised learning, the reinforcement learning model learns from the data environment as it naturally exists, and uses a simple feedback mechanism (the reinforcement signal) to nudge the model towards the ideal solution.

The practical advantage of the reinforcement approach is that it seeks to achieve a balance between being able to interpret uncharted data (which is where unsupervised learning algorithms flourish) and exploiting existing knowledge (where supervised learning typically excels). When this balance is achieved, the runtime performance of the model can be optimized, at least for a specific context.

Reinforcement learning hinges on the delivery of a reward signal (Image courtesy Notfruit/Wikimedia Commons)

According to the RLlib page on the Ray website, RLlib seeks to provide a scalable framework for building reinforcement models that are both performant and composable.

Here’s how the researchers behind Ray RLib describe their work in a paper that was published last month in the Cornell University Library:

“Reinforcement learning (RL) algorithms involve the deep nesting of distinct components, where each component typically exhibits opportunities for distributed computation. Current RL libraries offer parallelism at the level of the entire program, coupling all the components together and making existing implementations difficult to extend, combine, and reuse. We argue for building composable RL components by encapsulating parallelism and resource requirements within individual components, which can be achieved by building on top of a flexible task-based programming model. We demonstrate this principle by building Ray RLlib on top of Ray and show that we can implement a wide range of state-of-the-art algorithms by composing and reusing a handful of standard components. This composability does not come at the cost of performance — in our experiments, RLlib matches or exceeds the performance of highly optimized reference implementations”

The RLlib software runs atop Ray, the distributed execution framework from RISELab that director Michael Jordan last year said could displace Apache Spark. RLlib is the second library within the Ray project. The first was Ray Tune, a hyperparameter optimization framework for tuning neural networks.

RLlib integrates with Ray Tune, and its APIs support TensorFlow and PyTorch. The library incorporates a series of algorithms, including Proximal Policy Optimization (PPO), the Asynchronous Advantage Actor-Critic (A3C), and Deep Q Networks (DQN).

Ray runs on Ubuntu, Mac OX X, and Docker. It can also be used with GPUs. You can find downloads for Ray, Ray RLlib, and Ray Tune at this GitHub page.

The Next Data Revolution: Intelligent Real-Time Decisions

RISELab Replaces AMPLab with Secure, Real-Time Focus

Applications: Artificial Intelligence, Data Mining, Enterprise Analytics

Technologies: Frameworks, Middleware

Sectors: Academia

Vendors: RISELab

Tags: AI, machine learning, Ray, Reinforcement Learning, RISELab

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

Ray’s New Library Targets High Speed Reinforcement Learning

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 15, 2024

April 12, 2024

April 11, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Building an Operational Data Warehouse for Real-time Analytics

Can You Use Kafka as a Database?

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

Call & Contact Center Expo

AI & Big Data Expo North America 2024

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Ray’s New Library Targets High Speed Reinforcement Learning

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 15, 2024

April 12, 2024

April 11, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link