July 8, 2019

Program Synthesis Moves a Step Closer to Reality

George Leopold

As data scientists and software developers sort through the plethora of tools and APIs ranging from Python to Apache Spark, automation schemes are emerging to help programmers navigate those tools and the accompanying infrastructure that machine learning and other apps run on.

Among them is an emerging “programming-by-example” approach based on the Pandas library. The framework allows AI and other programmers to specify intent by “synthesizing” a program that comes up with the desired output based on inputs.

That goal is the basis of AutoPandas, a “program synthesis engine” for the popular data science library. Data scientist Ben Lorica noted in a recent blog post that investigators at the University of California at Berkeley’s RISELab recently unveiled new research on AutoPandas aimed squarely at making life a bit easier for harried software developers.

Founded in 2017, RISELab is the successor to AMPLab that produced popular open source technologies like Apache Spark and Apache Mesos. Ion Stoica, co-founder of Databricks, is the director of RISELab.

As AI and machine learning move from research to IT and customers service applications, Lorica noted that researchers driven by the emergence of capabilities like AutoML are building new tools that promise to automate various stages of the machine learning pipeline.

Program synthesis is defined as the task of automatically finding an intent-based program within a programming language. Neural-backed generators such as AutoPandas “are an extremely promising step toward practical program synthesis,” according to Lorica.

For example, programmers could specify a data input and output structure such as data frames. AutoPandas would then automatically synthesize a program that produces the desired output from the given input, Lorica explained.

According to a recent paper on program synthesis, the approach differs fundamentally from traditional compilers used to translate high-level code to a lower-level machine language. By contrast, program synthesizers perform searches to generate a program consistent with intent. That capability is considered nothing less than the “Holy Grail” of computer science, the authors noted.

RISELab’s automation tool uses “program generators” to capture API constraints, thereby winnowing the possible number of programs. It also uses neural network models to predict API calls along with the Ray distributed computing framework designed to scale programmer searches.

Recent items:

‘Data Scientist’ Title Evolving Into New Thing

RISELab Takes Flight at UC Berkeley

Applications: Artificial Intelligence, Research Analytics

Technologies: Frameworks

Sectors: Academia, Biosciences, Financial Services, Healthcare, Other

Vendors: Databricks

Tags: AI. RISELab, Apache Mesos, apache spark, AutoPandas, Ben Lorica, machine learning, Pandas, program synthesis, program synthesis engine, software development

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

Program Synthesis Moves a Step Closer to Reality

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

May 14, 2024

May 13, 2024

May 10, 2024

May 9, 2024

May 8, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Top 6 Strategies for Reducing Data Warehouse Costs

Building an Operational Data Warehouse for Real-time Analytics

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

AI & Big Data Expo North America 2024

CDAO Canada Public Sector 2024

AI Hardware & Edge AI Summit Europe

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Program Synthesis Moves a Step Closer to Reality

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

May 14, 2024

May 13, 2024

May 10, 2024

May 9, 2024

May 8, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link