January 26, 2022

Stanford Researchers Detail New Method for Error Detection in Perception Data

Oliver Peckham

(metamorworks/Shutterstock)

Autonomous and semi-autonomous vehicles are increasingly common, with most relying primarily on AI-powered cameras that rapidly detect vehicles, people, and obstacles within the frame and use that information (among other information, like depth sensor data) to operate or augment the operation of a vehicle. The AI models used in this process, of course, are trained with training datasets. But, Stanford researchers explained in a recent blog post, there’s a problem: “Unfortunately, many datasets are rife with errors!” In that blog post, they outlined how their team—composed of Stanford researchers Daniel Kang, Nikos Arechiga, Sudeep Pillai, Peter Bailis, and Matei Zaharia—used new tools to detect errors in those datasets.

Any errors in these kinds of datasets can pose serious problems, because the AI models are evaluated for how they stack up against those training datasets. The researchers demonstrated the problem by citing a public autonomous vehicle dataset from an otherwise-unidentified “leading labeling vendor that has produced labels for many autonomous vehicle companies” where “over 70% of the validation scenes contain at least one missing object box!”

To detect these errors, the researchers developed an abstraction method called learned observation assertions (LOA). “LOA is an abstraction designed to find errors in ML deployment pipelines with as little manual specification of error types as possible,” they wrote. “LOA achieves this [by] allowing users to specify features over ML pipelines.”

The team created an example LOA system, called Fixy, to illustrate the process. “Fixy learns feature distributions that specify likely and unlikely values (e.g., that a speed of 30mph is likely but 300mph is unlikely),” reads the abstract of the paper. “It then uses these feature distributions to score labels for potential errors.”

The researchers demonstrated how Fixy was used to identify an unlabeled motorcycle in a training dataset.

“We can specify the following features over the data: box volume, object velocity, and a feature that selects only model-predicted boxes that don’t overlap with a human label,” the blog explained. “These features are computed deterministically with short code snippets from the human labels and ML model predictions. Fixy will then execute on the new data and produce a rank-ordered list of possible errors.”

The team evaluated Fixy against Lyft’s Level 5 perception dataset and a dataset from the Toyota Research Institute. “LOA was also able to find errors in every single validation scene that had an error, which shows the utility of using a tool like LOA,” they wrote. Further, LOA was able to find 75% of the total errors identified within a selected scene from the Toyota dataset.

To learn more, read the blog post here.

Applications: Artificial Intelligence

Technologies: Middleware

Tags: autonomous vehicles, error detection, machine learning, self-driving cars, Stanford

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

Stanford Researchers Detail New Method for Error Detection in Perception Data

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

May 10, 2024

May 9, 2024

May 8, 2024

May 7, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Top 6 Strategies for Reducing Data Warehouse Costs

Building an Operational Data Warehouse for Real-time Analytics

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

AI & Big Data Expo North America 2024

CDAO Canada Public Sector 2024

AI Hardware & Edge AI Summit Europe

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Stanford Researchers Detail New Method for Error Detection in Perception Data

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

May 10, 2024

May 9, 2024

May 8, 2024

May 7, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link