April 5, 2021

Facts Can Change, and AI Can Help

Oliver Peckham

As understanding of an issue evolves, the best set of available facts often changes: for instance, mask-wearing wasn’t recommended early in the COVID-19 pandemic, but now, it’s common knowledge that mask-wearing is helpful to stop the spread of the disease. But what happens to the information published across the internet early in the pandemic which, while accurate when written, is now outdated and misleading? At MIT’s Computer Science and Artificial Intelligence Library (CSAIL), researchers are applying AI as a solution – and helping to improve the AI models themselves in the process.

“[Our AI models] can monitor updates to articles, identify significant changes, and suggest edits to other related articles,” explained Tal Schuster, lead author of the research and a PhD student at CSAIL. “Importantly, when articles are updated, our automatic fact verification models are sensitive to such edits and update their predictions accordingly.”

As a barometer for the most recent information, the team is using edits to major Wikipedia pages. This, of course, requires filtering out vast numbers of formatting-related and grammatical edits. “Automating this task isn’t easy,” Schuster said. “But manually checking each revision is impractical as there are more than six thousand edits every hour.”

Using a set of about 200 million of those revisions, the researchers applied deep learning to identify the 300,000 most likely to represent factual changes. About a third of these changes were annotated by hand for the first run.

“Achieving consistent high-quality results on this volume required a well-orchestrated effort,” says Alex Poulis, creator and senior director of TransPerfect DataForce, which assisted with the annotation. “We established a group of 70 annotators and industry-grade training and quality assurance processes, and we used our advanced annotation tools to maximize efficiency.”

Then, using that annotated data (collectively called the “Vitamin C” dataset), the process was automated, allowing the model to detect around 85% of fact-change revisions.

Building on this model, they introduced a second model for automatically suggesting revisions using sequence-to-sequence transformation. “Instead of teaching the model that the population of a certain city is this and this, we teach it to read the current sentence from Wikipedia and find the answer that it needs,” Schuster said.

The researchers describe their dataset generation in a paper, “Get Your Vitamin C! Robust Fact Verification with Contrastive Evidence,” which will be presented at the NAACL Conference on Computational Linguistics this summer. Furthermore, the team made the Vitamin C dataset public to assist other researchers working in fact verification.

“We hope both humans and machines will benefit from the models we created,” said Schuster.

Applications: Research Analytics

Technologies: Middleware

Sectors: Academia

Tags: coronavirus, COVID-19, csail, fact checking, MIT

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

Facts Can Change, and AI Can Help

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 17, 2024

April 16, 2024

April 15, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Building an Operational Data Warehouse for Real-time Analytics

Can You Use Kafka as a Database?

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

Call & Contact Center Expo

AI & Big Data Expo North America 2024

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Facts Can Change, and AI Can Help

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 17, 2024

April 16, 2024

April 15, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link