May 25, 2021

Researchers Evaluate Neural Language Models, Find XLNet Excellent

Oliver Peckham

Lexical substitution is essentially the process that a thesaurus helps humans to perform: replacing words in a sentence without changing the meaning. Now, researchers from the Skolkovo Institute of Science and Technology in Moscow – Skoltech, for short – have completed a groundbreaking, large-scale study to examine how the most advanced neural language models perform when handling lexical substitution tasks.

While simple on paper – humans, of course, find it very easy in their native tongues – lexical substitution is quite complex for artificial intelligences. The substitution can take a variety of nuanced forms: you might replace a word with a hypernym (a word with a broader meaning, like substituting “seat” for “chair”) – or you might substitute a word with a synecdoche, like saying “wheels” to refer to a car. These nuances and abstractions further complicate the already challenging process of artificially deciphering and recreating human language.

Alexander Panchenko – an assistant professor of natural language processing at Skoltech – and colleagues from a variety of research institutions (including HSE University, Lomonosov Moscow State University and Samsung Research Center Russia) set out to evaluate language models for these abilities. Substitution is important for more than just creativity: for instance, it helps models understand the contextual meaning of a word, which, in turn, helps to correct misspellings, or even work toward automatically simplifying writing. So Panchenko’s team measured the models on two fronts: first, their ability to substitute words; and second, their ability to process the contextual meanings of homonyms (e.g. “bat” as in baseball and “bat” as in the animal).

Evaluated models included a variety of language and masked language models (LMs and MLMs), including context2vec, ELMo, BERT, RoBERTa, and XLNet. The battery of tests yielded state-of-the-art results from XLNet through tests on multiple datasets. Furthermore, the researchers observed that large pre-trained language models yielded better results than previous methods of substitution and that incorporating information about the target word substantially improved the quality of the results.

Beyond a straightforward ranking of the models in question, the researchers see a variety of applications for their results.

“First of all, our results in lexical substitution may be useful for language learning (replacing words with their simpler equivalents),” Panchenko said. “Second, it may be useful for augmentation of textual data for training neural networks, as similar augmentation methods are common in computer vision but not so common in text analysis. Another obvious application is writing assistance – automatic suggestion of synonyms and text reformulation.”

To read the paper, click here.

Applications: Artificial Intelligence, Research Analytics

Technologies: Middleware

Sectors: Academia

Tags: language model, lexical substitution, neural language model, neural network

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

Researchers Evaluate Neural Language Models, Find XLNet Excellent

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 19, 2024

April 18, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Building an Operational Data Warehouse for Real-time Analytics

Can You Use Kafka as a Database?

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

Call & Contact Center Expo

AI & Big Data Expo North America 2024

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Researchers Evaluate Neural Language Models, Find XLNet Excellent

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 19, 2024

April 18, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link