May 2, 2019

Intel’s Sentiment Algorithm Needs Less Training Data

George Leopold

via Shutterstock

Intel’s AI lab has released an open-source version of a sentiment analysis algorithm designed to boost natural language processing applications that currently lack the ability to scale across different domains such as restaurant or hotel reviews.

Sentiment analysis is used to glean subjective information from text. That task is made easier when labeled training data is available. The chip maker’s Aspect-Based Sentiment Analysis (ABSA) algorithm released in April also addresses the shortfall in annotated training data required for commercial NLP deployments.

ABSA refers to the machine translation task of extracting aspects, or attributes, of a given domain. In a common example such as a restaurant review, the restaurant is the domain and specific aspects would include the menu, the quality of the food and the attentiveness of the wait staff.

The problem with current sentiment analysis approaches is that aspects within the same domain are semantically close—food, menu, desserts, etc.—while aspects from different domains are semantically different. The Intel researchers note that supervised learning algorithms can handle this domain sensitivity if labeled data is available for training. But labeled date tends to be sparse, and generating it is labor intensive.

Hence, ABSA is being promoted as a “lightly-supervised” alternative to standard sentiment analysis approaches by enabling “a wide variety of users to generate a detailed sentiment report,” the researchers noted in a blog post released on Thursday (May 2).

The goal is to develop a sentiment analysis framework requiring little or no labeled training data, thereby making it faster and cheaper to deploy commercial NLP systems.

ABSA is touted as improving the ability to extract aspect terms and their “sentiment polarity,” as in, the service was excellent or lousy.

Intel used a standard training and inference approach in developing its sentiment analysis algorithm. Unlabeled text documents were used in the training phase for a particular “target” domain. The outputs were opinion and aspect “lexicons” from a specific domain. “The user can edit the domain-specific lexicons which makes this a lightly-supervised approach,” the researchers said.

During the inference phase, opinions and aspects generated during model training were combined with an “unseen inference data set” of restaurant reviews. Together, they were used to generate a report compiling negative and positive sentiments about a product or service. (“The beet salad was terrific,” “the steak was tough.”)

“The algorithm does not require the training of a new model for each domain and can continuously learn from new data coming in,” Intel said.

Along with reducing the requirement for labeled training data and its ability to span different domains, the Intel researchers said the ABSA algorithm also represents an NLP approach in which a model can explain how it arrived at a conclusion or a recommendation.

Intel noted that emerging NLP applications illustrate the shift to “large pre-trained models with relatively small amounts of data instead of the traditional approach of training from-scratch with large amounts of data per task and then performing inference.”

That bodes well for commercial NLP applications, the researchers added. “We see that the field of computer vision has gone through a set of accelerated adoptions with the rise of transfer learning (e.g. ImageNet), which enabled the productization of the technology. We expect a similar development in the field of NLP.”

Recent items:

Will Neural Nets Replace Science Writers?

Using Emojis to Boost Sentiment Analysis

Applications: Predictive Analytics

Technologies: Frameworks

Sectors: Retail

Vendors: intel

Tags: ABSA, AI, Aspect-Based Sentiment Analysis, inference, Intel, natural language processing, NLP, sentiment analysis, training

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

Intel’s Sentiment Algorithm Needs Less Training Data

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

May 13, 2024

May 10, 2024

May 9, 2024

May 8, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Top 6 Strategies for Reducing Data Warehouse Costs

Building an Operational Data Warehouse for Real-time Analytics

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

AI & Big Data Expo North America 2024

CDAO Canada Public Sector 2024

AI Hardware & Edge AI Summit Europe

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Intel’s Sentiment Algorithm Needs Less Training Data

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

May 13, 2024

May 10, 2024

May 9, 2024

May 8, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link