使用语义相似度和方面关联来改进意见方面提取资源-CSDN文库

161 浏览量 2021-03-09 19:25:04 上传评论收藏 262KB PDF 举报

在当今的信息化时代，意见挖掘或情感分析成为自然语言处理(NLP)的一个重要研究领域。其核心任务之一是意见方面提取，即从包含意见的文本中提取出相关的观点目标或方面。例如，在评论“This phone has a good screen”中，目标是提取出“screen”。这些方面对于产品评论中的情感分析应用至关重要。意见方面提取面临诸多挑战，尤其是在缺乏标注数据的情况下，使得准确提取这些方面变得十分困难。现有的研究方法主要分为监督学习方法和无监督学习方法两大类。监督方法，如条件随机场(CRF)，需要大量的人工标注数据，而无监督方法则不需要，比如基于双传播(Double Propagation, DP)的语法依赖方法。这些无监督方法利用了意见词（如“good”）与目标方面（如“screen”）之间的显式语法关系，通过一系列种子意见词来提取方面和新的意见词。本文提出了一种新颖的基于语义相似度和方面关联的无监督方法来改进意见方面提取。该方法基于终身学习的框架，并实现了基于语义相似度和方面关联的两种推荐形式。通过对八个评论数据集进行的实验结果表明，本文提出的方法在改进意见方面提取方面是有效的。在介绍背景知识时，文章提到意见方面提取是情感分析或意见挖掘的一个基础任务。情感分析是指使用NLP技术自动识别、提取和处理文本中包含情感色彩的词汇和短语的过程。在产品评论中，方面往往指的是产品的属性或特征，这在很多情感分析应用中是必需的。研究者们在意见方面提取领域已经进行了大量的工作。比如，基于监督的CRF方法和基于无监督的DP方法已经在该领域取得了重要进展。DP等无监督方法的一个关键优势在于它们不需要任何人工标注数据，这是非常有价值的，因为手动标注数据往往耗时耗力，并且成本高昂。这些方法基于一个事实，即意见通常有明确的目标，并且意见词与目标方面之间经常存在显式的句法关系。基于上述背景，本文提出的改进方法不仅在理论上有突破，而且在实际应用中也显示出了它的实用性。通过语义相似度和方面关联来提取方面是一种创新的方法，它在很大程度上提高了意见方面提取的准确性和效率。语义相似度是衡量两个句子、短语或词汇在语义层面接近程度的标准，而方面关联是指产品不同属性或特征之间的内在联系。利用这两个方面，可以更精确地识别出哪些词汇是某个产品方面的代表。文章还强调了终身学习框架的重要性。终身学习是一种模仿人类学习过程的机器学习范式，它强调随着时间的推移不断学习和更新知识，而不是仅在训练阶段学习一次。在这种框架下，系统能够不断地通过新的数据或经验来改善其性能，这对于适应不断变化的自然语言使用情况非常有用。本文的研究工作不仅提供了意见方面提取领域的新方法，还指出了其潜在的应用价值和对终身学习理念的贡献。通过这种方式，本文为未来的研究工作提供了新的视角和实验数据，推动了该领域向更精准、高效的方向发展。

资源推荐

资源详情

资源评论

Improving Opinion Aspect Extraction using Semantic Similarity and

Aspect Associations

Qian Liu

1,2

, Bing Liu

, Yuanlin Zhang

, Doo Soon Kim

and Zhiqiang Gao

1,2

Key Lab of Computer Network and Information Integration (Southeast University), Ministry of Education, China

School of Computer Science and Engineering, Southeast University, China

Department of Computer Science, University of Illinois at Chicago, USA

Department of Computer Science, Texas Tech University, USA

Bosch research lab, USA

1,2

{qianliu, zqgao}@seu.edu.cn,

liub@cs.uic.edu,

y.zhang@ttu.edu,

DooSoon.Kim@us.bosch.com

Abstract

Aspect extraction is a key task of ﬁne-grained opin-

ion mining. Although it has been studied by many re-

searchers, it remains to be highly challenging. This pa-

per proposes a novel unsupervised approach to make

a major improvement. The approach is based on the

framework of lifelong learning and is implemented with

two forms of recommendations that are based on se-

mantic similarity and aspect associations respectively.

Experimental results using eight review datasets show

the effectiveness of the proposed approach.

Introduction

Aspect extraction is a fundamental task of opinion mining

or sentiment analysis (Liu 2012). It aims to extract opinion

targets from opinion text. For example, from “This phone

has a good screen,” it aims to extract “screen.” In product

reviews, aspects are product attributes or features. They are

needed in many sentiment analysis applications.

Aspect extraction has been studied by many researchers.

There are two main approaches: supervised and unsuper-

vised. Some existing work has shown that unsupervised syn-

tactic dependency-based methods such as double propaga-

tion (DP) (Qiu et al. 2011) can perform better than super-

vised Conditional Random Fields (CRF) (Lafferty, McCal-

lum, and Pereira 2001) based methods. The unsupervised

dependency-based methods also have the key advantage of

not requiring any human labeled data. They are based on the

fact that opinions have targets and there are often explicit

syntactic relations between opinion words (e.g., “good”) and

target aspects (e.g., “screen”). By exploiting such relations,

DP and other related methods can use a set of seed opinion

words to extract aspects and new opinion words, and then

use them to extract more aspects and opinion words through

bootstrapping propagation.

Figure 1 shows the dependency relations between words

in “The phone has a good screen.” If “good” is a known

opinion word (given or extracted), “screen,” a noun modiﬁed

by “good,” is clearly an aspect as they have a dependency

relation amod. From a given set of opinion words, we can

extract a set of aspects if we have a syntactic rule like “if

 2016, Association for the Advancement of Artiﬁcial

Figure 1: Dependency relations in the sentence “The phone

has a good screen.”

a word A, whose part-of-speech (POS) is a singular noun

(nn), has the dependency relation amod with (i.e., modiﬁed

by) an opinion word O, then A is an aspect.” Similarly, one

can use such rules to extract new aspects from the extracted

aspects, and new opinion words from the extracted aspects.

Although effective, syntactic rule-based methods such as

DP still have room for major improvements. This is not sur-

prising as it is very hard to design a set of rules to perform

extraction with high precision and high recall due to the ﬂex-

ibility of natural languages. One way to improve is to use

the prior knowledge in the framework of lifelong machine

learning. That is, the system retains its past experiences and

learned results as knowledge and uses it to help the new

learning or extraction. In other words, if the system already

knows a lot before extraction, it clearly can do much bet-

ter. The prior knowledge is mined automatically by exploit-

ing the abundance of reviews for all kinds of products on

the Web. This idea is workable because many products ac-

tually share aspects, e.g., many electronic products have as-

pects screen and battery. To exploit such knowledge for new

extraction, we use the idea of recommendation, in particu-

lar collaborative ﬁltering (Adomavicius and Tuzhilin 2005).

This type of recommendation uses the behavioral informa-

tion of other users to recommend products/services to the

current user. This is exactly the idea that we want to employ,

using the information in reviews of a large number of other

products to help extract aspects from reviews of the current

product. To the best of our knowledge, this is the ﬁrst time

that recommendation is used for aspect extraction.

In this work, we propose to use DP as the base and im-

prove its results dramatically through aspect recommenda-

tion. Two forms of recommendations are proposed, (1) se-

mantic similarity-based, and (2) aspect associations-based.

Semantic similarity-based recommendation aims to solve

the problem of missing synonymous aspects of DP using

word vectors trained from a large corpus of 5.8 million re-

views for similarity comparison. The word vectors are re-

garded as a form of prior knowledge learned from the past

data. For example, “photo” is a synonym of “picture.” Us-

ing the DP method, “picture” is extracted as an aspect from

the sentence “The picture is blurry,” but “photo” is not ex-

tracted from the sentence “The phone is good, but not its

photos.” One reason for the inability to extract “photo” is

that to ensure good extraction precision and recall, many

useful rules with low precision are not used. The proposed

semantic similarity-based recommendation makes use of the

extracted aspect “picture” to recommend “photo” based on

the semantic similarity of the two words.

However, “picture” cannot be used to recommend “bat-

tery” as an aspect because their semantic similarity value is

very small. To recommend “battery” (if it is not extracted),

we use the second form of recommendation, i.e., aspect as-

sociations or correlations. The idea is that many aspects are

correlated or co-occur across domains. For example, those

products with the aspect “picture” also have a high chance

of using batteries as pictures are usually taken by digital de-

vices which need batteries. If such associations can be dis-

covered, they can be used in recommendation of additional

aspects. For this purpose, we employ association rules from

data mining (Agrawal and Srikant 1994) which ﬁt our needs

very well. To mine associations, we use the extraction re-

sults from reviews of many other products or domains in the

lifelong learning fashion (Chen and Liu 2014).

In our experiments, we use a popular aspect extraction

evaluation corpus from (Hu and Liu 2004) and a new corpus

from (Liu et al. 2015). To learn word vectors and aspect as-

sociations, we use two large collections of product reviews.

Experimental results show that the two forms of recommen-

dations can recommend very reliable aspects, and the ap-

proach that employs both recommendations outperforms the

state-of-the-art dependency rule-based methods markedly.

Related Work

There are two main approaches to aspect extraction: su-

pervised and unsupervised. The former is mainly based on

CRF (Jakob and Gurevych 2010; Choi and Cardie 2010;

Mitchell et al. 2013), while the latter is mainly based on

topic modeling (Mei et al. 2007; Titov and McDonald

2008; Li, Huang, and Zhu 2010; Brody and Elhadad 2010;

Wang, Lu, and Zhai 2010; Moghaddam and Ester 2011;

Mukherjee and Liu 2012), and syntactic rules designed us-

ing dependency relations (Zhuang, Jing, and Zhu 2006;

Wang and Wang 2008; Wu et al. 2009; Zhang et al. 2010;

Qiu et al. 2011).

On the supervised approach, CRF-based methods need

manually labeled training data. Our method is unsupervised.

On the unsupervised approach, topic modeling often only

gives some rough topics rather than precise aspects as a top-

ical term does not necessarily mean an aspect. For example,

in a battery topic, a topic model may ﬁnd topical terms such

as “battery,” “life,” and “time,” etc., which are related to bat-

tery life (Lin and He 2009; Zhao et al. 2010; Jo and Oh 2011;

Fang and Huang 2012), but each word is not an aspect.

There are also frequency-based methods (Hu and Liu

2004; Popescu and Etzioni 2005; Zhu et al. 2009), word

alignment methods (Liu et al. 2013), label propagation

methods (Zhou, Wan, and Xiao 2013), and other methods.

This paper is most related to the DP method (Qiu et al.

2011), and aims to improve it. Since our method employs

word vectors learned from a large collection of product re-

views, it is also related to (Xu, Liu, and Zhao 2014), which

proposed a joint opinion relation detection method OCDNN.

Although they also used word vectors to represent words in

the neural network training, they used that as a feature repre-

sentation in their classiﬁcation. The work (Pavlopoulos and

Androutsopoulos 2014) explored the word vectors trained

on English Wikipedia to compute word similarities used in

a clustering algorithm. However, our work is quite different,

we train word vectors using a large review corpus and use

them to recommend aspects.

Our work is also related to topic modeling-based methods

in (Chen and Liu 2014; Chen, Mukherjee, and Liu 2014) as

they also used multiple past domains to help aspect extrac-

tion in the lifelong learning fashion. However, they can only

ﬁnd some rough topics as other topic models. We can ﬁnd

more precise aspects with the help of multiple past domains.

In (Liu et al. 2015), a rule selection method is proposed to

improve DP, but it is a supervised method.

Overall Algorithm

This section introduces algorithm AER (Aspect Extraction

based on Recommendation), Algorithm 1, which consists of

two main steps: base extraction and recommendation.

Algorithm 1 AER(D

, R

−

, R

, O)

Input: Target dataset D

, high precision aspect extraction

rules R

−

, high recall aspect extraction rules R

, seed

opinion words O

Output: Extracted aspect set A

1: T

−

← DPextract(D

, R

−

, O);

2: T

← DPextract(D

, R

, O);

3: T ← T

− T

−

;

4: T

← Sim-recom(T

−

, T );

5: T

← AR-recom(T

−

, T );

6: A ← T

−

∪ T

Step 1 (base extraction, lines 1-2): Given the target doc-

ument collection D

for extraction and a set O of seed opin-

ion words, this step ﬁrst uses the DP method (DPextract) to

extract an initial (or base) set T

−

of aspects employing a set

−

of high precision rules (line 1). The set of high preci-

sion rules are selected from the set of rules in DP by evalu-

ating their precisions individually using a development set.

The set T

−

of extracted aspects thus has very high precision

but not high recall. Then, extract a set T

of aspects from

a larger set R

of high recall rules (R

−

⊆R

) also using

DPextract (line 2). The set T

of extracted aspects thus has

very high recall but not high precision.

Step 2 (recommendation, lines 3-6): This step recom-

mends more aspects using T

−

as the base to improve the re-

call. To ensure recommendation quality, we require that the

剩余6页未读，继续阅读

评论收藏

内容反馈

weixin_38519763

粉丝: 5
资源: 922

使用语义相似度和方面关联来改进意见方面提取

基于ElasticSearch和语义相似度匹配的教学资源搜索策略.docx

论文研究-基于本体及相似度的文本聚类研究.pdf

基于语义相似度的关联词柔性簇模型* (2007年)

论文研究-基于上下文的领域本体概念和关系的提取.pdf

同义词词林（哈工大扩展版） + Python词语相似度计算源代码

基于word2vec的关键词提取算法_李跃鹏1

基于数据挖掘的图书馆海量图书信息分类检索系统设计.pdf

基于NLP的文本相似度检测方法.docx

两级相似度计算在主观题机器阅卷中的应用1

面向隐含语义文本的WEB数据挖掘研究.pdf

云计算-自动问答系统中问句相似度计算方法研究.pdf

基于几何对象聚类的学术文献图表定位研究.docx

网络游戏-一种基于词相似度的网络文本分类方法.zip

遥感图像蚁群算法和加权图像到类距离检索法.docx

程泽-12.19 基于在线评论词向量表征的产品属性提取1

煤矿安全隐患智能语义采集与智慧决策支持系统

基于ontology的跨媒体检索技术研究

1-3实体对齐算法在电商领域当中的实践和应用.pdf

电信设备-基于语义扩展的海量短文本信息过滤方法.zip

09118115佘瑾南_软件实践课程报告1

李航老师《统计学习方法》第2版课件：第17章 潜在语义分析.rar

利用分布式表征提高文本检索效率.pdf

最新资源

李航老师《统计学习方法》第2版课件：第17章潜在语义分析.rar