Neural_Relation_Extraction代码资源-CSDN文库

共3个文件

pdf：1个

docx：1个

zip：1个

python

需积分: 5 180 浏览量 2022-07-21 00:11:46 上传评论收藏 1.09MB RAR 举报

资源详情

资源评论

资源推荐

收起资源包目录

Neural_Relation_Extraction_with_Selective_Attentio.rar （3个子文件）

distant-supervised-relation-extraction-main.zip 604KB

Neural_Relation_Extraction_with_Selective_Attentio.pdf 313KB

Neural_Relation_Extraction_with_Selective_Attentio.docx 261KB

See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/306093646

Neural Relation Extraction with Selective Attention over Instances

Conference Paper · January 2016

DOI: 10.18653/v1/P16-1200

CITATIONS

835

READS

2,354

5 authors, including:

Some of the authors of this publication are also working on these related projects:

Sememe View project

Incorporating Relation Paths in Neural Relation Extraction View project

Zhiyuan Liu

Tsinghua University

396 PUBLICATIONS12,482 CITATIONS

SEE PROFILE

All content following this page was uploaded by Zhiyuan Liu on 16 August 2017.

The user has requested enhancement of the downloaded file.

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pages 2124–2133,

Berlin, Germany, August 7-12, 2016.

2016 Association for Computational Linguistics

Neural Relation Extraction with Selective Attention over Instances

Yankai Lin

, Shiqi Shen

, Zhiyuan Liu

1,2∗

, Huanbo Luan

, Maosong Sun

1,2

Department of Computer Science and Technology,

State Key Lab on Intelligent Technology and Systems,

National Lab for Information Science and Technology, Tsinghua University, Beijing, China

Jiangsu Collaborative Innovation Center for Language Competence, Jiangsu, China

Abstract

Distant supervised relation extraction has

been widely used to ﬁnd novel relational

facts from text. However, distant su-

pervision inevitably accompanies with the

wrong labelling problem, and these noisy

data will substantially hurt the perfor-

mance of relation extraction. To allevi-

ate this issue, we propose a sentence-level

attention-based model for relation extrac-

tion. In this model, we employ convolu-

tional neural networks to embed the se-

mantics of sentences. Afterwards, we

build sentence-level attention over multi-

ple instances, which is expected to dy-

namically reduce the weights of those

noisy instances. Experimental results on

real-world datasets show that, our model

can make full use of all informative sen-

tences and effectively reduce the inﬂuence

of wrong labelled instances. Our model

achieves signiﬁcant and consistent im-

provements on relation extraction as com-

pared with baselines. The source code of

this paper can be obtained from https:

//github.com/thunlp/NRE.

1 Introduction

In recent years, various large-scale knowledge

bases (KBs) such as Freebase (Bollacker et al.,

2008), DBpedia (Auer et al., 2007) and YAGO

(Suchanek et al., 2007) have been built and widely

used in many natural language processing (NLP)

tasks, including web search and question answer-

ing. These KBs mostly compose of relational facts

with triple format, e.g., (Microsoft, founder,

Bill Gates). Although existing KBs contain a

∗

Corresponding author: Zhiyuan Liu (li-

uzy@tsinghua.edu.cn).

massive amount of facts, they are still far from

complete compared to the inﬁnite real-world facts.

To enrich KBs, many efforts have been invested

in automatically ﬁnding unknown relational facts.

Therefore, relation extraction (RE), the process of

generating relational data from plain text, is a cru-

cial task in NLP.

Most existing supervised RE systems require a

large amount of labelled relation-speciﬁc training

data, which is very time consuming and labor in-

tensive. (Mintz et al., 2009) proposes distant su-

pervision to automatically generate training data

via aligning KBs and texts. They assume that if

two entities have a relation in KBs, then all sen-

tences that contain these two entities will express

this relation. For example, (Microsoft, founder,

Bill Gates) is a relational fact in KB. Distant su-

pervision will regard all sentences that contain

these two entities as active instances for relation

founder. Although distant supervision is an

effective strategy to automatically label training

data, it always suffers from wrong labelling prob-

lem. For example, the sentence “Bill Gates ’s turn

to philanthropy was linked to the antitrust prob-

lems Microsoft had in the U.S. and the European

union.” does not express the relation founder

but will still be regarded as an active instance.

Hence, (Riedel et al., 2010; Hoffmann et al., 2011;

Surdeanu et al., 2012) adopt multi-instance learn-

ing to alleviate the wrong labelling problem. The

main weakness of these conventional methods is

that most features are explicitly derived from NLP

tools such as POS tagging and the errors generated

by NLP tools will propagate in these methods.

Some recent works (Socher et al., 2012; Zeng

et al., 2014; dos Santos et al., 2015) attempt to

use deep neural networks in relation classiﬁca-

tion without handcrafted features. These meth-

ods build classiﬁer based on sentence-level anno-

tated data, which cannot be applied in large-scale

2124

CNN CNN CNN CNN

Figure 1: The architecture of sentence-level

attention-based CNN, where m

indicates the orig-

inal sentence for an entity pair, α

is the weight

given by sentence-level attention.

KBs due to the lack of human-annotated train-

ing data. Therefore, (Zeng et al., 2015) incor-

porates multi-instance learning with neural net-

work model, which can build relation extractor

based on distant supervision data. Although the

method achieves signiﬁcant improvement in re-

lation extraction, it is still far from satisfactory.

The method assumes that at least one sentence that

mentions these two entities will express their rela-

tion, and only selects the most likely sentence for

each entity pair in training and prediction. It’s ap-

parent that the method will lose a large amount

of rich information containing in neglected sen-

tences.

In this paper, we propose a sentence-level

attention-based convolutional neural network

(CNN) for distant supervised relation extraction.

As illustrated in Fig. 1, we employ a CNN to

embed the semantics of sentences. Afterwards, to

utilize all informative sentences, we represent the

relation as semantic composition of sentence em-

beddings. To address the wrong labelling prob-

lem, we build sentence-level attention over mul-

tiple instances, which is expected to dynamically

reduce the weights of those noisy instances. Fi-

nally, we extract relation with the relation vector

weighted by sentence-level attention. We evaluate

our model on a real-world dataset in the task of

relation extraction. The experimental results show

that our model achieves signiﬁcant and consistent

improvements in relation extraction as compared

with the state-of-the-art methods.

The contributions of this paper can be summa-

rized as follows:

• As compared to existing neural relation ex-

traction model, our model can make full use

of all informative sentences of each entity

pair.

• To address the wrong labelling problem in

distant supervision, we propose selective

attention to de-emphasize those noisy in-

stances.

• In the experiments, we show that selective

attention is beneﬁcial to two kinds of CNN

models in the task of relation extraction.

2 Related Work

Relation extraction is one of the most impor-

tant tasks in NLP. Many efforts have been invested

in relation extraction, especially in supervised re-

lation extraction. Most of these methods need a

great deal of annotated data, which is time con-

suming and labor intensive. To address this issue,

(Mintz et al., 2009) aligns plain text with Free-

base by distant supervision. However, distant su-

pervision inevitably accompanies with the wrong

labelling problem. To alleviate the wrong la-

belling problem, (Riedel et al., 2010) models dis-

tant supervision for relation extraction as a multi-

instance single-label problem, and (Hoffmann et

al., 2011; Surdeanu et al., 2012) adopt multi-

instance multi-label learning in relation extraction.

Multi-instance learning was originally proposed to

address the issue of ambiguously-labelled training

data when predicting the activity of drugs (Diet-

terich et al., 1997). Multi-instance learning con-

siders the reliability of the labels for each instance.

(Bunescu and Mooney, 2007) connects weak su-

pervision with multi-instance learning and extends

it to relation extraction. But all the feature-based

methods depend strongly on the quality of the fea-

tures generated by NLP tools, which will suffer

from error propagation problem.

Recently, deep learning (Bengio, 2009) has

been widely used for various areas, including com-

puter vision, speech recognition and so on. It has

also been successfully applied to different NLP

tasks such as part-of-speech tagging (Collobert

et al., 2011), sentiment analysis (dos Santos and

Gatti, 2014), parsing (Socher et al., 2013), and

machine translation (Sutskever et al., 2014). Due

to the recent success in deep learning, many re-

searchers have investigated the possibility of us-

ing neural networks to automatically learn features

2125

for relation extraction. (Socher et al., 2012) uses

a recursive neural network in relation extraction.

They parse the sentences ﬁrst and then represent

each node in the parsing tree as a vector. More-

over, (Zeng et al., 2014; dos Santos et al., 2015)

adopt an end-to-end convolutional neural network

for relation extraction. Besides, (Xie et al., 2016)

attempts to incorporate the text information of en-

tities for relation extraction.

Although these methods achieve great success,

they still extract relations on sentence-level and

suffer from a lack of sufﬁcient training data. In

addition, the multi-instance learning strategy of

conventional methods cannot be easily applied in

neural network models. Therefore, (Zeng et al.,

2015) combines at-least-one multi-instance learn-

ing with neural network model to extract relations

on distant supervision data. However, they assume

that only one sentence is active for each entity pair.

Hence, it will lose a large amount of rich informa-

tion containing in those neglected sentences. Dif-

ferent from their methods, we propose sentence-

level attention over multiple instances, which can

utilize all informative sentences.

The attention-based models have attracted a lot

of interests of researchers recently. The selectiv-

ity of attention-based models allows them to learn

alignments between different modalities. It has

been applied to various areas such as image clas-

siﬁcation (Mnih et al., 2014), speech recognition

(Chorowski et al., 2014), image caption generation

(Xu et al., 2015) and machine translation (Bah-

danau et al., 2014). To the best of our knowl-

edge, this is the ﬁrst effort to adopt attention-based

model in distant supervised relation extraction.

3 Methodology

Given a set of sentences {x

, x

, · · · , x

} and

two corresponding entities, our model measures

the probability of each relation r. In this section,

we will introduce our model in two main parts:

• Sentence Encoder. Given a sentence x and

two target entities, a convolutional neutral

network (CNN) is used to construct a dis-

tributed representation x of the sentence.

• Selective Attention over Instances. When

the distributed vector representations of all

sentences are learnt, we use sentence-level at-

tention to select the sentences which really

express the corresponding relation.

3.1 Sentence Encoder

Bill_Gates is the founder of Microsoft.

Sentence

Vector

Representaion

word

position

Convolution

Layer

Max

Pooling

= r

W *

+ b

Non-linear

Layer

Figure 2: The architecture of CNN/PCNN used for

sentence encoder.

As shown in Fig. 2, we transform the sentence

x into its distributed representation x by a CNN.

First, words in the sentence are transformed into

dense real-valued feature vectors. Next, convo-

lutional layer, max-pooling layer and non-linear

transformation layer are used to construct a dis-

tributed representation of the sentence, i.e., x.

3.1.1 Input Representation

The inputs of the CNN are raw words of the

sentence x. We ﬁrst transform words into low-

dimensional vectors. Here, each input word is

transformed into a vector via word embedding ma-

trix. In addition, to specify the position of each en-

tity pair, we also use position embeddings for all

words in the sentence.

Word Embeddings. Word embeddings aim to

transform words into distributed representations

which capture syntactic and semantic meanings

of the words. Given a sentence x consisting of

m words x = {w

, w

, · · · , w

}, every word

is represented by a real-valued vector. Word

representations are encoded by column vectors in

an embedding matrix V ∈ R

×|V |

where V is a

ﬁxed-sized vocabulary.

Position Embeddings. In the task of relation

extraction, the words close to the target entities are

usually informative to determine the relation be-

tween entities. Similar to (Zeng et al., 2014), we

use position embeddings speciﬁed by entity pairs.

It can help the CNN to keep track of how close

2126

评论收藏

内容反馈

大虾飞哥哥

粉丝: 68
资源: 29

Neural_Relation_Extraction 代码

评论0

最新资源

Neural_Relation_Extraction 代码

评论0

Convolutional-Recurrent-Neural-Networks-for-Relation-Extraction:卷积递归神经网络用于关系提取的Tensorflow实现

论文研究-Towards A Noise-Tolerant Neural Network Model for Distant Supervised Relation Extraction.pdf

Improving Distantly-Supervised Neural Relation Extraction using Side Information

Neural Relation Extraction with Selective Attention over Instances论文笔记

Neural Relation Extraction with Selective Attention over Instances

Joint Entity and Relation Extraction Based on a hybrid neural network

Artificial Neural Networks_New Research-Nova Science(2017).pdf

DIAG-NRE：A Neural Pattern Diagnosis Framework

Joint-Entity-recognition-and-relation-Extraction-using-joint-neural-model

A multi-task learning based approach to biomedical entity relation extraction

人工智能-项目实践-自注意力机制-通过BiGRU+注意力机制对关系进行自动抽取

2019百度语言与智能技术竞赛信息抽取赛参赛源码+学习说明（第5名）.zip

neural-networks-for-attitude-extraction:神经网络在情感态度提取中的应用

2019-ACL-清华等-Graph Neural Networks with Generated Parameters for

Learning a Deep Convolutional Network for Image Super-Resolution论文分析与pytorch代码

Nopie:Pytorch中的Neural OpenIE（正在进行中，未发布）

AGGCN：用于关系提取的注意力导向图卷积网络（ACL19论文的作者PyTorch实现）

基于Python+pytorch的图像处理+附完整代码图像处理，能够轻松实现图像的读取、显示、裁剪等还有机器学习等操作

python大作业 含爬虫、数据可视化、地图、报告、及源码（2016-2021全国各地区粮食产量）.rar

《点燃我温暖你》中李峋的同款爱心代码

第十五届蓝桥杯大赛软件赛省赛-PythonB组题目

Python金融量化的高级库：TA-Lib-0.4.24（包含python3.7、3.8、3.9、3.10的32位和64位版本）

大麦网抢票脚本【Python脚本】

Python数据分析项目实践，包括数据读取、评估、清洗、分析、可视化机器学习相关内容等

YOLOv8-火焰识别（火焰数据集+代码+GUI界面+内置训练好的模型文件）

人脸识别系统OpenCV+dlib+python（含数据库）Pyqt5界面设计 项目源码 毕业设计

Python教程2020版 完全入门 达到Python工程师水平 笔记+代码+课件+资料

Python学习笔记(干货) 中文PDF完整版.pdf

人体姿态检测

最新资源

python大作业含爬虫、数据可视化、地图、报告、及源码（2016-2021全国各地区粮食产量）.rar

人脸识别系统OpenCV+dlib+python（含数据库）Pyqt5界面设计项目源码毕业设计

Python教程2020版完全入门达到Python工程师水平笔记+代码+课件+资料