利用文档级信息结合语义空间加强事件检测.docx资源-CSDN文库

版权申诉

134 浏览量 2022-12-15 14:20:29 上传评论收藏 255KB DOCX 举报

资源推荐

资源详情

资源评论

Event detection (ED) is a crucial task of event extraction (EE), which aims to identify event

triggers from text and classify them into corresponding event types. The event trigger is the word

or phrase that can clearly indicate the existence of an event in a sentence. According to the

automatic context extraction (ACE) 2005 dataset, which is widely applied to the ED task, there are

8 event types and 33 subtypes, such as “Attack”, “Transport”, “Meet” etc. Take the following

sentences as examples:

S1: He has died of his wounds after being shot.

S2: An American tank fired on the Palestine hotel.

S3: Another veteran war correspondent is being fired for his controversial conduct in Iraq.

An ideal ED model is expected to recognize two events: a “Die” event triggered by the

trigger word “died” and an “Attack” event triggered by “shot” in S1.

The difficulty of the ED task lies in the diversity and ambiguity of natural language

expression. On the one hand, there are a variety of expressions that belong to the same event type.

In S1, “shot” triggers an “Attack” event, and “fired” also triggers the same event type in S2. On

the other hand, the same trigger can denote different events. In S3, “fired” can trigger an “Attack”

event or an “End-Position” event. Because of the ambiguity, a traditional approach may mislabel

“fired” with “Attack” according to the word “war” with sentence-level information. However, in

the same document, other sentences like “NBC is terminating freelancer reporter Peter Arnett for

statements he made to the Iraqi media.” could provide the clue that “fired” triggers an “End-

Position” event. Up to 57% of the event triggers are ambiguous in the ACE 2005 dataset

[1]

. Thus,

how to solve the ambiguity of event trigger has become an important problem in ED task.

ED is a booming and challenging task in NLP. The dominant approaches for ED adopt deep

neural networks to learn effective features for the input sentences. Most existing methods either

generally focus on sentence-level context, or ignore the correlations between events, such as

semantic correlation information. Many methods

[2-3]

mainly exploit sentence-level features that

lack a summary of the document. Sometimes sentence-level information is insufficient to address

the ambiguity of event trigger, such as the event trigger “fired” in S3. Some document-level

models have been proposed to leverage global context

[4-6]

. However, these methods extract

features of the entire document, which are coarse-grained features for event classification.

Actually, by means of processing context more effectively, the model’s performance can be

improved.

The semantic correlations between different events exist objectively and pervasively, and

they are manifested in several aspects. Initially, different event types have some semantic

relevance. For instance, compared with the “Transport” event, the “Attack” event and the “Injure”

event are semantically closer. Belonging to the same parent event type, different subtypes have

certain semantic correlations. “Be-Born” and “Marry” belong to the same parent event type

“Life”, which can reveal more collective features. They are more likely to co-occur in the same

document. Furthermore, different event triggers have some semantic correlations in the same

document, such as event trigger “shot” and “died” in S1. The events mentioned in the same

document tend to be semantically coherent. As pointed out by Ref. [5], many events usually co-

occur in the same document. According to the ACE 2005 dataset, the top 5 event types that

accompany with “Attack” event in the same sentence are as follows: Attack, Die, Transport, Injure

and Meet. Eventually, there is similar semantics between the event trigger and its corresponding

event type. The event type word indicates the fundamental semantic information and reveals

common features, and the event trigger word has extended semantic information with a more

specific context. Suppose we replace the trigger word with its corresponding event type word, the

semantics of the whole sentence will not change much. Thus, how to model the semantic

correlation information between event types and event triggers becomes a challenge to be

overcome.

Existing methods generally use the one-hot label, which classifies the event type with the 0/1

label. Despite the simplicity, it regards multiple events in the same document as independent ones,

and therefore it is difficult to accurately represent the correlations between different event types.

In this paper, we propose document embedding networks with shared semantics space

(DENSS) to address the aforementioned problems. To learn the event correlations, we use

bidirectional encoder representations from transformers (BERT) to obtain event type

representations and map them into a semantic space, where the more relevant event types are, the

closer they stay. We apply BERT again to acquire the representation of each word with document-

level and sentence-level information via gated attention, project the representation of each event

trigger into the same semantic space, and choose the label of the closest event type.

In summary, the contributions of this paper are as follows: 1) We study the event

correlations problem and propose a novel ED framework, which utilizes BERT for capturing

document-level and sentence-level information. 2) We employ a shared semantic space to

represent event types and event triggers, which minimizes the distance between each event trigger

and its corresponding type. Experiment results on the ACE 2005 dataset verify the effectiveness of

our approach.

1. Approach

1.1 Task Description

The goal of ED consists of identifying event triggers (trigger identification) and classifying

them into corresponding event types (trigger classification). According to the ACE 2005 dataset,

an event is defined as a specific occurrence involving one or more participants. The event trigger

剩余14页未读，继续阅读

评论收藏

内容反馈

版权申诉

罗伯特之技术屋

粉丝: 3650
资源: 1万+

利用文档级信息结合语义空间加强事件检测.docx

毕业设计-在Ycbcr空间中的基于肤色的人脸检测.docx

人工智能中的语义分析技术及其应用.docx

多尺度语义信息融合的目标检测.docx

安全企业谈等保2.0一提升安全运营与检测能力保障关键信息基础设施安全.docx.docx

工件高度检测.docx

基于时空融合图网络学习的视频异常事件检测.docx

单片机水位检测.docx

语义增强的多模态虚假新闻检测.docx

金属液位检测.docx

音频标记一致性约束CRNN声音事件检测.docx

基于多尺度特征预测的异常事件检测.docx

结合语义和多层特征融合的行人检测.docx

企业专利信息特征及信息化处理-2019年文档.docx

clickhouse文档.docx

基于高斯建模和YoLo V3目标检测的遗留物检测方法.docx

网站SEO诊断优化方案框架域名空间信息域名注册时间,到期..docx

产品语义设计中的分类.docx

基于词相关性特征的多归属谱聚类突发事件检测.docx

开放语义云平台方案白皮书.docx

相关实用应用程序（Windows可用）

免费可用的ChatGPT网页版.zip

ChatGPT使用总结：150个ChatGPT提示词模板（完整版）

chromedriver-win64.zip

全国计算机二级WPSoffice精选350道选择题题库（含答案）.pdf

农村公交与异构无人机协同配送优化

李飞飞自传 我看见的世界 The World I see

哈尔滨工业大学-ChatGPT调研报告-2023.3.6-94页.pdf

4个亲测好用的ChatGPT4渠道

基于小波与卷积神经网络的多尺度时间序列分类.zip

最新资源

李飞飞自传我看见的世界 The World I see