从文档评估和生成器中提取关键信息_KeyInformationExtractionFromDocumentsEvalu资源-CSDN文库

版权申诉

4 浏览量 2022-01-21 00:41:37 上传评论收藏 192KB PDF 举报

资源推荐

资源详情

资源评论

arXiv:2106.14624v1 [cs.CL] 9 Jun 2021

Key Information Extraction From Documents:

Evaluation And Generator

⋆

Oliver Bensch

1

, Mirela Popa

2

, and Constantin Spille

3

1

Maastricht University, 6200 MD Maastricht, The Netherlands

2

Maastricht University, 6200 MD Maastricht, The Netherlands

3

KI Group GmbH

o.bensch@student.maastrichtuniversity.nl

mirela.popa@maastrichtuniversity.nl

c.spille@kigroup.de

Abstract. Extracting information from documents usually relies on nat-

ural language processing methods working on one-dimensional sequences

of text. In some cases, for example, for the extraction of key informa-

tion from semi-structu red documents, such as invoice-documents, spatial

and formatting information of text are crucial to understand t he con-

textual meaning. Convolutional neural networks are already common in

computer vision models to process and extract relationships in multi-

dimensional data. Therefore, natural language processing models have

already been combined with computer vision models in the past, t o ben-

eﬁt from e.g. positional information and to improve performance of these

key information extraction models. Existing models were either trained

on unpublished data sets or on an annotated collection of receipts, which

did not focus on PDF-like do cuments. Hence, in this research project a

template-based document generator was created to compare state-of-the-

art models for information extraction. An existing information extrac-

tion model “Chargrid” (Katti et al., 2019) was reconstructed and t he

impact of a bounding box regression decoder, as well as the impact of an

NLP pre-processing step was evaluated for information extraction from

documents. The results have shown that NLP based pre-processing is

beneﬁcial for model performance. However, the use of a bounding box

regression decoder increases the model performance only for ﬁelds that

do not follow a rectangular shape.

1 Introduction

Natural language processing (NLP) methods are widely used on one-dimensional

sequences of text. In some ca ses, for e xample, in the extra ction of key info rmation

of invoice documents, spatial information s uch as the position of text are crucial

to understand the contextual meaning.

Convolutional neural networks (CNNs) are already common in computer vi-

sion (CV) models to process and extract relationships in multidimensional data.

⋆

Supported by organization KI Group GmbH.

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余6页未读，立即下载

内容反馈

版权申诉

易小侠

粉丝: 6469
资源: 9万+

最新资源

资源上传下载、课程学习等过程中有任何疑问或建议，欢迎提出宝贵意见哦~我们会及时处理！点击此处反馈

feedback-tip