使用所学知识库和多模式对齐生成放射报告_RadiologyReportGenerationwithaLearnedK资源-CSDN文库

版权申诉

117 浏览量 2022-01-14 23:27:18 上传评论收藏 906KB PDF 举报

资源推荐

资源详情

资源评论

Radiology Report Generation with a Learned Knowledge Base

and Multi-modal Alignment

Shuxin Yang

1,5

, Xian Wu

, Shen Ge

, Xingwang Wu

, S. Kevin Zhou

1,2

, Li Xiao

1,5

The Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology,

CAS, Beijing, 100190, China

School of Biomedical Engineering & Suzhou Institute for Advanced Research Center for Medical Imaging, Robotics, and

Analytic Computing & LEarning (MIRACLE) University of Science and Technology of China, Suzhou 215123, China

Tencent Medical AI Lab, Beijing, 100094, China

The First Afﬁliated Hospital of Anhui Medical University, HeFei, 230022, China

University of Chinese Academy of Sciences, Beijing, 100049, China

Abstract

In clinics, a radiology report is crucial for guiding a patient’s

treatment. Unfortunately, report writing imposes a heavy bur-

den on radiologists. To effectively reduce such a burden, we

hereby present an automatic, multi-modal approach for re-

port generation from chest x-ray. Our approach, motivated by

the observation that the descriptions in radiology reports are

highly correlated with the x-ray images, features two distinct

modules: (i) Learned knowledge base. To absorb the knowl-

edge embedded in the above-mentioned correlation, we auto-

matically build a knowledge base based on textual embedding.

(ii) Multi-modal alignment. To promote the semantic align-

ment among reports, disease labels and images, we explicitly

utilize textual embedding to guide the learning of the visual

feature space. We evaluate the performance of the proposed

model using metrics from both natural language generation

and clinic efﬁcacy on the public IU and MIMIC-CXR datasets.

Our ablation study shows that each module contributes to im-

proving the quality of generated reports. Furthermore, with

the aid of both modules, our approach clearly outperforms

state-of-the-art methods.

Introduction

The radiology report is crucial for assisting clinic decision

making (Zhou, Rueckert, and Fichtinger 2019). It describes

some observations on images such as diseases’ degree, size,

and location. However, the process of writing reports is time-

consuming and tedious for radiologists (Bruno, Walker, and

Abujudeh 2015). With the advancement of deep learning

and natural language processing, automatic radiology report

generation has attracted growing research interests.

Many radiology report generation approaches follow the

practice of image captioning models (Xu et al. 2015; Lu et al.

2017; Anderson et al. 2018). For example, (Jing, Xie, and

Xing 2018; Yuan et al. 2019) employ the encoder-decoder

architecture and propose the hierarchical generator as well

as the attention mechanism to generate long reports. How-

ever, radiology report generation task is different from image

captioning task. In image captioning, the model is required

to cover the details of the input image, while for radiology

report generation, the model is required to focus on the ab-

normal regions and infer potential diseases. Therefore, to

generate a correct radiology report, the model needs to iden-

tify the abnormal regions and provide proper descriptions.

To this end, the medical background knowledge needs to be

included in modeling.

Recently, some works attempt to integrate medical knowl-

edge in modeling: MKG (Zhang et al. 2020) and PPKED (Liu

et al. 2021) incorporate manual pre-constructed knowledge

graphs to enhance the generation, HRGR (Li et al. 2018)

builds a template database based on prior knowledge by man-

ually ﬁltering a set of sentences in the training corpus. These

methods achieve improved performance over image caption-

ing models. However, these models need to build the knowl-

edge graph or template database in advance which is still

laborious. In addition, when applying these models to images

of other diseases, the knowledge graph or template database

needs to be updated as well.

In this paper, we propose a knowledge base updating mech-

anism to store medical knowledge automatically. It learns

a knowledge base from training data. Firstly, we initialize

a memory as a knowledge base and use CNN/BERT model

to extract visual features and textual embeddings from the

input images and corresponding reference reports. Next, the

knowledge base is updated by the report embeddings during

the training phase. At the end of training, we ﬁx the knowl-

edge base as the model’s parameter and use it for report

generation. To acquire the related knowledge of the input

image, we propose a visual-knowledge attention module that

queries knowledge base with visual features. Finally, we em-

ploy the standard Transformer model with the help of the

visual features and acquired knowledge to generate radiology

reports.

Since the critical clinical information usually comes from

descriptions of abnormalities, where such sentences are rare

and diverse in radiology datasets, we need to enable the

knowledge base to focus on the knowledge of abnormalities.

To this end, we propose a multi-modal alignment mecha-

nism. It consists of visual-textual alignment and visual-label

alignment. The intuition is that the reports and disease la-

bels describe the same observations on the images, so the

semantic features among images, reports, and disease labels

should be consistent. Speciﬁcally, we adapt the triplet mar-

gin loss (Balntas et al. 2016) to align the visual features and

arXiv:2112.15011v1 [eess.IV] 30 Dec 2021

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余8页未读，立即下载

评论收藏

内容反馈

版权申诉

易小侠

粉丝: 6509
资源: 9万+

使用所学知识库和多模式对齐生成放射报告_Radiology Report Generation with a Learned K

最新资源

使用所学知识库和多模式对齐生成放射报告_Radiology Report Generation with a Learned K

diagnostic_radiology

GPT2-Chest-X-Ray-Report-Generation:这是我们的论文“使用条件变压器自动生成放射学报告”中提到的CDGPT2模型的实现。

诊断人工智能对中国放射学的影响_Diagnosing the Impact of AI on Radiology in Chin

covid19_radiology

Probability in Radiology.pdf

英文原版-Learning Radiology Recognizing the Basics 3rd Edition

RIS系统(影像信息系统).docx

AiAi.care 项目正在教计算机“看到”胸部 X 光片，并以人类放射科医生的方式解释它们_Dockerfile_代码_下载

Radiology Reporting System:用于保存放射线报告和患者数据的简单Java应用程序-开源

英文原版-Radiology for Undergraduate Finals and Foundation Years 1st Edition

CheXbert:结合使用自动贴标机和专家注释，使用BERT进行准确的放射学报告贴标

Deep Learning in Radiology: Recent Advances, Challenges and Future Trends

Women’s Imaging - MRI with Multimodality Correlation

蓝韵RIS系统培训PPT学习教案.pptx

Radiology.pdf

基于DICOM标准实现RIS与PACS影像执行过程的集成

PACS和RIS系统的集成和应用归纳.pdf

rad-cloud:云中的放射学

ucsf-radiology-getting-started:UCSF 放射学入门教程 Git、Bash、MRI Physics、服务器资源（主机、python 版本、ML 库、MATLAB、VNC）的新成员

Cobalt Strike下载

计算机系统-笔记-HUN2021级

北京邮电大学计算机考研复试笔试资料

cs1.6老版本供下载

合成孔径雷达的经典成像算法cs(matlab)仿真代码（吐血整理，内容全，注释全）

港大CS（MSC）面试整理

合成孔径雷达RD CS OmegaK算法点目标仿真.rar

计算机科学导论原书第二版答案.zip

Cobalt-Strike-4.5

cobaltstrike4.3.zip

最新资源