利用GPT-4识别癌症表型..pdf资源-CSDN文库

版权申诉

59 浏览量 2024-04-12 15:53:18 上传评论收藏 2.91MB PDF 举报

资源推荐

资源详情

资源评论

Leveraging GPT-4 for Identifying Cancer Phenotypes in Electronic Health

Records: A Performance Comparison between GPT-4, GPT-3.5-turbo, Flan-

T5 and spaCy’s Rule-based & Machine Learning-based methods

Kriti Bhattarai, BA

1,2

, Inez Y. Oh, PhD

, Jonathan Moran Sierra, BS

, Jonathan Tang, MD

Philip R.O. Payne, PhD

1,2

, Zachary B. Abrams, PhD

, Albert M. Lai, PhD

1,2

Institute for Informatics, Data Science & Biostatistics, Washington University School of

Medicine, St. Louis, Missouri, USA

Department of Computer Science, Washington University in St. Louis, Missouri, USA

Medical Scientist Training Program, Washington University School of Medicine, St. Louis,

Missouri, USA

Department of Internal Medicine, Washington University School of Medicine, St. Louis,

Missouri, USA

Corresponding Author:

Kriti Bhattarai

Department of Computer Science

Institute for Informatics, Data Science & Biostatistics

Washington University in St. Louis

660 S. Euclid Ave, 6

Floor

St. Louis, MO, 63110, USA

kriti.bhattarai@wustl.edu

Keywords: generative pre-trained transformer (GPT), natural language processing, large

language models, clinical phenotype extraction, electronic health records

Word Count:3862

The copyright holder for this preprint (whichthis version posted April 6, 2024. ; https://doi.org/10.1101/2023.09.27.559788doi: bioRxiv preprint

ABSTRACT

Objective: Accurately identifying clinical phenotypes from Electronic Health Records (EHRs)

provides additional insights into patients’ health, especially when such information is unavailable

in structured data. This study evaluates the application of OpenAI's Generative Pre-trained

Transformer (GPT)-4 model to identify clinical phenotypes from EHR text in non-small cell lung

cancer (NSCLC) patients. The goal was to identify disease stages, treatments and progression

utilizing GPT-4, and compare its performance against GPT-3.5-turbo, Flan-T5-xl, Flan-T5-xxl,

and two rule-based and machine learning-based methods, namely, scispaCy and medspaCy.

Materials and Methods: Phenotypes such as initial cancer stage, initial treatment, evidence of

cancer recurrence, and affected organs during recurrence were identified from 13,646 records for

63 NSCLC patients from Washington University in St. Louis, Missouri. The performance of the

GPT-4 model is evaluated against GPT-3.5-turbo, Flan-T5-xxl, Flan-T5-xl, medspaCy and

scispaCy by comparing precision, recall, and micro- F1 scores.

Results: GPT-4 achieved higher F1 score, precision, and recall compared to Flan-T5-xl, Flan-

T5-xxl, medspaCy and scispaCy’s models. GPT-3.5-turbo performed similarly to that of GPT-4.

GPT and Flan-T5 models were not constrained by explicit rule requirements for contextual

pattern recognition. SpaCy models relied on predefined patterns, leading to their suboptimal

performance.

Discussion and Conclusion: GPT-4 improves clinical phenotype identification due to its robust

pre-training and remarkable pattern recognition capability on the embedded tokens. It

demonstrates data-driven effectiveness even with limited context in the input. While rule-based

models remain useful for some tasks, GPT models offer improved contextual understanding of

the text, and robust clinical phenotype extraction.

The copyright holder for this preprint (whichthis version posted April 6, 2024. ; https://doi.org/10.1101/2023.09.27.559788doi: bioRxiv preprint

BACKGROUND AND SIGNIFICANCE

Introduction

Extracting clinical phenotypes from unstructured Electronic Health Records (EHRs) is a

critical task in natural language processing (NLP). Accurately identifying relevant phenotypes

from unstructured text utilizing NLP techniques provides additional insights into patients’ health,

especially when such information is unavailable in structured data. NLP extraction techniques

facilitate this process by mapping unstructured text to a structured representation, making it

easier to evaluate patients’ disease progression, treatment modalities, and treatment

effectiveness. This is particularly evident when analyzing data from non-small cell lung cancer

patients, where unstructured text is abundant. Accurately identifying disease stage, treatments

and progression from cancer text will contribute to continued research efforts aimed at

improving treatment strategies for non-small lung cancer patients, assessing disease progression,

and improving lung cancer-related outcomes.

Background

Clinical phenotype extraction is an ongoing research area where the type of extraction

tasks and target phenotypes vary across different clinical domains. Rule-based, machine

learning-based, and deep-learning models have been applied to phenotype extraction.

1-7

While

rule-based models extract phenotypes based on pre-defined patterns, most machine learning and

deep-learning approaches are trained on sentences or documents labeled with the relevant

phenotypes and the model subsequently classifies texts into these phenotypes.

5,8

SpaCy models,

including MedspaCy

and scispaCy

are two recent and frequently used hybrid frameworks that

utilize statistical and machine-learning named entity recognition methods in conjunction with

rule-based NLP to identify clinical phenotypes. There are studies that have utilized medspaCy

The copyright holder for this preprint (whichthis version posted April 6, 2024. ; https://doi.org/10.1101/2023.09.27.559788doi: bioRxiv preprint

and scispaCy to identify specific sections within EHR text for NER, extract phenotypes from

relation extraction documents, and generate text embeddings.

10-14

Although extracting clinical phenotypes is essential, several gaps remain in the literature.

There is no effective model for direct extraction, as most of these models require additional

training and fine-tuning.

15,16

Moreover, current methods often lack robustness, leading to

suboptimal performance.

15- 19

In addition, limited availability of labeled, publicly accessible

cancer EHR text leaves an important domain underexplored for NLP.

Pre-trained transformer-based language models have recently been studied for tasks such

as question answering, text generation, and machine translation.

20,21

Despite the success of

transformer-based language model in such tasks, their application in the context of clinical

phenotype extraction remain underexplored, opening numerous avenues of research. Recent

research has demonstrated the use of large language models (LLMs) for entity extraction.

22-25

However, it is essential to investigate these recent transformer-based methods in specific clinical

domains and compare their performance to previously recognized machine learning and rule-

based models to generate insights into their potential benefits for clinical phenotype extraction.

OBJECTIVES

The aim of this study was to investigate the most recent transformer-based language

models as they remain underexplored for cancer phenotype extraction from real-world EHR text.

We evaluated the application of OpenAI’s Generative Pre-Trained Transformer (GPT)-4 model

for clinical phenotype extraction in an EHR retrospective study focusing on non-small cell lung

cancer patients as a specific case study. We used GPT-4 to identify individual words or tokens in

a data sequence as distinct phenotypes. Specifically, we measure the prevalence of specific lung

cancer phenotypes, including cancer stage, treatment modalities, cancer recurrence instance, and

The copyright holder for this preprint (whichthis version posted April 6, 2024. ; https://doi.org/10.1101/2023.09.27.559788doi: bioRxiv preprint

organs affected by cancer recurrence. These phenotypes are important for informing treatment

decisions and assessing disease progression in non-small cell lung cancer patients.

We built the model framework using a clinical text dataset from Washington University

in St. Louis, Missouri, for a patient population diagnosed with non-small cell lung cancer. To

evaluate the effectiveness of GPT-4, we compared its results against 2 subject matter experts’

manual annotation. We also conducted a comparative analysis with GPT-3.5-turbo

, Flan-T5

(Flan-T5-xl, Flan-T5-xxl), and spaCy (medspaCy, scispaCy), currently frequently used rule-

based and machine learning approaches in clinical phenotype extraction. While Flan-T5 models

are LLMs, spacy models are two recent and hybrid frameworks that utilize statistical and

machine-learning methods in conjunction with rule-based NLP to identify clinical phenotypes.

We selected these baseline models based on their inherent capacity for rapid extraction, and their

ability to generate results without requiring training or additional fine-tuning.

Our comparison between scispaCy, medspaCy, Flan-T5-xl, Flan-T5-xxl, GPT-3.5-turbo

and GPT-4 aims to highlight the strengths and weaknesses of each approach for cancer

phenotype extraction from unstructured clinical text, providing valuable insights into their

effectiveness and potential use for cases in cancer phenotype extraction from EHR. In evaluating

these current approaches for phenotype extraction, we also note their limitations.

MATERIALS AND METHODS

To extract a detailed representation of specific lung cancer phenotypes, we used GPT-4,

available through Microsoft’s Azure OpenAI Service. We compared and evaluated the

performance of the current models by comparing true positives (recall) and false positives at the

patient-level. The following subsections discuss the datasets, annotation methods, and

methodologies used for extracted information, baseline comparison techniques, and evaluation

The copyright holder for this preprint (whichthis version posted April 6, 2024. ; https://doi.org/10.1101/2023.09.27.559788doi: bioRxiv preprint

剩余25页未读，继续阅读

评论收藏

内容反馈

版权申诉

百态老人

粉丝: 1995
资源: 2万+

利用 GPT-4 识别癌症表型..pdf

最新资源

利用 GPT-4 识别癌症表型..pdf

文心一言、GPT3.5及GPT-4的应用测评对比.rar

新版AI系统源码ChatGPT网站源码支持GPT-4支持AI绘画.rar

微软 GPT-4 报告 人工通用智能的星星之火GPT-4的早期实验.pdf

微软研究院：人工通用智能的星星之火-GPT-4的早期实验.pdf

OpenAI _ GPT-3新模型Davinci，将AI写作提升到新水平！网友惊呼：GPT-4要来了？.pdf

GPT-4技术报告（英）-2023-98页.pdf

openwrt-x86-64-uefi-gpt-squashfs.img

ChatGPT的原理分析：如何利用GPT-5辅助开发游戏.docx

GPT-4和GPT-3客户端 g4f.webview.0.2.8.0.exe

Multimodal-GPT-add-baize.zip

Developing Apps with GPT-4 and ChatGPT-使用GPT-4和ChatGPT开发应用程序-by

AI：GPT-4有什么不同.zip

ChatGPT使用方法：利用GPT-4设计社区联欢晚会.pdf

openwrt-koolshare-mod-v2.31-r10822-50aa0525d1-x86-64-uefi-gpt-squashfs.img.gz

海外ChatGPT、GPT-4如何赋能应用.pdf

PyPI 官网下载 | g-mlp-gpt-0.0.11.tar.gz

GPT-3学习简单笔记.md

《微软-GPT-4-报告》-154页-英文PDF-文件.rar

相关实用应用程序（Windows可用）

免费可用的ChatGPT网页版.zip

ChatGPT使用总结：150个ChatGPT提示词模板（完整版）

李飞飞自传 我看见的世界 The World I see

chromedriver-win64.zip

全国计算机二级WPSoffice精选350道选择题题库（含答案）.pdf

哈尔滨工业大学-ChatGPT调研报告-2023.3.6-94页.pdf

智联招聘：2024年大学生就业力调研报告.pdf

4个亲测好用的ChatGPT4渠道

数字电子时钟课程设计数字电子时钟课程设计

农村公交与异构无人机协同配送优化

最新资源

微软 GPT-4 报告人工通用智能的星星之火GPT-4的早期实验.pdf

李飞飞自传我看见的世界 The World I see