Machinelearningformedicalimaging：methodologicalfailuresand资源-CSDN文库

需积分: 5 66 浏览量 2022-04-21 18:25:34 上传评论收藏 1.32MB PDF 举报

医学图像的计算机分析研究为改善病人的健康带来了许多希望。然而，一些系统性的挑战正在减缓该领域的进展，从数据的局限性(如偏差)到研究激励(如优化出版)。在这篇文章中，我们回顾了开发和评估方法的障碍。根据来自文献和数据挑战的证据，我们的分析表明，在每一步中，潜在的偏见都可能渗入。我们还积极地讨论了目前为解决这些问题所作的努力。最后，对今后如何进一步解决这些问题提出了建议。在医疗领域，机器学习和人工智能的应用正逐渐崭露头角，特别是在医学图像分析方面。这一技术的潜力在于能够辅助医生进行诊断，甚至达到与专业医生相媲美的效果。例如，机器学习已被证明在识别多种疾病图像时表现出与医疗专家相当的准确性。随着软件应用程序开始获得临床使用的认证，机器学习有望成为实现医疗AI愿景的关键。尽管如此，当前的研究热潮并未确保临床实践的实质性进步。学术界的激励机制可能导致大量的研究成果集中在基准数据集上的表现提升，而非真正解决临床问题。罗伯特等人对62篇关于COVID-19机器学习研究的分析显示，尽管有大量论文发表，但没有一篇具备实际临床应用的潜力。在机器学习应用于医学图像分析的过程中，存在多种挑战。数据的局限性是一个重大问题。数据的偏差性，如样本选择偏倚、标注错误或代表性不足，可能严重影响模型的泛化能力。此外，研究者可能会为了发表成果而优化模型性能，而非关注其实际应用价值，这导致了研究激励与实际需求之间的脱节。文献和数据挑战提供的证据表明，从数据收集到模型评估的每个阶段都可能潜伏着偏见。数据预处理、特征工程、模型选择以及评估标准的选择都可能引入偏差。为了克服这些困难，研究者们已经开始采取措施，如使用更全面和代表性的数据集、开放源代码和数据以促进可重复性、以及开展公开的数据挑战以检验模型在不同条件下的性能。为了解决这些问题，文章提出了一些推荐策略。需要加强跨学科合作，让医学专家、统计学家和计算机科学家共同参与项目，确保研究目标与临床需求一致。建立更严格的评价体系，不仅关注模型的预测精度，还要评估其在实际环境中的稳健性和解释性。此外，提倡透明度和可重复性，公开数据、代码和实验设置，以促进研究的验证和改进。鼓励长期跟踪研究，评估模型在临床应用中的效果，以确保技术的实际转化。尽管机器学习在医疗图像分析领域面临着诸多挑战，但通过持续的研究和改进，我们可以期待这一技术在未来发挥更大的作用，真正推动临床实践的进步。研究人员必须意识到这些问题，并致力于寻找解决方案，以确保机器学习在医疗领域的应用不仅仅是理论上的成功，更是临床实践中的实用工具。

资源详情

资源评论

资源推荐

REVIEW ARTICLE

OPEN

Machine learning for medical imaging: methodological failures

and recommendations for the future

Gaël Varoquaux

1,2,3

✉

and Veronika Cheplygina

✉

Research in computer analysis of medical images bears many promises to improve patients’ health. However, a number of

systematic challenges are slowing down the progress of the ﬁeld, from limitations of the data, such as biases, to research incentives,

such as optimizing for publication. In this paper we review roadblocks to developing and assessing methods. Building our analysis

on evidence from the literature and data challenges, we show that at every step, potential biases can creep in. On a positive note,

we also discuss on-going efforts to counteract these problems. Finally we provide recommendations on how to further address

these problems in the future.

npj Digital Medicine (2022) 5:48 ; https://doi.org/10.1038/s41746-022-00592-y

INTRODUCTION

Machine learning, the cornerstone of today’s artiﬁcial intelligence

(AI) revolution, brings new promises to clinical practice with

medical images

1–3

. For example, to diagnose various conditions

from medical images, machine learning has been shown to

perform on par with medical experts

. Software applications are

starting to be certiﬁ ed for clinical use

5,6

. Machine learning may be

the key to realizing the vision of AI in medicine sketched several

decades ago

The stakes are high, and there is a staggering amount of

research on machine learning for medical images. But this growth

does not inherently lead to clinical progress. The higher volume of

research could be aligned with the academic incentives rather

than the needs of clinicians and patients. For example, there can

be an oversupply of papers showing state-of-the-art performance

on benchmark data, but no practical improvement for the clinical

problem. On the topic of machine learning for COVID, Robert

et al.

reviewed 62 published studies, but found none with

potential for clinical use.

In this paper, we explore avenues to improve clinical impact of

machine learning in medical imaging. After sketching the

situation, documenting uneven progress in Section It’s not all

about larger datasets, we study a number of failures frequent in

medical imaging papers, at different steps of the “publishing

lifecycle”: what data to use (Section Data, an imperfect window on

the clinic), what methods to use and how to evaluate them

(Section Evaluations that miss the target), and how to publish the

results (Section Publishing, distorted incentives). In each section,

we ﬁrst discuss the problems, supported with evidence from

previous research as well as our own analyses of recent papers. We

then discuss a number of steps to improve the situation,

sometimes borrowed from related communities. We hope that

these ideas will help shape research practices that are even more

effective at addressing real-world medical challenges.

IT’S NOT ALL ABOUT LARGER DATASETS

The availability of large labeled datasets has enabled solving

difﬁcult machine learning problems, such as natural image

recognition in computer vision, where datasets can contain

millions of images. As a result, there is widespread hope that

similar progress will happen in medical applications, algorithm

research should eventually solve a clinical problem posed as

discrimination task. However, medical datasets are typically

smaller, on the order of hundreds or thousands:

share a list of

sixteen “large open source medical imaging datasets”, with sizes

ranging from 267 to 65,000 subjects. Note that in medical imaging

we refer to the number of subjects, but a subject may have

multiple images, for example, taken at different points in time. For

simplicity here we assume a diagnosis task with one image/scan

per subject.

Few clinical questions come as well-posed discrimination tasks

that can be naturally framed as machine-learning tasks. But, even

for these, larger datasets have to date not lead to the progress

hoped for. One example is that of early diagnosis of Alzheimer’s

disease (AD), which is a growing health burden due to the aging

population. Early diagnosis would open the door to early-stage

interventions, most likely to be effective. Substantial efforts have

acquired large brain-imaging cohorts of aging individuals at risk of

developing AD, on which early biomarkers can be developed

using machine learning

. As a result, there have been steady

increases in the typical sample size of studies applying machine

learning to develop computer-aided diagnosis of AD, or its

predecessor, mild cognitive impairment. This growth is clearly

visible in publications, as on Fig. 1a, a meta-analysis compiling

478 studies from 6 systematic reviews

4,11–15

However, the increase in data size (with the largest datasets

containing over a thousand subjects) did not come with better

diagnostic accuracy, in particular for the most clinically relevant

question, distinguishing pathological versus stable evolution for

patients with symptoms of prodromal Alzheimer’s (Fig. 1b). Rather,

studies with larger sample sizes tend to report worse prediction

accuracy. This is worrisome, as these larger studies are closer to

real-life settings. On the other hand, research efforts across time

did lead to improvements even on large, heterogeneous cohorts

(Fig. 1c), as studies published later show improvements for large

sample sizes (statistical analysis in Supplementary Information).

Current medical-imaging datasets are much smaller than those

that brought breakthroughs in computer vision. Although a one-

INRIA, Versailles, France.

McGill University, Montreal, Canada.

Mila, Montreal, Canada.

IT University of Copenhagen, Copenhagen, Denmark.

✉

email: gael.varoquaux@inria.fr;

vech@itu.dk

www.nature.com/npjdigitalmed

Published in partnership with Seoul National University Bundang Hospital

1234567890():,;

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余7页未读，立即下载

评论收藏

内容反馈

努力+努力=幸运

粉丝: 17
资源: 136

Machine learning for medical imaging：methodological failures and

评论0

最新资源

Machine learning for medical imaging：methodological failures and

评论0

Machine Learning in Medical Imaging机器学习在医学成像中的应用

introduction to medical imaging:Physics,Engineering and

Optical time-stretch imaging: Principles and applications

Deep reinforcement learning in medical imaging A literature re

医学影像数据集列表 『An Index for Medical Imaging Datasets』

Deep Learning for Medical Image Analysis- Academic Press (2017).pdf

Presentation-2021-04-21-deep-learning-medical-imaging:深度成像2021学校中的不确定性和贝叶斯神经网络演示

Deep Learning and Convolutional Neural Networks for Medical Image Computing

Deep reinforcement learning in medical imaging A literature

hand book of medical imaging

Deep Learning for Medical Image Analysis

medical imaging

awesome-gan-for-medical-imaging:出色的GAN用于医学影像

Deep Learning in Medical Ultrasound Analysis： A Review.pdf

The Essential Physics for Medical Imaging

AI-in-Medical-Imaging

Big Data in Omics and Imaging: Association Analysis

《Magnetic Resonance Imaging - Physical Principles and Sequence Design》

Big Data in Omics and Imaging_Association Analysis-CRC (2018).pdf

Digital Image Processing and Analysis, 2nd Edition

英文原版-MR Imaging of the Abdomen and Pelvis 1st Edition

英文原版-Medical Imaging for the Health Care Provider 1st Edition

最新资源

医学影像数据集列表『An Index for Medical Imaging Datasets』