Foundation models for generalist medical artificial intelligence

doi:10.1038/s41586-023-05881-4

Open AccessJournal Article10.1038/s41586-023-05881-4

Foundation models for generalist medical artificial intelligence

Michael Moor, +6 more

- 01 Apr 2023

- Visual education

- Vol. 616, Iss: 7956, pp 259-265

804

TL;DR: Generalist medical AI (GMAI) as mentioned in this paper is a new paradigm for medical AI, which is capable of carrying out a diverse set of tasks using very little or no task-specific labelled data.

Abstract: The exceptionally rapid development of highly flexible, reusable artificial intelligence (AI) models is likely to usher in newfound capabilities in medicine. We propose a new paradigm for medical AI, which we refer to as generalist medical AI (GMAI). GMAI models will be capable of carrying out a diverse set of tasks using very little or no task-specific labelled data. Built through self-supervision on large, diverse datasets, GMAI will flexibly interpret different combinations of medical modalities, including data from imaging, electronic health records, laboratory results, genomics, graphs or medical text. Models will in turn produce expressive outputs such as free-text explanations, spoken recommendations or image annotations that demonstrate advanced medical reasoning abilities. Here we identify a set of high-impact potential applications for GMAI and lay out specific technical capabilities and training datasets necessary to enable them. We expect that GMAI-enabled applications will challenge current strategies for regulating and validating AI devices for medicine and will shift practices associated with the collection of large medical datasets. This review discusses generalist medical artificial intelligence, identifying potential applications and setting out specific technical capabilities and training datasets necessary to enable them, as well as highlighting challenges to its implementation.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arxiv.2409.19540

LoRKD: Low-Rank Knowledge Decomposition for Medical Foundation Models

Haolin Li, +7 more

- 28 Sep 2024

TL;DR: This paper proposes LoRKD, a novel framework for decomposing large-scale medical foundation models into lightweight expert models, enhancing specialization and reducing resource consumption, achieving state-of-the-art performance and superior transferability on downstream tasks.

...read moreread less

Journal Article•10.1016/j.patcog.2024.110706

View-unaligned clustering with graph regularization

Xinyu Cao, +2 more

- 01 Jun 2024

- Pattern Recognition

Journal Article•10.48550/arxiv.2407.13813

A review of handcrafted and deep radiomics in neurological diseases: transitioning from oncology to clinical neuroimaging

Elizaveta Lavrova, +5 more

- 18 Jul 2024

- arXiv.org

TL;DR: This review explores the application of handcrafted and deep radiomics in neurological diseases, highlighting its potential for improved diagnostic precision and treatment quality, but also emphasizing the need for collaborative efforts to overcome implementation challenges.

...read moreread less

Other•10.1002/9781394240197.ch5

Diagnostics

Ronald M. Razmi

- 05 Jan 2024

TL;DR: Diagnostics involve radiology, pathology, dermatology, ophthalmology, and cardiology. Images and visual inspections are key to diagnosis in these specialties.

...read moreread less

Journal Article•10.48550/arxiv.2408.14595

Surprisingly Fragile: Assessing and Addressing Prompt Instability in Multimodal Foundation Models

I. M. Stewart, +4 more

- 26 Aug 2024

TL;DR: Multimodal foundation models exhibit prompt instability, leading to performance drops, but can be mitigated with additional training on augmented data, improving accuracy and stability across modalities and domains.

...read moreread less

...

Expand

References

•Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

- 12 Jun 2017

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

94.2K

•Posted Content

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

- arXiv: Computation and Language

TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

81.7K

•Journal Article•10.1038/S41586-021-03819-2

Highly accurate protein structure prediction with AlphaFold

John M. Jumper, +33 more

- 15 Jul 2021

- Nature

TL;DR: For example, AlphaFold as mentioned in this paper predicts protein structures with an accuracy competitive with experimental structures in the majority of cases using a novel deep learning architecture. But the accuracy is limited by the fact that no homologous structure is available.

...read moreread less

28.2K

•Proceedings Article

Language Models are Few-Shot Learners

Tom B. Brown, +30 more

- 28 May 2020

TL;DR: GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic.

...read moreread less

25.2K

•Journal Article•10.1371/JOURNAL.PMED.1001779

UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age

Cathie Sudlow, +18 more

- 31 Mar 2015

- PLOS Medicine

TL;DR: The UK Biobank is described, a large population-based prospective study, established to allow investigation of the genetic and non-genetic determinants of the diseases of middle and old age.

...read moreread less

10.3K