Medical domain knowledge in domain-agnostic generative AI

The text-guided diffusion model GLIDE (Guided Language to Image Diffusion for Generation and Editing) is the state of the art in text-to-image generative artificial intelligence (AI). GLIDE has rich representations, but medical applications of this model have not been systematically explored. If GLI...

Full description

Saved in:
Bibliographic Details
Main Authors: Kather, Jakob Nikolas (Author) , Ghaffari Laleh, Narmin (Author) , Foersch, Sebastian (Author) , Truhn, Daniel (Author)
Format: Article (Journal)
Language:English
Published: 11 July 2022
In: npj digital medicine
Year: 2022, Volume: 5, Pages: 1-5
ISSN:2398-6352
DOI:10.1038/s41746-022-00634-5
Online Access:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1038/s41746-022-00634-5
Verlag, lizenzpflichtig, Volltext: https://www.nature.com/articles/s41746-022-00634-5
Get full text
Author Notes:Jakob Nikolas Kather, Narmin Ghaffari Laleh, Sebastian Foersch and Daniel Truhn
Description
Summary:The text-guided diffusion model GLIDE (Guided Language to Image Diffusion for Generation and Editing) is the state of the art in text-to-image generative artificial intelligence (AI). GLIDE has rich representations, but medical applications of this model have not been systematically explored. If GLIDE had useful medical knowledge, it could be used for medical image analysis tasks, a domain in which AI systems are still highly engineered towards a single use-case. Here we show that the publicly available GLIDE model has reasonably strong representations of key topics in cancer research and oncology, in particular the general style of histopathology images and multiple facets of diseases, pathological processes and laboratory assays. However, GLIDE seems to lack useful representations of the style and content of radiology data. Our findings demonstrate that domain-agnostic generative AI models can learn relevant medical concepts without explicit training. Thus, GLIDE and similar models might be useful for medical image processing tasks in the future - particularly with additional domain-specific fine-tuning.
Item Description:Gesehen am 28.02.2023
Physical Description:Online Resource
ISSN:2398-6352
DOI:10.1038/s41746-022-00634-5