Unconditional latent diffusion models memorize patient imaging data

Generative artificial intelligence models facilitate open-data sharing by proposing synthetic data as surrogates of real patient data. Despite the promise for healthcare, some of these models are susceptible to patient data memorization, where models generate patient data copies instead of novel syn...

Full description

Saved in:
Bibliographic Details
Main Authors: Dar, Salman Ul Hassan (Author) , Seyfarth, Marvin (Author) , Ayx, Isabelle (Author) , Papavassiliu, Theano (Author) , Schönberg, Stefan (Author) , Siepmann, Robert Malte (Author) , Laqua, Fabian Christopher (Author) , Kahmann, Jannik (Author) , Frey, Norbert (Author) , Baeßler, Bettina (Author) , Försch, Sebastian (Author) , Truhn, Daniel (Author) , Kather, Jakob Nikolas (Author) , Engelhardt, Sandy (Author)
Format: Article (Journal)
Language:English
Published: 11 August 2025
In: Nature biomedical engineering
Year: 2025, Pages: 1-15
ISSN:2157-846X
DOI:10.1038/s41551-025-01468-8
Online Access:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1038/s41551-025-01468-8
Verlag, lizenzpflichtig, Volltext: http://www.nature.com/articles/s41551-025-01468-8
Get full text
Author Notes:Salman Ul Hassan Dar, Marvin Seyfarth, Isabelle Ayx, Theano Papavassiliu, Stefan O. Schoenberg, Robert Malte Siepmann, Fabian Christopher Laqua, Jannik Kahmann, Norbert Frey, Bettina Baeßler, Sebastian Foersch, Daniel Truhn, Jakob Nikolas Kather & Sandy Engelhardt

MARC

LEADER 00000caa a22000002c 4500
001 1938340957
003 DE-627
005 20251015132543.0
007 cr uuu---uuuuu
008 251014s2025 xx |||||o 00| ||eng c
024 7 |a 10.1038/s41551-025-01468-8  |2 doi 
035 |a (DE-627)1938340957 
035 |a (DE-599)KXP1938340957 
040 |a DE-627  |b ger  |c DE-627  |e rda 
041 |a eng 
084 |a 33  |2 sdnb 
100 1 |a Dar, Salman Ul Hassan  |d 1990-  |e VerfasserIn  |0 (DE-588)1309366284  |0 (DE-627)187005489X  |4 aut 
245 1 0 |a Unconditional latent diffusion models memorize patient imaging data  |c Salman Ul Hassan Dar, Marvin Seyfarth, Isabelle Ayx, Theano Papavassiliu, Stefan O. Schoenberg, Robert Malte Siepmann, Fabian Christopher Laqua, Jannik Kahmann, Norbert Frey, Bettina Baeßler, Sebastian Foersch, Daniel Truhn, Jakob Nikolas Kather & Sandy Engelhardt 
264 1 |c 11 August 2025 
300 |a 15 
336 |a Text  |b txt  |2 rdacontent 
337 |a Computermedien  |b c  |2 rdamedia 
338 |a Online-Ressource  |b cr  |2 rdacarrier 
500 |a Gesehen am 14.10.2025 
520 |a Generative artificial intelligence models facilitate open-data sharing by proposing synthetic data as surrogates of real patient data. Despite the promise for healthcare, some of these models are susceptible to patient data memorization, where models generate patient data copies instead of novel synthetic samples, resulting in patient re-identification. Here we assess memorization in unconditional latent diffusion models by training them on a variety of datasets for synthetic data generation and detecting memorization with a self-supervised copy detection approach. We show a high degree of patient data memorization across all datasets, with approximately 37.2% of patient data detected as memorized and 68.7% of synthetic samples identified as patient data copies. Latent diffusion models are more susceptible to memorization than autoencoders and generative adversarial networks, and they outperform non-diffusion models in synthesis quality. Augmentation strategies during training, small architecture size and increasing datasets can reduce memorization, while overtraining the models can enhance it. These results emphasize the importance of carefully training generative models on private medical imaging datasets and examining the synthetic data to ensure patient privacy. 
650 4 |a Computational science 
650 4 |a Medical imaging 
650 4 |a Medical research 
650 4 |a Scientific community 
700 1 |a Seyfarth, Marvin  |e VerfasserIn  |0 (DE-588)1379014085  |0 (DE-627)1938452534  |4 aut 
700 1 |a Ayx, Isabelle  |d 1988-  |e VerfasserIn  |0 (DE-588)1052154514  |0 (DE-627)787934178  |0 (DE-576)407912959  |4 aut 
700 1 |a Papavassiliu, Theano  |d 1970-  |e VerfasserIn  |0 (DE-588)1028853440  |0 (DE-627)731667417  |0 (DE-576)376292849  |4 aut 
700 1 |a Schönberg, Stefan  |d 1969-  |e VerfasserIn  |0 (DE-588)131557912  |0 (DE-627)510700624  |0 (DE-576)298584891  |4 aut 
700 1 |a Siepmann, Robert Malte  |e VerfasserIn  |4 aut 
700 1 |a Laqua, Fabian Christopher  |e VerfasserIn  |4 aut 
700 1 |a Kahmann, Jannik  |e VerfasserIn  |0 (DE-588)1317496469  |0 (DE-627)1879403870  |4 aut 
700 1 |a Frey, Norbert  |e VerfasserIn  |0 (DE-588)141244976  |0 (DE-627)625824075  |0 (DE-576)322969514  |4 aut 
700 1 |a Baeßler, Bettina  |d 1983-  |e VerfasserIn  |0 (DE-588)101507930X  |0 (DE-627)671485032  |0 (DE-576)352370599  |4 aut 
700 1 |a Försch, Sebastian  |d 1985-  |e VerfasserIn  |0 (DE-588)1018553894  |0 (DE-627)682860832  |0 (DE-576)356024814  |4 aut 
700 1 |a Truhn, Daniel  |e VerfasserIn  |0 (DE-588)1047348306  |0 (DE-627)778145913  |0 (DE-576)400927314  |4 aut 
700 1 |a Kather, Jakob Nikolas  |d 1989-  |e VerfasserIn  |0 (DE-588)1064064914  |0 (DE-627)812897587  |0 (DE-576)423589091  |4 aut 
700 1 |a Engelhardt, Sandy  |d 1987-  |e VerfasserIn  |0 (DE-588)1122674465  |0 (DE-627)876003080  |0 (DE-576)481436049  |4 aut 
773 0 8 |i Enthalten in  |t Nature biomedical engineering  |d London : Nature Research, 2016  |g (2025), Seite 1-15  |h Online-Ressource  |w (DE-627)875699383  |w (DE-600)2878897-7  |w (DE-576)481341943  |x 2157-846X  |7 nnas  |a Unconditional latent diffusion models memorize patient imaging data 
773 1 8 |g year:2025  |g pages:1-15  |g extent:15  |a Unconditional latent diffusion models memorize patient imaging data 
856 4 0 |u https://doi.org/10.1038/s41551-025-01468-8  |x Verlag  |x Resolving-System  |z lizenzpflichtig  |3 Volltext 
856 4 0 |u http://www.nature.com/articles/s41551-025-01468-8  |x Verlag  |z lizenzpflichtig  |3 Volltext 
951 |a AR 
992 |a 20251014 
993 |a Article 
994 |a 2025 
998 |g 1122674465  |a Engelhardt, Sandy  |m 1122674465:Engelhardt, Sandy  |d 910000  |d 910100  |e 910000PE1122674465  |e 910100PE1122674465  |k 0/910000/  |k 1/910000/910100/  |p 14  |y j 
998 |g 1064064914  |a Kather, Jakob Nikolas  |m 1064064914:Kather, Jakob Nikolas  |d 910000  |d 910100  |e 910000PK1064064914  |e 910100PK1064064914  |k 0/910000/  |k 1/910000/910100/  |p 13 
998 |g 141244976  |a Frey, Norbert  |m 141244976:Frey, Norbert  |d 910000  |d 910100  |e 910000PF141244976  |e 910100PF141244976  |k 0/910000/  |k 1/910000/910100/  |p 9 
998 |g 1317496469  |a Kahmann, Jannik  |m 1317496469:Kahmann, Jannik  |d 60000  |e 60000PK1317496469  |k 0/60000/  |p 8 
998 |g 131557912  |a Schönberg, Stefan  |m 131557912:Schönberg, Stefan  |d 60000  |d 62900  |e 60000PS131557912  |e 62900PS131557912  |k 0/60000/  |k 1/60000/62900/  |p 5 
998 |g 1028853440  |a Papavassiliu, Theano  |m 1028853440:Papavassiliu, Theano  |d 60000  |d 61000  |e 60000PP1028853440  |e 61000PP1028853440  |k 0/60000/  |k 1/60000/61000/  |p 4 
998 |g 1052154514  |a Ayx, Isabelle  |m 1052154514:Ayx, Isabelle  |d 60000  |d 62900  |d 60000  |e 60000PA1052154514  |e 62900PA1052154514  |e 60000PA1052154514  |k 0/60000/  |k 1/60000/62900/  |k 0/60000/  |p 3 
998 |g 1379014085  |a Seyfarth, Marvin  |m 1379014085:Seyfarth, Marvin  |d 910000  |d 910100  |e 910000PS1379014085  |e 910100PS1379014085  |k 0/910000/  |k 1/910000/910100/  |p 2 
998 |g 1309366284  |a Dar, Salman ul Hassan  |m 1309366284:Dar, Salman ul Hassan  |d 910000  |d 910100  |e 910000PD1309366284  |e 910100PD1309366284  |k 0/910000/  |k 1/910000/910100/  |p 1  |x j 
999 |a KXP-PPN1938340957  |e 4786853364 
BIB |a Y 
SER |a journal 
JSO |a {"title":[{"title_sort":"Unconditional latent diffusion models memorize patient imaging data","title":"Unconditional latent diffusion models memorize patient imaging data"}],"language":["eng"],"person":[{"role":"aut","given":"Salman Ul Hassan","display":"Dar, Salman Ul Hassan","family":"Dar"},{"display":"Seyfarth, Marvin","given":"Marvin","family":"Seyfarth","role":"aut"},{"role":"aut","family":"Ayx","given":"Isabelle","display":"Ayx, Isabelle"},{"family":"Papavassiliu","display":"Papavassiliu, Theano","given":"Theano","role":"aut"},{"family":"Schönberg","given":"Stefan","display":"Schönberg, Stefan","role":"aut"},{"given":"Robert Malte","display":"Siepmann, Robert Malte","family":"Siepmann","role":"aut"},{"family":"Laqua","given":"Fabian Christopher","display":"Laqua, Fabian Christopher","role":"aut"},{"role":"aut","given":"Jannik","display":"Kahmann, Jannik","family":"Kahmann"},{"role":"aut","family":"Frey","display":"Frey, Norbert","given":"Norbert"},{"family":"Baeßler","given":"Bettina","display":"Baeßler, Bettina","role":"aut"},{"role":"aut","given":"Sebastian","display":"Försch, Sebastian","family":"Försch"},{"given":"Daniel","display":"Truhn, Daniel","family":"Truhn","role":"aut"},{"role":"aut","given":"Jakob Nikolas","display":"Kather, Jakob Nikolas","family":"Kather"},{"role":"aut","family":"Engelhardt","given":"Sandy","display":"Engelhardt, Sandy"}],"name":{"displayForm":["Salman Ul Hassan Dar, Marvin Seyfarth, Isabelle Ayx, Theano Papavassiliu, Stefan O. Schoenberg, Robert Malte Siepmann, Fabian Christopher Laqua, Jannik Kahmann, Norbert Frey, Bettina Baeßler, Sebastian Foersch, Daniel Truhn, Jakob Nikolas Kather & Sandy Engelhardt"]},"origin":[{"dateIssuedKey":"2025","dateIssuedDisp":"11 August 2025"}],"recId":"1938340957","note":["Gesehen am 14.10.2025"],"relHost":[{"pubHistory":["Volume: 1 (2016)-"],"type":{"bibl":"periodical","media":"Online-Ressource"},"language":["eng"],"disp":"Unconditional latent diffusion models memorize patient imaging dataNature biomedical engineering","recId":"875699383","note":["Gesehen am 29.12.16"],"id":{"eki":["875699383"],"zdb":["2878897-7"],"issn":["2157-846X"]},"part":{"pages":"1-15","year":"2025","extent":"15","text":"(2025), Seite 1-15"},"physDesc":[{"extent":"Online-Ressource"}],"title":[{"title":"Nature biomedical engineering","title_sort":"Nature biomedical engineering"}],"origin":[{"publisher":"Nature Research","dateIssuedDisp":"2016-","dateIssuedKey":"2016","publisherPlace":"London ; New York NY ; Tokyo"}]}],"physDesc":[{"extent":"15 S."}],"id":{"eki":["1938340957"],"doi":["10.1038/s41551-025-01468-8"]},"type":{"bibl":"article-journal","media":"Online-Ressource"}} 
SRT |a DARSALMANUUNCONDITIO1120