Unconditional latent diffusion models memorize patient imaging data
Generative artificial intelligence models facilitate open-data sharing by proposing synthetic data as surrogates of real patient data. Despite the promise for healthcare, some of these models are susceptible to patient data memorization, where models generate patient data copies instead of novel syn...
Saved in:
| Main Authors: | , , , , , , , , , , , , , |
|---|---|
| Format: | Article (Journal) |
| Language: | English |
| Published: |
11 August 2025
|
| In: |
Nature biomedical engineering
Year: 2025, Pages: 1-15 |
| ISSN: | 2157-846X |
| DOI: | 10.1038/s41551-025-01468-8 |
| Online Access: | Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1038/s41551-025-01468-8 Verlag, lizenzpflichtig, Volltext: http://www.nature.com/articles/s41551-025-01468-8 |
| Author Notes: | Salman Ul Hassan Dar, Marvin Seyfarth, Isabelle Ayx, Theano Papavassiliu, Stefan O. Schoenberg, Robert Malte Siepmann, Fabian Christopher Laqua, Jannik Kahmann, Norbert Frey, Bettina Baeßler, Sebastian Foersch, Daniel Truhn, Jakob Nikolas Kather & Sandy Engelhardt |
MARC
| LEADER | 00000caa a22000002c 4500 | ||
|---|---|---|---|
| 001 | 1938340957 | ||
| 003 | DE-627 | ||
| 005 | 20251015132543.0 | ||
| 007 | cr uuu---uuuuu | ||
| 008 | 251014s2025 xx |||||o 00| ||eng c | ||
| 024 | 7 | |a 10.1038/s41551-025-01468-8 |2 doi | |
| 035 | |a (DE-627)1938340957 | ||
| 035 | |a (DE-599)KXP1938340957 | ||
| 040 | |a DE-627 |b ger |c DE-627 |e rda | ||
| 041 | |a eng | ||
| 084 | |a 33 |2 sdnb | ||
| 100 | 1 | |a Dar, Salman Ul Hassan |d 1990- |e VerfasserIn |0 (DE-588)1309366284 |0 (DE-627)187005489X |4 aut | |
| 245 | 1 | 0 | |a Unconditional latent diffusion models memorize patient imaging data |c Salman Ul Hassan Dar, Marvin Seyfarth, Isabelle Ayx, Theano Papavassiliu, Stefan O. Schoenberg, Robert Malte Siepmann, Fabian Christopher Laqua, Jannik Kahmann, Norbert Frey, Bettina Baeßler, Sebastian Foersch, Daniel Truhn, Jakob Nikolas Kather & Sandy Engelhardt |
| 264 | 1 | |c 11 August 2025 | |
| 300 | |a 15 | ||
| 336 | |a Text |b txt |2 rdacontent | ||
| 337 | |a Computermedien |b c |2 rdamedia | ||
| 338 | |a Online-Ressource |b cr |2 rdacarrier | ||
| 500 | |a Gesehen am 14.10.2025 | ||
| 520 | |a Generative artificial intelligence models facilitate open-data sharing by proposing synthetic data as surrogates of real patient data. Despite the promise for healthcare, some of these models are susceptible to patient data memorization, where models generate patient data copies instead of novel synthetic samples, resulting in patient re-identification. Here we assess memorization in unconditional latent diffusion models by training them on a variety of datasets for synthetic data generation and detecting memorization with a self-supervised copy detection approach. We show a high degree of patient data memorization across all datasets, with approximately 37.2% of patient data detected as memorized and 68.7% of synthetic samples identified as patient data copies. Latent diffusion models are more susceptible to memorization than autoencoders and generative adversarial networks, and they outperform non-diffusion models in synthesis quality. Augmentation strategies during training, small architecture size and increasing datasets can reduce memorization, while overtraining the models can enhance it. These results emphasize the importance of carefully training generative models on private medical imaging datasets and examining the synthetic data to ensure patient privacy. | ||
| 650 | 4 | |a Computational science | |
| 650 | 4 | |a Medical imaging | |
| 650 | 4 | |a Medical research | |
| 650 | 4 | |a Scientific community | |
| 700 | 1 | |a Seyfarth, Marvin |e VerfasserIn |0 (DE-588)1379014085 |0 (DE-627)1938452534 |4 aut | |
| 700 | 1 | |a Ayx, Isabelle |d 1988- |e VerfasserIn |0 (DE-588)1052154514 |0 (DE-627)787934178 |0 (DE-576)407912959 |4 aut | |
| 700 | 1 | |a Papavassiliu, Theano |d 1970- |e VerfasserIn |0 (DE-588)1028853440 |0 (DE-627)731667417 |0 (DE-576)376292849 |4 aut | |
| 700 | 1 | |a Schönberg, Stefan |d 1969- |e VerfasserIn |0 (DE-588)131557912 |0 (DE-627)510700624 |0 (DE-576)298584891 |4 aut | |
| 700 | 1 | |a Siepmann, Robert Malte |e VerfasserIn |4 aut | |
| 700 | 1 | |a Laqua, Fabian Christopher |e VerfasserIn |4 aut | |
| 700 | 1 | |a Kahmann, Jannik |e VerfasserIn |0 (DE-588)1317496469 |0 (DE-627)1879403870 |4 aut | |
| 700 | 1 | |a Frey, Norbert |e VerfasserIn |0 (DE-588)141244976 |0 (DE-627)625824075 |0 (DE-576)322969514 |4 aut | |
| 700 | 1 | |a Baeßler, Bettina |d 1983- |e VerfasserIn |0 (DE-588)101507930X |0 (DE-627)671485032 |0 (DE-576)352370599 |4 aut | |
| 700 | 1 | |a Försch, Sebastian |d 1985- |e VerfasserIn |0 (DE-588)1018553894 |0 (DE-627)682860832 |0 (DE-576)356024814 |4 aut | |
| 700 | 1 | |a Truhn, Daniel |e VerfasserIn |0 (DE-588)1047348306 |0 (DE-627)778145913 |0 (DE-576)400927314 |4 aut | |
| 700 | 1 | |a Kather, Jakob Nikolas |d 1989- |e VerfasserIn |0 (DE-588)1064064914 |0 (DE-627)812897587 |0 (DE-576)423589091 |4 aut | |
| 700 | 1 | |a Engelhardt, Sandy |d 1987- |e VerfasserIn |0 (DE-588)1122674465 |0 (DE-627)876003080 |0 (DE-576)481436049 |4 aut | |
| 773 | 0 | 8 | |i Enthalten in |t Nature biomedical engineering |d London : Nature Research, 2016 |g (2025), Seite 1-15 |h Online-Ressource |w (DE-627)875699383 |w (DE-600)2878897-7 |w (DE-576)481341943 |x 2157-846X |7 nnas |a Unconditional latent diffusion models memorize patient imaging data |
| 773 | 1 | 8 | |g year:2025 |g pages:1-15 |g extent:15 |a Unconditional latent diffusion models memorize patient imaging data |
| 856 | 4 | 0 | |u https://doi.org/10.1038/s41551-025-01468-8 |x Verlag |x Resolving-System |z lizenzpflichtig |3 Volltext |
| 856 | 4 | 0 | |u http://www.nature.com/articles/s41551-025-01468-8 |x Verlag |z lizenzpflichtig |3 Volltext |
| 951 | |a AR | ||
| 992 | |a 20251014 | ||
| 993 | |a Article | ||
| 994 | |a 2025 | ||
| 998 | |g 1122674465 |a Engelhardt, Sandy |m 1122674465:Engelhardt, Sandy |d 910000 |d 910100 |e 910000PE1122674465 |e 910100PE1122674465 |k 0/910000/ |k 1/910000/910100/ |p 14 |y j | ||
| 998 | |g 1064064914 |a Kather, Jakob Nikolas |m 1064064914:Kather, Jakob Nikolas |d 910000 |d 910100 |e 910000PK1064064914 |e 910100PK1064064914 |k 0/910000/ |k 1/910000/910100/ |p 13 | ||
| 998 | |g 141244976 |a Frey, Norbert |m 141244976:Frey, Norbert |d 910000 |d 910100 |e 910000PF141244976 |e 910100PF141244976 |k 0/910000/ |k 1/910000/910100/ |p 9 | ||
| 998 | |g 1317496469 |a Kahmann, Jannik |m 1317496469:Kahmann, Jannik |d 60000 |e 60000PK1317496469 |k 0/60000/ |p 8 | ||
| 998 | |g 131557912 |a Schönberg, Stefan |m 131557912:Schönberg, Stefan |d 60000 |d 62900 |e 60000PS131557912 |e 62900PS131557912 |k 0/60000/ |k 1/60000/62900/ |p 5 | ||
| 998 | |g 1028853440 |a Papavassiliu, Theano |m 1028853440:Papavassiliu, Theano |d 60000 |d 61000 |e 60000PP1028853440 |e 61000PP1028853440 |k 0/60000/ |k 1/60000/61000/ |p 4 | ||
| 998 | |g 1052154514 |a Ayx, Isabelle |m 1052154514:Ayx, Isabelle |d 60000 |d 62900 |d 60000 |e 60000PA1052154514 |e 62900PA1052154514 |e 60000PA1052154514 |k 0/60000/ |k 1/60000/62900/ |k 0/60000/ |p 3 | ||
| 998 | |g 1379014085 |a Seyfarth, Marvin |m 1379014085:Seyfarth, Marvin |d 910000 |d 910100 |e 910000PS1379014085 |e 910100PS1379014085 |k 0/910000/ |k 1/910000/910100/ |p 2 | ||
| 998 | |g 1309366284 |a Dar, Salman ul Hassan |m 1309366284:Dar, Salman ul Hassan |d 910000 |d 910100 |e 910000PD1309366284 |e 910100PD1309366284 |k 0/910000/ |k 1/910000/910100/ |p 1 |x j | ||
| 999 | |a KXP-PPN1938340957 |e 4786853364 | ||
| BIB | |a Y | ||
| SER | |a journal | ||
| JSO | |a {"title":[{"title_sort":"Unconditional latent diffusion models memorize patient imaging data","title":"Unconditional latent diffusion models memorize patient imaging data"}],"language":["eng"],"person":[{"role":"aut","given":"Salman Ul Hassan","display":"Dar, Salman Ul Hassan","family":"Dar"},{"display":"Seyfarth, Marvin","given":"Marvin","family":"Seyfarth","role":"aut"},{"role":"aut","family":"Ayx","given":"Isabelle","display":"Ayx, Isabelle"},{"family":"Papavassiliu","display":"Papavassiliu, Theano","given":"Theano","role":"aut"},{"family":"Schönberg","given":"Stefan","display":"Schönberg, Stefan","role":"aut"},{"given":"Robert Malte","display":"Siepmann, Robert Malte","family":"Siepmann","role":"aut"},{"family":"Laqua","given":"Fabian Christopher","display":"Laqua, Fabian Christopher","role":"aut"},{"role":"aut","given":"Jannik","display":"Kahmann, Jannik","family":"Kahmann"},{"role":"aut","family":"Frey","display":"Frey, Norbert","given":"Norbert"},{"family":"Baeßler","given":"Bettina","display":"Baeßler, Bettina","role":"aut"},{"role":"aut","given":"Sebastian","display":"Försch, Sebastian","family":"Försch"},{"given":"Daniel","display":"Truhn, Daniel","family":"Truhn","role":"aut"},{"role":"aut","given":"Jakob Nikolas","display":"Kather, Jakob Nikolas","family":"Kather"},{"role":"aut","family":"Engelhardt","given":"Sandy","display":"Engelhardt, Sandy"}],"name":{"displayForm":["Salman Ul Hassan Dar, Marvin Seyfarth, Isabelle Ayx, Theano Papavassiliu, Stefan O. Schoenberg, Robert Malte Siepmann, Fabian Christopher Laqua, Jannik Kahmann, Norbert Frey, Bettina Baeßler, Sebastian Foersch, Daniel Truhn, Jakob Nikolas Kather & Sandy Engelhardt"]},"origin":[{"dateIssuedKey":"2025","dateIssuedDisp":"11 August 2025"}],"recId":"1938340957","note":["Gesehen am 14.10.2025"],"relHost":[{"pubHistory":["Volume: 1 (2016)-"],"type":{"bibl":"periodical","media":"Online-Ressource"},"language":["eng"],"disp":"Unconditional latent diffusion models memorize patient imaging dataNature biomedical engineering","recId":"875699383","note":["Gesehen am 29.12.16"],"id":{"eki":["875699383"],"zdb":["2878897-7"],"issn":["2157-846X"]},"part":{"pages":"1-15","year":"2025","extent":"15","text":"(2025), Seite 1-15"},"physDesc":[{"extent":"Online-Ressource"}],"title":[{"title":"Nature biomedical engineering","title_sort":"Nature biomedical engineering"}],"origin":[{"publisher":"Nature Research","dateIssuedDisp":"2016-","dateIssuedKey":"2016","publisherPlace":"London ; New York NY ; Tokyo"}]}],"physDesc":[{"extent":"15 S."}],"id":{"eki":["1938340957"],"doi":["10.1038/s41551-025-01468-8"]},"type":{"bibl":"article-journal","media":"Online-Ressource"}} | ||
| SRT | |a DARSALMANUUNCONDITIO1120 | ||