Privacy-preserving data quality assessment for federated health data networks

Background: Assessing data quality in federated health data systems presents unique challenges, particularly when data custodians cannot expose raw data due to privacy regulations. Traditional quality assessment approaches often require centralised access, which conflicts with the principles of data...

Full description

Saved in:
Bibliographic Details
Main Authors: Tomášik, Radovan (Author) , Kussel, Tobias (Author) , Dudová, Zdenka (Author) , Kacová, Radoslava (Author) , Hrstka, Roman (Author) , Lablans, Martin (Author) , Holub, Petr (Author)
Format: Article (Journal)
Language:English
Published: 27 January 2026
In: BMC medical informatics and decision making
Year: 2026, Volume: 26, Pages: 1-11
ISSN:1472-6947
DOI:10.1186/s12911-025-03328-6
Online Access:Verlag, kostenfrei, Volltext: https://doi.org/10.1186/s12911-025-03328-6
Get full text
Author Notes:Radovan Tomášik, Tobias Kussel, Zdenka Dudová, Radoslava Kacová, Roman Hrstka, Martin Lablans and Petr Holub
Description
Summary:Background: Assessing data quality in federated health data systems presents unique challenges, particularly when data custodians cannot expose raw data due to privacy regulations. Traditional quality assessment approaches often require centralised access, which conflicts with the principles of data sovereignty and confidentiality. Methods: In this study, we evaluate the utility of federated data quality assessment with differential privacy techniques to safeguard sensitive health data. The aim is to develop tooling and demonstrate a proof-of-concept implementation over a synthetic dataset of observational medical data. Results: We present a privacy-preserving framework for evaluating data quality in federated environments using differential privacy. Our approach enables individual data providers to compute local quality metrics and share only aggregated, privacy-protected results. We implement a proof-of-concept that supports predefined quality checks across different data models and demonstrate how meaningful insights into data quality can be obtained without compromising sensitive information. Conclusion: This work demonstrates that differential privacy can be effectively applied to enable federated quality assessment in health data networks without compromising individual privacy. By implementing a proof-of-concept system over synthetic health data, we show that it is possible to obtain meaningful quality metrics in a decentralised setting.
Item Description:Veröffentlicht: 27. Januar 2026
Gesehen am 26.03.2026
Physical Description:Online Resource
ISSN:1472-6947
DOI:10.1186/s12911-025-03328-6