TY  - JOUR
AU  - Avraam, Demetris
AU  - Wilson, Rebecca C
AU  - Aguirre Chan, Noemi
AU  - Banerjee, Soumya
AU  - Bishop, Tom R P
AU  - Butters, Olly
AU  - Cadman, Tim
AU  - Cederkvist, Luise
AU  - Duijts, Liesbeth
AU  - Escribà Montagut, Xavier
AU  - Garner, Hugh
AU  - Gonçalves, Gonçalo
AU  - González, Juan R
AU  - Haakma, Sido
AU  - Hartlev, Mette
AU  - Hasenauer, Jan
AU  - Huth, Manuel
AU  - Hyde, Eleanor
AU  - Jaddoe, Vincent W V
AU  - Marcon, Yannick
AU  - Mayrhofer, Michaela Th
AU  - Molnar-Gabor, Fruzsina
AU  - Morgan, Andrei Scott
AU  - Murtagh, Madeleine
AU  - Nestor, Marc
AU  - Nybo Andersen, Anne-Marie
AU  - Parker, Simon
AU  - Pinot de Moira, Angela
AU  - Schwarz, Florian
AU  - Strandberg-Larsen, Katrine
AU  - Swertz, Morris A
AU  - Welten, Marieke
AU  - Wheater, Stuart
AU  - Burton, Paul
TI  - DataSHIELD: mitigating disclosure risk in a multi-site federated analysis platform.
JO  - Bioinformatics advances
VL  - 5
IS  - 1
SN  - 2635-0041
CY  - Oxford
PB  - Oxford University Press
M1  - DKFZ-2025-00734
SP  - vbaf046
PY  - 2025
AB  - The validity of epidemiologic findings can be increased using triangulation, i.e. comparison of findings across contexts, and by having sufficiently large amounts of relevant data to analyse. However, access to data is often constrained by practical considerations and by ethico-legal and data governance restrictions. Gaining access to such data can be time-consuming due to the governance requirements associated with data access requests to institutions in different jurisdictions.DataSHIELD is a software solution that enables remote analysis without the need for data transfer (federated analysis). DataSHIELD is a scientifically mature, open-source data access and analysis platform aligned with the 'Five Safes' framework, the international framework governing safe research access to data. It allows real-time analysis while mitigating disclosure risk through an active multi-layer system of disclosure-preventing mechanisms. This combination of real-time remote statistical analysis, disclosure prevention mechanisms, and federation capabilities makes DataSHIELD a solution for addressing many of the technical and regulatory challenges in performing the large-scale statistical analysis of health and biomedical data. This paper describes the key components that comprise the disclosure protection system of DataSHIELD. These broadly fall into three classes: (i) system protection elements, (ii) analysis protection elements, and (iii) governance protection elements.Information about the DataSHIELD software is available in https://datashield.org/ and https://github.com/datashield.
LB  - PUB:(DE-HGF)16
C6  - pmid:40191546
C2  - pmc:PMC11968321
DO  - DOI:10.1093/bioadv/vbaf046
UR  - https://inrepo02.dkfz.de/record/300281
ER  -