Journal Article DKFZ-2019-02245

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
A Generic Method and Implementation to Evaluate and Improve Data Quality in Distributed Research Networks.

 ;  ;  ;  ;

2019
Thieme52258 Stuttgart

Methods of information in medicine 58(2-03), 086 - 093 () [10.1055/s-0039-1693685]
 GO

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: With the increasing personalization of clinical therapies, translational research is evermore dependent on multisite research cooperations to obtain sufficient data and biomaterial. Distributed research networks rely on the availability of high-quality data stored in local databases operated by their member institutions. However, reusing data documented by independent health providers for the purpose of care, rather than research ('secondary use'), reveal a high variability in terms of data formats, as well as poor data quality, across network sites. The aim of this work is the provision of a process for the assessment of data quality with regard to completeness and syntactic accuracy across independently operated data warehouses using common definitions stored in a central (network-wide) metadata repository (MDR). For assessment of data quality across multiple sites, we employ a framework of so-called bridgeheads. These are federated data warehouses, which allow the sites to participate in a research network. A central MDR is used to store the definitions of the commonly agreed data elements and their permissible values. We present the design for a generator of quality reports within a bridgehead, allowing the validation of data in the local data warehouse against a research network's central MDR. A standardized quality report can be produced at each network site, providing a means to compare data quality across sites, as well as to channel feedback to the local data source systems, and local documentation personnel. A reference implementation for this concept has been successfully utilized at 10 sites across the German Cancer Consortium. We have shown that comparable data quality assessment across different partners of a distributed research network is feasible when a central metadata repository is combined with locally installed assessment processes. To achieve this, we designed a quality report and the process for generating such a report. The final step was the implementation in a German research network.


Contributing Institute(s):
  1. Verbundinformationssysteme (E260)
  2. DKTK Heidelberg (L101)
  3. Medizinische Informatik in der Translationalen Onkologie (E240)
Research Program(s):
  1. 315 - Imaging and radiooncology (POF3-315) (POF3-315)

Appears in the scientific report 2019
Database coverage:
Medline ; Clarivate Analytics Master Journal List ; Current Contents - Clinical Medicine ; IF < 5 ; JCR ; SCOPUS ; Science Citation Index ; Science Citation Index Expanded ; Web of Science Core Collection
Click to display QR Code for this record

The record appears in these collections:
Document types > Articles > Journal Article
Public records
Publications database

 Record created 2019-09-20, last modified 2024-02-29



Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)