[Data-driven intensive care: a lack of comprehensive datasets].

Med Klin Intensivmed Notfmed

Medizinische Klinik mit Schwerpunkt Nephrologie und internistische Intensivmedizin, Charité - Universitätsmedizin Berlin, Augustenburger Platz 1, 13353, Berlin, Deutschland.

Published: June 2024

Intensive care units provide a data-rich environment with the potential to generate datasets in the realm of big data, which could be utilized to train powerful machine learning (ML) models. However, the currently available datasets are too small and exhibit too little diversity due to their limitation to individual hospitals. This lack of extensive and varied datasets is a primary reason for the limited generalizability and resulting low clinical utility of current ML models. Often, these models are based on data from single centers and suffer from poor external validity. There is an urgent need for the development of large-scale, multicentric, and multinational datasets. Ensuring data protection and minimizing re-identification risks pose central challenges in this process. The "Amsterdam University Medical Center database (AmsterdamUMCdb)" and the "Salzburg Intensive Care database (SICdb)" demonstrate that open access datasets are possible in Europe while complying with the data protection regulations of the General Data Protection Regulation (GDPR). Another challenge in building intensive care datasets is the absence of semantic definitions in the source data and the heterogeneity of data formats. Establishing binding industry standards for the semantic definition is crucial to ensure seamless semantic interoperability between datasets.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s00063-024-01141-zDOI Listing

Publication Analysis

Top Keywords

intensive care
16
data protection
12
datasets
7
data
7
[data-driven intensive
4
care
4
care a lack
4
a lack comprehensive
4
comprehensive datasets]
4
datasets] intensive
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!