Publications by authors named "F Prasser"

Aims: Data availability remains a critical challenge in modern, data-driven medical research. Due to the sensitive nature of patient health records, they are rightfully subject to stringent privacy protection measures. One way to overcome these restrictions is to preserve patient privacy by using anonymization and synthetization strategies.

View Article and Find Full Text PDF

Background: Clinical data warehouses provide harmonized access to healthcare data for medical researchers. Informatics for Integrating Biology and the Bedside (i2b2) is a well-established open-source solution with the major benefit that data representations can be tailored to support specific use cases. These data representations can be defined and improved via an iterative approach together with domain experts and the medical researchers using the platform.

View Article and Find Full Text PDF

Introduction: Data provenance, which documents the origin, history, and transformations of data, can enhance the reproducibility of processing workflows and help to address errors and quality issues. In this work, we focus on tracking and utilizing provenance information as part of quality management in Extract-Transform-Load (ETL) processes used to build clinical data warehouses.

Methods: We designed and implemented a framework that automatically tracks how data flows through an ETL process and detects errors and quality problems during processing.

View Article and Find Full Text PDF
Article Synopsis
  • Large-scale health data is complex and offers research opportunities, but existing analysis tools struggle with user-friendliness and comprehensiveness due to the nature of longitudinal data.
  • The paper introduces HERALD, a user-friendly query language designed to transform longitudinal health data into simpler cross-sectional tables, featuring a natural language syntax and integration with i2b2.
  • HERALD allows for versatile query types, processes patient-specific data efficiently, and includes an open-source implementation with a web interface for easy statistical analysis, making it a valuable resource for data scientists and researchers.
View Article and Find Full Text PDF