Merging Genomics Public Datasets with Clinical Cancer Registry Data - Lessons Learned.

Stud Health Technol Inform

GECKO Institute for Medicine, Informatics and Economics, Heilbronn University.

Published: May 2021

Publicly available datasets - for example via cBioPortal for Cancer Genomics - could be a valuable source for benchmarks and comparisons with local patient records. However, such an approach is only valid if patient cohorts are comparable to each other and if the documentation is complete and sufficient. In this paper, records from exocrine pancreatic cancer patients documented in a local cancer registry are compared with two public datasets to calculate overall survival. Several data preprocessing steps were necessary to ensure comparability of the different datasets and a common database schema was created. Our assumption that the public datasets could be used to augment the data of the local cancer registry could not be validated, since the analysis on overall survival showed a significant difference. We discuss several reasons and explanations for this finding. So far, comparing different datasets with each other and drawing medical conclusions on such comparisons should be conducted with great caution.

Download full-text PDF

Source
http://dx.doi.org/10.3233/SHTI210063DOI Listing

Publication Analysis

Top Keywords

public datasets
12
cancer registry
12
local cancer
8
datasets
6
cancer
5
merging genomics
4
genomics public
4
datasets clinical
4
clinical cancer
4
registry data
4

Similar Publications

Background: Proprotein convertase subtilisin/kexin type 9 (PCSK9) monoclonal antibodies (mAbs) have demonstrated promising effects in lowering cardiovascular incidents among patients with acute coronary syndrome. However, their influence on early platelet reactivity after primary percutaneous coronary intervention (PPCI) remains unclear.

Objectives: This research sought to investigate the effects of entirely human anti-PCSK9 antibodies on platelet function as measured by thrombelastography and 12-month postoperative results in patients receiving PPCI and treated with ticagrelor-based dual antiplatelet therapy.

View Article and Find Full Text PDF

Good practices in artificial intelligence (AI) model validation are key for achieving trustworthy AI. Within the cancer imaging domain, attracting the attention of clinical and technical AI enthusiasts, this work discusses current gaps in AI validation strategies, examining existing practices that are common or variable across technical groups (TGs) and clinical groups (CGs). The work is based on a set of structured questions encompassing several AI validation topics, addressed to professionals working in AI for medical imaging.

View Article and Find Full Text PDF

Objective: To evaluate the feasibility of utilizing artificial intelligence (AI)-predicted biparametric MRI (bpMRI) image features for predicting the aggressiveness of prostate cancer (PCa).

Materials And Methods: A total of 878 PCa patients from 4 hospitals were retrospectively collected, all of whom had pathological results after radical prostatectomy (RP). A pre-trained AI algorithm was used to select suspected PCa lesions and extract lesion features for model development.

View Article and Find Full Text PDF

Background And Objective: Psilocybin is currently being extensively studied as a potential therapeutic agent for multiple psychiatric disorders. Here, a systematic literature review of all published pharmacokinetic data on the pharmacologically active metabolite of psilocybin, psilocin, is presented.

Methods: The review includes clinical studies that reported pharmacokinetic data and/or parameters after psilocybin administration in humans.

View Article and Find Full Text PDF

CXR-LLaVA: a multimodal large language model for interpreting chest X-ray images.

Eur Radiol

January 2025

Department of Radiology, Seoul National University College of Medicine, Seoul National University Hospital, Seoul, Republic of Korea.

Objective: This study aimed to develop an open-source multimodal large language model (CXR-LLaVA) for interpreting chest X-ray images (CXRs), leveraging recent advances in large language models (LLMs) to potentially replicate the image interpretation skills of human radiologists.

Materials And Methods: For training, we collected 592,580 publicly available CXRs, of which 374,881 had labels for certain radiographic abnormalities (Dataset 1) and 217,699 provided free-text radiology reports (Dataset 2). After pre-training a vision transformer with Dataset 1, we integrated it with an LLM influenced by the LLaVA network.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!