Unlabelled: Cancer registries are collections of curated data about malignant tumor diseases. The amount of data processed by cancer registries increases every year, making manual registration more and more tedious.
Objective: We sought to develop an automatic analysis pipeline that would be able to identify and preprocess registry input for incident prostate adenocarcinomas in a French regional cancer registry.
Methods: Notifications from different sources submitted to the Bas-Rhin cancer registry were used here: pathology data and, ICD 10 diagnosis codes from hospital discharge data and healthcare insurance data. We trained a Support Vector Machine model (machine learning) to predict whether patient's data must be considered or not as a prostate adenocarcinoma incident case that should therefore be registered. The final registration of all identified cases was manually confirmed by a specialized technician. Text mining tools (regular expressions) were used to extract clinical and biological data from non-structured pathology reports.
Results: We performed two successive analyses. First, we used 982 cases manually labeled by registrars from the 2014 dataset to predict the registration of 785 cases submitted in 2015. Then, we repeated the procedure using the 2089 cases labeled by registrars from the 2014 and 2015 datasets to predict the registration of 926 cases submitted in the 2016 data. The algorithm identified 663 cases of prostate adenocarcinoma in 2015, and 610 in 2016. From these findings, 663 and 531 cases were respectively added to the registry; and 641 and 512 cases were confirmed by the specialized technician. This registration process has achieved a precision level above 96 %. The algorithm obtained an overall precision of 99 % (99.5 % in 2015 and 98.5 % in 2016) and a recall of 97 % (97.8 % in 2015 and 96.9 % in 2016). When the information was found in pathology report, text mining was more than 90 % accuracy for major indicators: PSA test, Gleason score, and incidence date). For both PSA and tumor side, information was not detected in the majority of cases."
Conclusion: Machine learning was able to identify new cases of prostate cancer, and text mining was able to prefill the data about incident cases. Machine-learning-based automation of the registration process could reduce delays in data production and allow investigators to devote more time to complex tasks and analysis.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.ijmedinf.2020.104139 | DOI Listing |
Alzheimers Dement
December 2024
The University of Texas Health Science Center at Houston, Houston, TX, USA.
Background: Developing drugs for treating Alzheimer's disease (AD) has been extremely challenging and costly due to limited knowledge on underlying biological mechanisms and therapeutic targets. Repurposing drugs or their combination has shown potential in accelerating drug development due to the reduced drug toxicity while targeting multiple pathologies.
Method: To address the challenge in AD drug development, we developed a multi-task machine learning pipeline to integrate a comprehensive knowledge graph on biological/pharmacological interactions and multi-level evidence on drug efficacy, to identify repurposable drugs and their combination candidates RESULT: Using the drug embedding from the heterogeneous graph representation model, we ranked drug candidates based on evidence from post-treatment transcriptomic patterns, mechanistic efficacy in preclinical models, population-based treatment effect, and Phase 2/3 clinical trials.
Background: In Alzheimer's Disease (AD) trials, clinical scales are used to assess treatment effect in patients. Minimizing statistical uncertainty of trial outcomes is an important consideration to increase statistical power. Machine learning models can leverage baseline data to create AI-generated digital twins - individualized predictions (or prognostic scores) of how each patient's clinical outcomes may change during a trial assuming they received placebo.
View Article and Find Full Text PDFAlzheimers Dement
December 2024
Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA.
Background: The prohibitive costs of drug development for Alzheimer's Disease (AD) emphasize the need for alternative in silico drug repositioning strategies. Graph learning algorithms, capable of learning intrinsic features from complex network structures, can leverage existing databases of biological interactions to improve predictions in drug efficacy. We developed a novel machine learning framework, the PreSiBOGNN, that integrates muti-modal information to predict cognitive improvement at the subject level for precision medicine in AD.
View Article and Find Full Text PDFAlzheimers Dement
December 2024
Imperial College London, London, United Kingdom; UK Dementia Research Institute, Care Research and Technology Centre, London, United Kingdom.
Background: Close to 23% of unplanned hospital admissions for people living with dementia (PLWD) are due to potentially preventable causes such as severe urinary tract infections (UTIs), falls, and respiratory problems. These affect the well-being of PLWD, cause stress to carers and increase pressure on healthcare services.
Method: We use routinely collected in-home sensory data to monitor nocturnal activity and sleep data.
Alzheimers Dement
December 2024
Department of Psychology & Language Sciences, University College London, London, United Kingdom.
Background: Dysphagia is an important feature of neurodegenerative diseases and potentially life-threatening in primary progressive aphasia (PPA), but remains poorly characterised in these syndromes. We hypothesised that dysphagia would be more prevalent in nonfluent/agrammatic variant (nfv)PPA than other PPA syndromes, predicted by accompanying motor features and associated with atrophy affecting regions implicated in swallowing control.
Methods: In a retrospective case-control study at our tertiary referral centre, we recruited 56 patients with PPA (21 nfvPPA, 22 semantic variant (sv)PPA, 13 logopenic variant (lv)PPA).
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!