A machine learning tool for identifying non-metastatic colorectal cancer in primary care.

Eur J Cancer

Department of Neurobiology, Care Sciences and Society, Division of Family Medicine and Primary Care, Karolinska Institutet, Solna, Sweden; Regional Cancer Centre Stockholm-Gotland, Region Stockholm, Stockholm, Sweden; Department of Medical Sciences, Division of Clinical Diabetology and Metabolism, Uppsala University, Uppsala, Sweden.

Published: March 2023

Background: Primary health care (PHC) is often the first point of contact when diagnosing colorectal cancer (CRC). Human limitations in processing large amounts of information warrant the use of machine learning as a diagnostic prediction tool for CRC.

Aim: To develop a predictive model for identifying non-metastatic CRC (NMCRC) among PHC patients using diagnostic data analysed with machine learning.

Design And Setting: A case-control study containing data on PHC visits for 542 patients >18 years old diagnosed with NMCRC in the Västra Götaland Region, Sweden, during 2011, and 2,139 matched controls.

Method: Stochastic gradient boosting (SGB) was used to construct a model for predicting the presence of NMCRC based on diagnostic codes from PHC consultations during the year before the date of cancer diagnosis and the total number of consultations. Variables with a normalised relative influence (NRI) >1% were considered having an important contribution to the model. Risks of having NMCRC were calculated using odds ratios of marginal effects.

Results: Of the 361 variables used as predictors in the stochastic gradient boosting model, 184 had non-zero influence, with 16 variables having NRI >1% and a combined NRI of 63.3%. Variables representing anaemia and bleeding had a combined NRI of 27.6%. The model had a sensitivity of 73.3% and a specificity of 83.5%. Change in bowel habit had the highest odds ratios of marginal effects at 28.8.

Conclusion: Machine learning is useful for identifying variables of importance for predicting NMCRC in PHC. Malignant diagnoses may be hidden behind benign symptoms such as haemorrhoids.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ejca.2023.01.011DOI Listing

Publication Analysis

Top Keywords

machine learning
12
identifying non-metastatic
8
colorectal cancer
8
nmcrc phc
8
stochastic gradient
8
gradient boosting
8
odds ratios
8
ratios marginal
8
combined nri
8
phc
5

Similar Publications

T-helper 17 (Th17) cells significantly influence the onset and advancement of malignancies. This study endeavor focused on delineating molecular classifications and developing a prognostic signature grounded in Th17 cell differentiation-related genes (TCDRGs) using machine learning algorithms in head and neck squamous cell carcinoma (HNSCC). A consensus clustering approach was applied to The Cancer Genome Atlas-HNSCC cohort based on TCDRGs, followed by an examination of differential gene expression using the limma package.

View Article and Find Full Text PDF

Ultrasensitive Detection of Circulating Plasma Cells Using Surface-Enhanced Raman Spectroscopy and Machine Learning for Multiple Myeloma Monitoring.

Anal Chem

January 2025

Key Laboratory of OptoElectronic Science and Technology for Medicine of Ministry of Education, Fujian Provincial Key Laboratory of Photonics Technology, Fujian Normal University, Fuzhou, Fujian 350117, China.

Multiple myeloma is a hematologic malignancy characterized by the proliferation of abnormal plasma cells in the bone marrow. Despite therapeutic advancements, there remains a critical need for reliable, noninvasive methods to monitor multiple myeloma. Circulating plasma cells (CPCs) in peripheral blood are robust and independent prognostic markers, but their detection is challenging due to their low abundance.

View Article and Find Full Text PDF

Background: Sepsis, a critical global health challenge, accounted for approximately 20% of worldwide deaths in 2017. Although the Sequential Organ Failure Assessment (SOFA) score standardizes the diagnosis of organ dysfunction, early sepsis detection remains challenging due to its insidious symptoms. Current diagnostic methods, including clinical assessments and laboratory tests, frequently lack the speed and specificity needed for timely intervention, particularly in vulnerable populations such as older adults, intensive care unit (ICU) patients, and those with compromised immune systems.

View Article and Find Full Text PDF

Purpose: Adaptive radiotherapy accounts for interfractional anatomic changes. We hypothesize that changes in the gross tumor volumes identified during daily scans could be analyzed using delta-radiomics to predict disease progression events. We evaluated whether an auxiliary data set could improve prediction performance.

View Article and Find Full Text PDF

Purpose: Establishing an accurate prognosis remains challenging in older patients with cancer because of the population's heterogeneity and the current predictive models' reduced ability to capture the complex interactions between oncologic and geriatric predictors. We aim to develop and externally validate a new predictive score (the Geriatric Cancer Scoring System [GCSS]) to refine individualized prognosis for older patients with cancer during the first year after a geriatric assessment (GA).

Materials And Methods: Data were collected from two French prospective multicenter cohorts of patients with cancer 70 years and older, referred for GA: ELCAPA (training set January 2007-March 2016) and ONCODAGE (validation set August 2008-March 2010).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!