Background & Aims: Accurate hepatocellular carcinoma (HCC) risk prediction facilitates appropriate surveillance strategy and reduces cancer mortality. We aimed to derive and validate novel machine learning models to predict HCC in a territory-wide cohort of patients with chronic viral hepatitis (CVH) using data from the Hospital Authority Data Collaboration Lab (HADCL).

Methods: This was a territory-wide, retrospective, observational, cohort study of patients with CVH in Hong Kong in 2000-2018 identified from HADCL based on viral markers, diagnosis codes, and antiviral treatment for chronic hepatitis B and/or C. The cohort was randomly split into training and validation cohorts in a 7:3 ratio. Five popular machine learning methods, namely, logistic regression, ridge regression, AdaBoost, decision tree, and random forest, were performed and compared to find the best prediction model.

Results: A total of 124,006 patients with CVH with complete data were included to build the models. In the training cohort (n = 86,804; 6,821 HCC), ridge regression (area under the receiver operating characteristic curve [AUROC] 0.842), decision tree (0.952), and random forest (0.992) performed the best. In the validation cohort (n = 37,202; 2,875 HCC), ridge regression (AUROC 0.844) and random forest (0.837) maintained their accuracy, which was significantly higher than those of HCC risk scores: CU-HCC (0.672), GAG-HCC (0.745), REACH-B (0.671), PAGE-B (0.748), and REAL-B (0.712) scores. The low cut-off (0.07) of HCC ridge score (HCC-RS) achieved 90.0% sensitivity and 98.6% negative predictive value (NPV) in the validation cohort. The high cut-off (0.15) of HCC-RS achieved high specificity (90.0%) and NPV (95.6%); 31.1% of patients remained indeterminate.

Conclusions: HCC-RS from the ridge regression machine learning model accurately predicted HCC in patients with CVH. These machine learning models may be developed as built-in functional keys or calculators in electronic health systems to reduce cancer mortality.

Lay Summary: Novel machine learning models generated accurate risk scores for hepatocellular carcinoma (HCC) in patients with chronic viral hepatitis. HCC ridge score was consistently more accurate than existing HCC risk scores. These models may be incorporated into electronic medical health systems to develop appropriate cancer surveillance strategies and reduce cancer death.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8844233PMC
http://dx.doi.org/10.1016/j.jhepr.2022.100441DOI Listing

Publication Analysis

Top Keywords

machine learning
24
learning models
16
risk scores
16
ridge regression
16
hcc ridge
16
novel machine
12
hepatocellular carcinoma
12
patients chronic
12
chronic viral
12
viral hepatitis
12

Similar Publications

Triple-negative breast cancer (TNBC) remains a significant global health challenge, emphasizing the need for precise identification of patients with specific therapeutic targets and those at high risk of metastasis. This study aimed to identify novel therapeutic targets for personalized treatment of TNBC patients by elucidating their roles in cell cycle regulation. Using weighted gene co-expression network analysis (WGCNA), we identified 83 hub genes by integrating gene expression profiles with clinical pathological grades.

View Article and Find Full Text PDF

The rising incidence of pancreatic diseases, including acute and chronic pancreatitis and various pancreatic neoplasms, poses a significant global health challenge. Pancreatic ductal adenocarcinoma (PDAC) for example, has a high mortality rate due to late-stage diagnosis and its inaccessible location. Advances in imaging technologies, though improving diagnostic capabilities, still necessitate biopsy confirmation.

View Article and Find Full Text PDF

Proteins' flexibility is a feature in communicating changes in cell signaling instigated by binding with secondary messengers, such as calcium ions, associated with the coordination of muscle contraction, neurotransmitter release, and gene expression. When binding with the disordered parts of a protein, calcium ions must balance their charge states with the shape of calcium-binding proteins and their versatile pool of partners depending on the circumstances they transmit. Accurately determining the ionic charges of those ions is essential for understanding their role in such processes.

View Article and Find Full Text PDF

Background: Alzheimer's disease (AD) is a progressive neurodegenerative disorder affecting millions worldwide, leading to cognitive and functional decline. Early detection and intervention are crucial for enhancing the quality of life of patients and their families. Remote Monitoring Technologies (RMTs) offer a promising solution for early detection by tracking changes in behavioral and cognitive functions, such as memory, language, and problem-solving skills.

View Article and Find Full Text PDF

Purpose: To develop and validate a prostate-specific membrane antigen (PSMA) PET/CT based multimodal deep learning model for predicting pathological lymph node invasion (LNI) in prostate cancer (PCa) patients identified as candidates for extended pelvic lymph node dissection (ePLND) by preoperative nomograms.

Methods: [Ga]Ga-PSMA-617 PET/CT scan of 116 eligible PCa patients (82 in the training cohort and 34 in the test cohort) who underwent radical prostatectomy with ePLND were analyzed in our study. The Med3D deep learning network was utilized to extract discriminative features from the entire prostate volume of interest on the PET/CT images.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!