AI Article Synopsis

  • - This study aimed to improve early detection of prevalent and deadly cancers like primary liver cancer, colorectal adenocarcinoma, and lung adenocarcinoma by developing a machine learning model using cell-free DNA fragmentomics.
  • - A total of 1,214 participants were included, with most being in early cancer stages, and the model achieved a high Area Under the Curve (AUC) of 0.983, showcasing strong sensitivity and specificity for differentiating cancer patients from healthy individuals.
  • - The model's accuracy for predicting cancer origin was robust at 93.1%, maintaining high sensitivity even with reduced sequencing depth, indicating a promising approach for multi-cancer early detection.

Article Abstract

Early detection can benefit cancer patients with more effective treatments and better prognosis, but existing early screening tests are limited, especially for multi-cancer detection. This study investigated the most prevalent and lethal cancer types, including primary liver cancer (PLC), colorectal adenocarcinoma (CRC), and lung adenocarcinoma (LUAD). Leveraging the emerging cell-free DNA (cfDNA) fragmentomics, we developed a robust machine learning model for multi-cancer early detection. 1,214 participants, including 381 PLC, 298 CRC, 292 LUAD patients, and 243 healthy volunteers, were enrolled. The majority of patients (N = 971) were at early stages (stage 0, N = 34; stage I, N = 799). The participants were randomly divided into a training cohort and a test cohort in a 1:1 ratio while maintaining the ratio for the major histology subtypes. An ensemble stacked machine learning approach was developed using multiple plasma cfDNA fragmentomic features. The model was trained solely in the training cohort and then evaluated in the test cohort. Our model showed an Area Under the Curve (AUC) of 0.983 for differentiating cancer patients from healthy individuals. At 95.0% specificity, the sensitivity of detecting all cancer reached 95.5%, while 100%, 94.6%, and 90.4% for PLC, CRC, and LUAD, individually. The cancer origin model demonstrated an overall 93.1% accuracy for predicting cancer origin in the test cohort (97.4%, 94.3%, and 85.6% for PLC, CRC, and LUAD, respectively). Our model sensitivity is consistently high for early-stage and small-size tumors. Furthermore, its detection and origin classification power remained superior when reducing sequencing depth to 1× (cancer detection: ≥ 91.5% sensitivity at 95.0% specificity; cancer origin: ≥ 91.6% accuracy). In conclusion, we have incorporated plasma cfDNA fragmentomics into the ensemble stacked model and established an ultrasensitive assay for multi-cancer early detection, shedding light on developing cancer early screening in clinical practice.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9188251PMC
http://dx.doi.org/10.1186/s12943-022-01594-wDOI Listing

Publication Analysis

Top Keywords

early detection
16
multi-cancer early
12
test cohort
12
cancer origin
12
cancer
10
cell-free dna
8
cancer patients
8
early screening
8
cfdna fragmentomics
8
machine learning
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!