AI Article Synopsis

  • The study addresses the challenge of differentiating between pathogenic cancer mutations and mutations from clonal hematopoiesis in cell-free DNA (cfDNA), which is vital for accurate diagnoses and treatments.
  • Researchers developed a specialized machine learning model using a reference catalog of 25,000 single nucleotide variants linked to tumor or clonal hematopoiesis origins.
  • Their model achieved a 95% accuracy in classifying these variants, showcasing the effectiveness of advanced data analysis and machine learning techniques in improving liquid biopsy methods and personalized medicine.

Article Abstract

Motivation: Distinguishing between pathogenic cancer-associated mutations and other somatic variants present in cell-free DNA (cfDNA) is one of the challenges in the field of liquid biopsy. This distinction is critical, since the misclassification of mutations stemming from clonal hematopoiesis (CH) as tumor-derived and vice versa could result in inaccurate diagnoses and inappropriate therapeutic interventions for patients.

Results: We addressed this by developing a specialized machine learning technique to differentiate tumor- or CH-related mutations in cfDNA. We established a comprehensive in-house reference catalog, comprising approximately 25,000 single nucleotide variants (SNVs), each linked to either tumor or CH origin. This reference serves as a foundation for training a deep learning model, which is structured on the semi-supervised generative adversarial network (SSGAN) architecture. By analyzing genomic coordinates and nucleotide composition of cfDNA variants, our model attains 95 % area under the curve (AUC) in classifying uncharacterized variants as CH or tumor-derived. In conclusion, our research emphasizes the potential of genomic feature prediction, using cfDNA data, to stand as a robust alternative to conventional multi-analyte sequencing methods. This approach not only enhances the accuracy of distinguishing CH from tumor mutations in liquid biopsy data, but also highlights the potential of advanced data analysis techniques and machine learning in genomics and personalized medicine. : https://github.com/FPalizban/SSGAN.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11530920PMC
http://dx.doi.org/10.1016/j.heliyon.2024.e39379DOI Listing

Publication Analysis

Top Keywords

cell-free dna
8
liquid biopsy
8
machine learning
8
predicting somatic
4
somatic mutation
4
mutation origins
4
origins cell-free
4
dna semi-supervised
4
semi-supervised gan
4
gan models
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!