Using AI-predicted protein structures as a reference to predict loss-of-function activity in tumor suppressor breast cancer genes.

Comput Struct Biotechnol J

Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, United States.

Published: December 2024

AI Article Synopsis

  • The classification of missense variants in tumor suppressor genes related to breast cancer is complicated due to their rarity, leading to uncertainty in clinical decisions.
  • This study utilizes generative AI to create high-resolution protein structures, which are then analyzed for protein stability changes to assess loss-of-function (LOF) activity in specific domains of the protein.
  • Results show that AI-generated structures of the BRCA1-C terminal and DNA-binding domains significantly improve LOF activity prediction compared to experimental structures, indicating the potential of AI in this area of genetic research.

Article Abstract

Background: The loss-of-function (LOF) classification of most missense variants in tumor suppressor breast cancer genes , and remains unclassified and confounds clinical actionability. Classifying these variants is challenging due to their rarity, leading clinicians to rely on predictive methods. Protein stability changes are associated with function, making stability predictors valuable. Stability predictions upon missense variant perturbations require high-resolution protein structures. However, the availability of these high-resolution structures is lacking. This study explores using generative AI to predict high-resolution protein structures, which can then be analyzed with protein stability prediction methods to assess LOF activity in ordered regions of the protein. This study also determines the appropriate protein stability and dedicated missense prediction methods in dbNSFP v4.7 database to predict LOF activity in ordered regions of these four genes. Functional classifications from homology recombination DNA repair (HDR) assays and variant classifications from the ClinVar database provide a reliable dataset for evaluating the performance of these prediction methods.

Results: Complex AlphaFold2 structures of the BRCA1-C terminal (BRCT) domain and the DNA-binding (DB) domain of analyzed using protein stability tool FoldX predicts LOF activity from missense variants significantly better than experimentally-derived structures in ordered regions. The BRCT domain achieved an Area Under the Curve (AUC)= 0.861 (95 % CI:0.858-0.863) and AUC= 0.842 (95 % CI:0.840-0.845), while the DB domain achieved an AUC= 0.836 (95 % CI:0.8322-0.841), compared to AUC= 0.847 (95 % CI:0.844-0.850) and AUC= 0.835 (95 % CI:0.832-0.837) from the BRCT domain, and AUC= 0.830 (95 % CI:0.821-0.8320) from the DB domain from experimentally-derived structures. Protein stability does not predict LOF activity from missense variants better than dedicated missense predictors. Overall, we find that AlphaMissense ranks highly, with an average AUC= 0.890 (95 % CI 0.886-0.895) from ordered regions across these four cancer genes, compared to all other missense predictors present in the dbNSFP database.

Conclusions: The study reveals that generative AI protein predicted structures can outperform experimentally-derived structures in evaluating LOF activity from predicted protein stability in ordered regions of genes BRCA1, BRCA2, PALB2 and RAD51C. The study also highlights the predictive performance of AlphaMissense as the premier missense prediction method to predict LOF activity from missense variants in these four tumor suppressor breast cancer genes. The code for this study can be downloaded for free on GitHub (https://github.com/rohandavidg/CarePred).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11490748PMC
http://dx.doi.org/10.1016/j.csbj.2024.10.008DOI Listing

Publication Analysis

Top Keywords

protein stability
24
lof activity
24
ordered regions
20
cancer genes
16
missense variants
16
protein structures
12
tumor suppressor
12
suppressor breast
12
breast cancer
12
predict lof
12

Similar Publications

Targeting CHEK1: Ginsenosides-Rh2 and Cu2O@G-Rh2 nanoparticles in thyroid cancer.

Cell Biol Toxicol

January 2025

Department of Radiology, Shengjing Hospital of China Medical University, No. 36 Sanhao Street, Heping District, Shenyang, 110004, Liaoning Province, China.

Thyroid cancer (THCA) is an increasingly common malignant tumor of the endocrine system, with its incidence rising steadily in recent years. For patients who experience recurrence or metastasis, treatment options are relatively limited, and the prognosis is poor. Therefore, exploring new therapeutic strategies has become particularly urgent.

View Article and Find Full Text PDF

Background: Except host and environmental factors influencing individual human cytokine responses, pre-analytical handling procedures and detection methods also affect cytokine levels.

Methods: In this study, we used cytometric bead array (CBA) and chemiluminescence (ECL). These two methods were used to test serum and plasma samples from 50 healthy adult volunteers and 50 rheumatoid arthritis (RA) patients' cytokine levels.

View Article and Find Full Text PDF

Plant pathogens pose significant threats to global cereal crop production, particularly for essential crops like rice and wheat, which are fundamental to global food security and provide nearly 40% of the global caloric intake. As the global population continues to rise, increasing agricultural production to meet food demands becomes even more critical. However, the production of these vital crops is constantly threatened by phytopathological diseases, especially those caused by fungal pathogens such as , the causative agent of rice blast disease, , responsible for head blight (FHB) in wheat, and , the source of Septoria tritici blotch (STB).

View Article and Find Full Text PDF

Background: Classical-like Ehlers Danlos Syndrome type 1 (clEDS1) is a very rare form of Ehlers Danlos Syndrome (EDS) caused by tenascin-X (TNX) deficiency, with only 56 individuals reported. TNX is an extracellular matrix protein needed for collagen stability. Previous publications propose that individuals with clEDS1 might be at risk for gastrointestinal (GI) tract perforations and/or tracheal ruptures.

View Article and Find Full Text PDF

Myotonic Dystrophy type 2 (DM2) is a multisystem disease affecting many tissues, including skeletal muscle, heart, and brain. DM2 is caused by unstable expansion of CCTG repeats in an intron 1 of a gene coding for cellular nuclear binding protein (CNBP). The expanded CCTG repeats cause DM2 pathology due to the accumulation of RNA CCUG repeats, which affect RNA processing in patients' cells.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!