Next-generation sequencing methods have not only allowed an understanding of genome sequence variation during the evolution of organisms but have also provided invaluable information about genetic variants in inherited disease and the emergence of resistance to drugs in cancers and infectious disease. A challenge is to distinguish mutations that are drivers of disease or drug resistance, from passengers that are neutral or even selectively advantageous to the organism. This requires an understanding of impacts of missense mutations in gene expression and regulation, and on the disruption of protein function by modulating protein stability or disturbing interactions with proteins, nucleic acids, small molecule ligands, and other biological molecules. Experimental approaches to understanding differences between wild-type and mutant proteins are most accurate but are also time-consuming and costly. Computational tools used to predict the impacts of mutations can provide useful information more quickly. Here, we focus on two widely used structure-based approaches, originally developed in the Blundell lab: site-directed mutator (SDM), a statistical approach to analyze amino acid substitutions, and mutation cutoff scanning matrix (mCSM), which uses graph-based signatures to represent the wild-type structural environment and machine learning to predict the effect of mutations on protein stability. Here, we describe DUET that uses machine learning to combine the two approaches. We discuss briefly the development of mCSM for understanding the impacts of mutations on interfaces with other proteins, nucleic acids, and ligands, and we exemplify the wide application of these approaches to understand human genetic disorders and drug resistance mutations relevant to cancer and mycobacterial infections. STATEMENT FOR A BROADER AUDIENCE: Genetic or somatic changes in genes can lead to mutations in human proteins, which give rise to genetic disorders or cancer, or to genes of pathogens leading to drug resistance. Computer software described here, using statistical approaches or machine learning, uses the information from genome sequencing of humans and pathogens, together with experimental or modeled 3D structures of gene products, the proteins, to predict impacts of mutations in genetic disease, cancer and drug resistance.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6933854PMC
http://dx.doi.org/10.1002/pro.3774DOI Listing

Publication Analysis

Top Keywords

impacts mutations
16
machine learning
16
drug resistance
16
mutations
9
mutations protein
8
sdm statistical
8
statistical approach
8
understanding impacts
8
protein stability
8
proteins nucleic
8

Similar Publications

Purpose: Patients with partial or complete DPD deficiency have decreased capacity to degrade fluorouracil and are at risk of developing toxicity, which can be even life-threatening.

Case: A 43-year-old man with moderately differentiated rectal adenocarcinoma on capecitabine presented to the emergency department with complaints of nausea, vomiting, diarrhea, weakness, and lower abdominal pain for several days. Laboratory findings include grade 4 neutropenia (ANC 10) and thrombocytopenia (platelets 36,000).

View Article and Find Full Text PDF

The abnormally viscous and thick mucus is a hallmark of cystic fibrosis (CF). How the mutated CF gene causes abnormal mucus remains an unanswered question of paramount interest. Mucus is produced by the hydration of gel-forming mucin macromolecules that are stored in intracellular granules prior to release.

View Article and Find Full Text PDF

Polymyxins, critical last-resort antibiotics, impact the distribution of membrane-bound divalent cations in the outer membrane of Gram-negative bacteria. We employed atomistic molecular dynamics simulations to model the effect of displacing these ions. Two polymyxin-sensitive and two polymyxin-resistant models of the outer membrane of were investigated.

View Article and Find Full Text PDF

Naa15 Haploinsufficiency and De Novo Missense Variants Associate With Neurodevelopmental Disorders and Interfere With Neurogenesis and Neuron Development.

Autism Res

January 2025

Center for Medical Genetics and Hunan key Laboratory of Medical Genetics, MOE Key Laboratory of Rare Pediatric Disease, School of Life Sciences, Central South University, Changsha, Hunan, China.

Neurodevelopmental disorders (NDDs) encompass a group of conditions that impact brain development and function, exhibiting significant genetic and clinical heterogeneity. NAA15, the auxiliary subunit of the N-terminal acetyltransferase complex, has garnered attention due to its association with NDDs. However, the precise role of NAA15 in cortical development and its contribution to NDDs remain elusive.

View Article and Find Full Text PDF

Arrhythmogenic calmodulin variants D131E and Q135P disrupt interaction with the L-type voltage-gated Ca channel (Ca1.2) and reduce Ca-dependent inactivation.

Acta Physiol (Oxf)

February 2025

Department of Biochemistry, Cell and Systems Biology, Institute of Systems, Molecular and Integrative Biology, Faculty of Health and Life Sciences, University of Liverpool, Liverpool, UK.

Aim: Long QT syndrome (LQTS) and catecholaminergic polymorphism ventricular tachycardia (CPVT) are inherited cardiac disorders often caused by mutations in ion channels. These arrhythmia syndromes have recently been associated with calmodulin (CaM) variants. Here, we investigate the impact of the arrhythmogenic variants D131E and Q135P on CaM's structure-function relationship.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!