MutBLESS: A tool to identify disease-prone sites in cancer using deep learning.

Biochim Biophys Acta Mol Basis Dis

Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India. Electronic address:

Published: August 2023

Understanding the molecular basis and impact of mutations at different stages of cancer are long-standing challenges in cancer biology. Identification of driver mutations from experiments is expensive and time intensive. In the present study, we collected the data for experimentally known driver mutations in 22 different cancer types and classified them into six categories: breast cancer (BRCA), acute myeloid leukaemia (LAML), endometrial carcinoma (EC), stomach cancer (STAD), skin cancer (SKCM), and other cancer types which contains 5747 disease prone and 5514 neutral sites in 516 proteins. The analysis of amino acid distribution along mutant sites revealed that the motifs AAA and LR are preferred in disease-prone sites whereas QPP and QF are dominant in neutral sites. Further, we developed a method using deep neural networks to predict disease-prone sites with amino acid sequence-based features such as physicochemical properties, secondary structure, tri-peptide motifs and conservation scores. We obtained an average AUC of 0.97 in five cancer types BRCA, LAML, EC, STAD and SKCM in a test dataset and 0.72 in all other cancer types together. Our method showed excellent performance for identifying cancer-specific mutations with an average sensitivity, specificity, and accuracy of 96.56 %, 97.39 %, and 97.64 %, respectively. We developed a web server for identifying cancer-prone sites, and it is available at https://web.iitm.ac.in/bioinfo2/MutBLESS/index.html. We suggest that our method can serve as an effective method to identify disease-prone sites and assist to develop therapeutic strategies.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.bbadis.2023.166721DOI Listing

Publication Analysis

Top Keywords

disease-prone sites
16
cancer types
16
cancer
10
identify disease-prone
8
sites
8
driver mutations
8
neutral sites
8
amino acid
8
mutbless tool
4
tool identify
4

Similar Publications

MutBLESS: A tool to identify disease-prone sites in cancer using deep learning.

Biochim Biophys Acta Mol Basis Dis

August 2023

Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India. Electronic address:

Understanding the molecular basis and impact of mutations at different stages of cancer are long-standing challenges in cancer biology. Identification of driver mutations from experiments is expensive and time intensive. In the present study, we collected the data for experimentally known driver mutations in 22 different cancer types and classified them into six categories: breast cancer (BRCA), acute myeloid leukaemia (LAML), endometrial carcinoma (EC), stomach cancer (STAD), skin cancer (SKCM), and other cancer types which contains 5747 disease prone and 5514 neutral sites in 516 proteins.

View Article and Find Full Text PDF

(McRae) is a hemibiotrophic oomycete fungus that infects tender nuts, growing buds, and crown regions, resulting in fruit, bud, and crown rot diseases in arecanut ( L.), respectively. Among them, fruit rot disease (FRD) causes serious economic losses that are borne by the growers, making it the greatest yield-limiting factor in arecanut crops.

View Article and Find Full Text PDF

Predicting potential residues associated with lung cancer using deep neural network.

Mutat Res

July 2021

Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, 600036, India; School of Computing, Tokyo Tech World Research Hub Initiative (WRHI), Institute of Innovative Research, Tokyo Institute of Technology, Midori-ku, Kanagawa, 226-8503, Yokohama, Japan. Electronic address:

Lung cancer is a prominent type of cancer, which leads to high mortality rate worldwide. The major lung cancers lung adenocarcinoma (LUAD) and lung squamous carcinoma (LUSC) occur mainly due to somatic driver mutations in proteins and screening of such mutations is often cost and time intensive. Hence, in the present study, we systematically analyzed the preferred residues, residues pairs and motifs of 4172 disease prone sites in 195 proteins and compared with 4137 neutral sites.

View Article and Find Full Text PDF

Neutrophils are implicated in the pathogenesis of atherosclerosis but are seldom detected in atherosclerotic plaques. We investigated whether neutrophil-derived microvesicles may influence arterial pathophysiology. Here we report that levels of circulating neutrophil microvesicles are enhanced by exposure to a high fat diet, a known risk factor for atherosclerosis.

View Article and Find Full Text PDF

Studying drug-protein interactions has gained significant attention lately, and this is because the majority of drugs interact with proteins, thereby altering their structure and, moreover, their functionality. Rivastigmine tartrate (RT) is a drug that is in use for mild to moderate Alzheimer therapy. This study was targeted to characterize the interaction between human transferrin (hTf) and RT by employing spectroscopy, isothermal titration calorimetry (ITC), and molecular docking studies.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!