Covalent ligands have attracted increasing attention due to their unique advantages, such as long residence time, high selectivity, and strong binding affinity. They also show promise for targets where previous efforts to identify noncovalent small molecule inhibitors have failed. However, our limited knowledge of covalent binding sites has hindered the discovery of novel ligands. Therefore, developing in silico methods to identify covalent binding sites is highly desirable. Here, we propose DeepCoSI, the first structure-based deep graph learning model to identify ligandable covalent sites in the protein. By integrating the characterization of the binding pocket and the interactions between each cysteine and the surrounding environment, DeepCoSI achieves state-of-the-art predictive performances. The validation on two external test sets which mimic the real application scenarios shows that DeepCoSI has strong ability to distinguish ligandable sites from the others. Finally, we profiled the entire set of protein structures in the RCSB Protein Data Bank (PDB) with DeepCoSI to evaluate the ligandability of each cysteine for covalent ligand design, and made the predicted data publicly available on website.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9343084 | PMC |
http://dx.doi.org/10.34133/2022/9873564 | DOI Listing |
Mol Divers
January 2025
Key Laboratory of Prevention and Treatment of Cardiovascular and Cerebrovascular Diseases Ministry of Education, Jiangxi Province Key Laboratory of Biomaterials and Biofabrication for Tissue Engineering, Gannan Medical University, Ganzhou, 341000, Jiangxi, China.
Identifying drug-target binding affinity (DTA) plays a critical role in early-stage drug discovery. Despite the availability of various existing methods, there are still two limitations. Firstly, sequence-based methods often extract features from fixed length protein sequences, requiring truncation or padding, which can result in information loss or the introduction of unwanted noise.
View Article and Find Full Text PDFComput Biol Med
January 2025
Drug Design and Discovery Lab, Helmy Institute of Medical Sciences, Zewail City of Science, Technology and Innovation, Giza, 12578, Egypt; Biomedical Sciences Program, University of Science and Technology, Zewail City of Science, Technology and Innovation, Giza, 12578, Egypt. Electronic address:
Epidermal growth factor receptor (EGFR) is amongst the earliest targeted kinases by small-molecule inhibitors for the management of EGFR-positive cancer types. While a few inhibitors are granted FDA approval for clinical use, discovery of new inhibitors is still of merit to enhance ligand-binding stability and subsequent enzyme inhibition. Thus, a structure-based design approach was adopted to devise a new series of twenty-nine N3-substituted quinazolin-4-ones as type I ATP-competitive inhibitors targeting the deep hydrophobic pocket of EGFR.
View Article and Find Full Text PDFNat Commun
January 2025
School of Computer Science and Engineering, Central South University, Changsha, 410083, China.
Computational methods for predicting protein function are of great significance in understanding biological mechanisms and treating complex diseases. However, existing computational approaches of protein function prediction lack interpretability, making it difficult to understand the relations between protein structures and functions. In this study, we propose a deep learning-based solution, named DPFunc, for accurate protein function prediction with domain-guided structure information.
View Article and Find Full Text PDFBrief Bioinform
November 2024
AI Lab, Research Center for Industries of the Future, Westlake University, Zhejiang 310058, China.
The rational design of Ribonucleic acid (RNA) molecules is crucial for advancing therapeutic applications, synthetic biology, and understanding the fundamental principles of life. Traditional RNA design methods have predominantly focused on secondary structure-based sequence design, often neglecting the intricate and essential tertiary interactions. We introduce R3Design, a tertiary structure-based RNA sequence design method that shifts the paradigm to prioritize tertiary structure in the RNA sequence design.
View Article and Find Full Text PDFTrends Microbiol
December 2024
Department of Plant Pathology and Microbiology, Institute of Environmental Science, The Faculty of Agriculture, Food, and Environment, The Hebrew University of Jerusalem, Rehovot, Israel. Electronic address:
Bacteria colonize every niche on Earth and play key roles in many environmental and host-associated processes. The sequencing revolution revealed the remarkable bacterial genetic and proteomic diversity and the genomic content of cultured and uncultured bacteria. However, deciphering functions of novel proteins remains a high barrier, often preventing the deep understanding of microbial life and its interaction with the surrounding environment.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!