Publications by authors named "Dong Sheng Cao"

Article Synopsis
  • * A new optimized parameter fitting algorithm, using the Nelder-Mead method, was developed and integrated into a user-friendly online application called CPhaMAS, which facilitates pharmacokinetic data analysis through three main modules.
  • * Evaluation results showed that CPhaMAS has improved accuracy in parameter estimation, as indicated by lower mean relative errors in various models, compared to existing software like WinNonlin, while also being easy to use without programming knowledge.
View Article and Find Full Text PDF

Liver microsomal stability, a crucial aspect of metabolic stability, significantly impacts practical drug discovery. However, current models for predicting liver microsomal stability are based on limited molecular information from a single species. To address this limitation, we constructed the largest public database of compounds from three common species: human, rat, and mouse.

View Article and Find Full Text PDF

Drug discovery and development constitute a laborious and costly undertaking. The success of a drug hinges not only good efficacy but also acceptable absorption, distribution, metabolism, elimination, and toxicity (ADMET) properties. Overall, up to 50% of drug development failures have been contributed from undesirable ADMET profiles.

View Article and Find Full Text PDF

Motivation: Spatial clustering is essential and challenging for spatial transcriptomics' data analysis to unravel tissue microenvironment and biological function. Graph neural networks are promising to address gene expression profiles and spatial location information in spatial transcriptomics to generate latent representations. However, choosing an appropriate graph deep learning module and graph neural network necessitates further exploration and investigation.

View Article and Find Full Text PDF

Patents play a crucial role in drug research and development, providing early access to unpublished data and offering unique insights. Identifying key compounds in patents is essential to finding novel lead compounds. This study collected a comprehensive data set comprising 1555 patents, encompassing 1000 key compounds, to explore innovative approaches for predicting these key compounds.

View Article and Find Full Text PDF

Detecting drug-drug interactions (DDIs) is an essential step in drug development and drug administration. Given the shortcomings of current experimental methods, the machine learning (ML) approach has become a reliable alternative, attracting extensive attention from the academic and industrial fields. With the rapid development of computational science and the growing popularity of cross-disciplinary research, a large number of DDI prediction studies based on ML methods have been published in recent years.

View Article and Find Full Text PDF

Recent advances and achievements of artificial intelligence (AI) as well as deep and graph learning models have established their usefulness in biomedical applications, especially in drug-drug interactions (DDIs). DDIs refer to a change in the effect of one drug to the presence of another drug in the human body, which plays an essential role in drug discovery and clinical research. DDIs prediction through traditional clinical trials and experiments is an expensive and time-consuming process.

View Article and Find Full Text PDF

Adverse drug events (ADEs) are common in clinical practice and can cause significant harm to patients and increase resource use. Natural language processing (NLP) has been applied to automate ADE detection, but NLP systems become less adaptable when drug entities are missing or multiple medications are specified in clinical narratives. Additionally, no Chinese-language NLP system has been developed for ADE detection due to the complexity of Chinese semantics, despite ˃10 million cases of drug-related adverse events occurring annually in China.

View Article and Find Full Text PDF

Triple-negative breast cancer (TNBC) is a particularly invasive subtype of breast cancer and usually has a poor prognosis due to the lack of effective therapeutic targets. Approximately 25% of TNBC patients carry a breast cancer susceptibility gene1/2 (BRCA1/2) mutation. Clinically, PARP1 inhibitors have been approved for the treatment of patients with BRCA1/2-mutated breast cancer through the mechanism of synthetic lethality.

View Article and Find Full Text PDF

Identification and validation of bioactive small-molecule targets is a significant challenge in drug discovery. In recent years, various in-silico approaches have been proposed to expedite time- and resource-consuming experiments for target detection. Herein, we developed several chemogenomic models for target prediction based on multi-scale information of chemical structures and protein sequences.

View Article and Find Full Text PDF

Advancing spatially resolved transcriptomics (ST) technologies help biologists comprehensively understand organ function and tissue microenvironment. Accurate spatial domain identification is the foundation for delineating genome heterogeneity and cellular interaction. Motivated by this perspective, a graph deep learning (GDL) based spatial clustering approach is constructed in this paper.

View Article and Find Full Text PDF

The -octanol/buffer solution distribution coefficient at pH = 7.4 (log ) is an indicator of lipophilicity, and it influences a wide variety of absorption, distribution, metabolism, excretion, and toxicity (ADMET) properties and druggability of compounds. In log  prediction, graph neural networks (GNNs) can uncover subtle structure-property relationships (SPRs) by automatically extracting features from molecular graphs that facilitate the learning of SPRs, but their performances are often limited by the small size of available datasets.

View Article and Find Full Text PDF

Malignant melanoma (MM) is a highly life-threatening tumor causing the majority of the cutaneous cancer-related deaths. Previously, ribosomal protein S6 kinase 2 (RSK2), the downstream effector of the MAPK pathway, represents a therapeutic target in melanoma. AE007 is discovered as a targeted RSK2 inhibitor, and subsequent results showed that AE007 inhibits RSK2 by directly binding to its protein kinase domain.

View Article and Find Full Text PDF

Identification of potential targets for known bioactive compounds and novel synthetic analogs is of considerable significance. In silico target fishing (TF) has become an alternative strategy because of the expensive and laborious wet-lab experiments, explosive growth of bioactivity data and rapid development of high-throughput technologies. However, these TF methods are based on different algorithms, molecular representations and training datasets, which may lead to different results when predicting the same query molecules.

View Article and Find Full Text PDF

Machine learning-based scoring functions (MLSFs) have become a very favorable alternative to classical scoring functions because of their potential superior screening performance. However, the information of negative data used to construct MLSFs was rarely reported in the literature, and meanwhile the putative inactive molecules recorded in existing databases usually have obvious bias from active molecules. Here we proposed an easy-to-use method named AMLSF that combines active learning using negative molecular selection strategies with MLSF, which can iteratively improve the quality of inactive sets and thus reduce the false positive rate of virtual screening.

View Article and Find Full Text PDF

Traditional Chinese Medicine (TCM) has been widely used in the treatment of various diseases for millennia. In the modernization process of TCM, TCM ingredient databases are playing more and more important roles. However, most of the existing TCM ingredient databases do not provide simplification function for extracting key ingredients in each herb or formula, which hinders the research on the mechanism of actions of the ingredients in TCM databases.

View Article and Find Full Text PDF

Accurate prediction of pharmacological properties of small molecules is becoming increasingly important in drug discovery. Traditional feature-engineering approaches heavily rely on handcrafted descriptors and/or fingerprints, which need extensive human expert knowledge. With the rapid progress of artificial intelligence technology, data-driven deep learning methods have shown unparalleled advantages over feature-engineering-based methods.

View Article and Find Full Text PDF
Article Synopsis
  • * The study utilized seven machine learning algorithms and various molecular representations, achieving a balanced accuracy of up to 72.6% and an AUC of 76.8% with the best model, indicating effective classification of hematotoxicity.
  • * Advanced techniques like SHAP and matched molecular pair analysis were employed to identify crucial structural features and inform safer drug design processes, highlighting the study's potential as a valuable resource for assessing hematotoxicity in new drugs.
View Article and Find Full Text PDF

Drug-drug interaction (DDI) often causes serious adverse reactions and thus results in inestimable economic and social loss. Currently, comprehensive DDI evaluation has become a major challenge in pharmaceutical research due to the time-consuming and costly process of the experimental assessment and it is of high necessity to develop effective in silico methods to predict and evaluate DDIs accurately and efficiently. In this study, based on a large number of substrates and inhibitors related to five important CYP450 isozymes (CYP1A2, CYP2C9, CYP2C19, CYP2D6 and CYP3A4), a series of high-performance predictive models for metabolic DDIs were constructed by two machine learning methods (random forest and XGBoost) and 4 different types of descriptors (MOE_2D, CATS, ECFP4 and MACCS).

View Article and Find Full Text PDF

Structural information for chemical compounds is often described by pictorial images in most scientific documents, which cannot be easily understood and manipulated by computers. This dilemma makes optical chemical structure recognition (OCSR) an essential tool for automatically mining knowledge from an enormous amount of literature. However, existing OCSR methods fall far short of our expectations for realistic requirements due to their poor recovery accuracy.

View Article and Find Full Text PDF

Objectives: Photobiomodulation (PBM) is widely used in clinical therapy, and is an effective approach to resist the bacterial infection of the cutaneous wound and modulate the wound healing process. Due to the several detriments of lasers, Red & Blue LED light (RBLL) may be a more viable light source. This study is aimed to evaluate and compare the therapeutic effect of RBLL light on different multi-drug resistant (MDR) bacteria in vitro and male Sprague-Dawley (SD) rat refractory MDR infection wound model in vivo.

View Article and Find Full Text PDF
Article Synopsis
  • Understanding chemical-gene interactions (CGIs) is essential for drug screening, and while wet lab experiments are tedious and costly, computational methods offer a more efficient approach for large-scale analysis.
  • The study introduces BioNet, a deep biological network model that uses a graph encoder-decoder architecture to predict interactions between chemicals and genes, leveraging a large dataset that includes over 79,000 entities and more than 34 million relations.
  • BioNet demonstrates impressive performance in predictions, achieving a high ROC curve score of 0.952, and its findings have been validated against external data, particularly in relation to cancer and COVID-19 interactions.
View Article and Find Full Text PDF

In the process of drug discovery, the optimization of lead compounds has always been a challenge faced by pharmaceutical chemists. Matched molecular pair analysis (MMPA), a promising tool to efficiently extract and summarize the relationship between structural transformation and property change, is suitable for local structural optimization tasks. Especially, the integration of MMPA with QSAR modeling can further strengthen the utility of MMPA in molecular optimization navigation.

View Article and Find Full Text PDF
Article Synopsis
  • DprE1 is a key enzyme in the cell wall biosynthesis of Mycobacterium, making it a target for new tuberculosis (TB) treatments.
  • The study used advanced molecular modeling techniques to identify two promising compounds, B2 and H3, that can inhibit DprE1 and kill Mycobacterium smegmatis in the lab.
  • Notably, compound H3 was found to effectively inhibit Mycobacterium tuberculosis with minimal harm to mouse cells, highlighting its potential as a new anti-TB drug.
View Article and Find Full Text PDF

Nowadays, computational approaches have drawn more and more attention when exploring the relationship between sweetness and chemical structure instead of traditional experimental tests. In this work, we proposed a novel multi-layer sweetness evaluation system based on machine learning methods. It can be used to evaluate sweet properties of compounds with different chemical spaces and categories, including natural, artificial, carbohydrate, non-carbohydrate, nutritive and non-nutritive ones, suitable for different application scenarios.

View Article and Find Full Text PDF