The viability of cells and the integrity of the genome depend on the detection and repair of damaged DNA through intricate mechanisms. Cancer treatment employs chemotherapy or radiation therapy to eliminate neoplastic cells by causing substantial damage to their DNA. In many cases, improved DNA repair mechanisms lead to resistance to these medicines; therefore, it is essential to expand efforts to develop drugs that can sensitise cells to these treatments by inhibiting the DNA repair process. Multiple studies have demonstrated a correlation between the overexpression of Apurinic/Apyrimidinic Endonuclease (APE1), the primary mammalian enzyme responsible for excising apurinic or apyrimidinic sites in DNA, and the resistance of cells to cancer therapies; in contrast, APE1 downregulation increases cellular susceptibility to DNA-damaging agents. Thus, the effectiveness of existing therapies can be improved by promoting the targeted sensitization of cancer cells while protecting healthy cells. The current study aims to employ explainable artificial intelligence (XAI) to enhance the accuracy and reliability of machine learning models for the prediction of APE1 inhibitors. Various ML-based regression models are employed to predict the pIC50 value of different medicines. Bayesian optimization and the Permutation Feature Importance (PFI) approach are employed to determine the best hyperparameters of machine learning models and to discover the most significant features for recognizing drug candidates that target APE1 enzymes, respectively. To acquire comprehensive elucidations for the predictive models in our research, two XAI methodologies, namely SHAP and LIME, are used. The SHAP analysis reveals that the features 'C1SP2' and 'ASP-2' are essential in influencing the model's predictions. The SHAP values demonstrate variability for features such as 'maxHBint2' and 'GATS1s,' signifying that their impact is dependent on specific instances within the dataset. The LIME study corroborates these findings, demonstrating that 'C1SP2' and 'ASP-2' are the most significant positive contributors, whereas features like 'SHCHnX,' 'nHdCH2,' and 'GATS1s' result in a decrease in the predicted values. Due to the limited sample size of the APE1 dataset, direct training on this dataset posed challenges in model generalization and reliability. To overcome this limitation, the BACE-1 dataset is leveraged for model training, enabling the ML models to learn from a more extensive and diverse chemical space. Among the tested algorithms, XGBoost demonstrated superior predictive performance, achieving R = 0.890, MAE = 0.186, and RMSE = 0.245, significantly surpassing state-of-the-art methods, such as LightGBM and QSAR-ML, which attained R scores of 0.798 and 0.630, respectively. These results highlight the robustness of our approach, demonstrating its enhanced generalization capability and superior predictive accuracy compared to existing methodologies.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s11030-025-11133-6DOI Listing

Publication Analysis

Top Keywords

machine learning
12
learning models
12
prediction ape1
8
ape1 inhibitors
8
dna repair
8
'c1sp2' 'asp-2'
8
superior predictive
8
ape1
6
models
6
cells
6

Similar Publications

Background: White tea, an agriculturally distinctive product, exhibits significant aroma variations across different regions. Nevertheless, the mechanisms driving these differences, and distinguishing methods suitable for specific origins, have been scarcely reported. In this study, we analyzed the aroma characteristics and volatile components of 100 white tea samples from ten regions, utilizing sensory evaluation, headspace solid-phase microextraction-gas chromatography-mass spectrometry and chemometrics, then established a discrimination model.

View Article and Find Full Text PDF

Background: Breast cancer, a highly prevalent global cancer, poses significant challenges, especially in advanced stages. Prognostic models are crucial to enhance patient outcomes. Tertiary lymphoid structures (TLS) within the tumor microenvironment have been associated with better prognostic outcomes.

View Article and Find Full Text PDF

Wearable sensors have emerged as a transformative technology, enabling real-time monitoring and advanced functionality in various fields, including healthcare, human-machine interaction, and environmental sensing. This review provides a comprehensive overview of the latest advancements in wearable sensor technologies, focusing on innovations in sensor design, material flexibility, and integration with machine learning. We explore the feasibility of wearable electronics in achieving high-performance, flexible devices and discuss their potential to enhance human-machine interactions through intelligent data processing and decision-making.

View Article and Find Full Text PDF

Background: Numerous studies have reported that dysregulation of fatty acid metabolic pathways is associated with the pathogenesis of vitiligo, in which arachidonic acid metabolism (AAM) plays an important role. However, the molecular mechanisms of AAM in the pathogenesis of vitiligo have not been clarified. Therefore, we aimed to identify the biomarkers and molecular mechanisms associated with AAM in vitiligo using bioinformatics methods.

View Article and Find Full Text PDF

Distributed and networked analysis of volumetric image data for remote collaboration of microscopy image analysis.

J Med Imaging (Bellingham)

March 2025

Purdue University, School of Electrical and Computer Engineering, Video and Image Processing Laboratory, West Lafayette, Indiana, United States.

Purpose: The advancement of high-content optical microscopy has enabled the acquisition of very large three-dimensional (3D) image datasets. The analysis of these image volumes requires more computational resources than a biologist may have access to in typical desktop or laptop computers. This is especially true if machine learning tools are being used for image analysis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!