An ensemble-based drug-target interaction prediction approach using multiple feature information with data balancing.

J Biol Eng

Computer Science & Engineering Department, Faculty of Electronic Engineering, Menoufia University, Menouf, Egypt.

Published: August 2022

Background: Recently, drug repositioning has received considerable attention for its advantage to pharmaceutical industries in drug development. Artificial intelligence techniques have greatly enhanced drug reproduction by discovering therapeutic drug profiles, side effects, and new target proteins. However, as the number of drugs increases, their targets and enormous interactions produce imbalanced data that might not be preferable as an input to a prediction model immediately.

Methods: This paper proposes a novel scheme for predicting drug-target interactions (DTIs) based on drug chemical structures and protein sequences. The drug Morgan fingerprint, drug constitutional descriptors, protein amino acid composition, and protein dipeptide composition were employed to extract the drugs and protein's characteristics. Then, the proposed approach for extracting negative samples using a support vector machine one-class classifier was developed to tackle the imbalanced data problem feature sets from the drug-target dataset. Negative and positive samplings were constructed and fed into different prediction algorithms to identify DTIs. A 10-fold CV validation test procedure was applied to assess the predictability of the proposed method, in addition to the study of the effectiveness of the chemical and physical features in the evaluation and discovery of the drug-target interactions.

Results: Our experimental model outperformed existing techniques concerning the curve for receiver operating characteristic (AUC), accuracy, precision, recall F-score, mean square error, and MCC. The results obtained by the AdaBoost classifier enhanced prediction accuracy by 2.74%, precision by 1.98%, AUC by 1.14%, F-score by 3.53%, and MCC by 4.54% over existing methods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9361677PMC
http://dx.doi.org/10.1186/s13036-022-00296-7DOI Listing

Publication Analysis

Top Keywords

imbalanced data
8
drug
7
ensemble-based drug-target
4
drug-target interaction
4
prediction
4
interaction prediction
4
prediction approach
4
approach multiple
4
multiple feature
4
feature data
4

Similar Publications

Background: The implementation of large language models (LLMs), such as BART (Bidirectional and Auto-Regressive Transformers) and GPT-4, has revolutionized the extraction of insights from unstructured text. These advancements have expanded into health care, allowing analysis of social media for public health insights. However, the detection of drug discontinuation events (DDEs) remains underexplored.

View Article and Find Full Text PDF

Purpose Of Review: The densely populated Asia Pacific region is home to 600 million children, and suffers from a significant burden of morbidity and mortality due to infections associated with antimicrobial resistance (AMR). We aimed to identify the drivers, challenges and potential opportunities to alter the burden of AMR within the region.

Recent Findings: Despite the high AMR burden borne by the Asia Pacific region, there are limited (and geographically imbalanced) published data to delineate the contemporary epidemiology of serious multidrug-resistant bacterial infections in children.

View Article and Find Full Text PDF

Background: The Department of Rehabilitation Medicine is key to improving patients' quality of life. Driven by chronic diseases and an aging population, there is a need to enhance the efficiency and resource allocation of outpatient facilities. This study aims to analyze the treatment preferences of outpatient rehabilitation patients by using data and a grading tool to establish predictive models.

View Article and Find Full Text PDF

ST-CIRL: a reinforcement learning-based feature selection approach for enhanced anxiety classification.

Physiol Meas

January 2025

Department of Electronics and Communication , Delhi Technological University Department of Electronics and Communication, Delhi Technological university, Bawana, New Delhi-42, New Delhi, Delhi, 110042, INDIA.

A physiological signal-based Human-Computer Interaction (HCI) system provides a communication link between human emotional states and external devices. Accurately classifying these signals is vital for effective interaction, which requires extracting and selecting the most discriminative features to differentiate between various emotional states. This paper introduces the SMOTETomek-Correlated Interactive Reinforcement Learning (ST-CIRL) framework for anxiety classification, which leverages meta-descriptive statistics to enhance the state representation in the reinforcement learning process.

View Article and Find Full Text PDF

Obstructive sleep apnea (OSA) patients have varying degrees of cognitive impairment, but the specific pathogenic mechanism is still unclear. Meanwhile, poor compliance with continuous positive airway pressure (CPAP) in OSA prompts better solutions. This study aimed to identify differentially expressed genes between the non-obese OSA patients and healthy controls, and to explore potential biomarkers associated with cognitive impairment.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!