VSMP: a novel variable selection and modeling method based on the prediction.

J Chem Inf Comput Sci

State Key Laboratory of Pollution Control and Resources Reuse, Department of Environmental Science & Engineering, Nanjing University, Nanjing 210093, P. R. China.

Published: October 2003

The use of numerous descriptors that are indicative of molecular structure and topology is becoming more common in quantitative structure-activity relationship (QSAR). How to choose the adequate descriptors for QSAR studies is important but difficult because there are no absolute rules to govern this choice. A variety of variable selection techniques including stepwise, partial least squares/principal component analysis (PLS/PCA), neural network, and evolutionary algorithm such as genetic algorithm have been applied to this common problem. All-subsets regression (ASR) is capable of finding out the best variable subset from among a large pool. In this paper, a novel variable selection and modeling method based on the prediction, for short VSMP, has been developed. Here two controllable parameters, the interrelation coefficient between the pairs of the independent variables (r(int)) and the correlation coefficient (q(2)) obtained using the leave-one-out (LOO) cross-validation technique, are introduced into the ASR to improve its performances. This technique differs from the other variable selection procedures related to the ASR by two main features: (1) The search of various optimal subset search is controlled by the statistic q(2) or root-mean-square error (RMSEP) in the LOO cross-validation step rather than the correlation coefficient obtained in the modeling step (r(2)). (2) The searching speed of all optimal subsets is expedited by the statistic r(int) together with q(2). A comparison of the results of the VSMP applied to the Selwood data set (n = 31 compounds, m = 53 descriptors) with those obtained from alternative algorithms shows the good performance of the technique.

Download full-text PDF

Source
http://dx.doi.org/10.1021/ci020377jDOI Listing

Publication Analysis

Top Keywords

variable selection
16
novel variable
8
selection modeling
8
modeling method
8
method based
8
based prediction
8
correlation coefficient
8
loo cross-validation
8
variable
5
vsmp novel
4

Similar Publications

Background: Paracoccidioidomycosis (PCM) is a systemic mycosis endemic and limited to Latin America. Brazil is responsible for more than 80% of diagnosed cases in the world. Since PCM is not a notifiable disease, there are still no accurate data on its incidence in Brazil.

View Article and Find Full Text PDF

Background: Sensory disorders of the inferior alveolar nerve, often arising from dental procedures, markedly impact the quality of life of patients. This article proposes a scoping review to analyze emerging trends in pharmacological treatment for these disorders, addressing scientific gaps and clinical practices.

Material And Methods: The review followed the PRISMA-ScR protocol, conducting data searches across various databases, including PubMed and Cochrane, until March 2024.

View Article and Find Full Text PDF

Purpose: Standard therapy for breast cancer after breast-conserving surgery is radiation therapy (RT) plus hormone therapy (HT). For patients with a low-risk of recurrence, there is an interest in deescalating therapy.

Methods And Materials: A retrospective study was carried out for patients treated at the Swedish Cancer Institute from 2000 to 2015, aged 70 years or older, with pT1N0 or pT1NX estrogen receptor-positive and ERBB2-negative unifocal breast cancer without positive surgical margins, high nuclear grade, or lymphovascular invasion.

View Article and Find Full Text PDF

Introduction: We assessed the prevalence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection and associated socio-occupational factors among delivery riders from a Brazilian city at two time points during the pandemic.

Methodology: Surveys for antibody and viral RNA testing were conducted from November 2020 to January 2021, and from March to May 2021 in a group of 117 delivery riders. A questionnaire on socio-occupational characteristics and coronavirus disease 2019 (COVID-19) preventive measures was completed.

View Article and Find Full Text PDF

Objective: To analyze the sociostructural determinants associated with mental health problems during the lockdown period among populations residing in Brazil, Chile, Ecuador, Mexico, Peru, and Spain who lived with minors or dependents, approached from a gender perspective.

Methods: A cross-sectional study was conducted in six participating countries via an adapted, self-managed online survey. People living with minors and/or dependents were selected.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!