Rank order entropy: why one metric is not enough.

J Chem Inf Model

Department of Chemistry, Rensselaer Polytechnic Institute, Troy, New York 12180, United States.

Published: September 2011

The use of Quantitative Structure-Activity Relationship models to address problems in drug discovery has a mixed history, generally resulting from the misapplication of QSAR models that were either poorly constructed or used outside of their domains of applicability. This situation has motivated the development of a variety of model performance metrics (r(2), PRESS r(2), F-tests, etc.) designed to increase user confidence in the validity of QSAR predictions. In a typical workflow scenario, QSAR models are created and validated on training sets of molecules using metrics such as Leave-One-Out or many-fold cross-validation methods that attempt to assess their internal consistency. However, few current validation methods are designed to directly address the stability of QSAR predictions in response to changes in the information content of the training set. Since the main purpose of QSAR is to quickly and accurately estimate a property of interest for an untested set of molecules, it makes sense to have a means at hand to correctly set user expectations of model performance. In fact, the numerical value of a molecular prediction is often less important to the end user than knowing the rank order of that set of molecules according to their predicted end point values. Consequently, a means for characterizing the stability of predicted rank order is an important component of predictive QSAR. Unfortunately, none of the many validation metrics currently available directly measure the stability of rank order prediction, making the development of an additional metric that can quantify model stability a high priority. To address this need, this work examines the stabilities of QSAR rank order models created from representative data sets, descriptor sets, and modeling methods that were then assessed using Kendall Tau as a rank order metric, upon which the Shannon entropy was evaluated as a means of quantifying rank-order stability. Random removal of data from the training set, also known as Data Truncation Analysis (DTA), was used as a means for systematically reducing the information content of each training set while examining both rank order performance and rank order stability in the face of training set data loss. The premise for DTA ROE model evaluation is that the response of a model to incremental loss of training information will be indicative of the quality and sufficiency of its training set, learning method, and descriptor types to cover a particular domain of applicability. This process is termed a "rank order entropy" evaluation or ROE. By analogy with information theory, an unstable rank order model displays a high level of implicit entropy, while a QSAR rank order model which remains nearly unchanged during training set reductions would show low entropy. In this work, the ROE metric was applied to 71 data sets of different sizes and was found to reveal more information about the behavior of the models than traditional metrics alone. Stable, or consistently performing models, did not necessarily predict rank order well. Models that performed well in rank order did not necessarily perform well in traditional metrics. In the end, it was shown that ROE metrics suggested that some QSAR models that are typically used should be discarded. ROE evaluation helps to discern which combinations of data set, descriptor set, and modeling methods lead to usable models in prioritization schemes and provides confidence in the use of a particular model within a specific domain of applicability.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3428235PMC
http://dx.doi.org/10.1021/ci200170kDOI Listing

Publication Analysis

Top Keywords

rank order
48
training set
24
rank
12
qsar models
12
order
12
set
11
models
9
qsar
9
model
8
model performance
8

Similar Publications

Background: Bladder cancer (BCa) is one of the most common malignancies worldwide, and its prognostication and treatment remains challenging. The fast growth of various cancer cells requires reprogramming of its energy metabolism using aerobic glycolysis as a major energy source. However, the prognostic and therapeutic value of glycolysis-related genes in BCa remains to be determined.

View Article and Find Full Text PDF

Objective: This study aimed to identify the top 10 international research priorities for musculoskeletal health of people with generalized joint hypermobility.

Methods: A 3-round Delphi method utilizing an online survey was implemented. Three participant stakeholder groups were eligible for inclusion: (1) people with lived experience of joint hypermobility or their carers, (2) healthcare professionals, and (3) researchers with experience working with individuals with hypermobility.

View Article and Find Full Text PDF

Short and mid-term research priorities for Veterans with multiple sclerosis: A modified Delphi process engaging Veterans, researchers, and operational partners.

Mult Scler Relat Disord

January 2025

Multiple Sclerosis Center of Excellence West, Veterans Affairs, USA; Rehabilitation Care Service, VA Puget Sound Health Care System, 1660 S Columbian Way, Seattle, Washington, 98108, USA; Department of Rehabilitation Medicine, University of Washington, 325 9th Avenue, Seattle, Washington, 98104, USA. Electronic address:

Background/objective: Identifying research priorities of Veterans, MS researchers, and key stakeholders is critical to advance high-quality, evidence-based, and Veteran-specific MS care.

Methods: We used a modified Delphi approach to identify research priorities for Veterans with MS. Electronic surveys were distributed to Veterans with MS (n = 50,975), MS researchers (n = 191), VA healthcare providers (1,337), and funding agency representatives (n = 6) asking about their 2-3 most important research questions that would benefit Veterans with MS for researchers to answer in the next 5-10 years.

View Article and Find Full Text PDF

Background: In 1962, the idea emerged that medical students' tolerance of uncertainty could determine their specialty choice. While some studies supported this claim, others refuted it, often using independently developed instruments. We explored whether the reported link between specialty choice and uncertainty tolerance is more myth than evidence by employing established instruments to investigate whether specialty choice could be explained by variance in uncertainty tolerance.

View Article and Find Full Text PDF

Between- and Within-Cluster Spearman Rank Correlations.

Stat Med

February 2025

Department of Biostatistics, Vanderbilt University, Nashville, Tennessee, USA.

Clustered data are common in practice. Clustering arises when subjects are measured repeatedly, or subjects are nested in groups (e.g.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!