Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy.

Med Care

*Dalla Lana School of Public Health, University of Toronto, Toronto, ON †Department of Epidemiology, Biostatistics and Occupational Health, McGill University ‡Direction de la Santé Publique de Montréal §Department of Medicine, McGill University, Montreal, QC, Canada.

Published: August 2017

Objective: Valid measurement of outcomes such as disease prevalence using health care utilization data is fundamental to the implementation of a "learning health system." Definitions of such outcomes can be complex, based on multiple diagnostic codes. The literature on validating such data demonstrates a lack of awareness of the need for a stratified sampling design and corresponding statistical methods. We propose a method for validating the measurement of diagnostic groups that have: (1) different prevalences of diagnostic codes within the group; and (2) low prevalence.

Methods: We describe an estimation method whereby: (1) low-prevalence diagnostic codes are oversampled, and the positive predictive value (PPV) of the diagnostic group is estimated as a weighted average of the PPV of each diagnostic code; and (2) claims that fall within a low-prevalence diagnostic group are oversampled relative to claims that are not, and bias-adjusted estimators of sensitivity and specificity are generated.

Application: We illustrate our proposed method using an example from population health surveillance in which diagnostic groups are applied to physician claims to identify cases of acute respiratory illness.

Conclusions: Failure to account for the prevalence of each diagnostic code within a diagnostic group leads to the underestimation of the PPV, because low-prevalence diagnostic codes are more likely to be false positives. Failure to adjust for oversampling of claims that fall within the low-prevalence diagnostic group relative to those that do not leads to the overestimation of sensitivity and underestimation of specificity.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5510703PMC
http://dx.doi.org/10.1097/MLR.0000000000000324DOI Listing

Publication Analysis

Top Keywords

diagnostic codes
16
low-prevalence diagnostic
16
diagnostic group
16
diagnostic groups
12
diagnostic
12
health care
8
care utilization
8
utilization data
8
ppv diagnostic
8
diagnostic code
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!