[Application of detecting and taking overdispersion into account in Poisson regression model].

Rev Epidemiol Sante Publique

Unité d'Evaluation Médicale, Pôle Pharmacie et Santé Publique, CHU de Poitiers, Université de Poitiers, Pavillon Camille-Guérin, 2 Rue de la Milétrie, BP 577, 86021 Poitiers Cedex, France.

Published: August 2009

Background: Researchers often use the Poisson regression model to analyze count data. Overdispersion can occur when a Poisson regression model is used, resulting in an underestimation of variance of the regression model parameters. Our objective was to take overdispersion into account and assess its impact with an illustration based on the data of a study investigating the relationship between use of the Internet to seek health information and number of primary care consultations.

Methods: Three methods, overdispersed Poisson, a robust estimator, and negative binomial regression, were performed to take overdispersion into account in explaining variation in the number (Y) of primary care consultations. We tested overdispersion in the Poisson regression model using the ratio of the sum of Pearson residuals over the number of degrees of freedom (chi(2)/df). We then fitted the three models and compared parameter estimation to the estimations given by Poisson regression model.

Results: Variance of the number of primary care consultations (Var[Y]=21.03) was greater than the mean (E[Y]=5.93) and the chi(2)/df ratio was 3.26, which confirmed overdispersion. Standard errors of the parameters varied greatly between the Poisson regression model and the three other regression models. Interpretation of estimates from two variables (using the Internet to seek health information and single parent family) would have changed according to the model retained, with significant levels of 0.06 and 0.002 (Poisson), 0.29 and 0.09 (overdispersed Poisson), 0.29 and 0.13 (use of a robust estimator) and 0.45 and 0.13 (negative binomial) respectively.

Conclusion: Different methods exist to solve the problem of underestimating variance in the Poisson regression model when overdispersion is present. The negative binomial regression model seems to be particularly accurate because of its theorical distribution ; in addition this regression is easy to perform with ordinary statistical software packages.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.respe.2009.02.209DOI Listing

Publication Analysis

Top Keywords

poisson regression
28
regression model
28
overdispersion account
12
regression
12
number primary
12
primary care
12
negative binomial
12
poisson
10
model
8
internet seek
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!