The relationship between the mean and variance is an implicit assumption of parametric modeling. While many distributions in the exponential family have a theoretical mean-variance relationship, it is often the case that the data under investigation are correlated, thus varying from the relation. We present a generalized method of moments estimation technique for modeling certain correlated data by adjusting the mean-variance relationship parameters based on a canonical parameterization. The proposed mean-variance form describes overdispersion using two parameters and implements an adjusted canonical parameter which makes this approach feasible for all distributions in the exponential family. Test statistics and confidence intervals are used to measure the deviations from the mean-variance relation parameters. We use the modified relation as a means of fitting generalized quasi-likelihood models to correlated data. The performance of the proposed modified generalized quasi-likelihood model is demonstrated through a simulation study and the importance of accounting for overdispersion is highlighted through the evaluation of adolescent obesity data collected from a U.S. longitudinal study.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7233292PMC
http://dx.doi.org/10.1177/0962280219884980DOI Listing

Publication Analysis

Top Keywords

generalized quasi-likelihood
12
quasi-likelihood model
8
distributions exponential
8
exponential family
8
mean-variance relationship
8
correlated data
8
modification generalized
4
model analysis
4
analysis add
4
add health
4

Similar Publications

edgeR is an R/Bioconductor software package for differential analyses of sequencing data in the form of read counts for genes or genomic features. Over the past 15 years, edgeR has been a popular choice for statistical analysis of data from sequencing technologies such as RNA-seq or ChIP-seq. edgeR pioneered the use of the negative binomial distribution to model read count data with replicates and the use of generalized linear models to analyze complex experimental designs.

View Article and Find Full Text PDF

Background: Sleep disturbances are a major public health concern in the United States, leading to adverse health outcomes. In the working population, job strain has been identified as an important risk factor for sleep disturbances, but evidence from the United States remained limited. This study aimed to examine longitudinal associations between job strain and sleep disturbances in the United States, with a focus on the alternative formulations of job strain.

View Article and Find Full Text PDF

Background: The burden of the COVID-19 pandemic in terms of morbidity and mortality differentially affected populations. Between and within populations, behavior change was likewise heterogeneous. Factors influencing precautionary behavior adoption during COVID-19 have been associated with multidimensional aspects of risk perception; however, the influence of lived experiences during other recent outbreaks on behavior change during COVID-19 has been less studied.

View Article and Find Full Text PDF

Hierarchical selection of genetic and gene by environment interaction effects in high-dimensional mixed models.

Stat Methods Med Res

December 2024

Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, QC, Canada.

Interactions between genes and environmental factors may play a key role in the etiology of many common disorders. Several regularized generalized linear models have been proposed for hierarchical selection of gene by environment interaction effects, where a gene-environment interaction effect is selected only if the corresponding genetic main effect is also selected in the model. However, none of these methods allow to include random effects to account for population structure, subject relatedness and shared environmental exposure.

View Article and Find Full Text PDF

In the classical theory of locally optimal designs, which is developed within the framework of the center+error model, the most efficient design is the one based on MGLE, the maximum Gaussian likelihood estimator. However, practical scenarios often lack of complete information as to the governing probability model for the response measure and deviate from Gaussianity and homoscedasticity assumptions, in which, MqLE, the maximum quasi-likelihood estimator, has been advocated in the literature. In this work, we examine the locally optimal design based on the novel oracle-SLSE, the second-order least-square estimator, in the case where the underlying probability model is incompletely specified.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!