We extend the discussion of Lee et al. and others on methods for performing secondary analyses of case-control sampled data and carry out an extensive investigation of efficiency and robustness. We find that, with the exception of the 'analyse-the-controls-only' strategy for populations in which cases are rare, ad hoc methods in common usage often lead to extremely misleading conclusions and that it is not possible to tell in advance when this will happen. Weighted likelihood and semi-parametric maximum likelihood methods are justified theoretically. We find that semi-parametric maximum likelihood can be as much as twice as efficient as the weighted method, but is subject to bias in estimating parameters of interest when the nuisance models this method requires have been mis-specified. The weighted method needs no nuisance models and thus is robust in this regard, but we cannot tell when it is going to be very inefficient without sophisticated modelling as through the SPML method. Practitioners should routinely use both methods and will often have to weigh up the practical consequences of severe inefficiency and lack of robustness in the context of their enquiries.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1002/sim.2283 | DOI Listing |
Biostatistics
November 2024
Department of Population Medicine, Harvard Pilgrim Healthcare Institute and Harvard Medical School, 401 Park Drive, Boston, MA, 02215, United States.
Maximum likelihood inference can often become computationally intensive when performing joint modeling of longitudinal and time-to-event data, due to the intractable integrals in the joint likelihood function. The computational challenges escalate further when modeling HIV-1 viral load data, owing to the nonlinear trajectories and the presence of left-censored data resulting from the assay's lower limit of quantification. In this paper, for a joint model comprising a nonlinear mixed-effect model and a Cox Proportional Hazards model, we develop a computationally efficient Stochastic EM (StEM) algorithm for parameter estimation.
View Article and Find Full Text PDFCommun Stat Theory Methods
August 2023
Division of Biostatistics and Bioinformatics, Department of Environmental and Public Health Sciences, University of Cincinnati, Cincinnati, Ohio, 45267, USA.
Phase IV clinical trials are designed to monitor long-term side effects of medical treatment. For instance, childhood cancer survivors treated with chest radiation and/or anthracycline are often at risk of developing cardiotoxicity during their adulthood. Often the primary focus of a study could be on estimating the cumulative incidence of a particular outcome of interest such as cardiotoxicity.
View Article and Find Full Text PDFSci Total Environ
June 2024
Mahavir Cancer Sansthan and Research Centre, Patna, Bihar 801505, India.
Recently, a substantial increase in gallbladder cancer (GBC) cases has been reported in Bihar, India. The region's groundwater can naturally contain harmful concentrations of arsenic, which appears to be epidemiologically linked to the unusually high incidence. However, the root causes remain largely unexplored.
View Article and Find Full Text PDFEntropy (Basel)
March 2024
Department of Statistics, Texas A&M University, College Station, TX 77843, USA.
We commonly encounter the problem of identifying an optimally weight-adjusted version of the empirical distribution of observed data, adhering to predefined constraints on the weights. Such constraints often manifest as restrictions on the moments, tail behavior, shapes, number of modes, etc., of the resulting weight-adjusted empirical distribution.
View Article and Find Full Text PDFInt J Behav Nutr Phys Act
January 2024
Department of Epidemiological Methods and Etiological Research, Leibniz Institute for Prevention Research and Epidemiology - BIPS, Achterstrasse 30, 28359, Bremen, Germany.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!