Background: Bias from data missing not at random (MNAR) is a persistent concern in health-related research. A bias analysis quantitatively assesses how conclusions change under different assumptions about missingness using bias parameters that govern the magnitude and direction of the bias. Probabilistic bias analysis specifies a prior distribution for these parameters, explicitly incorporating available information and uncertainty about their true values.
View Article and Find Full Text PDFAuxiliary variables are used in multiple imputation (MI) to reduce bias and increase efficiency. These variables may often themselves be incomplete. We explored how missing data in auxiliary variables influenced estimates obtained from MI.
View Article and Find Full Text PDFIn clinical studies, multi-state model (MSM) analysis is often used to describe the sequence of events that patients experience, enabling better understanding of disease progression. A complicating factor in many MSM studies is that the exact event times may not be known. Motivated by a real dataset of patients who received stem cell transplants, we considered the setting in which some event times were exactly observed and some were missing.
View Article and Find Full Text PDFBackground: There are many ways in which selection bias might impact COVID-19 research. Here we focus on selection for receiving a polymerase-chain-reaction (PCR) SARS-CoV-2 test and how known changes to selection pressures over time may bias research into COVID-19 infection.
Methods: Using UK Biobank (N = 420,231; 55% female; mean age = 66.
Many partial identification problems can be characterized by the optimal value of a function over a set where both the function and set need to be estimated by empirical data. Despite some progress for convex problems, statistical inference in this general setting remains to be developed. To address this, we derive an asymptotically valid confidence interval for the optimal value through an appropriate relaxation of the estimated set.
View Article and Find Full Text PDFBackground: Failure to appropriately account for unmeasured confounding may lead to erroneous conclusions. Quantitative bias analysis (QBA) can be used to quantify the potential impact of unmeasured confounding or how much unmeasured confounding would be needed to change a study's conclusions. Currently, QBA methods are not routinely implemented, partly due to a lack of knowledge about accessible software.
View Article and Find Full Text PDFBackground: Non-random selection of analytic subsamples could introduce selection bias in observational studies. We explored the potential presence and impact of selection in studies of SARS-CoV-2 infection and COVID-19 prognosis.
Methods: We tested the association of a broad range of characteristics with selection into COVID-19 analytic subsamples in the Avon Longitudinal Study of Parents and Children (ALSPAC) and UK Biobank (UKB).
Background: Longitudinal data analysis can improve our understanding of the influences on health trajectories across the life-course. There are a variety of statistical models which can be used, and their fitting and interpretation can be complex, particularly where there is a nonlinear trajectory. Our aim was to provide an accessible guide along with applied examples to using four sophisticated modelling procedures for describing nonlinear growth trajectories.
View Article and Find Full Text PDFLongitudinal data are necessary to reveal changes within an individual as he or she ages. However, rarely will a single cohort study capture data throughout a person's entire life span. Here we describe in detail the steps needed to develop life-course trajectories from cohort studies that cover different and overlapping periods of life.
View Article and Find Full Text PDFA key assumption in Mendelian randomisation is that the relationship between the genetic instruments and the outcome is fully mediated by the exposure, known as the exclusion restriction assumption. However, in epidemiological studies, the exposure is often a coarsened approximation to some latent continuous trait. For example, latent liability to schizophrenia can be thought of as underlying the binary diagnosis measure.
View Article and Find Full Text PDFIn patient follow-up studies, events of interest may take place between periodic clinical assessments and so the exact time of onset is not observed. Such events are known as "bounded" or "interval-censored." Methods for handling such events can be categorized as either (i) applying multiple imputation (MI) strategies or (ii) taking a full likelihood-based (LB) approach.
View Article and Find Full Text PDFStat Methods Med Res
December 2020
Multiple imputation has become one of the most popular approaches for handling missing data in statistical analyses. Part of this success is due to Rubin's simple combination rules. These give frequentist valid inferences when the imputation and analysis procedures are so-called congenial and the embedding model is correctly specified, but otherwise may not.
View Article and Find Full Text PDFBackground: Children with neurodevelopmental disorders are at increased risk of developing depression. Irritability predicts depression in the general population and is common in children with neurodevelopmental disorders. Thus, it is possible that irritability in children with neurodevelopmental disorders contributes to the link with later depression.
View Article and Find Full Text PDFParticipants in epidemiologic and genetic studies are rarely true random samples of the populations they are intended to represent, and both known and unknown factors can influence participation in a study (known as selection into a study). The circumstances in which selection causes bias in an instrumental variable (IV) analysis are not widely understood by practitioners of IV analyses. We use directed acyclic graphs (DAGs) to depict assumptions about the selection mechanism (factors affecting selection) and show how DAGs can be used to determine when a two-stage least squares IV analysis is biased by different selection mechanisms.
View Article and Find Full Text PDFBackground: Missing data are unavoidable in epidemiological research, potentially leading to bias and loss of precision. Multiple imputation (MI) is widely advocated as an improvement over complete case analysis (CCA). However, contrary to widespread belief, CCA is preferable to MI in some situations.
View Article and Find Full Text PDFBackground: Hypertension (high blood pressure) is a common long-term health condition. Patient involvement in treating and monitoring hypertension is essential. Control of hypertension improves population cardiovascular outcomes.
View Article and Find Full Text PDFInt J Obes (Lond)
September 2018
Objective: To examine associations of parental socioeconomic position with early-life offspring body mass index (BMI) trajectories in a middle-income country.
Subjects: Overall, 12,385 Belarusian children born 1996-97 and enrolled in a randomised breastfeeding promotion trial at birth, with 3-14 measurements of BMI from birth to 7 years.
Methods: Cohort analysis in which exposures were parental education (common secondary or less; advanced secondary or partial university; completed university) and occupation (manual; non-manual) at birth, and the outcome was BMI z-score trajectories estimated using multilevel linear spline models, controlling for trial arm, location, parental BMI, maternal smoking status and number of older siblings.
The linear mixed model with an added integrated Ornstein-Uhlenbeck (IOU) process (linear mixed IOU model) allows for serial correlation and estimation of the degree of derivative tracking. It is rarely used, partly due to the lack of available software. We implemented the linear mixed IOU model in Stata and using simulations we assessed the feasibility of fitting the model by restricted maximum likelihood when applied to balanced and unbalanced data.
View Article and Find Full Text PDFEstimating the parameters of a regression model of interest is complicated by missing data on the variables in that model. Multiple imputation is commonly used to handle these missing data. Joint model multiple imputation and full-conditional specification multiple imputation are known to yield imputed data with the same asymptotic distribution when the conditional models of full-conditional specification are compatible with that joint model.
View Article and Find Full Text PDFPurpose: To investigate whether premorbid maternal and childhood psychological problems are risk factors for chronic disabling fatigue at age 13 years among children in the Avon Longitudinal Study of Parents and Children birth cohort.
Methods: Chronic disabling fatigue was defined as fatigue of at least 3-month, and up to 5-year, duration that prevented school attendance or hobbies/sport/leisure activities, and for which other causes were not identified. Maternal psychological factors were symptoms of anxiety and depression assessed up to eight times between pregnancy and age 6 years.
Background: There is increasing emphasis in medical research on modelling growth across the life course and identifying factors associated with growth. Here, we demonstrate multilevel models for childhood growth either as a smooth function (using fractional polynomials) or a set of connected linear phases (using linear splines).
Methods: We related parental social class to height from birth to 10 years of age in 5,588 girls from the Avon Longitudinal Study of Parents and Children (ALSPAC).
Background: Chained equations imputation is widely used in medical research. It uses a set of conditional models, so is more flexible than joint modelling imputation for the imputation of different types of variables (e.g.
View Article and Find Full Text PDFIntroduction: School-based smoking prevention programmes can be effective, but evidence on cost-effectiveness is lacking. We conducted a cost-effectiveness analysis of a school-based "peer-led" intervention.
Methods: We evaluated the ASSIST (A Stop Smoking In Schools Trial) programme in a cluster randomized controlled trial.
Angew Chem Int Ed Engl
January 2008
Amino acids, the building blocks of proteins, also serve as precursors to a wide range of other naturally occurring substances including alkaloids, antibiotics, and, the subject of this Review, heterocyclic peptides. Simple alpha-amino acids are converted into complex arrays of heteroaromatic rings that display interesting and potent biological activity. The thiopeptide antibiotics, with their complex molecular architectures, are wonderful examples.
View Article and Find Full Text PDF