Biometrics
Department of Mathematics, Texas Christian University, Fort Worth, Texas.
Published: December 2019
Errors-in-variables models in high-dimensional settings pose two challenges in application. First, the number of observed covariates is larger than the sample size, while only a small number of covariates are true predictors under an assumption of model sparsity. Second, the presence of measurement error can result in severely biased parameter estimates, and also affects the ability of penalized methods such as the lasso to recover the true sparsity pattern. A new estimation procedure called SIMulation-SELection-EXtrapolation (SIMSELEX) is proposed. This procedure makes double use of lasso methodology. First, the lasso is used to estimate sparse solutions in the simulation step, after which a group lasso is implemented to do variable selection. The SIMSELEX estimator is shown to perform well in variable selection, and has significantly lower estimation error than naive estimators that ignore measurement error. SIMSELEX can be applied in a variety of errors-in-variables settings, including linear models, generalized linear models, and Cox survival models. It is furthermore shown in the Supporting Information how SIMSELEX can be applied to spline-based regression models. A simulation study is conducted to compare the SIMSELEX estimators to existing methods in the linear and logistic model settings, and to evaluate performance compared to naive methods in the Cox and spline models. Finally, the method is used to analyze a microarray dataset that contains gene expression measurements of favorable histology Wilms tumors.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1111/biom.13112 | DOI Listing |
Annu Int Conf IEEE Eng Med Biol Soc
July 2024
Functional cortico-muscular couplings are commonly assessed through cortico-muscular coherence (CMC) analysis, a measure of linear dependency between electroencephalogram (EEG) and electromyogram (EMG) signals. However, the presence of noise in EEG and EMG signals may exceed the strength of synchronous components, posing challenges in reliably detecting CMC. This study introduces an approach based on weighted errors-in-variables (EIV) modelling to extract relevant versions of cortical and muscular signals governing movement control from noisy EEG and EMG signals, aiming to enhance co-herence estimation.
View Article and Find Full Text PDFSci Rep
February 2025
School of Mathematics and Statistics, Xinyang Normal University, Xinyang, 464000, People's Republic of China.
The non-Gaussian characteristic of the external disturbance poses a great challenge for system modeling and identification. This paper develops a robust recursive estimation algorithm for the errors-in-variables nonlinear system with the impulsive noise. The algorithm is formulated by minimizing the continuous logarithmic mixed p-norm criterion, and is capable of giving a robust estimation against the impulsive noise through an adjustable weight gain.
View Article and Find Full Text PDFBiometrics
October 2024
Department of Statistics, Colorado State University, Fort Collins, CO 80523, United States.
Motivated by the challenges in analyzing gut microbiome and metagenomic data, this work aims to tackle the issue of measurement errors in high-dimensional regression models that involve compositional covariates. This paper marks a pioneering effort in conducting statistical inference on high-dimensional compositional data affected by mismeasured or contaminated data. We introduce a calibration approach tailored for the linear log-contrast model.
View Article and Find Full Text PDFStat Med
August 2024
Institute of Statistical Science, Academia Sinica, Taipei, Taiwan.
Analysis of competing risks data has been an important topic in survival analysis due to the need to account for the dependence among the competing events. Also, event times are often recorded on discrete time scales, rendering the models tailored for discrete-time nature useful in the practice of survival analysis. In this work, we focus on regression analysis with discrete-time competing risks data, and consider the errors-in-variables issue where the covariates are prone to measurement errors.
View Article and Find Full Text PDFBiostatistics
October 2024
Department of Statistics and Data Science, University of Pennsylvania, Academic Research Building 311, 265 South 37th Street Philadelphia, PA 19104, United States.
CRISPR genome engineering and single-cell RNA sequencing have accelerated biological discovery. Single-cell CRISPR screens unite these two technologies, linking genetic perturbations in individual cells to changes in gene expression and illuminating regulatory networks underlying diseases. Despite their promise, single-cell CRISPR screens present considerable statistical challenges.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!
© LitMetric 2025. All rights reserved.