Subsampling is a practical strategy for analyzing vast survival data, which are progressively encountered across diverse research domains. While the optimal subsampling method has been applied to inferences for Cox models and parametric accelerated failure time (AFT) models, its application to semi-parametric AFT models with rank-based estimation have received limited attention. The challenges arise from the non-smooth estimating function for regression coefficients and the seemingly zero contribution from censored observations in estimating functions in the commonly seen form. To address these challenges, we develop optimal subsampling probabilities for both event and censored observations by expressing the estimating functions through a well-defined stochastic process. Meanwhile, we apply an induced smoothing procedure to the non-smooth estimating functions. As the optimal subsampling probabilities depend on the unknown regression coefficients, we employ a two-step procedure to obtain a feasible estimation method. An additional benefit of the method is its ability to resolve the issue of underestimation of the variance when the subsample size approaches the full sample size. We validate the performance of our estimators through a simulation study and apply the methods to analyze the survival time of lymphoma patients in the surveillance, epidemiology, and end results program.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1002/sim.10200 | DOI Listing |
NPJ Precis Oncol
January 2025
Department of Thoracic Surgery, Peking University People's Hospital, Beijing, 100044, China.
Next-generation sequencing (NGS) offers a promising approach for differentiating multiple primary lung cancers (MPLC) from intrapulmonary metastasis (IPM), though panel selection and clonal interpretation remain challenging. Whole-exome sequencing (WES) data from 80 lung cancer samples were utilized to simulate MPLC and IPM, with various sequenced panels constructed through gene subsampling. Two clonal interpretation approaches primarily applied in clinical practice, MoleA (based on shared mutation comparison) and MoleB (based on probability calculation), were subsequently evaluated.
View Article and Find Full Text PDFEnviron Sci Technol
January 2025
Rollins School of Public Health, Emory U, Atlanta, Ga 30322, United States.
Repeated measurements of household air pollution may provide better estimates of average exposure but can add to costs and participant burden. In a randomized trial of gas versus biomass cookstoves in four countries, we took supplemental personal 24-h measurements on a 10% subsample for mothers and infants, interspersed between protocol samples. Mothers had up to five postrandomization protocol measurements over 16 months, while infants had three measurements over one year.
View Article and Find Full Text PDFJ Neural Eng
January 2025
Center for Complex Systems and Brain Sciences, Universidad Nacional de San Martin Escuela de Ciencia Y Tecnologia, 25 de Mayo y Francia, San Martín, Buenos Aires, 1650, ARGENTINA.
Objective Magnetic resonance imaging (MRI), functional MRI (fMRI) and other neuroimaging techniques are routinely used in medical diagnosis, cognitive neuroscience or recently in brain decoding. They produce three- or four-dimensional scans reflecting the geometry of brain tissue or activity, which is highly correlated temporally and spatially. While there exist numerous theoretically guided methods for analyzing correlations in one-dimensional data, they often cannot be readily generalized to the multidimensional geometrically embedded setting.
View Article and Find Full Text PDFBMC Res Notes
December 2024
Department of Biological Sciences, University of Arkansas, Fayetteville, AR, USA.
Objective: Extracting DNA is essential in wildlife genetic studies, and numerous methods are available. However, the process is costly and time-consuming for non-model organisms, including most wildlife species. Therefore, we optimized a cost-efficient protocol to extract DNA from the muscle tissue of White-tailed Deer using the DNAdvance kit (Beckman Coulter), a magnetic-bead-based approach.
View Article and Find Full Text PDFJ Inflamm Res
December 2024
Department of Cardiology, College of Medicine, Southwest Jiaotong University, Chengdu Cardiovascular Disease Research Institute, The Third People's Hospital of Chengdu, Chengdu, Sichuan, People's Republic of China.
Background: Increased levels of remnant cholesterol (RC) and inflammation are linked to higher risks of atherosclerotic cardiovascular disease. Whether a combination of C-reactive protein (CRP) and RC improves the predictive ability for evaluating the severity of coronary artery lesions remains unknown.
Methods: A total of 1675 patients with coronary artery disease were stratified according to the Synergy Between Percutaneous Coronary Intervention (SYNTAX) score (SYNTAX score ≤22 versus SYNTAX score >22).
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!