The paper addresses a common problem in the analysis of high-dimensional high-throughput "omics" data, which is parameter estimation across multiple variables in a set of data where the number of variables is much larger than the sample size. Among the problems posed by this type of data are that variable-specific estimators of variances are not reliable and variable-wise tests statistics have low power, both due to a lack of degrees of freedom. In addition, it has been observed in this type of data that the variance increases as a function of the mean. We introduce a non-parametric adaptive regularization procedure that is innovative in that : (i) it employs a novel "similarity statistic"-based clustering technique to generate local-pooled or regularized shrinkage estimators of population parameters, (ii) the regularization is done jointly on population moments, benefiting from C. Stein's result on inadmissibility, which implies that usual sample variance estimator is improved by a shrinkage estimator using information contained in the sample mean. From these joint regularized shrinkage estimators, we derived regularized t-like statistics and show in simulation studies that they offer more statistical power in hypothesis testing than their standard sample counterparts, or regular common value-shrinkage estimators, or when the information contained in the sample mean is simply ignored. Finally, we show that these estimators feature interesting properties of variance stabilization and normalization that can be used for preprocessing high-dimensional multivariate data. The method is available as an R package, called 'MVR' ('Mean-Variance Regularization'), downloadable from the CRAN website.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3375876 | PMC |
http://dx.doi.org/10.1016/j.csda.2012.01.012 | DOI Listing |
J Educ Health Promot
November 2024
Department of Medical Education, Medical Education Research Center, Education Development Center, Isfahan University of Medical Sciences, Isfahan, Iran.
Background: Since assessment of basic thinking skills is crucial in identifying an individual's cognitive capacities and comprehending their strengths and weaknesses, developing a suitable evaluating instrument holds significant importance in scheduling the training of basic thinking skills. This study aims to develop and subsequently validate a self-assessment questionnaire of basic thinking skills for medical sciences students.
Materials And Methods: The present study of designing and psychometrically testing the self-assessment questionnaire of basic thinking skills among medical sciences students was conducted between 2022 and 2023 at Isfahan University.
Orthop J Sports Med
January 2025
Department of Orthopedics, Affiliated Zhongshan Hospital of Dalian University, Dalian, PR China.
Background: Although previous studies have investigated the risk factors for rotator cuff syndrome (RCS), there remains controversy due to uncontrolled and uncertain confounding factors in their analyses.
Purpose: To perform Mendelian randomization (MR) analysis using single-nucleotide polymorphisms to investigate the causal relationship between RCS and 4 risk factors: type 2 diabetes mellitus (T2DM), high blood pressure (HBP), body mass index (BMI), and low high-density lipoprotein cholesterol (HDL-C).
Study Design: Descriptive epidemiology study.
Sci Data
January 2025
Section of Intensive Plant Food Systems, Albrecht Daniel Thaer-Institute of Agricultural and Horticultural Sciences, Humboldt Universität zu Berlin, Berlin, Germany.
Multi-environmental trials (MET) with temporal and spatial variance are crucial for understanding genotype-environment-management (GxExM) interactions in crops. Here, we present a MET dataset for winter wheat in Germany. The dataset encompasses MET spanning six years (2015-2020), six locations and nine crop management scenarios (consisting of combinations for three treatments, unbalanced in each location and year) comparing 228 cultivars released between 1963 and 2016, amounting to a total of 526,751 data points covering 24 traits.
View Article and Find Full Text PDFMedicine (Baltimore)
November 2024
Obstetrics and Gynecology Hospital of Fudan University, Shanghai, China.
Consensus remains elusive regarding the relationship between C-reactive protein (CRP) levels and endometrial cancer (EC). Our study sought to elucidate the causal association between CRP and EC, aiming to contribute to the understanding of this complex interplay. We primarily utilized the random-effects inverse variance-weighted method.
View Article and Find Full Text PDFBackground: Platelets are correlated with myeloid leukemia (ML), but to date, there have been no studies confirming the causal relationship between them.
Methods: Platelet count (PLT), mean platelet volume (MPV), plateletcrit (PCT), and platelet distribution width (PDW) data were obtained from the GWAS catalog database as exposure factors. Acute myeloid leukemia (AML) and chronic myeloid leukemia (CML) data were obtained from the FinnGen database as outcome indicators.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!