A fast kernel independence test for cluster-correlated data.

Sci Rep

Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109, USA.

Published: December 2022

Cluster-correlated data receives a lot of attention in biomedical and longitudinal studies and it is of interest to assess the generalized dependence between two multivariate variables under the cluster-correlated structure. The Hilbert-Schmidt independence criterion (HSIC) is a powerful kernel-based test statistic that captures various dependence between two random vectors and can be applied to an arbitrary non-Euclidean domain. However, the existing HSIC is not directly applicable to cluster-correlated data. Therefore, we propose a HSIC-based test of independence for cluster-correlated data. The new test statistic combines kernel information so that the dependence structure in each cluster is fully considered and exhibits good performance under high dimensions. Moreover, a rapid p value approximation makes the new test fast applicable to large datasets. Numerical studies show that the new approach performs well in both synthetic and real world data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9755291PMC
http://dx.doi.org/10.1038/s41598-022-26278-9DOI Listing

Publication Analysis

Top Keywords

cluster-correlated data
16
test statistic
8
test
5
cluster-correlated
5
data
5
fast kernel
4
kernel independence
4
independence test
4
test cluster-correlated
4
data cluster-correlated
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!