Nearest-neighbor Projected-Distance Regression (NPDR) is a feature selection technique that uses nearest-neighbors in high dimensional data to detect complex multivariate effects including epistasis. NPDR uses a regression formalism that allows statistical significance testing and efficient control for multiple testing. In addition, the regression formalism provides a mechanism for NPDR to adjust for population structure, which we apply to a GWAS of systemic lupus erythematosus (SLE). We also test NPDR on benchmark simulated genetic variant data with epistatic effects, main effects, imbalanced data for case-control design and continuous outcomes. NPDR identifies potential interactions in an epistasis network that influences the SLE disorder.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7387719PMC
http://dx.doi.org/10.3389/fgene.2020.00784DOI Listing

Publication Analysis

Top Keywords

population structure
8
regression formalism
8
npdr
5
nearest-neighbor projected
4
projected distance
4
regression
4
distance regression
4
regression epistasis
4
epistasis detection
4
detection gwas
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!