A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data.

PLoS Comput Biol

Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, United States of America.

Published: October 2017

Investigating the pleiotropic effects of genetic variants can increase statistical power, provide important information to achieve deep understanding of the complex genetic structures of disease, and offer powerful tools for designing effective treatments with fewer side effects. However, the current multiple phenotype association analysis paradigm lacks breadth (number of phenotypes and genetic variants jointly analyzed at the same time) and depth (hierarchical structure of phenotype and genotypes). A key issue for high dimensional pleiotropic analysis is to effectively extract informative internal representation and features from high dimensional genotype and phenotype data. To explore correlation information of genetic variants, effectively reduce data dimensions, and overcome critical barriers in advancing the development of novel statistical methods and computational algorithms for genetic pleiotropic analysis, we proposed a new statistic method referred to as a quadratically regularized functional CCA (QRFCCA) for association analysis which combines three approaches: (1) quadratically regularized matrix factorization, (2) functional data analysis and (3) canonical correlation analysis (CCA). Large-scale simulations show that the QRFCCA has a much higher power than that of the ten competing statistics while retaining the appropriate type 1 errors. To further evaluate performance, the QRFCCA and ten other statistics are applied to the whole genome sequencing dataset from the TwinsUK study. We identify a total of 79 genes with rare variants and 67 genes with common variants significantly associated with the 46 traits using QRFCCA. The results show that the QRFCCA substantially outperforms the ten other statistics.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5659802PMC
http://dx.doi.org/10.1371/journal.pcbi.1005788DOI Listing

Publication Analysis

Top Keywords

quadratically regularized
12
genetic variants
12
regularized functional
8
canonical correlation
8
correlation analysis
8
association analysis
8
high dimensional
8
pleiotropic analysis
8
ten statistics
8
analysis
7

Similar Publications

Recovering the relaxation spectrum, a fundamental rheological characteristic of polymers, from experiment data requires special identification methods since it is a difficult ill-posed inverse problem. Recently, a new approach relating the identification index directly with a completely unknown real relaxation spectrum has been proposed. The integral square error of the relaxation spectrum model was applied.

View Article and Find Full Text PDF

Background: We aimed to investigate the association between maternal caffeine intake during pregnancy and asthma in children by 10 years of age.

Methods: We considered 5585 mother-child pairs enrolled in a population-based birth cohort. Consumption of regular and decaffeinated coffee, black and green tea, and cola beverages before and during pregnancy was obtained through face-to-face interviews within 72 h after giving birth, and total caffeine intake (mg/day) was estimated.

View Article and Find Full Text PDF

Motivation: In cine MRI, the measurements within each timeframe alone are too noisy for image reconstruction. Some information must be 'borrowed' from other time frames and the reconstruction algorithm is a slow iterative procedure.

Goals: We set up a constrained objective function, which uses the measurements at other time frames to regularize the image reconstruction.

View Article and Find Full Text PDF

This study aimed to develop, characterize, and validate an encapsulant based on beeswax (BW) for rumen-protected fat (RPF) using the melting emulsification technique. Buriti oil (BO) was used as the core material, and BW was used as the encapsulating material at three different proportions of BW:BO (9:1, 4:1, and 2:1 g/g ratio respectively). RPF microspheres (BWBO9:1, BWBO4:1, and BWBO2:1) were characterized and tested in six 3-year-old castrated male Santa Ines sheep (average body weight of 56.

View Article and Find Full Text PDF

The well-posedness of the initial-boundary value problem for higher-order quadratic nonlinear Schrödinger equations on the half-line is studied by utilizing the Fokas solution formula for the corresponding linear problem. Using this formula, linear estimates are derived in Bourgain spaces for initial data in spatial Sobolev spaces on the half-line and boundary data in temporal Sobolev spaces suggested by the time regularity of the linear initial value problem. Then, the needed bilinear estimates are derived and used for showing that the iteration map defined via the Fokas solution formula is a contraction in appropriate solution spaces.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!