Publications by authors named "Matthew Aguirre"

Gene regulatory networks (GRNs) govern many core developmental and biological processes underlying human complex traits. Even with broad-scale efforts to characterize the effects of molecular perturbations and interpret gene coexpression, it remains challenging to infer the architecture of gene regulation in a precise and efficient manner. Key properties of GRNs, like hierarchical structure, modular organization, and sparsity, provide both challenges and opportunities for this objective.

View Article and Find Full Text PDF
Article Synopsis
  • - The text discusses the challenges of detecting complex genetic interactions (epistasis) that influence human traits, pointing out that traditional regression methods struggle with high-order interactions in large genomic datasets due to computational limitations and inadequacies in modeling biological interactions properly.
  • - It introduces the epiTree pipeline, built on a framework called Predictability, Computability, Stability (PCS), which utilizes tree-based models to identify higher-order interactions in genomic data by selecting relevant variants based on tissue-specific gene expression and employing iterative random forests.
  • - The efficacy of the epiTree pipeline is validated through two case studies from the UK Biobank, demonstrating its ability to reveal both known and novel genetic interactions in predicting traits like red hair and multiple sclerosis, thus potentially
View Article and Find Full Text PDF

Mapping the functional human genome and impact of genetic variants is often limited to European-descendent population samples. To aid in overcoming this limitation, we measured gene expression using RNA sequencing in lymphoblastoid cell lines (LCLs) from 599 individuals from six African populations to identify novel transcripts including those not represented in the hg38 reference genome. We used whole genomes from the 1000 Genomes Project and 164 Maasai individuals to identify 8,881 expression and 6,949 splicing quantitative trait loci (eQTLs/sQTLs), and 2,611 structural variants associated with gene expression (SV-eQTLs).

View Article and Find Full Text PDF
Article Synopsis
  • Congenital heart disease (CHD) has a strong genetic component, yet previous research has struggled to pinpoint inherited risks due to limited analysis of common variants in small groups of people.
  • A large study involving 55,342 participants reanalyzed genetic data, identifying 16 new genetic locations associated with different types of CHD, including 12 rare variants with notable effects.
  • The findings indicate that while each type of CHD is heritable, they appear to have distinct genetic risks, underscoring the complexity of CHD genetics.
View Article and Find Full Text PDF

Background: A large proportion of genetic risk remains unexplained for structural heart disease involving the interventricular septum (IVS) including hypertrophic cardiomyopathy and ventricular septal defects. This study sought to develop a reproducible proxy of IVS structure from standard medical imaging, discover novel genetic determinants of IVS structure, and relate these loci to diseases of the IVS, hypertrophic cardiomyopathy, and ventricular septal defect.

Methods: We estimated the cross-sectional area of the IVS from the 4-chamber view of cardiac magnetic resonance imaging in 32ā€‰219 individuals from the UK Biobank which was used as the basis of genome wide association studies and Mendelian randomization.

View Article and Find Full Text PDF

Polygenic risk scores (PRSs) quantify the contribution of multiple genetic loci to an individual's likelihood of a complex trait or disease. However, existing PRSs estimate this likelihood with common genetic variants, excluding the impact of rare variants. Here, we report on a method to identify rare variants associated with outlier gene expression and integrate their impact into PRS predictions for body mass index (BMI), obesity, and bariatric surgery.

View Article and Find Full Text PDF

Whole-genome sequencing studies applied to large populations or biobanks with extensive phenotyping raise new analytic challenges. The need to consider many variants at a locus or group of genes simultaneously and the potential to study many correlated phenotypes with shared genetic architecture provide opportunities for discovery not addressed by the traditional one variant, one phenotype association study. Here, we introduce a Bayesian model comparison approach called MRP (multiple rare variants and phenotypes) for rare-variant association studies that considers correlation, scale, and direction of genetic effects across a group of genetic variants, phenotypes, and studies, requiring only summary statistic data.

View Article and Find Full Text PDF

Objective: The purpose of this study was to understand the experiences of historically underrepresented graduate students, more than half of whom were enrolled in science, technology, engineering, and mathematics (STEM) disciplines, during the COVID-19 pandemic. This focus group study represents an initial stage in developing an intervention for historically underrepresented graduate students and their families.

Background: Underrepresentation of graduate students of color in STEM has been attributed to a myriad of factors, including a lack of support systems.

View Article and Find Full Text PDF

Polygenic risk models have led to significant advances in understanding complex diseases and their clinical presentation. While polygenic risk scores (PRS) can effectively predict outcomes, they do not generally account for disease subtypes or pathways which underlie within-trait diversity. Here, we introduce a latent factor model of genetic risk based on components from Decomposition of Genetic Associations (DeGAs), which we call the DeGAs polygenic risk score (dPRS).

View Article and Find Full Text PDF

Clinical laboratory tests are a critical component of the continuum of care. We evaluate the genetic basis of 35 blood and urine laboratory measurements in the UK Biobank (nā€‰=ā€‰363,228 individuals). We identify 1,857 loci associated with at least one trait, containing 3,374 fine-mapped associations and additional sets of large-effect (>0.

View Article and Find Full Text PDF

The clinical evaluation of a genetic syndrome relies upon recognition of a characteristic pattern of signs or symptoms to guide targeted genetic testing for confirmation of the diagnosis. However, individuals displaying a single phenotype of a complex syndrome may not meet criteria for clinical diagnosis or genetic testing. Here, we present a phenome-wide association study (PheWAS) approach to systematically explore the phenotypic expressivity of common and rare alleles in genes associated with four well-described syndromic diseases (Alagille (AS), Marfan (MS), DiGeorge (DS), and Noonan (NS) syndromes) in the general population.

View Article and Find Full Text PDF

The UK Biobank is a very large, prospective population-based cohort study across the United Kingdom. It provides unprecedented opportunities for researchers to investigate the relationship between genotypic information and phenotypes of interest. Multiple regression methods, compared with genome-wide association studies (GWAS), have already been showed to greatly improve the prediction performance for a variety of phenotypes.

View Article and Find Full Text PDF

Population-scale biobanks that combine genetic data and high-dimensional phenotyping for a large number of participants provide an exciting opportunity to perform genome-wide association studies (GWAS) to identify genetic variants associated with diverse quantitative traits and diseases. A major challenge for GWAS in population biobanks is ascertaining disease cases from heterogeneous data sources such as hospital records, digital questionnaire responses, or interviews. In this study, we use genetic parameters, including genetic correlation, to evaluate whether GWAS performed using cases in the UK Biobank ascertained from hospital records, questionnaire responses, and family history of disease implicate similar disease genetics across a range of effect sizes.

View Article and Find Full Text PDF

Population-based biobanks with genomic and dense phenotype data provide opportunities for generating effective therapeutic hypotheses and understanding the genomic role in disease predisposition. To characterize latent components of genetic associations, we apply truncated singular value decomposition (DeGAs) to matrices of summary statistics derived from genome-wide association analyses across 2,138 phenotypes measured in 337,199 White British individuals in the UK Biobank study. We systematically identify key components of genetic associations and the contributions of variants, genes, and phenotypes to each component.

View Article and Find Full Text PDF

Copy-number variations (CNVs) represent a significant proportion of the genetic differences between individuals and many CNVs associate causally with syndromic disease and clinical outcomes. Here, we characterize the landscape of copy-number variation and their phenome-wide effects in a sample of 472,228 array-genotyped individuals from the UK Biobank. In addition to population-level selection effects against genic loci conferring high mortality, we describe genetic burden from potentially pathogenic and previously uncharacterized CNV loci across more than 3,000 quantitative and dichotomous traits, with separate analyses for common and rare classes of variation.

View Article and Find Full Text PDF

Summary: Large biobanks linking phenotype to genotype have led to an explosion of genetic association studies across a wide range of phenotypes. Sharing the knowledge generated by these resources with the scientific community remains a challenge due to patient privacy and the vast amount of data. Here, we present Global Biobank Engine (GBE), a web-based tool that enables exploration of the relationship between genotype and phenotype in biobank cohorts, such as the UK Biobank.

View Article and Find Full Text PDF