Importance: Germline gene panel testing is recommended for men with advanced prostate cancer (PCa) or a family history of cancer. While evidence is limited for some genes currently included in panel testing, gene panels are also likely to be incomplete and missing genes that influence PCa risk and aggressive disease.
Objective: To identify genes associated with aggressive PCa.
Little is known regarding the potential relationship between clonal hematopoiesis (CH) of indeterminate potential (CHIP), which is the expansion of hematopoietic stem cells with somatic mutations, and risk of prostate cancer, the fifth leading cause of cancer death of men worldwide. We evaluated the association of age-related CHIP with overall and aggressive prostate cancer risk in two large whole-exome sequencing studies of 75 047 European ancestry men, including 7663 prostate cancer cases, 2770 of which had aggressive disease, and 3266 men carrying CHIP variants. We found that CHIP, defined by over 50 CHIP genes individually and in aggregate, was not significantly associated with overall (aggregate HR = 0.
View Article and Find Full Text PDFCarriers of germline biallelic pathogenic variants in the MUTYH gene have a high risk of colorectal cancer. We test 5649 colorectal cancers to evaluate the discriminatory potential of a tumor mutational signature specific to MUTYH for identifying biallelic carriers and classifying variants of uncertain clinical significance (VUS). Using a tumor and matched germline targeted multi-gene panel approach, our classifier identifies all biallelic MUTYH carriers and all known non-carriers in an independent test set of 3019 colorectal cancers (accuracy = 100% (95% confidence interval 99.
View Article and Find Full Text PDFHere we describe MyGene2, Geno2MP, VariantMatcher, and Franklin; databases that provide variant-level information and phenotypic features to researchers, clinicians, healthcare providers and patients. Following the footsteps of the Matchmaker Exchange project that connects exome, genome, and phenotype databases at the gene level, these databases have as one goal to facilitate connection to one another using Data Connect, a standard for discovery and search of biomedical data from the Global Alliance for Genomics and Health (GA4GH).
View Article and Find Full Text PDFGeneMatcher (genematcher.org) is a tool designed to connect individuals with an interest in the same gene. Now used around the world to create collaborations and generate the evidence needed to support novel disease gene identification, GeneMatcher is a founding member of the Matchmaker Exchange (MME; matchmakerexchange.
View Article and Find Full Text PDFPurpose: Mendelian disease genomic research has undergone a massive transformation over the past decade. With increasing availability of exome and genome sequencing, the role of Mendelian research has expanded beyond data collection, sequencing, and analysis to worldwide data sharing and collaboration.
Methods: Over the past 10 years, the National Institutes of Health-supported Centers for Mendelian Genomics (CMGs) have played a major role in this research and clinical evolution.
TET3 encodes an essential dioxygenase involved in epigenetic regulation through DNA demethylation. TET3 deficiency, or Beck-Fahrner syndrome (BEFAHRS; MIM: 618798), is a recently described neurodevelopmental disorder of the DNA demethylation machinery with a nonspecific phenotype resembling other chromatin-modifying disorders, but inconsistent variant types and inheritance patterns pose diagnostic challenges. Given TET3's direct role in regulating 5-methylcytosine and recent identification of syndrome-specific DNA methylation profiles, we analyzed genome-wide DNA methylation in whole blood of TET3-deficient individuals and identified an episignature that distinguishes affected and unaffected individuals and those with mono-allelic and bi-allelic pathogenic variants.
View Article and Find Full Text PDFBackground: With the advent of whole exome (ES) and genome sequencing (GS) as tools for disease gene discovery, rare variant filtering, prioritization and data sharing have become essential components of the search for disease genes and variants potentially contributing to disease phenotypes. The computational storage, data manipulation, and bioinformatic interpretation of thousands to millions of variants identified in ES and GS, respectively, is a challenging task. To aid in that endeavor, we constructed PhenoDB, GeneMatcher and VariantMatcher.
View Article and Find Full Text PDFBackground: There is an urgent need to identify factors specifically associated with aggressive prostate cancer (PCa) risk. We investigated whether rare pathogenic, likely pathogenic, or deleterious (P/LP/D) germline variants in DNA repair genes are associated with aggressive PCa risk in a case-case study of aggressive vs nonaggressive disease.
Methods: Participants were 5545 European-ancestry men, including 2775 nonaggressive and 2770 aggressive PCa cases, which included 467 metastatic cases (16.
Identifying genes and variants contributing to rare disease phenotypes and Mendelian conditions informs biology and medicine, yet potential phenotypic consequences for variation of >75% of the ~20,000 annotated genes in the human genome are lacking. Technical advances to assess rare variation genome-wide, particularly exome sequencing (ES), enabled establishment in the United States of the National Institutes of Health (NIH)-supported Centers for Mendelian Genomics (CMGs) and have facilitated collaborative studies resulting in novel "disease gene" discoveries. Pedigree-based genomic studies and rare variant analyses in families with suspected Mendelian conditions have led to the elucidation of hundreds of novel disease genes and highlighted the impact of de novo mutational events, somatic variation underlying nononcologic traits, incompletely penetrant alleles, phenotypes with high locus heterogeneity, and multilocus pathogenic variation.
View Article and Find Full Text PDFTo further dissect the genetic architecture of colorectal cancer (CRC), we performed whole-genome sequencing of 1,439 cases and 720 controls, imputed discovered sequence variants and Haplotype Reference Consortium panel variants into genome-wide association study data, and tested for association in 34,869 cases and 29,051 controls. Findings were followed up in an additional 23,262 cases and 38,296 controls. We discovered a strongly protective 0.
View Article and Find Full Text PDFThe breast cancer risk variants identified in genome-wide association studies explain only a small fraction of the familial relative risk, and the genes responsible for these associations remain largely unknown. To identify novel risk loci and likely causal genes, we performed a transcriptome-wide association study evaluating associations of genetically predicted gene expression with breast cancer risk in 122,977 cases and 105,974 controls of European ancestry. We used data from the Genotype-Tissue Expression Project to establish genetic models to predict gene expression in breast tissue and evaluated model performance using data from The Cancer Genome Atlas.
View Article and Find Full Text PDFRecent technological advancements have permitted high-throughput measurement of the human genome, epigenome, metabolome, transcriptome, and proteome at the population level. We hypothesized that subsets of genes identified from omic studies might have closely related biological functions and thus might interact directly at the network level. Therefore, we conducted an integrative analysis of multi-omic datasets of non-small cell lung cancer (NSCLC) to search for association patterns beyond the genome and transcriptome.
View Article and Find Full Text PDFKabuki syndrome is a monogenic disorder caused by loss of function variants in either of two genes encoding histone-modifying enzymes. We performed targeted sequencing in a cohort of 27 probands with a clinical diagnosis of Kabuki syndrome. Of these, 12 had causative variants in the two known Kabuki syndrome genes.
View Article and Find Full Text PDFBreast cancer risk is influenced by rare coding variants in susceptibility genes, such as BRCA1, and many common, mostly non-coding variants. However, much of the genetic contribution to breast cancer risk remains unknown. Here we report the results of a genome-wide association study of breast cancer in 122,977 cases and 105,974 controls of European ancestry and 14,068 cases and 13,104 controls of East Asian ancestry.
View Article and Find Full Text PDFMost common breast cancer susceptibility variants have been identified through genome-wide association studies (GWAS) of predominantly estrogen receptor (ER)-positive disease. We conducted a GWAS using 21,468 ER-negative cases and 100,594 controls combined with 18,908 BRCA1 mutation carriers (9,414 with breast cancer), all of European origin. We identified independent associations at P < 5 × 10 with ten variants at nine new loci.
View Article and Find Full Text PDFThere has been extensive debate about both the necessity of orthogonal confirmation of next-generation sequencing (NGS) results in Clinical Laboratory Improvement Amendments-approved laboratories and return of research NGS results to participants enrolled in research studies. In eMERGE-PGx, subjects underwent research NGS using PGRNseq and orthogonal targeted genotyping in clinical laboratories, which prompted a comparison of genotyping results between platforms. Concordance (percentage agreement) was reported for 4077 samples tested across nine combinations of research and clinical laboratories.
View Article and Find Full Text PDFTo identify common alleles associated with different histotypes of epithelial ovarian cancer (EOC), we pooled data from multiple genome-wide genotyping projects totaling 25,509 EOC cases and 40,941 controls. We identified nine new susceptibility loci for different EOC histotypes: six for serous EOC histotypes (3q28, 4q32.3, 8q21.
View Article and Find Full Text PDFThis unit describes a technique for generating exome-enriched sequencing libraries using DNA extracted from formalin-fixed paraffin-embedded (FFPE) samples. Utilizing commercially available kits, we present a low-input FFPE workflow starting with 50 ng of DNA. This procedure includes a repair step to address damage caused by FFPE preservation that improves sequence quality.
View Article and Find Full Text PDFOrofacial clefts (OFCs), which include non-syndromic cleft lip with or without cleft palate (CL/P), are among the most common birth defects in humans, affecting approximately 1 in 700 newborns. CL/P is phenotypically heterogeneous and has a complex etiology caused by genetic and environmental factors. Previous genome-wide association studies (GWASs) have identified at least 15 risk loci for CL/P.
View Article and Find Full Text PDFCleft palate (CP) is a common birth defect occurring in 1 in 2,500 live births. Approximately half of infants with CP have a syndromic form, exhibiting other physical and cognitive disabilities. The other half have nonsyndromic CP, and to date, few genes associated with risk for nonsyndromic CP have been characterized.
View Article and Find Full Text PDFImportance: Large-scale DNA sequencing identifies incidental rare variants in established Mendelian disease genes, but the frequency of related clinical phenotypes in unselected patient populations is not well established. Phenotype data from electronic medical records (EMRs) may provide a resource to assess the clinical relevance of rare variants.
Objective: To determine the clinical phenotypes from EMRs for individuals with variants designated as pathogenic by expert review in arrhythmia susceptibility genes.