Atrial fibrillation (AF) is the most common heart rhythm abnormality and is a leading cause of heart failure and stroke. This large-scale meta-analysis of genome-wide association studies increased the power to detect single-nucleotide variant associations and found more than 350 AF-associated genetic loci. We identified candidate genes related to muscle contractility, cardiac muscle development and cell-cell communication at 139 loci.
View Article and Find Full Text PDFTo address the concern that polygenic hazard scores for prostate cancer (PCa) might not distinguish between indolent and aggressive disease, we performed analyses using a 601-variant polygenic score (PHS601). We hypothesized that among men who eventually developed PCa, those with higher PHS were more likely to develop aggressive disease. We analyzed genetic and phenotypic data from a diverse, national cohort of men diagnosed with PCa (Million Veteran Program, n = 69,901, 6413 metastatic).
View Article and Find Full Text PDFBackground: Coronary artery disease is a leading cause of morbidity and mortality in the United States. Coronary artery disease can lead to major complications including myocardial infarction (MI). The association of dietary cholesterol with coronary artery disease remains inconsistent.
View Article and Find Full Text PDFHeart failure (HF) has limited therapeutic options. In this study, we differentiated the pathophysiological underpinnings of the HF subtypes-HF with reduced ejection fraction (HFrEF) and HF with preserved ejection fraction (HFpEF)-and uncovered subtype-specific therapeutic strategies. We investigated the causal roles of the human proteome and transcriptome using Mendelian randomization on more than 420,000 participants from the Million Veteran Program (27,799 HFrEF and 27,579 HFpEF cases).
View Article and Find Full Text PDFObjective: Electronic health record (EHR) systems contain a wealth of clinical data stored as both codified data and free-text narrative notes (NLP). The complexity of EHR presents challenges in feature representation, information extraction, and uncertainty quantification. To address these challenges, we proposed an efficient Aggregated naRrative Codified Health (ARCH) records analysis to generate a large-scale knowledge graph (KG) for a comprehensive set of EHR codified and narrative features.
View Article and Find Full Text PDFBackground: Cancer-associated thrombosis (CAT) is a leading cause of death in patients diagnosed with cancer. However, pharmacologic thromboprophylaxis use in cancer patients must be carefully evaluated due to a 2-fold increased risk of experiencing a major bleeding event within this population. The electronic health record CAT (EHR-CAT) risk assessment model (RAM) was recently developed, and reports improved performance over the widely used Khorana score.
View Article and Find Full Text PDFPrior research suggests metformin has anti-cancer effects, yet data are limited. We examined the association between diabetes treatment (metformin versus sulfonylurea) and risk of incident diabetes-related and non- diabetes-related cancers in US veterans. This retrospective cohort study included US veterans, without cancer, aged ≥ 55 years, who were new users of metformin or sulfonylureas for diabetes between 2001 to 2012.
View Article and Find Full Text PDFMotivation: The increasing availability of Electronic Health Record (EHR) systems has created enormous potential for translational research. Recent developments in representation learning techniques have led to effective large-scale representations of EHR concepts along with knowledge graphs that empower downstream EHR studies. However, most existing methods require training with patient-level data, limiting their abilities to expand the training with multi-institutional EHR data.
View Article and Find Full Text PDFTo broaden our understanding of bradyarrhythmias and conduction disease, we performed common variant genome-wide association analyses in up to 1.3 million individuals and rare variant burden testing in 460,000 individuals for sinus node dysfunction (SND), distal conduction disease (DCD) and pacemaker (PM) implantation. We identified 13, 31 and 21 common variant loci for SND, DCD and PM, respectively.
View Article and Find Full Text PDFObjective: Event capture in clinical trials is resource-intensive, and electronic medical records (EMRs) offer a potential solution. This study develops algorithms for EMR-based death and hospitalization capture and compares them with traditional event capture methods.
Materials And Methods: We compared the effectiveness of EMR-based event capture and site-captured events adjudicated by a clinical endpoint committee in the multi-center INfluenza Vaccine to Effectively Stop cardio Thoracic Events and Decompensated heart failure (INVESTED) trial for participants from the Veterans Affairs healthcare system.
Iron homoeostasis is tightly regulated, with hepcidin and soluble transferrin receptor (sTfR) playing significant roles. However, the genetic determinants of these traits and the biomedical consequences of iron homoeostasis variation are unclear. In a meta-analysis of 12 cohorts involving 91,675 participants, we found 43 genomic loci associated with either hepcidin or sTfR concentration, of which 15 previously unreported.
View Article and Find Full Text PDFBackground: Veterans of the 1990-1991 Gulf War have experienced excess health problems, most prominently the multisymptom condition Gulf War illness (GWI). The Department of Veterans Affairs (VA) Cooperative Studies Program #2006 "Genomics of Gulf War Illness in Veterans" project was established to address important questions concerning pathobiological and genetic aspects of GWI. The current study evaluated patterns of chronic ill health/GWI in the VA Million Veteran Program (MVP) Gulf War veteran cohort in relation to wartime exposures and key features of deployment, 27-30 years after Gulf War service.
View Article and Find Full Text PDFPolygenic scores (PGSs) are a promising tool for estimating individual-level genetic risk of disease based on the results of genome-wide association studies (GWASs). However, their promise has yet to be fully realized because most currently available PGSs were built with genetic data from predominantly European-ancestry populations, and PGS performance declines when scores are applied to target populations different from the populations from which they were derived. Thus, there is a great need to improve PGS performance in currently under-studied populations.
View Article and Find Full Text PDFLarge-scale genome-wide association studies of schizophrenia have uncovered hundreds of associated loci but with extremely limited representation of African diaspora populations. We surveyed electronic health records of 200,000 individuals of African ancestry in the Million Veteran and All of Us Research Programs, and, coupled with genotype-level data from four case-control studies, realized a combined sample size of 13,012 affected and 54,266 unaffected persons. Three genome-wide significant signals - near , , and - are the first to be independently identified in populations of predominantly African ancestry.
View Article and Find Full Text PDFImportance: The X chromosome has remained enigmatic in Alzheimer disease (AD), yet it makes up 5% of the genome and carries a high proportion of genes expressed in the brain, making it particularly appealing as a potential source of unexplored genetic variation in AD.
Objectives: To perform the first large-scale X chromosome-wide association study (XWAS) of AD.
Design, Setting, And Participants: This was a meta-analysis of genetic association studies in case-control, family-based, population-based, and longitudinal AD-related cohorts from the US Alzheimer's Disease Genetics Consortium, the Alzheimer's Disease Sequencing Project, the UK Biobank, the Finnish health registry, and the US Million Veterans Program.
Large-scale cohort and epidemiological studies suggest that posttraumatic stress disorder (PTSD) confers risk for late-onset Alzheimer's disease (AD) and related dementias (ADRD); however, the basis for this association remains unclear. Several prior studies of military Veterans have reported that carriers of the apolipoprotein E () ε4 gene variant are at heightened risk for the development of PTSD following combat exposure, suggesting that PTSD and ADRD may share some genetic risk. This cohort study was designed to further examine the hypothesis that ADRD genetic risk also confers risk for PTSD.
View Article and Find Full Text PDFBackground: Meat consumption could increase the risk of type 2 diabetes. However, evidence is largely based on studies of European and North American populations, with heterogeneous analysis strategies and a greater focus on red meat than on poultry. We aimed to investigate the associations of unprocessed red meat, processed meat, and poultry consumption with type 2 diabetes using data from worldwide cohorts and harmonised analytical approaches.
View Article and Find Full Text PDFAims: Elevated lipoprotein(a) [Lp(a)] is a causal risk factor for atherosclerotic cardiovascular disease, but the mechanisms of risk are debated. Studies have found inconsistent associations between Lp(a) and measurements of atherosclerosis. We aimed to assess the relationship between Lp(a), low-density lipoprotein cholesterol (LDL-C), and coronary artery plaque severity.
View Article and Find Full Text PDFBackground: Vitamin D may prevent the development of hypertension through down-regulation of renin-angiotensin system. However, epidemiologic studies assessing the interrelation of vitamin D-related biomarkers with hypertension are sparse.
Methods: We examined the prospective associations between vitamin D-related biomarkers and the risk of hypertension in a nested case-control study.
Purpose: Stage in multiple myeloma (MM) is an essential measure of disease risk, but its measurement in large databases is often lacking. We aimed to develop and validate a natural language processing (NLP) algorithm to extract oncologists' documentation of stage in the national Veterans Affairs (VA) Healthcare System.
Methods: Using nationwide electronic health record (EHR) and cancer registry data from the VA Corporate Data Warehouse, we developed and validated a rule-based NLP algorithm to extract oncologist-determined MM stage.