The COVID-19 pandemic exposed a global deficiency of systematic, data-driven guidance to identify high-risk individuals. Here, we illustrate the utility of routinely recorded medical history to predict the risk for 1741 diseases across clinical specialties and support the rapid response to emerging health threats such as COVID-19. We developed a neural network to learn from health records of 502,489 UK Biobank participants.
View Article and Find Full Text PDFUnderstanding the genetic basis of routinely-acquired blood tests can provide insights into several aspects of human physiology. We report a genome-wide association study of 42 quantitative blood test traits defined using Electronic Healthcare Records (EHRs) of ~50,000 British Bangladeshi and British Pakistani adults. We demonstrate a causal variant within the PIEZO1 locus which was associated with alterations in red cell traits and glycated haemoglobin.
View Article and Find Full Text PDFBroad-capture proteomic platforms now enable simultaneous assessment of thousands of plasma proteins, but most of these are not actively secreted and their origins are largely unknown. Here we integrate genomic with deep phenomic information to identify modifiable and non-modifiable factors associated with 4,775 plasma proteins in ~8,000 mostly healthy individuals. We create a data-driven map of biological influences on the human plasma proteome and demonstrate segregation of proteins into clusters based on major explanatory factors.
View Article and Find Full Text PDFLiver X receptor-α (LXRα) regulates cellular cholesterol abundance and potently activates hepatic lipogenesis. Here we show that at least 1 in 450 people in the UK Biobank carry functionally impaired mutations in LXRα, which is associated with biochemical evidence of hepatic dysfunction. On a western diet, male and female mice homozygous for a dominant negative mutation in LXRα have elevated liver cholesterol, diffuse cholesterol crystal accumulation and develop severe hepatitis and fibrosis, despite reduced liver triglyceride and no steatosis.
View Article and Find Full Text PDFBackground: Variation in thyroid function parameters within the normal range has been observationally associated with adverse health outcomes. Whether those associations reflect causal effects is largely unknown.
Methods: We systematically tested associations between genetic differences in thyrotropin (TSH) and free thyroxine (FT4) within the normal range and more than 1100 diseases and more than 6000 molecular traits (metabolites and proteins) in three large population-based cohorts.
For many diseases there are delays in diagnosis due to a lack of objective biomarkers for disease onset. Here, in 41,931 individuals from the United Kingdom Biobank Pharma Proteomics Project, we integrated measurements of ~3,000 plasma proteins with clinical information to derive sparse prediction models for the 10-year incidence of 218 common and rare diseases (81-6,038 cases). We then compared prediction models developed using proteomic data with models developed using either basic clinical information alone or clinical information combined with data from 37 clinical assays.
View Article and Find Full Text PDFEarly evidence that patients with (multiple) pre-existing diseases are at highest risk for severe COVID-19 has been instrumental in the pandemic to allocate critical care resources and later vaccination schemes. However, systematic studies exploring the breadth of medical diagnoses, including common, but non-fatal diseases are scarce, but may help to understand severe COVID-19 among patients at supposedly low risk. Here, we systematically harmonized >12 million primary care and hospitalisation health records from ~500,000 UK Biobank participants into 1448 collated disease terms to systematically identify diseases predisposing to severe COVID-19 (requiring hospitalisation or death) and its post-acute sequalae, Long COVID.
View Article and Find Full Text PDFBackground: Early evidence that patients with (multiple) pre-existing diseases are at highest risk for severe COVID-19 has been instrumental in the pandemic to allocate critical care resources and later vaccination schemes. However, systematic studies exploring the breadth of medical diagnoses are scarce but may help to understand severe COVID-19 among patients at supposedly low risk.
Methods: We systematically harmonized >12 million primary care and hospitalisation health records from ~500,000 UK Biobank participants into 1448 collated disease terms to systematically identify diseases predisposing to severe COVID-19 (requiring hospitalisation or death) and its post-acute sequalae, Long COVID.
Background: Broad-capture proteomic technologies have the potential to improve disease prediction, enabling targeted prevention and management, but studies have so far been limited to very few selected diseases and have not evaluated predictive performance across multiple conditions. We aimed to evaluate the potential of serum proteins to improve risk prediction over and above health-derived information and polygenic risk scores across a diverse set of 24 outcomes.
Methods: We designed multiple case-cohorts nested in the EPIC-Norfolk prospective study, from participants with available serum samples and genome-wide genotype data, with more than 32 974 person-years of follow-up.
The COVID-19 pandemic exposed a global deficiency of systematic, data-driven guidance to identify high-risk individuals. Here, we illustrate the utility of routinely recorded medical history to predict the risk for 1883 diseases across clinical specialties and support the rapid response to emerging health threats such as COVID-19. We developed a neural network to learn from health records of 502,460 UK Biobank.
View Article and Find Full Text PDFSurviving long periods without food has shaped human evolution. In ancient and modern societies, prolonged fasting was/is practiced by billions of people globally for religious purposes, used to treat diseases such as epilepsy, and recently gained popularity as weight loss intervention, but we still have a very limited understanding of the systemic adaptions in humans to extreme caloric restriction of different durations. Here we show that a 7-day water-only fast leads to an average weight loss of 5.
View Article and Find Full Text PDFIdentifying circulating proteins associated with cognitive function may point to biomarkers and molecular process of cognitive impairment. Few studies have investigated the association between circulating proteins and cognitive function. We identify 246 protein measures quantified by the SomaScan assay as associated with cognitive function (p < 4.
View Article and Find Full Text PDFAims/hypothesis: The identification of people who are at high risk of developing type 2 diabetes is a key part of population-level prevention strategies. Previous studies have evaluated the predictive utility of omics measurements, such as metabolites, proteins or polygenic scores, but have considered these separately. The improvement that combined omics biomarkers can provide over and above current clinical standard models is unclear.
View Article and Find Full Text PDFRaynaud's phenomenon (RP) is a common vasospastic disorder that causes severe pain and ulcers, but despite its high reported heritability, no causal genes have been robustly identified. We conducted a genome-wide association study including 5,147 RP cases and 439,294 controls, based on diagnoses from electronic health records, and identified three unreported genomic regions associated with the risk of RP (p < 5 × 10). We prioritized ADRA2A (rs7090046, odds ratio (OR) per allele: 1.
View Article and Find Full Text PDFBackground: Understanding the role of circulating proteins in prostate cancer risk can reveal key biological pathways and identify novel targets for cancer prevention.
Methods: We investigated the association of 2,002 genetically predicted circulating protein levels with risk of prostate cancer overall, and of aggressive and early onset disease, using -pQTL Mendelian randomization (MR) and colocalization. Findings for proteins with support from both MR, after correction for multiple-testing, and colocalization were replicated using two independent cancer GWAS, one of European and one of African ancestry.
Obesity (Silver Spring)
November 2023
Metabolome reflects the interplay of genome and exposome at molecular level and thus can provide deep insights into the pathogenesis of a complex disease like major depression. To identify metabolites associated with depression we performed a metabolome-wide association analysis in 13,596 participants from five European population-based cohorts characterized for depression, and circulating metabolites using ultra high-performance liquid chromatography/tandem accurate mass spectrometry (UHPLC/MS/MS) based Metabolon platform. We tested 806 metabolites covering a wide range of biochemical processes including those involved in lipid, amino-acid, energy, carbohydrate, xenobiotic and vitamin metabolism for their association with depression.
View Article and Find Full Text PDFWe conduct a large-scale meta-analysis of heart failure genome-wide association studies (GWAS) consisting of over 90,000 heart failure cases and more than 1 million control individuals of European ancestry to uncover novel genetic determinants for heart failure. Using the GWAS results and blood protein quantitative loci, we perform Mendelian randomization and colocalization analyses on human proteins to provide putative causal evidence for the role of druggable proteins in the genesis of heart failure. We identify 39 genome-wide significant heart failure risk variants, of which 18 are previously unreported.
View Article and Find Full Text PDFA linear ion trap (LIT) is an affordable, robust mass spectrometer that provides fast scanning speed and high sensitivity, where its primary disadvantage is inferior mass accuracy compared to more commonly used time-of-flight or orbitrap (OT) mass analyzers. Previous efforts to utilize the LIT for low-input proteomics analysis still rely on either built-in OTs for collecting precursor data or OT-based library generation. Here, we demonstrate the potential versatility of the LIT for low-input proteomics as a stand-alone mass analyzer for all mass spectrometry (MS) measurements, including library generation.
View Article and Find Full Text PDFVenous thromboembolism (VTE) is a common, multi-causal disease with potentially serious short- and long-term complications. In clinical practice, there is a need for improved plasma biomarker-based tools for VTE diagnosis and risk prediction. Here we show, using proteomics profiling to screen plasma from patients with suspected acute VTE, and several case-control studies for VTE, how Complement Factor H Related 5 protein (CFHR5), a regulator of the alternative pathway of complement activation, is a VTE-associated plasma biomarker.
View Article and Find Full Text PDF