Understanding the genetic basis of routinely-acquired blood tests can provide insights into several aspects of human physiology. We report a genome-wide association study of 42 quantitative blood test traits defined using Electronic Healthcare Records (EHRs) of ~50,000 British Bangladeshi and British Pakistani adults. We demonstrate a causal variant within the PIEZO1 locus which was associated with alterations in red cell traits and glycated haemoglobin.
View Article and Find Full Text PDFBackground: Despite the growing interest in the use of human genomic data for drug target identification and validation, the extent to which the spectrum of human disease has been addressed by genome-wide association studies (GWAS), or by drug development, and the degree to which these efforts overlap remain unclear.
Methods: In this study we harmonize and integrate different data sources to create a sample space of all the human drug targets and diseases and identify points of convergence or divergence of GWAS and drug development efforts.
Results: We show that only 612 of 11,158 diseases listed in Human Disease Ontology have an approved drug treatment in at least one region of the world.
Broad-capture proteomic platforms now enable simultaneous assessment of thousands of plasma proteins, but most of these are not actively secreted and their origins are largely unknown. Here we integrate genomic with deep phenomic information to identify modifiable and non-modifiable factors associated with 4,775 plasma proteins in ~8,000 mostly healthy individuals. We create a data-driven map of biological influences on the human plasma proteome and demonstrate segregation of proteins into clusters based on major explanatory factors.
View Article and Find Full Text PDFLiver X receptor-α (LXRα) regulates cellular cholesterol abundance and potently activates hepatic lipogenesis. Here we show that at least 1 in 450 people in the UK Biobank carry functionally impaired mutations in LXRα, which is associated with biochemical evidence of hepatic dysfunction. On a western diet, male and female mice homozygous for a dominant negative mutation in LXRα have elevated liver cholesterol, diffuse cholesterol crystal accumulation and develop severe hepatitis and fibrosis, despite reduced liver triglyceride and no steatosis.
View Article and Find Full Text PDFUnderstanding the genetic basis of neuro-related proteins is essential for dissecting the molecular basis of human behavioural traits and the disease aetiology of neuropsychiatric disorders. Here the SCALLOP Consortium conducted a genome-wide association meta-analysis of over 12,000 individuals for 184 neuro-related proteins in human plasma. The analysis identified 125 cis-regulatory protein quantitative trait loci (cis-pQTL) and 164 trans-pQTL.
View Article and Find Full Text PDFBackground: Variation in thyroid function parameters within the normal range has been observationally associated with adverse health outcomes. Whether those associations reflect causal effects is largely unknown.
Methods: We systematically tested associations between genetic differences in thyrotropin (TSH) and free thyroxine (FT4) within the normal range and more than 1100 diseases and more than 6000 molecular traits (metabolites and proteins) in three large population-based cohorts.
A novel algorithm, AlphaMissense, has been shown to have an improved ability to predict the pathogenicity of rare missense genetic variants. However, it is not known whether AlphaMissense improves the ability of gene-based testing to identify disease-influencing genes. Using whole-exome sequencing data from the UK Biobank, we compared gene-based association analysis strategies including sets of deleterious variants: predicted loss-of-function (pLoF) variants only, pLoF plus AlphaMissense pathogenic variants, pLoF with missense variants predicted to be deleterious by any of five commonly utilized annotation methods (Missense (1/5)) or only variants predicted to be deleterious by all five methods (Missense (5/5)).
View Article and Find Full Text PDFFor many diseases there are delays in diagnosis due to a lack of objective biomarkers for disease onset. Here, in 41,931 individuals from the United Kingdom Biobank Pharma Proteomics Project, we integrated measurements of ~3,000 plasma proteins with clinical information to derive sparse prediction models for the 10-year incidence of 218 common and rare diseases (81-6,038 cases). We then compared prediction models developed using proteomic data with models developed using either basic clinical information alone or clinical information combined with data from 37 clinical assays.
View Article and Find Full Text PDFEarly evidence that patients with (multiple) pre-existing diseases are at highest risk for severe COVID-19 has been instrumental in the pandemic to allocate critical care resources and later vaccination schemes. However, systematic studies exploring the breadth of medical diagnoses, including common, but non-fatal diseases are scarce, but may help to understand severe COVID-19 among patients at supposedly low risk. Here, we systematically harmonized >12 million primary care and hospitalisation health records from ~500,000 UK Biobank participants into 1448 collated disease terms to systematically identify diseases predisposing to severe COVID-19 (requiring hospitalisation or death) and its post-acute sequalae, Long COVID.
View Article and Find Full Text PDFBackground: Early evidence that patients with (multiple) pre-existing diseases are at highest risk for severe COVID-19 has been instrumental in the pandemic to allocate critical care resources and later vaccination schemes. However, systematic studies exploring the breadth of medical diagnoses are scarce but may help to understand severe COVID-19 among patients at supposedly low risk.
Methods: We systematically harmonized >12 million primary care and hospitalisation health records from ~500,000 UK Biobank participants into 1448 collated disease terms to systematically identify diseases predisposing to severe COVID-19 (requiring hospitalisation or death) and its post-acute sequalae, Long COVID.
Background: Broad-capture proteomic technologies have the potential to improve disease prediction, enabling targeted prevention and management, but studies have so far been limited to very few selected diseases and have not evaluated predictive performance across multiple conditions. We aimed to evaluate the potential of serum proteins to improve risk prediction over and above health-derived information and polygenic risk scores across a diverse set of 24 outcomes.
Methods: We designed multiple case-cohorts nested in the EPIC-Norfolk prospective study, from participants with available serum samples and genome-wide genotype data, with more than 32 974 person-years of follow-up.
The COVID-19 pandemic exposed a global deficiency of systematic, data-driven guidance to identify high-risk individuals. Here, we illustrate the utility of routinely recorded medical history to predict the risk for 1883 diseases across clinical specialties and support the rapid response to emerging health threats such as COVID-19. We developed a neural network to learn from health records of 502,460 UK Biobank.
View Article and Find Full Text PDFSurviving long periods without food has shaped human evolution. In ancient and modern societies, prolonged fasting was/is practiced by billions of people globally for religious purposes, used to treat diseases such as epilepsy, and recently gained popularity as weight loss intervention, but we still have a very limited understanding of the systemic adaptions in humans to extreme caloric restriction of different durations. Here we show that a 7-day water-only fast leads to an average weight loss of 5.
View Article and Find Full Text PDF