We characterized the longitudinal serum protein signatures of women 6 and 10 years after gestational diabetes mellitus (GDM) to identify factors associated with the development of type 2 diabetes mellitus (T2D) and prediabetes in this at-risk post-GDM population, aiming to discover potential biomarkers for early diagnosis and prevention of T2D. Our study identified 75 T2D-associated serum proteins and 23 prediabetes-associated proteins, some of which were validated in an independent T2D cohort. Machine learning (ML) performed on the longitudinal proteomics highlighted protein signatures associated with progression to post-GDM diabetes.
View Article and Find Full Text PDFThe emergence of biobank-level datasets offers new opportunities to discover novel biomarkers and develop predictive algorithms for human disease. Here, we present an ensemble machine-learning framework (machine learning with phenotype associations, MILTON) utilizing a range of biomarkers to predict 3,213 diseases in the UK Biobank. Leveraging the UK Biobank's longitudinal health record data, MILTON predicts incident disease cases undiagnosed at time of recruitment, largely outperforming available polygenic risk scores.
View Article and Find Full Text PDFThe ongoing expansion of human genomic datasets propels therapeutic target identification; however, extracting gene-disease associations from gene annotations remains challenging. Here, we introduce Mantis-ML 2.0, a framework integrating AstraZeneca's Biological Insights Knowledge Graph and numerous tabular datasets, to assess gene-disease probabilities throughout the phenome.
View Article and Find Full Text PDFGenome-wide association studies (GWASs) have established the contribution of common and low-frequency variants to metabolic blood measurements in the UK Biobank (UKB). To complement existing GWAS findings, we assessed the contribution of rare protein-coding variants in relation to 355 metabolic blood measurements-including 325 predominantly lipid-related nuclear magnetic resonance (NMR)-derived blood metabolite measurements (Nightingale Health Plc) and 30 clinical blood biomarkers-using 412,393 exome sequences from four genetically diverse ancestries in the UKB. Gene-level collapsing analyses were conducted to evaluate a diverse range of rare-variant architectures for the metabolic blood measurements.
View Article and Find Full Text PDFBackground: Terrorism and armed conflict cause blast and ballistic casualties that are unusual in civilian practice. The immediate surgical response to mass casualty events, with civilians injured by these mechanisms, has not been systematically characterised. Standardising an approach to reacting to these events is challenging but is essential to optimise preparation for them.
View Article and Find Full Text PDFLarge-scale phenome-wide association studies performed using densely-phenotyped cohorts such as the UK Biobank (UKB), reveal many statistically robust gene-phenotype relationships for both clinical and continuous traits. Here, we present Gene-SCOUT, a tool used to identify genes with similar continuous trait fingerprints to a gene of interest. A fingerprint reflects the continuous traits identified to be statistically associated with a gene of interest based on multiple underlying rare variant genetic architectures.
View Article and Find Full Text PDFElucidating functionality in non-coding regions is a key challenge in human genomics. It has been shown that intolerance to variation of coding and proximal non-coding sequence is a strong predictor of human disease relevance. Here, we integrate intolerance to variation, functional genomic annotations and primary genomic sequence to build JARVIS: a comprehensive deep learning model to prioritize non-coding regions, outperforming other human lineage-specific scores.
View Article and Find Full Text PDF© LitMetric 2025. All rights reserved.