Publications by Julius Jacobsen | LitMetric

Publications by authors named "Julius Jacobsen"

Page 1 of 2

Efficient reinterpretation of rare disease cases using Exomiser.

Letizia Vestito Julius O B Jacobsen Susan Walker Valentina Cipriani Nomi L Harris

NPJ Genom Med

December 2024

Whole genome sequencing has transformed rare disease research; however, 50-80% of rare disease patients remain undiagnosed after such testing. Regular reanalysis can identify new diagnoses, especially in newly discovered disease-gene associations, but efficient tools are required to support clinical interpretation. Exomiser, a phenotype-driven variant prioritisation tool, fulfils this role; within the 100,000 Genomes Project (100kGP), diagnoses were identified after reanalysis in 463 (2%) of 24,015 unsolved patients after previous analysis for variants in known disease genes.

View Article and Find Full Text PDF

Leveraging clinical intuition to improve accuracy of phenotype-driven prioritization.

Martha A Beckwith Daniel Danis Yasemin Bridges Julius O B Jacobsen Damian Smedley

Genet Med

October 2024

Article Synopsis

Clinical intuition plays a crucial role in differential diagnosis, but current algorithms for rare genetic diseases overlook this aspect and assume equal chances for all possible Mendelian diseases.
The new ClintLR algorithm enhances the existing LIRICAL algorithm by adjusting the pretest probabilities of related diseases based on clinical intuition.
Simulation results indicate that ClintLR significantly improves the ranking of accurate diagnoses in genetic sequencing, making it a valuable tool available for free online.

View Article and Find Full Text PDF

A corpus of GA4GH phenopackets: Case-level phenotyping for genomic diagnostics and discovery.

Daniel Danis Michael J Bamshad Yasemin Bridges Andrés Caballero-Oteyza Pilar Cacheiro Julius O B Jacobsen

HGG Adv

October 2024

Article Synopsis

The GA4GH Phenopacket Schema, released in 2022 and approved as a standard by ISO, allows the sharing of clinical and genomic data, including phenotypic descriptions and genetic information, to aid in genomic diagnostics.
Phenopacket Store Version 0.1.19 offers a collection of 6668 phenopackets linked to various diseases and genes, making it a crucial resource for testing algorithms and software in genomic research.
This collection represents the first extensive case-level, standardized phenotypic information sourced from medical literature, supporting advancements in diagnostic genomics and machine learning applications.

View Article and Find Full Text PDF

The Unified Phenotype Ontology (uPheno): A framework for cross-species integrative phenomics.

Nicolas Matentzoglu Susan M Bello Ray Stefancsik Sarah M Alghamdi Anna V Anagnostopoulos Julius O B Jacobsen

bioRxiv

September 2024

Article Synopsis

Phenotypic data helps us understand how genomic variations affect living organisms and is vital for clinical applications like diagnosing diseases and developing treatments.
The field of phenomics aims to unify and analyze the vast amounts of phenotypic data collected over time, but faces challenges due to inconsistent methods and vocabularies used to record this information.
The Unified Phenotype Ontology (uPheno) framework offers a solution by providing a standardized system for organizing phenotype terms, allowing for better integration of data across different species and improving research on genotype-phenotype associations.

View Article and Find Full Text PDF

Systematic benchmarking demonstrates large language models have not reached the diagnostic accuracy of traditional rare-disease decision support tools.

Justin T Reese Leonardo Chimirri Yasemin Bridges Daniel Danis J Harry Caufield Julius Ob Jacobsen

medRxiv

November 2024

Article Synopsis

- Large language models (LLMs) are being tested for their ability to help diagnose genetic diseases, but their evaluation is complicated due to how they generate unstructured responses.
- Researchers benchmarked LLMs against 5,213 case reports using established phenotypic criteria and compared their performance to a traditional diagnostic tool, Exomiser.
- The best-performing LLM correctly diagnosed cases 23.6% of the time, while Exomiser achieved 35.5%, indicating that while LLMs are improving, they still lag behind conventional bioinformatics methods and need further research for effective integration into diagnostic processes.

View Article and Find Full Text PDF

Towards a standard benchmark for variant and gene prioritisation algorithms: PhEval - Phenotypic inference Evaluation framework.

Yasemin Bridges Vinicius de Souza Katherina G Cortes Melissa Haendel Nomi L Harris Julius Ob Jacobsen

bioRxiv

June 2024

Background: Computational approaches to support rare disease diagnosis are challenging to build, requiring the integration of complex data types such as ontologies, gene-to-phenotype associations, and cross-species data into variant and gene prioritisation algorithms (VGPAs). However, the performance of VGPAs has been difficult to measure and is impacted by many factors, for example, ontology structure, annotation completeness or changes to the underlying algorithm. Assertions of the capabilities of VGPAs are often not reproducible, in part because there is no standardised, empirical framework and openly available patient data to assess the efficacy of VGPAs - ultimately hindering the development of effective prioritisation tools.

View Article and Find Full Text PDF

A corpus of GA4GH Phenopackets: case-level phenotyping for genomic diagnostics and discovery.

Daniel Danis Michael J Bamshad Yasemin Bridges Pilar Cacheiro Leigh C Carmody Julius O B Jacobsen

medRxiv

May 2024

Article Synopsis

View Article and Find Full Text PDF

Critical assessment of variant prioritization methods for rare disease diagnosis within the rare genomes project.

Sarah L Stenton Melanie C O'Leary Gabrielle Lemire Grace E VanNoy Stephanie DiTroia Julius O B Jacobsen

Hum Genomics

April 2024

Background: A major obstacle faced by families with rare diseases is obtaining a genetic diagnosis. The average "diagnostic odyssey" lasts over five years and causal variants are identified in under 50%, even when capturing variants genome-wide. To aid in the interpretation and prioritization of the vast number of variants detected, computational methods are proliferating.

View Article and Find Full Text PDF

Improving prenatal diagnosis through standards and aggregation.

Michael H Duyzend Pilar Cacheiro Julius O B Jacobsen Jessica Giordano Harrison Brand

Prenat Diagn

April 2024

Advances in sequencing and imaging technologies enable enhanced assessment in the prenatal space, with a goal to diagnose and predict the natural history of disease, to direct targeted therapies, and to implement clinical management, including transfer of care, election of supportive care, and selection of surgical interventions. The current lack of standardization and aggregation stymies variant interpretation and gene discovery, which hinders the provision of prenatal precision medicine, leaving clinicians and patients without an accurate diagnosis. With large amounts of data generated, it is imperative to establish standards for data collection, processing, and aggregation.

View Article and Find Full Text PDF

Rare disease gene association discovery from burden analysis of the 100,000 Genomes Project data.

Valentina Cipriani Letizia Vestito Emma F Magavern Julius Ob Jacobsen Gavin Arno

medRxiv

December 2023

To discover rare disease-gene associations, we developed a gene burden analytical framework and applied it to rare, protein-coding variants from whole genome sequencing of 35,008 cases with rare diseases and their family members recruited to the 100,000 Genomes Project (100KGP). Following triaging of the results, 88 novel associations were identified including 38 with existing experimental evidence. We have published the confirmation of one of these associations, hereditary ataxia with , and independent confirmatory evidence has recently been published for four more.

View Article and Find Full Text PDF

The Monarch Initiative in 2024: an analytic platform integrating phenotypes, genes and diseases across species.

Tim E Putman Kevin Schaper Nicolas Matentzoglu Vincent P Rubinetti Faisal S Alquaddoomi Julius O B Jacobsen

Nucleic Acids Res

January 2024

Bridging the gap between genetic variations, environmental determinants, and phenotypic outcomes is critical for supporting clinical diagnosis and understanding mechanisms of diseases. It requires integrating open data at a global scale. The Monarch Initiative advances these goals by developing open ontologies, semantic data models, and knowledge graphs for translational research.

View Article and Find Full Text PDF

The Human Phenotype Ontology in 2024: phenotypes around the world.

Michael A Gargano Nicolas Matentzoglu Ben Coleman Eunice B Addo-Lartey Anna V Anagnostopoulos Julius O B Jacobsen

Nucleic Acids Res

January 2024

The Human Phenotype Ontology (HPO) is a widely used resource that comprehensively organizes and defines the phenotypic features of human disease, enabling computational inference and supporting genomic and phenotypic analyses through semantic similarity and machine learning algorithms. The HPO has widespread applications in clinical diagnostics and translational research, including genomic diagnostics, gene-disease discovery, and cohort analytics. In recent years, groups around the world have developed translations of the HPO from English to other languages, and the HPO browser has been internationalized, allowing users to view HPO term labels and in many cases synonyms and definitions in ten languages in addition to English.

View Article and Find Full Text PDF

Critical assessment of variant prioritization methods for rare disease diagnosis within the Rare Genomes Project.

Sarah L Stenton Melanie O'Leary Gabrielle Lemire Grace E VanNoy Stephanie DiTroia Julius O B Jacobsen

medRxiv

August 2023

Background: A major obstacle faced by rare disease families is obtaining a genetic diagnosis. The average "diagnostic odyssey" lasts over five years, and causal variants are identified in under 50%. The Rare Genomes Project (RGP) is a direct-to-participant research study on the utility of genome sequencing (GS) for diagnosis and gene discovery.

View Article and Find Full Text PDF

Phenopacket-tools: Building and validating GA4GH Phenopackets.

Daniel Danis Julius O B Jacobsen Alex H Wagner Tudor Groza Martha A Beckwith

PLoS One

May 2023

Article Synopsis

* Phenopacket-tools is an open-source Java library that makes it easier to build, convert, and validate these phenopackets by providing user-friendly tools and predefined components.
* The library supports developers in standardizing the collection and sharing of clinical data to enhance genomic diagnostics, research, and precision medicine, with detailed documentation and tutorial resources available online.

View Article and Find Full Text PDF

GA4GH Phenopackets: A Practical Introduction.

Markus S Ladewig Julius O B Jacobsen Alex H Wagner Daniel Danis Baha El Kassaby

Adv Genet (Hoboken)

March 2023

The Global Alliance for Genomics and Health (GA4GH) is developing a suite of coordinated standards for genomics for healthcare. The Phenopacket is a new GA4GH standard for sharing disease and phenotype information that characterizes an individual person, linking that individual to detailed phenotypic descriptions, genetic information, diagnoses, and treatments. A detailed example is presented that illustrates how to use the schema to represent the clinical course of a patient with retinoblastoma, including demographic information, the clinical diagnosis, phenotypic features and clinical measurements, an examination of the extirpated tumor, therapies, and the results of genomic analysis.

View Article and Find Full Text PDF

The GA4GH Phenopacket schema defines a computable representation of clinical data.

Julius O B Jacobsen Michael Baudis Gareth S Baynam Jacques S Beckmann Sergi Beltran

Nat Biotechnol

June 2022

View Article and Find Full Text PDF

Evaluation of phenotype-driven gene prioritization methods for Mendelian diseases.

Julius O B Jacobsen Catherine Kelly Valentina Cipriani Peter N Robinson Damian Smedley

Brief Bioinform

September 2022

Yuan et al. recently described an independent evaluation of several phenotype-driven gene prioritization methods for Mendelian disease on two separate, clinical datasets. Although they attempted to use default settings for each tool, we describe three key differences from those we currently recommend for our Exomiser and PhenIX tools.

View Article and Find Full Text PDF

SvAnna: efficient and accurate pathogenicity prediction of coding and regulatory structural variants in long-read genome sequencing.

Daniel Danis Julius O B Jacobsen Parithi Balachandran Qihui Zhu Feyza Yilmaz

Genome Med

April 2022

Structural variants (SVs) are implicated in the etiology of Mendelian diseases but have been systematically underascertained owing to sequencing technology limitations. Long-read sequencing enables comprehensive detection of SVs, but approaches for prioritization of candidate SVs are needed. Structural variant Annotation and analysis (SvAnna) assesses all classes of SVs and their intersection with transcripts and regulatory sequences, relating predicted effects on gene function with clinical phenotype data.

View Article and Find Full Text PDF

The Clinical Variant Analysis Tool: Analyzing the evidence supporting reported genomic variation in clinical practice.

Hui-Lin Chin Nour Gazzaz Stephanie Huynh Iulia Handra Lynn Warnock Julius O B Jacobsen

Genet Med

July 2022

Purpose: Genomic test results, regardless of laboratory variant classification, require clinical practitioners to judge the applicability of a variant for medical decisions. Teaching and standardizing clinical interpretation of genomic variation calls for a methodology or tool.

Methods: To generate such a tool, we distilled the Clinical Genome Resource framework of causality and the American College of Medical Genetics/Association of Molecular Pathology and Quest Diagnostic Laboratory scoring of variant deleteriousness into the Clinical Variant Analysis Tool (CVAT).

View Article and Find Full Text PDF

Phenotype-driven approaches to enhance variant prioritization and diagnosis of rare disease.

Julius O B Jacobsen Catherine Kelly Valentina Cipriani Genomics England Research Consortium Christopher J Mungall

Hum Mutat

August 2022

Rare disease diagnostics and disease gene discovery have been revolutionized by whole-exome and genome sequencing but identifying the causative variant(s) from the millions in each individual remains challenging. The use of deep phenotyping of patients and reference genotype-phenotype knowledge, alongside variant data such as allele frequency, segregation, and predicted pathogenicity, has proved an effective strategy to tackle this issue. Here we review the numerous tools that have been developed to automate this approach and demonstrate the power of such an approach on several thousand diagnosed cases from the 100,000 Genomes Project.

View Article and Find Full Text PDF

The RD-Connect Genome-Phenome Analysis Platform: Accelerating diagnosis, research, and gene discovery for rare diseases.

Steven Laurie Davide Piscia Leslie Matalonga Alberto Corvó Marcos Fernández-Callejo Julius O B Jacobsen

Hum Mutat

June 2022

Rare disease patients are more likely to receive a rapid molecular diagnosis nowadays thanks to the wide adoption of next-generation sequencing. However, many cases remain undiagnosed even after exome or genome analysis, because the methods used missed the molecular cause in a known gene, or a novel causative gene could not be identified and/or confirmed. To address these challenges, the RD-Connect Genome-Phenome Analysis Platform (GPAP) facilitates the collation, discovery, sharing, and analysis of standardized genome-phenome data within a collaborative environment.

View Article and Find Full Text PDF

GA4GH: International policies and standards for data sharing across genomic research and healthcare.

Heidi L Rehm Angela J H Page Lindsay Smith Jeremy B Adams Gil Alterovitz Julius O Jacobsen

Cell Genom

November 2021

The Global Alliance for Genomics and Health (GA4GH) aims to accelerate biomedical advances by enabling the responsible sharing of clinical and genomic data through both harmonized data aggregation and federated approaches. The decreasing cost of genomic sequencing (along with other genome-wide molecular assays) and increasing evidence of its clinical utility will soon drive the generation of sequence data from tens of millions of humans, with increasing levels of diversity. In this perspective, we present the GA4GH strategies for addressing the major challenges of this data revolution.

View Article and Find Full Text PDF

100,000 Genomes Pilot on Rare-Disease Diagnosis in Health Care - Preliminary Report.

N Engl J Med

November 2021

Background: The U.K. 100,000 Genomes Project is in the process of investigating the role of genome sequencing in patients with undiagnosed rare diseases after usual care and the alignment of this research with health care implementation in the U.

View Article and Find Full Text PDF

Interpretable prioritization of splice variants in diagnostic next-generation sequencing.

Daniel Danis Julius O B Jacobsen Leigh C Carmody Michael A Gargano Julie A McMurry

Am J Hum Genet

November 2021

View Article and Find Full Text PDF

Interpretable prioritization of splice variants in diagnostic next-generation sequencing.

Daniel Danis Julius O B Jacobsen Leigh C Carmody Michael A Gargano Julie A McMurry

Am J Hum Genet

September 2021

A critical challenge in genetic diagnostics is the computational assessment of candidate splice variants, specifically the interpretation of nucleotide changes located outside of the highly conserved dinucleotide sequences at the 5' and 3' ends of introns. To address this gap, we developed the Super Quick Information-content Random-forest Learning of Splice variants (SQUIRLS) algorithm. SQUIRLS generates a small set of interpretable features for machine learning by calculating the information-content of wild-type and variant sequences of canonical and cryptic splice sites, assessing changes in candidate splicing regulatory sequences, and incorporating characteristics of the sequence such as exon length, disruptions of the AG exclusion zone, and conservation.

View Article and Find Full Text PDF