Using genetic data to infer evolutionary distances between molecular sequence pairs based on a Markov substitution model is a common procedure in phylogenetics, in particular for selecting a good starting tree to improve upon. Many evolutionary patterns can be accurately modelled using substitution models that are available in closed form, including the popular general time reversible model (GTR) for DNA data. For more complex biological phenomena, such as variations in lineage-specific evolutionary rates over time (heterotachy), other approaches such as the GTR with rate variation (GTR ) are required, but do not admit analytical solutions and do not automatically allow for likelihood calculations crucial for Bayesian analysis. In this paper, we derive a hybrid approach between these two methods, incorporating -distributed rate variation and heterotachy into a hierarchical Bayesian GTR-style framework. Our approach is differentiable and amenable to both stochastic gradient descent for optimisation and Hamiltonian Markov chain Monte Carlo for Bayesian inference. We show the utility of our approach by studying hypotheses regarding the origins of the eukaryotic cell within the context of a universal tree of life and find evidence for a two-domain theory.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1007/s11538-024-01403-z | DOI Listing |
Bull Math Biol
January 2025
Section of Epidemiology, University of Copenhagen, Copenhagen, Denmark.
Using genetic data to infer evolutionary distances between molecular sequence pairs based on a Markov substitution model is a common procedure in phylogenetics, in particular for selecting a good starting tree to improve upon. Many evolutionary patterns can be accurately modelled using substitution models that are available in closed form, including the popular general time reversible model (GTR) for DNA data. For more complex biological phenomena, such as variations in lineage-specific evolutionary rates over time (heterotachy), other approaches such as the GTR with rate variation (GTR ) are required, but do not admit analytical solutions and do not automatically allow for likelihood calculations crucial for Bayesian analysis.
View Article and Find Full Text PDFR Soc Open Sci
January 2025
Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY 10032, USA.
The 2019 emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and its rapid spread created a public health emergency of international concern. However, the impact of the pandemic in Sub-Saharan Africa, as documented in cases, hospitalizations and deaths, appears far lower than in the Americas, Europe and Asia. Characterization of the transmission dynamics is critical for understanding how SARS-CoV-2 spreads and the true scale of the pandemic.
View Article and Find Full Text PDFStat Med
February 2025
Division of Biostatistics, College of Public Health, The Ohio State University, Ohio, USA.
Leveraging external data information to supplement randomized clinical trials has been a popular topic in recent years, especially for medical device and drug discovery. In rare diseases, it is very challenging to recruit patients and run a large-scale randomized trial. To take advantage of real-world data from historical trials on the same disease, we can run a small hybrid trial and borrow historical controls to increase the power.
View Article and Find Full Text PDFMitochondrial DNA B Resour
January 2025
East China Sea Ecological Center of the Ministry of Natural Resources, MNR, Shanghai, China.
In this study, the complete mitochondrial genome of was sequenced by Illumina high-throughput sequencing and its characteristics were analyzed. The mitogenome of is 16,635 bp long, and it encodes the standard set of 13 PCGs, 22 tRNA genes, and two rRNA genes. The mitogenome has a GC content of 29.
View Article and Find Full Text PDFVirus Evol
December 2024
Department of Epidemiology and Population Health, Stanford University School of Medicine, 300 Pasteur Drive, Stanford, CA 94305, United States.
Despite the increasing burden of dengue in Kenya and Africa, the introduction and expansion of the virus in the region remain poorly understood. The objective of this study is to examine the genetic diversity and evolutionary histories of dengue virus (DENV) serotypes 1 and 3 in Kenya and contextualize their circulation within circulation dynamics in the broader African region. Viral RNA was extracted from samples collected from a cohort of febrile patients recruited at clinical sites in Kenya from 2013 to 2022.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!