We present a novel algorithm, implemented in the software ARGinfer, for probabilistic inference of the Ancestral Recombination Graph under the Coalescent with Recombination. Our Markov Chain Monte Carlo algorithm takes advantage of the Succinct Tree Sequence data structure that has allowed great advances in simulation and point estimation, but not yet probabilistic inference. Unlike previous methods, which employ the Sequentially Markov Coalescent approximation, ARGinfer uses the Coalescent with Recombination, allowing more accurate inference of key evolutionary parameters. We show using simulations that ARGinfer can accurately estimate many properties of the evolutionary history of the sample, including the topology and branch lengths of the genealogical tree at each sequence site, and the times and locations of mutation and recombination events. ARGinfer approximates posterior probability distributions for these and other quantities, providing interpretable assessments of uncertainty that we show to be well calibrated. ARGinfer is currently limited to tens of DNA sequences of several hundreds of kilobases, but has scope for further computational improvements to increase its applicability.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8936483 | PMC |
http://dx.doi.org/10.1371/journal.pcbi.1009960 | DOI Listing |
Genetics
January 2025
Max Planck Research Group Behavioural Genomics, Max Planck Institute for Evolutionary Biology, August-Thienemann-Straße 2, 24306 Plön, Germany.
Multiple methods of demography inference are based on the ancestral recombination graph. This powerful approach uses observed mutations to model local genealogies changing along chromosomes by historical recombination events. However, inference of underlying genealogies is difficult in regions with high recombination rate relative to mutation rate due to the lack of mutations representing genealogies.
View Article and Find Full Text PDFJAMA Netw Open
January 2025
Mental Illness Research, Education and Clinical Center, Crescenz Veterans Affairs Medical Center, Philadelphia, Pennsylvania.
Importance: Recently, the US Food and Drug Administration gave premarketing approval to an algorithm based on its purported ability to identify individuals at genetic risk for opioid use disorder (OUD). However, the clinical utility of the candidate genetic variants included in the algorithm has not been independently demonstrated.
Objective: To assess the utility of 15 genetic variants from an algorithm intended to predict OUD risk.
JAMA Oncol
January 2025
Marie-Josée and Henry R. Kravis Center for Molecular Oncology, Memorial Sloan Kettering Cancer Center, New York, New York.
Importance: Although differences in the prevalence of key cancer-specific somatic mutations as a function of genetic ancestry among patients with cancer has been well-established, few studies have addressed the practical clinical implications of these differences for the growing number of biomarker-driven treatments.
Objective: To determine if the approval of precision oncology therapies has benefited patients with cancer from various ancestral backgrounds equally over time.
Design, Setting, And Participants: A retrospective analysis of samples from patients with solid cancers who underwent clinical sequencing using the integrated mutation profiling of actionable cancer targets (MSK-IMPACT) assay between January 2014 and December 2022 was carried out.
Gigascience
January 2025
Centre for Evolutionary & Organismal Biology, Zhejiang University School of Medicine, Hangzhou 310058, China.
Background: A thorough analysis of genome evolution is fundamental for biodiversity understanding. The iconic monotremes (platypus and echidna) feature extraordinary biology. However, they also exhibit rearrangements in several chromosomes, especially in the sex chromosome chain.
View Article and Find Full Text PDFPLoS Genet
January 2025
Melbourne Integrative Genomics, School of Mathematics & Statistics, University of Melbourne, Victoria, Australia.
Inference of evolutionary and demographic parameters from a sample of genome sequences often proceeds by first inferring identical-by-descent (IBD) genome segments. By exploiting efficient data encoding based on the ancestral recombination graph (ARG), we obtain three major advantages over current approaches: (i) no need to impose a length threshold on IBD segments, (ii) IBD can be defined without the hard-to-verify requirement of no recombination, and (iii) computation time can be reduced with little loss of statistical efficiency using only the IBD segments from a set of sequence pairs that scales linearly with sample size. We first demonstrate powerful inferences when true IBD information is available from simulated data.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!