Motivation: We explore the problem of constructing near-perfect phylogenies on bi-allelic haplotypes, where the deviation from perfect phylogeny is entirely due to homoplasy events. We present polynomial-time algorithms for restricted versions of the problem. We show that these algorithms can be extended to genotype data, in which case the problem is called the near-perfect phylogeny haplotyping (NPPH) problem. We present a near-optimal algorithm for the H1-NPPH problem, which is to determine if a given set of genotypes admit a phylogeny with a single homoplasy event. The time-complexity of our algorithm for the H1-NPPH problem is O(m2(n + m)), where n is the number of genotypes and m is the number of SNP sites. This is a significant improvement over the earlier O(n4) algorithm. We also introduce generalized versions of the problem. The H(1, q)-NPPH problem is to determine if a given set of genotypes admit a phylogeny with q homoplasy events, so that all the homoplasy events occur in a single site. We present an O(m(q+1)(n + m)) algorithm for the H(1,q)-NPPH problem.
Results: We present results on simulated data, which demonstrate that the accuracy of our algorithm for the H1-NPPH problem is comparable to that of the existing methods, while being orders of magnitude faster.
Availability: The implementation of our algorithm for the H1-NPPH problem is available upon request.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1093/bioinformatics/btl262 | DOI Listing |
Ann Bot
November 2024
Key Laboratory of Southern Subtropical Plant Diversity, Fairy Lake Botanical Garden Shenzhen & Chinese Academy of Sciences, Shenzhen, 518004, China.
Background And Aims: Cycads are a key lineage to understand the early evolution of seed plants and their response to past environmental changes. However, tracing the evolutionary trajectory of cycad species is challenging when the robust relationships at inter- or infrageneric level are not well resolved.
Methods: Here, using 2901 single-copy nuclear genes, we explored the species relationships and gene flow within the second largest genus of cycads, i.
Cladistics
August 2024
State Key Laboratory of Plant Diversity and Prominent Crops, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China.
A major question in evolutionary biology is how often the same developmental events, mechanisms and genes are reused in the recurrent evolution of similar phenotypes. If this happens frequently, it would suggest that evolution is often constrained by developmental genetic mechanisms. To help address this question, we used adherens junction staining and laser ablation to analyze the development underlying several features of nematode male tails have evolved recurrently.
View Article and Find Full Text PDFMol Ecol
December 2023
Faculty of Biosciences and Aquaculture, Nord University, Bodø, Norway.
Interspecific hybridization events are on the rise in natural systems due to climate change disrupting species barriers. Across taxa, microsatellites have long been the molecular markers of choice to identify admixed individuals. However, with the advent of high-throughput sequencing easing the generation of genome-wide datasets, incorrect reports of hybridization resulting from microsatellite technical artefacts have been uncovered in a growing number of taxa.
View Article and Find Full Text PDFFEBS Lett
October 2023
Institut de Recherche en Infectiologie de Montpellier (IRIM), University of Montpellier, France.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!