Using in silico amplified fragment length polymorphism (AFLP) fingerprints, we explore the relationship between sequence similarity and phylogeny accuracy to test when, in terms of genetic divergence, the quality of AFLP data becomes too low to be informative for a reliable phylogenetic reconstruction. We generated DNA sequences with known phylogenies using balanced and unbalanced trees with recent, uniform and ancient radiations, and average branch lengths (from the most internal node to the tip) ranging from 0.02 to 0.4 substitutions per site. The resulting sequences were used to emulate the AFLP procedure. Trees were estimated by maximum parsimony (MP), neighbor-joining (NJ), and minimum evolution (ME) methods from both DNA sequences and virtual AFLP fingerprints. The estimated trees were compared with the reference trees using a score that measures overall differences in both topology and relative branch length. As expected, the accuracy of AFLP-based phylogenies decreased dramatically in the more divergent data sets. Above a divergence of approximately 0.05, AFLP-based phylogenies were largely inaccurate irrespective of the distinct topology, radiation model, or phylogenetic method used. This value represents an upper bound of expected tree accuracy for data sets with a simple divergence history; AFLP data sets with a similar divergence but with unbalanced topologies and short ancestral branches produced much less accurate trees. The lack of homology of AFLP bands quickly increases with divergence and reaches its maximum value (100%) at a divergence of only 0.4. Low guanine-cytosine (GC) contents increase the number of nonhomologous bands in AFLP data sets and lead to less reliable trees. However, the effect of the lack of band homology on tree accuracy is surprisingly small relative to the negative impact due to the low information content of AFLP characters. Tree-building methods based on genetic distance displayed similar trends and outperformed parsimony at low but not at high divergences. However, the impact of using alternative phylogenetic methods on tree accuracy was generally small relative to the uncertainty arising from factors such as divergence, nonhomology of bands, or the low information content of AFLP characters. Nevertheless, our data suggest that under certain circumstances, AFLPs may be suitable to reconstruct deeper phylogenies than usually accepted.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1093/molbev/msp315 | DOI Listing |
Cancer Genet
December 2024
Department of Otolaryngology, University of Minnesota, MMC396, 420 Delaware St SE, Minneapolis, MN 55455, USA.
Objective: Studies of squamous cell carcinoma of the head and neck (HNSCC) have demonstrated the importance of nuclear receptors and their associated coregulators in the development and treatment of HNSCC. We sought to characterize members of the nuclear receptor super family through interrogation of RNA-Seq and microarray data.
Materials And Methods: TCGA RNA-Seq data within the cBioportal platform comparing HNSCC samples (n = 515 patients with RNA-Seq data) to normal tissue (n = 82 patients) was interrogated for significant differences in nuclear receptor expression.
Curr Opin Plant Biol
January 2025
Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA.
Plant diseases constantly threaten crops and food systems, while global connectivity further increases the risks of spreading existing and exotic pathogens. Here, we first explore how an integrative approach involving plant pathway knowledgegraphs, differential gene expression data, and biochemical data informing Raman spectroscopy could be used to detect plant pathways responding to pathogen attacks. The Plant Reactome (https://plantreactome.
View Article and Find Full Text PDFLifetime Data Anal
January 2025
Institut Camille Jordan, UMR 5208, Université Claude Bernard Lyon 1, Bat. Braconnier, 43, blvd du 11 novembre 1918, F - 69622, Villeurbanne Cedex, France.
Based on the expectile loss function and the adaptive LASSO penalty, the paper proposes and studies the estimation methods for the accelerated failure time (AFT) model. In this approach, we need to estimate the survival function of the censoring variable by the Kaplan-Meier estimator. The AFT model parameters are first estimated by the expectile method and afterwards, when the number of explanatory variables can be large, by the adaptive LASSO expectile method which directly carries out the automatic selection of variables.
View Article and Find Full Text PDFJ Chem Inf Model
January 2025
Department of Urology, Ji'an Third People's Hospital, Ji'an 343000, Jiangxi, China.
As combination therapy becomes more common in clinical applications, predicting adverse effects of combination medications is a challenging task. However, there are three limitations of the existing prediction models. First, they rely on a single view of the drug and cannot fully utilize multiview information, resulting in limited performance when capturing complex structures.
View Article and Find Full Text PDFMol Ecol Resour
January 2025
Section for Molecular Ecology and Evolution, Globe Institute, University of Copenhagen, Copenhagen, Denmark.
Reduced representation sequencing (RRS) has proven to be a cost-effective solution for sequencing subsets of the genome in non-model species for large-scale studies. However, the targeted nature of RRS approaches commonly introduces large amounts of missing data, leading to reduced statistical power and biased estimates in downstream analyses. Genotype imputation, the statistical inference of missing sites across the genome, is a powerful alternative to overcome the caveats associated with missing sites.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!