AI Article Synopsis

  • Reduced-representation genome sequencing techniques like RADseq lower sequencing costs and computational demands while aiding in genetic variation analysis and phylogeny reconstruction.
  • RADseq data present challenges due to incomplete genome coverage and missing information, which complicates traditional phylogenomic methods.
  • The study tests a new alignment-free method called AAF on RADseq data, proposing optimized read selection procedures that enhance phylogenetic accuracy and efficiency, with resultant tools available on GitHub.

Article Abstract

Reduced-representation genome sequencing such as RADseq aids the analysis of genomes by reducing the quantity of data, thereby lowering both sequencing costs and computational burdens. RADseq was initially designed for studying genetic variation across genomes at the population level, but has also proved to be suitable for interspecific phylogeny reconstruction. RADseq data pose challenges for standard phylogenomic methods, however, due to incomplete coverage of the genome and large amounts of missing data. Alignment-free methods are both efficient and accurate for phylogenetic reconstructions with whole genomes and are especially practical for nonmodel organisms; nonetheless, alignment-free methods have not been applied with reduced genome sequencing data. Here, we test a full-genome assembly- and alignment-free method, AAF, in application to RADseq data and propose two procedures for reads selection to remove reads from restriction sites that were not found in taxa being compared. We validate these methods using both simulations and real data sets. Reads selection improved the accuracy of phylogenetic construction in every simulated scenario and the two real data sets, making AAF as good or better than a comparable alignment-based method, even though AAF had much lower computational burdens. We also investigated the sources of missing data in RADseq and their effects on phylogeny reconstruction using AAF. The AAF pipeline modified for RADseq or other reduced-representation sequencing data, phyloRAD, is available on github (https://github.com/fanhuan/phyloRAD).

Download full-text PDF

Source
http://dx.doi.org/10.1111/1755-0998.12921DOI Listing

Publication Analysis

Top Keywords

genome sequencing
12
sequencing data
12
data
10
reduced-representation genome
8
computational burdens
8
phylogeny reconstruction
8
radseq data
8
missing data
8
alignment-free methods
8
method aaf
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!