: a python pipeline for generating recombination-filtered multi-locus datasets.

Bioinform Adv

Department of Evolution, Ecology & Organismal Biology, University of California, Riverside, CA 92521, United States.

Published: November 2023

Summary: (Exclusion of Recombined DNA) is a dependency-free Python pipeline that implements the four-gamete test to automatically filter out recombined DNA blocks from thousands of DNA sequence loci. This procedure helps all loci better meet the "no intralocus recombination" assumption common to many coalescent-based analyses in population genomic, phylogeographic, and shallow-scale phylogenomic studies. The user-friendly pipeline contains five standalone applications-four file conversion scripts and one main script that performs the recombination filtering procedures. The pipeline outputs recombination-filtered data in a variety of common formats and a tab-delimited table that displays descriptive statistics for all loci and the analysis results. A novel feature of this software is that the user can select whether to output the longest nonrecombined sequence blocks from recombined loci (current best practice) or randomly select nonrecombined blocks from loci (a newer approach). We tested with six published phylogenomic datasets that ranged in size from 27 to 2237 loci and came in a variety of input file formats. In all trials the data could be easily analyzed in only seconds for the smaller datasets and <30 min for the largest using a simple laptop computer.

Availability And Implementation: was written in Python 3 under the MIT license. The program applications, user manual (including step-by-step tutorials), and sample data are freely available at https://github.com/Sammccarthypotter/ExRec.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10715780PMC
http://dx.doi.org/10.1093/bioadv/vbad174DOI Listing

Publication Analysis

Top Keywords

python pipeline
8
recombined dna
8
loci
6
pipeline generating
4
generating recombination-filtered
4
recombination-filtered multi-locus
4
multi-locus datasets
4
datasets summary
4
summary exclusion
4
exclusion recombined
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!