Summary: In viral genomic research and surveillance, inter-sample contamination can affect variant detection, analysis of within-host evolution, outbreak reconstruction, and detection of superinfections and recombination events. While sample barcoding methods exist to track inter-sample contamination, they are not always used and can only detect contamination in the experimental pipeline from the point they are added. The underlying genomic information in a sample, however, carries information about inter-sample contamination occurring at any stage. Here, we present Polyphonia, a tool for detecting inter-sample contamination directly from deep sequencing data without the need for additional controls, using intrahost variant frequencies. We apply Polyphonia to 1102 SARS-CoV-2 samples sequenced at the Broad Institute and already tracked using molecular barcoding for comparison.

Availability And Implementation: Polyphonia is available as a standalone Docker image and is also included as part of viral-ngs, available in Dockstore. Full documentation, source code, and instructions for use are available at https://github.com/broadinstitute/polyphonia.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btae698DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11652266PMC

Publication Analysis

Top Keywords

inter-sample contamination
20
detecting inter-sample
8
viral genomic
8
sequencing data
8
contamination
6
inter-sample
5
polyphonia
4
polyphonia detecting
4
contamination viral
4
genomic sequencing
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!