Accurate sampling and deep sequencing of the HIV-1 protease gene using a Primer ID.

Cassandra B Jabara Corbin D Jones Jeffrey Roach Jeffrey A Anderson Ronald Swanstrom

Proc Natl Acad Sci U S A

Department of Biology, Lineberger Comprehensive Cancer Center, University of North Carolina Center for AIDS Research, Carolina Center for Genome Sciences, Chapel Hill, NC 27599, USA.

Published: December 2011

Viruses can create complex genetic populations within a host, and deep sequencing technologies allow extensive sampling of these populations. Limitations of these technologies, however, potentially bias this sampling, particularly when a PCR step precedes the sequencing protocol. Typically, an unknown number of templates are used in initiating the PCR amplification, and this can lead to unrecognized sequence resampling creating apparent homogeneity; also, PCR-mediated recombination can disrupt linkage, and differential amplification can skew allele frequency. Finally, misincorporation of nucleotides during PCR and errors during the sequencing protocol can inflate diversity. We have solved these problems by including a random sequence tag in the initial primer such that each template receives a unique Primer ID. After sequencing, repeated identification of a Primer ID reveals sequence resampling. These resampled sequences are then used to create an accurate consensus sequence for each template, correcting for recombination, allelic skewing, and misincorporation/sequencing errors. The resulting population of consensus sequences directly represents the initial sampled templates. We applied this approach to the HIV-1 protease (pro) gene to view the distribution of sequence variation of a complex viral population within a host. We identified major and minor polymorphisms at coding and noncoding positions. In addition, we observed dynamic genetic changes within the population during intermittent drug exposure, including the emergence of multiple resistant alleles. These results provide an unprecedented view of a complex viral population in the absence of PCR resampling.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3250168	PMC
http://dx.doi.org/10.1073/pnas.1110064108	DOI Listing

Publication Analysis

Top Keywords

deep sequencing

hiv-1 protease

sequencing protocol

sequence resampling

complex viral

viral population

sequencing

sequence

accurate sampling

sampling deep

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!