Deciphering human immunodeficiency virus type 1 transmission and early envelope diversification by single-genome amplification and sequencing

Jesus F. Salazar-Gonzalez, Elizabeth Bailes, Kimmy T. Pham, Maria G. Salazar, M. Brad Guffey, Brandon F. Keele, Cynthia A. Derdeyn, Paul Farmer, Eric Hunter, Susan Allen, Olivier Manigart, Joseph Mulenga, Jeffrey A. Anderson, Ronald Swanstrom, Barton F. Haynes, Gayathri S. Athreya, Bette T. M. Korber, Paul M. Sharp, George M. Shaw, Beatrice H. Hahn

Research output: Contribution to journalArticlepeer-review

Abstract

Accurate identification of the transmitted virus and sequences evolving from it could be instrumental in elucidating the transmission of human immunodeficiency virus type 1 (HIV-1) and in developing vaccines, drugs, or microbicides to prevent infection. Here we describe an experimental approach to analyze HIV-1 env genes as intact genetic units amplified from plasma virion RNA by single-genome amplification (SGA), followed by direct sequencing of uncloned DNA amplicons. We show that this strategy precludes in vitro artifacts caused by Taq-induced nucleotide substitutions and template switching, provides an accurate representation of the env quasispecies in vivo, and has an overall error rate (including nucleotide misincorporation, insertion, and deletion) of less than 8 x 10(-5). Applying this method to the analysis of virus in plasma from 12 Zambian subjects from whom samples were obtained within 3 months of seroconversion, we show that transmitted or early founder viruses can be identified and that molecular pathways and rates of early env diversification can be defined. Specifically, we show that 8 of the 12 subjects were each infected by a single virus, while 4 others acquired more than one virus; that the rate of virus evolution in one subject during an 80-day period spanning seroconversion was 1.7 x 10(-5) substitutions per site per day; and that evidence of strong immunologic selection can be seen in Env and overlapping Rev sequences based on nonrandom accumulation of nonsynonymous mutations. We also compared the results of the SGA approach with those of more-conventional bulk PCR amplification methods performed on the same patient samples and found that the latter is associated with excessive rates of Taq-induced recombination, nucleotide misincorporation, template resampling, and cloning bias. These findings indicate that HIV-1 env genes, other viral genes, and even full-length viral genomes responsible for productive clinical infection can be identified by SGA analysis of plasma virus sampled at intervals typical in large-scale vaccine trials and that pathways of viral diversification and immune escape can be determined accurately.

Original languageEnglish
Pages (from-to)3952-3970
Number of pages19
JournalJournal of Virology
Volume82
Issue number8
DOIs
Publication statusPublished - Apr 2008

Cite this