Abstract
HIV-1 incidence among youth, especially men who have sex with men (MSM), is increasing in the United States. We aimed to better understand the patterns of adolescent HIV-1 acquisition, to help guide future prevention interventions. We conducted a study combining epidemiologic and HIV-1 pol sequence data from a retrospective cohort of HIV-infected adults and adolescents in Seattle, WA between 2000 and 2013. Adolescents were defined as 13–24 years of age at the time of first HIV-1 care. Maximum-likelihood phylogenetic trees were reconstructed to identify putative viral transmission clusters of two or more individuals, followed by multivariable regression tests of associations between clustering and demographic and clinical parameters. The dataset included 3,102 sequences from 1,953 individuals; 72 putative transmission clusters were identified, representing 168 individuals (8.6%). MSM and MSM/intravenous drug use (IDU) were positively associated with clustering, with aOR 3.18 (95% CI: 1.34–7.55) and 2.59 (95% CI: 1.04–6.49), respectively. African American race was negatively associated with clustering (aOR 0.54 95% CI: 0.32–0.91). Twenty-five clusters contained one adolescent and five clusters contained two adolescents. Other individuals who clustered with adolescents were predominantly male (95%), white (85%), and either MSM (66%) or MSM/IDU (16%), with a greater mean age (34 years vs. 22 years; p < .01). In this Seattle cohort, HIV-1 transmission linkages were identified between white male adolescents and older MSM adults. Interventions aimed at age-discrepant pairs may reduce HIV-1 infections in adolescent males.
Introduction
S
Methods
Study population
We performed a retrospective analysis of HIV-1-infected individuals identified from the University of Washington (UW) HIV Cohort described previously 12 or Seattle Children's Hospital (SCH) with one or more HIV-1 pol genotypes collected between February 2000 and March 2013. Individuals with perinatal or transfusion acquisition were excluded. Adolescents were defined as 13–24 years of age at time of entry into HIV-related care. Demographic and clinical characteristics recorded at first HIV-1 genotyping were obtained from the UW HIV Information System (UWHIS) and SCH. Individual data assessed in this study included the following: age at first recorded HIV-1 care, age at HIV-1 pol genotyping, sex, race, CD4+ T cell count and plasma viral load at first HIV-1 care, housing status (homeless or not) at date of first HIV-1 genotyping, and HIV-1 risk group, including MSM, heterosexual (HET), IDU, MSM who also reported intravenous drug use (MSM/IDU). Sequences have been deposited in GenBank with accession numbers: KY034691 to KY037792.
Sequence analysis
Partial HIV-1 pol gene sequences (length = ∼1,080 nucleotides (with 2 substantially shorter sequences, 600 and 800 bp, also included) from amino acid positions 6 to 99 of protease and 1–253 of reverse transcriptase, relative to HXB2 numbering) were obtained from the University of Washington Clinical Virology Laboratory. Sequences were aligned using MUSCLE
13
within Geneious v8,
14
followed by manual adjustment. Maximum-likelihood phylogenetic trees were reconstructed using FastTree 2.1.5
15,16
rooted with HXB2, assuming the GTR model of nucleotide substitution with gamma-distributed substitution rate heterogeneity, also within Geneious v8. Node support was estimated using FastTree SH-like (likelihood based) approach.
15,16
Phylogenies were reconstructed using datasets including: (1) only the first genotype collected from each individual, to avoid erroneously excluding an individual from a cluster due to viral evolution away from an earlier variant and (2) separately, all available sequences from each individual, to increase the detection of transmission linkages from individuals who transmitted later in their infection. To the alignments, we added 500 randomly selected U.S. HIV-1 subtype B pol sequences (from
Cluster identification
Phylogenetic clusters were defined as: (1) sequences with a shared ancestral node containing only sequences from our Seattle-based dataset; (2) node support values 16 >0.95; and (3) maximum pairwise genetic distances ≤0.015 nucleotide substitutions/site, using ClusterPicker. 18 These node support and distance thresholds are conservative for molecular epidemiological studies of pol. 19,20 To allow for identification of more clusters, we identified clusters observed in either of the two phylogenies (i.e., using “only first sequences obtained from each individual” or “all sequences obtained”). We were unable to identify the direction of transmission (i.e., the source and recipient individuals) due to unknown dates of seroconversion.
Statistical analysis
To identify predictors of cluster membership, univariable logistic regression analysis was performed using each covariate followed by multivariable analysis using characteristics with p < .2, regardless of effect size in the univariable analysis, except adolescent age, the primary covariable of interest.
To identify risk factors associated with HIV-1 infection of adolescents, we focused on clusters that contained at least one adolescent, termed “adolescent clusters.” The adolescent's risk factors were compared to data from others within the cluster. The “other” individual(s) (regardless of adolescent or adult status) were interpreted to be the “exposure” individual. The demographic and epidemiological traits of this exposure individual were assumed to be proxies for the trait frequency in the transmission network for which the clusters are representative. 21 Chi-squared tests and unpaired two-tailed t-tests were used to compare frequencies of categorical and continuous variables, respectively. We also compared (1) individuals within adolescent clusters to individuals within nonadolescent clusters to assess differences between these two high transmission groups and (2) adolescents within clusters to adolescents not in clusters to assess differences between adolescents in high and low transmission groups.
Results
The dataset for our analyses included 3,102 pol sequences from 1,953 individuals; 374 (19%) were adolescents, with a median age of 22 years (IQR: 20–23; mean: 21.48) (Table 1, Supplementary Table S1; Supplementary Data are available online at
Chi-squared test used for categorical variables and unpaired two-tailed t-test used for comparing means of continuous variables.
Multivariable model included sex, age group, race, and HIV-1 risk group as covariates.
All transgender were male-to-female, this group included too few individuals to include in either regression model.
Adolescents were defined as ages 13–24 years at the time first engaged in HIV-1 care.
“Other” races include Asian, Black African, Hawaiian, Pacific Islander, Multiracial, and American Indian.
IDU, intravenous drug use; MSM, men who have sex with men.
Bold represents p value of less than or equal to 0.05.
Clusters defined by FastTree likelihood-based node support ≥0.95 and maximum pairwise genetic distance <1.5%.
Adolescents excluded patients with perinatally-acquired or transfusion-acquired HIV who were 13–24 years of age at time engaged in HIV-1 care.
Characteristics for “Others” in adolescent clusters were counted once per adolescent; because five adolescent clusters had two adolescents, the demographic characteristics for 18 individuals (10 adolescents and 8 adults) in the “Others” column were counted twice.
Comparison between adolescents in adolescent clusters and “Others” in adolescent clusters.
Chi-squared test used for categorical variables and unpaired two-tailed t-test used for comparing means of continuous variables.
Comparison between those in nonadolescent cluster and total (adolescents and adults) in adolescent cluster.
Comparison between adolescents not in a cluster and adolescents in a cluster.
Number of patients (%), unless otherwise specified.
Bold represents p value of less than or equal to 0.05.
Discussion
Phylogenetic analyses revealed that Seattle adolescents in putative HIV-1 transmission clusters likely acquired HIV-1 through sexual pairing with older adults. The observed age differential of 12 years supports previous epidemiologic research reporting intergenerational sex to be a risk factor for HIV-1 acquisition in young MSM. 22 –25 This is also consistent with a national study that found that 36% of MSM 13–24 years had a potential transmission partner who was >5 years older. 26 Studies have shown that young MSM in age-discrepant pairs are more likely to engage in unprotected sex and receptive anal intercourse with older partners and report drug use and sexually transmitted infections, all of which increase their risk of acquiring and transmitting HIV-1. 5,6,22,27,28 Qualitative research has suggested that the participation in these behaviors may stem from differences in financial or social status. 22,29 The higher baseline CD4 counts in adolescents, who clustered in phylogenetic analysis compared with those adolescents who did not, suggest one of two possibilities: (1) that the clustered adolescents were the recipients rather than the source of HIV-1 infection or (2) that the clustered adolescents were earlier in their course of disease. We think the former explanation more probable since individuals at younger ages generally have less time since sexual debut.
Limitations to our study, similar to all studies with retrospective sampling of pol genotypes, relate to uncertainties regarding the proportion of the HIV-1-infected population studied, and our inability to identify the transmission networks for all infected adolescents (10% of adolescents in our study were in a phylogenetic cluster), and particularly for young women. The UW and SCH HIV clinics provide the majority of free and/or government-sponsored HIV care in the Seattle metropolitan area. Thus, it is possible that many Seattle adolescents were infected by individuals residing in other geographic regions or by individuals who receive care funded by private insurance or who do not seek or receive HIV care. The majority of Seattle's HIV-infected individuals who receive care outside of the UW and SCH networks obtain care in private practice settings financed by private insurance. These individuals are likely of older age, higher socioeconomic status, and maintain viral suppression. The unavailability of these individuals’ sequences could bias our results toward an underestimate of the age differentials within adolescent clusters. The lack of data related to homeless status may have limited our sensitivity to assess associations with cluster membership (either generally or among adolescents). Finally, HIV-1 transmission risk factors were reported by the individuals themselves, which may have led to a reporting bias against stigmatized behaviors such as IDU and MSM. Since we found that both MSM and MSM/IDU were significantly associated with clustering, omission of these behaviors would likely have biased our findings toward the null.
In conclusion, our phylogenetic study of HIV-1 pol sequences in Seattle finds that a significant number of adolescent HIV-1 transmission events are associated with age-discrepant sexual pairing, specifically among white MSM. This suggests that the MSM community may benefit from new prevention strategies aimed at age-discrepant pairs.
Footnotes
Acknowledgments
Authors’ contributions: E.W.: data collection, data curation, study design, data analysis, data interpretation, and writing; J.T.H.: study design, data analysis, data interpretation, and writing; S.v.R.: data collection and database implementation; M.K.: data collection and database implementation; K.T.: data analysis, data interpretation, and writing; G.P.: generated all the sequence data; and L.F.: data collection, study design, data interpretation, and writing. This work was supported by an International Maternal Pediatric Adolescent AIDS Clinical Trials Group (IMPAACT) Virology Developmental Laboratory award (U01 AI068632), the Clinical Epidemiology and Health Services Research Core of the UW/FHCRC Center for AIDS Research (P30 AI027757), the Clinical Research and Retrovirology Core of the Seattle Centers for AIDS Research (P30 AI 027757), and the IMPAACT Statistical and Data Management Center (U01 AI068616) (to L.F.). Overall support for IMPAACT was provided by the National Institute of Allergy and Infectious Diseases (NIAID) of the National Institutes of Health (NIH) under award numbers UM1AI068632 (IMPAACT LOC), UM1AI068616 (IMPAACT SDMC), and UM1AI106716 (IMPAACT LC), with cofunding from the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) and the National Institute of Mental Health (NIMH) and strategic funds from Seattle Children's Research Institute (to L.F.). Additional support was from NIAID R01 AI108490 (to J.T.H.) and by a developmental grant from the University of Washington Center for AIDS Research (CFAR), an NIH funded program under award number P30AI027757 that is supported by the following NIH Institutes and Centers (NIAID, NCI, NIMH, NIDA, NICHD, NHLBI, NIA, NIGMS, and NIDDK). EW received salary support from the Ruth L. Kirschstein National Research Service Award (NRSA) NIH grant T32HP10002. The corresponding author had full access to all the data in the study and had final responsibility for the decision to submit for publication.
Author Disclosure Statement
No competing financial interests exist.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
