Abstract
The most potent and broad HIV envelope (Env)-specific antibodies often when reverted to their inferred germline versions representing the naive B cell receptor, fail to bind Env, suggesting that the initial responding B cell population not only exclusively comprises a naive population, but also a pre-existing cross-reactive antigen-experienced B cell pool that expands following Env exposure. Previously we isolated gp120-reactive monoclonal antibodies (mAbs) from participants in HVTN 105, an HIV vaccine trial. Using deep sequencing, focused on immunoglobulin G (IgG), IgA, and IgM, VH-lineage tracking, we identified four of these mAb lineages in pre-immune peripheral blood. We also looked through the ∼7 month postvaccination bone marrow, and interestingly, several of these lineages that were found in prevaccination blood were still persistent in the postvaccination bone marrow, including the CD138+ long-lived plasma cell compartment. The majority of the pre-immune lineage members included IgM, however, IgG and IgA members were also prevalent and exhibited somatic hypermutation. These results suggest that vaccine-induced gp120-specific antibody lineages originate from both naive and cross-reactive memory B cells.
Introduction
Despite increased access to antiretrovirals for treatment and prevention, HIV-1 is still a major health burden, with ∼1.5 million new infections and ∼680,000 HIV-related deaths globally in 2020. 1 Thus, the development of a safe and an effective preventive HIV vaccine remains a global priority. HIV-1 Envelope (Env) glycoprotein is the primary target of the humoral response to HIV-1 infection and the antibody (Ab) response has been correlated with vaccine-mediated protection from HIV infection. 2 –7 Although Env is highly immunogenic and readily induces Abs, Ab-mediated protection from HIV appears to be conferred, if at all, by only a minor subset of Ab clones from the overall Env-specific immunoglobulin (Ig) repertoire. Approximately 10%–50% of people living with HIV (PLWH) within several years of infection develop Abs with tremendous breadth able to neutralize diverse isolates, although typically not their own contemporary isolates, 8,9 indicating that such broadly neutralizing antibodies are capable of being induced, and that if recapitulated by vaccination would likely confer protection from infection.
Numerous HIV-1 broadly neutralizing monoclonal antibodies (bnmAbs) have been isolated from PLWH and are the most well characterized of HIV-1-specific mAbs, including extensive deep-sequencing analysis of their clonal lineages. These bnmAbs typically have extensive somatic hypermutation and common molecular features in the heavy- and light-chain variable regions. Some of these bnmAbs share very specific structural features such as gene usage and complementarity-determining region 3 (CDR3) lengths, particularly observed among those targeting the CD4 binding site, referred to as VRC01-class bnmAbs. 10,11 Many of these bnmAbs when reverted to their presumed germline or naive sequence fail to bind HIV Env, 12 –14 raising the possibility that they may arise from naive B cells that responded initially to a different antigen, acquiring cross-reactivity with HIV Env sufficient to respond to HIV infection, and then subsequently developed increasing affinity to HIV Env as part of their extensive somatic hypermutation process in response to chronic viremia.
Recently, the development of “germline-targeting” immunogens has showed their potential for binding predicted germline versions of bnmAbs, activating transgenic B cells and generating Ab responses in B cell receptor (BCR) transgenic mouse models. 15 –22 Havenar-Daughton et al. have used the eOD-GT8 immunogen successfully to isolate genuine human naive B cells ex vivo, showing some promise for the “germline-targeting” vaccine development concept. 15 Subsequently, the eOD-GT8 60mer immunogen is in the early stages of clinical testing to determine the potential of germline targeting approaches for stimulating the human B cell repertoire. 23
Several HIV-1 vaccine efficacy trials have been completed so far although no vaccine has been licensed to date. 24 RV144 is the only immunogen-based preventive HIV vaccine trial that has demonstrated moderate protection so far; however, this regimen did not induce a substantial neutralizing Ab response. 25 More recently, two phase 2b antibody-mediated prevention trials with the VRC01 bnmAb infusion in high-risk populations have found that the VRC01 was able to block acquisition of only ∼30% of the circulating strains, however, it was 75% effective at preventing acquisition of VRC01-sensitive strains (in vitro IC80 < 1 μg/mL). 26 –28 Although VRC01 could not prevent the overall HIV-1 acquisition more effectively than placebo, these “proof-of-concept” studies indicated the potential of bnmAbs to prevent HIV infection. All these studies highlight the need to understand the mechanisms of bnAb formation to develop an effective HIV-1 vaccine. 26,27
Several findings from PLWH support that one of the possible developmental pathways for HIV-1 Env antibodies involve polyreactive B cells that cross-react with self and/or microbial antigens, particularly those that target the gp41 component of Env. 29 –31 In addition, Williams et al. reported that following HIV vaccination, a gp41-reactive mAb lineage, which cross-reacts to gut residing Escherichia coli RNA polymerase, developed. They further suggested that such cross-reactive populations may result in an immunodominance that minimizes the development of other Env-specific B cell responses to vaccination. 32 In general, HIV-1 Env vaccine containing both gp120 and gp41 primarily induces an intestinal microbiota-derived cross-reactive gp41-dominant response along with a low frequency of gp120 response, 32 whereas immunization with gp120-only results in a comparatively higher frequency of gp120-specific memory B cell response than the response that is induced from gp140 or gp160 vaccination. 33,34
Moreover, several lines of evidence indicate that the precursors of Env-specific mAb lineages can be identified in the naive B cell repertoire 15 and some antibodies with bnAb activity can also arise from naive B cells. 35
Previously we isolated 66 gp120 reactive mAbs 36 from the peripheral blood B cells of HVTN 105 trial participants, 37 a phase 1 trial testing AIDSVAX® B/E (gp120 protein) that was used in RV144, combined with a DNA immunogen. To determine the origin of these mAb producing B cells, we used deep sequencing-based VH lineage tracking and identified several of these mAb lineages in peripheral blood before vaccination (pre-immune). The majority of the pre-immune gp120-reactive lineages included IgM, however, IgG and IgA members were also predominant before vaccination. In addition, the majority of the pre-immune lineage members exhibited somatic hypermutation before vaccination, suggesting that the vaccine-induced gp120-specific Ab lineages originated from both naive B cells and cross-reactive memory B cells.
Materials and Methods
Study participants
Blood samples for this study were obtained from 21 participants at the University of Rochester, who participated in the HVTN 105 phase 1 randomized, blinded, multisite HIV vaccine clinical trial (

Characterization of pre-immune B cell repertoire in blood.
Peripheral blood mononuclear cells (PBMCs) were obtained before the vaccination and 7 days after the final vaccination. Bone marrow and tonsil biopsy samples were collected 7 months after the final vaccination.
HIV-specific antibody secreting cells ELISpot
The frequency of HIV-specific antibody secreting cells in total PBMCs was determined by ELISpot similar to as previously described. 36,38,39 Briefly, sterile 96-well polyvinylidene fluoride membrane plates (MilliporeSigma, USA) were coated overnight at 4°C with 50 μL of 5 μg/mL of HIV-1 Env gp140 SF162.B (NIH AIDS Reagent Program) or 1 μg/mL of anti-human IgG or IgM (Jackson ImmunoResearch, West Grove, PA, USA) in PBS. Plates were blocked with RPMI1640 (Corning, VA, USA) media with 10% fetal bovine serum (Atlanta Biologicals, GA, USA) for 2 h at 37°C. Then PBMCs were added in a final volume of 200 μL per well in triplicate. The plates were incubated for ∼40 h at 37°C in 5% CO2 and then washed with PBS containing 0.1% Tween 20.
Bound antibodies were detected with 50 μL of 1 μg/mL of alkaline phosphatase-conjugated anti-human IgA and horseradish peroxidase-conjugated anti-human IgG (diluted in PBS containing 0.1% Tween 20 and 1% BSA) Ab (Jackson ImmunoResearch) for 2 h at 37°C in 5% CO2 and then developed with VECTOR Blue, Alkaline Phosphatase Substrate Kit III (Vector Laboratories, Burlingame, CA). The spots per well were counted using the CTL immunospot reader (Cellular Technologies Ltd., Shaker Heights, OH, USA).
VH next-generation sequencing
For the Ig VH sequencing library preparation, PBMCs were collected before the start of the vaccination trial (at M0). gp120+ve and gp120-ve B cell fractions were isolated by magnetic selection using biotinylated HIV Env protein gp120 MN.B (ARP-12570, NIH AIDS Reagent Program) and
The Ig heavy-chain libraries were prepared as described in Piepenbrink et al., 202141 with modifications. Briefly, total RNA was isolated using the RNeasy Mini Kit (Qiagen, Germany) and treated with DNase I (Turbo DNA-free Kit; Invitrogen, Lithuania). Approximately 1/9 part of this RNA was used for cDNA synthesis in a 20 μL reaction with the qScript cDNA Synthesis Kit (Quantabio, MA, USA). Two rounds of PCRs were carried out to generate the libraries with a cocktail of VH1–6 forward primers and IgA, IgG, and IgM reverse primers 36 with the modifications adapted for the Illumina Nextera approach. Following PCR, amplicons were analyzed in 1% agarose gel and bands corresponding to ∼600 bp were purified with the QiAquick Gel Extraction Kit (Qiagen, USA). All the reactions were carried out in triplicates and pooled together during the gel purification.
The individual libraries were further purified using the ProNex size-selective purification system (Promega, WI, USA) to select products between the 500 and 700 bp range. Final products were submitted to the University of Alabama at Birmingham Genomics Core Laboratory, where quality control and quantification were performed using qRT-PCR. Finally, the libraries were pooled together in equimolar ratio and sequenced on an Illumina MiSeq system (Illumina, Inc., CA, USA) using 2 × 300 bp paired-end kits (Illumina MiSeq Reagent Kit v3, 600-cycle; Illumina Inc.).
Sequence analysis of the repertoire
Sequence analysis was performed using an in-house custom analysis pipeline described previously.
36,42,43
Briefly, all sequences were aligned with
Expanded clones were determined (Supplementary Fig. S1) by plotting size-ranked lineages (% partitions of total sequences) stacked on along the y-axis, and the normalized size per lineage (% of total sequences) plotted on the x-axis. The “knee” of the resulting curve indicates the lineages above which are considered “expanded.” This knee is determined by drawing a line from the top of the curve to the plot origin, and then identifying the longest perpendicular line to the clonality curve.
For targeted resolution of pre-immune lineage members of gp120-specific mAb-based lineages and the subsequent lineage tree construction, mAb sequences were parsed against the libraries, and sequences having the same V and J gene usage, HCDR3 length, and ≥80% HCDR3 similarities to the mAb sequence were identified and underwent further filtering using Partis, a multihidden Markov model-based method, 51 to confirm the clonal relationship of lineage members. For lineage tree construction, within a lineage, instances of only a single identical nucleotide sequence were removed, with the exception of sequences that match any M0 or M13 or inferred node sequence. The resulting sequences were analyzed using PHYLIP's dnapars tool (version 3.69s). 52 The output file was then parsed using in-house custom scripts, collapsing any duplicate sequences into an individual node, and was visualized using Cytoscape. 53
mAb generation
Mature mAbs were cloned out from plasmablasts isolated 7 days after the final vaccination as previously described in Basu et al. 36 Pre-immune heavy chain representatives were selected on the basis of highest amino acid similarity from the respective germlines. Predicted germline versions were designed with germline V, J, and pre-immune CDR3 sequences for the heavy chains and mature CDR3 for the light chains. These V-regions were synthesized from gBlocks (IDT) and then used for the production of recombinant mAbs as full human IgG1 as described previously. 36,54,55 For the pre-immune version mAbs, pre-immune heavy chains were paired with mature light chains, and for the germline version mAbs, germline heavy chains were paired with germline light chains. After 8 days of transfection, mAbs were purified from the culture supernatant using Magna Protein A beads (Promega).
Enzyme-linked immunosorbent assay
The reactivity profiles of mAbs against HIV Env proteins were detected by ELISA. Briefly, ELISA plates (Nunc MaxiSorp; Thermo Fisher Scientific, NY, USA) were coated overnight at 4°C with 50 μL of 0.5 μg/mL of HIV Env gp120 (MN.B or A244.AE, NIH AIDS Reagent Program) in PBS and blocked with 3% BSA in PBS for 30 min at room temperature. Plasma samples were tested at 1:2,500 dilution and mAbs were tested at 10-fold dilutions (100, 10, 1, and 0.1 μg/mL). Fifty microliters of the sample (diluted in PBS containing 0.05% Tween 20 or PBST) was added per well in triplicate and incubated for 1 h. The reaction was detected using peroxidase-conjugated anti-human IgG (Jackson ImmunoResearch), diluted 1:2,000 in PBST. Mean optical density values of triplicate test samples were divided by control (PBST) and represented as relative units (RU).
Surface plasmon resonance
Surface plasmon resonance (SPR) experiments were performed on a Biacore T200 with a CM5 sensor and human IgG capture kit (Cytiva) at 25°C using HEPES buffered saline (HBS) with 0.005% P20 as the running buffer. Anti-human IgG was captured using the protocol recommended by Cytiva over all four flow paths. Between 535 and 580 RU of each Ab was captured using a contact time of 60 s and a 10 μL/min flow rate in an HBS running buffer containing 3 mM EDTA and 0.005% P20. MN.B gp120 (AIDS Reagent Repository) was used as the analyte diluted between 1.6 and 200 nM. gp120 MN.B was injected over flow paths 1, 2, 3, and 4 with a contact time of 120 s, flow rate of 30 μL/min, and a long dissociation of 1,200 s.
Flow path 1 was used as reference, while antibodies were bound in flow paths 2, 3, or 4. 3 M magnesium chloride was injected over all four flow paths for regeneration using a contact time of 30 s and flow rate of 30 μL/min. 1:1 Langmuir model was used to determine the kinetics of each Ab. Because of the low level of binding of gp120 (or the saturation at low gp120 MN.B concentrations), steady-state kinetics were attempted for both pre-immune 1098B8 and germline 1098B8 by reducing the captured Ab to ∼120 RU and increasing the contact time of analyte to 300 s. The pre-immune 1098B8 mAb did not fit well to 1:1 Langmuir kinetics and the lower binding could only be estimated.
Statistical analysis
Statistical analysis of VH deep sequencing data was done with GraphPad Prism v7.0 or MATLAB, using the unpaired Mann–Whitney test or Wilcoxon matched-pairs singed rank test, as appropriate.
Results
Identification of HIV-1 Env-specific B cells in pre-immune blood
We have previously observed the presence of HIV-1 Env gp140 reactive B cells in healthy participants both within the peripheral blood CD27− naive B cell and CD27+ memory B cell compartments (Fig. 1A), suggesting the presence of pre-existing non-naive HIV-1-Env reactive B cell populations. To characterize this pre-immune HIV-1 Env reactive B cell compartment further, we collected peripheral blood samples from 21 healthy participants of the HVTN 105 HIV-1 vaccine clinical trial before and after immunization (Fig. 1B). The participants received four vaccinations over a period of 6 months with different combinations of AIDSVAX B/E bivalent gp120 protein, which includes gp120 MN.B and gp120
Human gut microbiota reactive antibodies are known to recognize gp41 subunit of the HIV-1 Env.
29,37,56
–58
Therefore, we decided to isolate peripheral blood pre-immune HIV-1-Env-specific (gp120pos) B cells using magnetic bead-based enrichment with gp120 MN.B and
Characterization of pre-immune B cell repertoire in blood
To understand the molecular features of HIV-1 Env-specific pre-immune BCR repertoire, we analyzed characteristics of the heavy chain variable region by deep sequencing of these gp120pos and gp120neg peripheral blood B cells from all 21 participants. Comparing the isotype distribution (Fig. 1C), we found IgM was the predominant isotype in both the gp120pos and gp120neg repertoires overall, as expected, since naive IgM+ B cells compose the majority of the peripheral B cell compartment in healthy individuals. Non-IgM isotypes (i.e., IgG or IgA) were also substantially present in gp120pos (ranging from 3.8% to 52% for IgA and from 1.5% to 50.9% for IgG) as well as in gp120neg (ranging from 2.7% to 41.5% for IgA and from 1.9% to 53.7% for IgG) in pre-immune blood.
However, on the individual participant basis, higher frequencies of IgM (in 15/21; 71% of the participants) and IgG (in 12/21; 57% of the participants) were found in gp120neg compared with gp120pos. In contrast, the gp120pos fraction was significantly enriched (p = .0003) for IgA with a higher IgA proportion in 86% of the participants (18/21) compared with their gp120neg population.
The CDR3 is the most variable region of the Ab molecule and a major determinant for antigen recognition and binding. Among various known characteristics, the presence of long HCDR3, as defined as ≥22 aa, is a common feature for some well-characterized bnmAbs against HIV-1, and several Env-reactive mAbs isolated from HIV vaccine trials also possess this feature. 36,59 We searched for this trait in the pre-immune peripheral blood B cells and found significant (p < .0001) enrichment of lineages with long HCDR3 in the gp120pos over gp120neg fraction in 17 of the 21 (∼81%) participants (Fig. 1D).
Next, we wanted to assess the presence of somatically mutated Abs in the pre-immune repertoire as an indication of antigen experience. Since the predominant population in the peripheral blood of healthy is expected to be naive B cells, which lack somatic hypermutation, BCR sequences with >10% nt mutations compared with germline were analyzed to focus on the non-naive population only. There was no significant difference in the overall VH mutation level between the gp120pos and gp120neg non-naive B cell fractions (not shown). Assessment of expanded clonal lineages, those that predominate the repertoires (Supplementary Fig. S1), revealed that there was a significantly (p = .0083) higher frequency of mutated (>10% nt) VH sequences in gp120pos (Fig. 1E). This increased VH mutation in gp120pos compared with gp120neg was most evident in framework region 1 (FR1) (p = .0098) and FR2 (p = .0001). Together, these results indicate that the peripheral blood pre-immune gp120pos compartment includes IgM, IgG, and IgA B cells, with proportionally more IgA, clones with longer HCDR3, and increased somatic hypermutation in FR1 and FR2.
Expansion of HIV-1 Env-specific pre-immune B cells following HIV Env vaccination
The pre-immune HIV Env-reactive B cell fraction is assumed to not have had any prior exposure to HIV Env and is anticipated to have a certain degree of representation in the resulting HIV Env vaccine-induced B cell repertoire. Previously, for the HIV Env vaccine phase 1 trial HVTN 105, BCR repertoires from peripheral blood 7 days after the final vaccination (M6), expected to be dominated by vaccine-induced IgG plasmablasts, were sequenced. 36 We sought to understand the degree of overlap between the gp120pos prevaccination and the vaccine-induced peripheral blood BCR repertoire. Nine out of 11 participants for whom M6 samples were analyzed had postvaccination lineages that could be traced back to the HIV-1 Env-specific pre-immune B cell compartment. However, as expected, due to the large size and diversity of the repertoire and limitation of sequencing depth, only a few lineages were shared between the pre- and post-vaccination samples.
To understand whether these shared pre-immune lineages originated from the naive compartment or not, pre-immune sequences having ≤1% nucleotide mutations from germline in the variable gene were considered derived from naive B cells, and those with >10% nucleotide mutations from germline in the variable gene were considered derived from non-naive B cells (Supplementary Table S1). These criteria were based on previous observations of somatic hypermutation in naive BCR repertoire analysis to account for sequencing errors and allelic miscalls 43 –48 and to minimize the potential of false identification of a naive B cell within the non-naive bin. Although a similar number of shared lineages between M6 and pre-immune gp120pos were found in the naive and the non-naive compartments (Fig. 2A), the shared lineages between M6 and gp120pos pre-immune comprised a significantly (p = .0117) higher proportion (highest 2.4%) of the non-naive compartment of the gp120pos pre-immune compared with the proportion of the gp120pos pre-immune naive compartment that was shared with M6 (highest 0.20%) (Fig. 2B).

Expansion of HIV-1 Env-specific pre-immune B cells following HIV Env vaccination. gp120pos pre-immune B cells that shared lineages with M6 peripheral blood that was obtained 7 days after final immunization were split into naive (≤1% nucleotide mutations from germline in the variable gene) and non-naive (>10% nucleotide mutations from germline in the variable gene) compartments and characterized. Each symbol represents an individual subject.
Among the lineages that are shared between pre-immune and postvaccination (M6), there was a higher frequency of VH1–2, VH1–8, and VH5–51 lineages in the naive than in non-naive pre-immune lineage members (Fig. 2C). Contrastingly, the frequency of VH1–69 lineages was higher in non-naive (11.7%) than in naive (2.2%) pre-immune members. Among the JH gene usage, increased JH6 usage in the naive than in the non-naive population was most evident (Fig. 2D). Together, these results suggest that the non-naive gp120pos pre-immune compartment is preferentially responsive to this HIV vaccine than the naive gp120pos pre-immune compartment. These results further suggest that the VH1–2, VH1–8, and VH5–51 gp120pos lineages may preferentially arise from the naive pre-immune compartment, whereas the VH1–69 gp120 lineages may preferentially arise from the non-naive pre-immune compartment following HIV Env vaccine.
Mixed origins of vaccine-induced gp120-specific pre-immune repertoire
Observing shared lineage members in the pre- and postvaccination peripheral blood samples, we next sought to definitively characterize the dynamics of the gp120 pre-immune lineages that respond to vaccination. Previously we isolated 66 gp120-specific mAbs from HVTN 105 participants after their last vaccination. 36 We sought to determine whether the precursor B cells for these mAbs were present in the peripheral blood pre-immune B cell compartment and subsequently participated in the Env-reactive B cell response to the vaccination. We parsed our mAbs against the pre-immune deep-sequencing database using the criteria of identical VH and JH, HCDR3 length, ≥80% HCDR3 nucleotide sequence similarity, and clonal assignment based on the Partis multihidden Markov model program, 51 consistent with previous deep-sequencing clonal lineage analyses 60 –62 to identify the pre-immune clonal lineage members, and subsequently identified four lineages from three of the study participants (Table 1; Supplementary Fig. S2). No distinct binding or molecular features such as V or J family gene usage distinguished these mAbs from the other mAbs that were not found in the pre-immune repertoire (not shown).
Summary of Vaccine-Induced gp120-Reactive Monoclonal Antibody Lineages Identified in Pre-immune Peripheral Blood
M1 = 7 dp 2nd vaccination and 1 m.p.f.v; M3 = 7 dp 3rd vaccination and 3 m.p.f.v; M6 = 7 dp 4th and final vaccination, 6 m.p.f.v; M13 = 13 m.p.f.v (7 months postfinal vaccination); BM (7 months postfinal vaccination); tonsil (7 months postfinal vaccination).
BM, bone marrow; dp, days post; mAb, monoclonal antibody; Ig, immunoglobulin; m.p.f.v, month post 1st vaccination.
Three of these four mAbs utilize VH1, which we previously reported to be dominant in the vaccine-induced lineages in HVTN 105. 36 Pre-immune members of 2/4 lineages (1131D12, 1095A8) were found to be IgM and IgA and/or IgG, with a minimum of 6% VH amino acid mutation from germline, indicating they developed from a non-naive pre-immune origin. The 1098B8 lineage members were only found as IgG in pre-immune, also indicating development from a non-naive pre-immune origin. The 1131D12 lineage members were found as IgA and somatically mutated IgM in pre-immune indicating the development from a non-naive pre-immune origin. Members of the 1131D12 lineage were also found in tonsil 7 months after the final vaccination, all as IgA, indicating the mucosal presence of this lineage. Among these three non-naive pre-immune origin lineages, substantial VH somatic hypermutation (6%–37% AA) in the pre-immune lineage members was evident.
The 1131A5 lineage was found as only IgM in pre-immune, including lineage members in their germline configuration, lacking any VH amino acid mutation, suggesting that the 1131A5 lineage developed, at least in part, from a naive pre-immune origin.
We selected three representative lineages to investigate further and constructed lineage trees from the clonally related sequences. The 1131A5 lineage (Fig. 3) had the lowest somatic mutation (0.3%–7.4% nt) observed in pre-immune members (M0), which consisted of only IgM and was considerably mutated when isolated from postvaccination blood as an IgG mAb. This lineage is likely indicative of a naive B cell responding to HIV gp120 proceeding through affinity maturation and class switching. The 1095A8 lineage (Fig. 4) includes pre-immune members that are highly somatically mutated (6.5%–36.6% AA) and IgA, IgG, and IgM. The lineage spans multiple time points following the vaccination course, with IgA dominance in pre-immune and throughout.

HIV-1 Env reactive 1131A5 lineage in pre-immune blood.

HIV-1 Env reactive 1095A8 lineage in pre-immune blood.
The 1098B8 pre-immune lineage members (Fig. 5) were identified as IgG and had substantial mutation from germline (16.3%–17.4% AA) (Fig. 6).

HIV-1 Env reactive 1098B8 lineage in pre-immune blood. Phylogenic analysis of 1098B8 lineage. Lineage members were defined as same heavy-chain V and J gene usage, HCDR3 length, and >80% HCDR3 similarity to the mAb sequence, and clustered based on Partis analysis. M13 sequences were obtained from peripheral blood, total BM, and CD138+ BM PC. Individual nodes indicate identical nucleotide sequences (n = 1–653). Black nodes indicate undefined isotype, symbol shape indicates time point. BM, bone marrow; PC, plasma cells.

Sequence alignment of 1098B8 lineage members. Alignments depict germline, pre-immune lineage members, M6 mAb, and M13 BM PC-derived sequences for 1098B8 lineage.
Lineage members with additional somatic mutations were also present as CD138+ long-lived plasma cells in the bone marrow 7 months after final vaccination and also in peripheral blood memory B cells. This suggests that the lineage represents a non-naive B cell responding to HIV gp120 and resulting in additional clonal expansion and somatic hypermutation.
gp120 binding by pre-immune antibodies
We tested the gp120 binding ability of the pre-immune versions of these lineages (Fig. 7). The representatives of each pre-immune lineage and the germline versions were synthesized and complemented with the mature or the germline light-chain versions. The pre-immune mAbs showed lower reactivity than the postvaccination mature Abs against HIV-1 Env of the vaccine component strains MN.B [AIDSVAX B/E] (Fig. 7A) and

gp120 binding by pre-immune antibodies.
Together, these results suggest that 1098B8 developed from a non-naive pre-immune origin as a result of an initial naive B cell, which had minimal gp120 reactivity, responding to a non-HIV Env antigen and undergoing somatic hypermutation and acquiring HIV Env cross-reactivity, and subsequently responding to the HIV Env vaccine and undergoing HIV Env-directed affinity maturation.
Discussion
While HIV Env is immunogenic and its delivery in numerous forms such as plasmid DNA, protein, and viral vector readily induces Env-specific Ab and B cells, only a minority subset of the responding repertoire is typically able to target the most consequential epitopes or have the potential for conferring protection. As HIV vaccine development progresses, a major emphasis has been placed in trying to focus the humoral vaccine response to highly conserved epitopes and elicit clonal lineages that preferentially target these epitopes. To aid in such nuanced HIV vaccine development efforts, we sought to characterize the state and features of the peripheral blood gp120-specific B cell repertoire before immunization, and the features of those pre-immune B cell clonal lineages that ultimately respond to the vaccine. We determined that in humans a major proportion of the vaccine-induced gp120-specific BCR repertoire originates from non-naive B cells present before immunization.
Most of the pre-immune gp120-specific repertoire consisted of IgM (∼60%), however, IgA and IgG comprised ∼25% and ∼15%, respectively. Although a significant level of Ab diversity has been observed in the pre-immune gp120-specific repertoire, it cannot be ruled out that the use of trimeric gp120 instead of monomeric gp120 protein for isolating B cells could have a higher potential to bind to the rare Ab expressing or lower affinity IgM precursor B cells. 63,64 Another factor to consider is that glycans found on HIV-1 Env and other pathogens are also present on host molecules, 65,66 suggesting that the pre-immune gp120 reactive B cell population may include cross-reactive but weakly glycol-specific B cells. In addition, the absence of particular glycans (e.g., presence of glycan holes) on the antigens has shown to improve HIV Env-specific immune responses. 67 –69 However, several bnAbs, which recognize the natural heavily glycosylated epitopes on HIV-Env protein, have been well-characterized. 70,71
Half of the gp120-specific mAb lineages we were able to track back to pre-immune included pre-immune IgA clonal members. While the functional relevance and consequences of the IgA compartment as a major origin of gp120-vaccine-induced responses are to be determined, it is consistent with past findings during early HIV infection and exposure that an IgA response to predominantly gp41 but also gp12072–74 is observed particularly in mucosa. 75,76 IgA antibodies and B cells are major contributors to mucosal protection, and although we did not look in-depth at the mucosal BCR repertoire and only following immunization, IgA members of the 1131D12 lineage were evident in tonsil after vaccination, and IgA pre-immune members were present in peripheral blood, which may suggest a mucosal association with the origin of this lineage.
The systemic and mucosal IgA BCR repertoire is strongly influenced by B1 B cells, 77,78 which are an unconventional B cell population that responds quickly against mucosal pathogens through innate-like recognition via IgM, but also that can further develop into highly antigen-specific IgG and IgA B cells 77,79,80 indeed, about half of the mucosal IgA has a B1 origin. 77,78 Future studies that directly evaluate pre-immune mucosal B cells or mucosal immunization may provide additional insight into the development of protective IgA responses to HIV vaccines. Together this suggests that further assessment of the contribution of mucosal IgA and B1 B cells to the pre-immune gp120 compartment is warranted.
An enrichment of long HCDR3 and increased somatic hypermutation in FR1 and FR2 was evident in the pre-immune gp120 compartment, suggesting intrinsic germline and acquired molecular features that determine recognition of gp120. This is consistent with the observation that long HCDR3 is frequently found in responses to HIV infection and vaccination, particularly with Abs targeting gp120. 59,81 In addition, there have been reports of acquired FR1 and FR2 motifs that may be conserved among public clonotypes recognizing gp120. 82 –85
VH1–2 and VH1–46 usage among the gp120 repertoire is particularly notable, given their use among VRC01-class antibodies that target the CD4 binding site of gp120 and have a potent and broad neutralizing activity. Although VH1–2 and VH1–46 were not overrepresented in the gp120 pre-immune compartment, they were overrepresented among the pre-immune lineages of naive origin that responded to the immunization. In contrast, VH1–69, which is known to be frequent in antiviral responses, 86 and VH3–23, which is the most common VH gene used in the human Ig repertoire, 45,87 were overrepresented in those vaccine responding pre-immune lineages of non-naive origin. It is anticipated that tuning vaccines to elicit VH-specific/class-type responses, such as being done with the “germline-targeting” approaches, may be more consistent and reproducible among vaccinees when targeting the naive pre-immune compartment, and subsequently more uniform, as being dependent on just germline representations.
However, induction of VH-specific/class-type responses that are overrepresented in the non-naive pre-immune compartment (e.g., VH1–69) would likely be more inconsistent and highly variable among vaccinees, and heavily dependent on exposure history to the cross-reactive non-HIV antigens that stimulated the initial naive B cell and the acquisition of cross-reactivity to gp120. Such a process would likely be difficult to reverse engineer into a vaccination strategy, and it remains unclear, given the likely redundant pathways to generate bnAbs against the same epitope, 88,89 whether pre-immune non-naive and naive lineages can equally develop into bnAbs. Due to the infrequent nature of gp120-specificity in the pre-immune repertoire and limitations of deep sequencing approaches, highly quantitative tracking and characterization of the lineages that respond or do not respond to immunization are difficult but should be pursued further given the fundamental heterogeneity we have defined here.
As mature light chains were used to reconstruct the pre-immune version mAbs, the contribution of the light-chain somatic hypermutation in the affinity maturation could not be resolved in this study. We observed that the non-naive pre-immune VH (e.g., 1098B8, 1095A8) conferred greater gp120 binding activity than the germline VH. This is consistent with previous demonstration that many HIV bnAbs, including 3BNC60, PGT145, CAP256-VRC26, NIH45–46, b12, and 2G12 when reverted to germline, have little to no recognition of unmodified gp120. 12,14,84,90 –92 This supports the possibility that at least a substantial proportion of the non-naive pre-immune gp120-specific compartment acquired affinity for gp120 through the affinity maturation process against another antigen. Through this process, these clonal lineages and their memory B cells would have likely expanded beyond naive frequencies, which could have serious implications for immunodominance, and provide rationale for the substantial representation of non-naive origin lineages in the postvaccine gp120-specific repertoire.
A number of studies have shown that the pre-existing immunity significantly affects the Ab responses in the setting of infections/vaccinations that include influenza, 93,94 SARS-CoV-2, 95,96 Zika, 97 and yellow fever. 98 Several gp120 reactive antibodies have demonstrated cross-reactivity with bacterial antigens, 32,99,100 but the sources and dynamics of these cross-reactive antigens remain to be fully resolved. If these initiating antigens or microbes could be identified, they may be highly beneficial to use in a prepriming approach to establish the non-naive pre-immune gp120 compartment, readied to respond to gp120 immunogens.
The current study was not able to resolve these initiating antigens, nor the precise gp120 epitopes recognized by the described mAbs or their dependence on glycosylated residues, and is a major emphasis of ongoing efforts. Because mature HCDR3 forms as a result of the combined influence of random V-D-J recombination, N additions, and somatic mutation, precisely determining the germline HCDR3 is not feasible, and the contribution of individual germline HCDR3 residues in conferring gp120 reactivity with the lineages we have identified should be examined.
In RV144, the decreased risk of infection observed in vaccinated participants was strongly correlated with the gp120-specific Ab response, but protection did not appear to be long-lived, highlighting that durability is a critical element of effective HIV vaccine. We have previously demonstrated that the HVTN 105 vaccine regimen induces long-lived bone marrow plasma cells, 36 the major source of systemic plasma Ab, and here we observe gp120-specific mAb lineage members persisting in the bone marrow for both lineages that arise from naive (1131A5) and non-naive (1098B8, 1131D12, 1095A8) origins. Although we cannot exclude some pre-immune seeding of the bone marrow with these non-naive origin lineage members, this suggests that origin of the gp120 humoral response, whether naive or non-naive, does not grossly hinder the ability to establish long-lived gp120-specific plasma cells.
It remains to be determined to what extent pre-immune gp120 reactive plasma antibodies may occur and their relationship with bone marrow plasma cells and influence to the humoral response to HIV vaccination and infection.
In conclusion, we have demonstrated that the pool of B cells that respond to initial immunization with HIV Env includes a substantial proportion of antigen-experienced non-naive B cells. Further defining the mechanisms by which the pre-immune Env-reactive B cell compartment develops and responds to vaccination will likely be necessary for developing broadly efficacious HIV vaccines and may provide insights for other viral vaccines seeking to induce precise Ab classes or germlines.
Data Availability Statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation. The Ab and repertoire RNA sequences are available from the database of Genotypes and Phenotypes (dbGaP) (
Footnotes
Acknowledgments
We are grateful for the assistance of the Rochester Victory Alliance and its staff. We are grateful for the assistance of the University of Rochester Flow Cytometry Core Facility, and the University of Alabama at Birmingham Genomics Core Laboratory and Multidisciplinary Molecular Interaction Core. We are most grateful for the participation of the HVTN 105 study volunteers, and thank the HVTN protocol team for their support of this project. We are very appreciative of the assistance provided by the HIV Vaccine Trials Network in enabling and coordinating this project. A version of this article has been published on preprint server BioRxiv
Authors' Contributions
M.B., A.F.R., J.L., L.-X.M, C.A.B., M.C.K., and J.J.K. conceived and designed the study. J.L. and L.-X.M. provided critical samples. M.B. and M.S.P. performed the experiments. M.B., M.S.P., C.F., A.F.R., and J.J.K. performed the data curation and analysis. M.B. and J.J.K. drafted and edited the article.
Author Disclosure Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Funding Information
This research was partially funded by the National Institutes of Health [5R21AI116285, 5R01AI117787, 5R01DE027245 to J.J.K., UM1AI069511 to M.C.K.], the University of Rochester Center for AIDS Research P30AI078498 (NIH/NIAID), the University of Alabama at Birmingham Center for AIDS Research P30AI027767 (NIH/NIAID), and the University of Alabama at Birmingham HIV Clinical Trials Unit UM1AI069452 (NIH/NIAID).
Supplementary Material
Supplementary Table S1
Supplementary Figure S1
Supplementary Figure S2
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
