Abstract
Genetic factors from an HIV-1 host can affect the rate of progression to AIDS and HIV infection. To investigate the frequency of mutations in the CCR5 gene, HIV-1 samples from infected women and uninfected individuals were selected for sequencing of the CCR5 gene regions encoding the N- and C-terminal protein domains. Physicochemical CCR5 modeling and potential protein domain analysis were performed in order to evaluate the impact of the mutations found in the properties and structure of CCR5. The p.L55Q mutation in the N-terminal protein domain was observed only in uninfected individuals, with an allelic frequency of 1.8%. Physicochemical analysis revealed that the p.L55Q mutation magnified the flexibility and accessibility profiles and the modeling of CCR5 structures showed resulting in a small deviation to the right, as well as a hydrophobic to hydrophilic property alteration. The p.L55Q mutation also resulted in a slight modification of the electrostatic load of this region. Additionally, three novel silent mutations were found at the C-terminal coding region among HIV-1-infected women. The results suggest that the p.L55Q mutation might alter CCR5 conformation. Further studies should be conducted to verify the role of this mutation in HIV-1 susceptibility.
Introduction
T
The mechanisms underlying HIV-1 infection and pathogenesis require a combination of viral and host factors. The CCR5 coreceptor is essential at initial steps of HIV-1 infection, in all routes of transmission, and also has an important role in disease progression, as the CCR5 expression levels were associated with HIV-1-mediated bystander apoptosis. 1,8 –10 Individuals carrying the CCR5Δ32 allele are resistant to R5- and R5/X4- HIV-1 strains. 11 This mutation results in a frameshift and generates a truncated protein that cannot be exported to the cell surface, resulting in reduced expression of the receptor at the cell membrane. 11 –13 In addition to the CCR5Δ32 mutation, other polymorphisms leading to posttranslational modifications, such as tyrosine sulfation and O-glycosylation of the CCR5 N-terminal domain, can affect the R5-strains entry into target cells. For instance, the tyrosine sulfations at positions 10 and 14 of the CCR5 N-terminal domain facilitate HIV-1 entry, contributing to the recognition of the CCR5 binding region by HIV-1 gp120. 3,14 –18 Glycosylation of CCR5 may also play a role in modulating the efficiency of HIV-1 entry at specific target cells. 18 Other mutations in N-terminal domains result in amino acid alteration, such as I12L, C20S, A29S, I42F, and L55Q. 19,20 In addition, some mutations at the CCR5 carboxyl-terminal (C-terminal) domain can impact protein folding and conformation, generating an atypical protein that may not anchor properly in the cell membrane and therefore may affect virus entry. 21
To verify the presence of CCR5 gene mutations and their frequencies in HIV-1-infected women and HIV-1-uninfected individuals from Bahia, Brazil, the CCR5 gene regions encoding the N- and C-terminal protein domains were analyzed.
Materials and Methods
Study population
HIV-1-infected women
CCR5 gene regions encoding the N- and C-terminal protein domains were studied in a convenience sample of 114 HIV-1-infected women belonging to subtypes B and C who were recruited during a gynecological follow-up. These patients were followed at the Reference Center for Sexual Transmitted Disease (RCSTD)/AIDS of Feira de Santana, Bahia, Brazil, a facility that provides free patient management. In 2007, blood samples were collected from all patients who agreed to participate in the study, and were then processed at the Advanced Laboratory of Public Health (LASP)/CPqGM/FIOCRUZ in Bahia, Brazil. Clinical and epidemiological data were obtained from medical records. Informed consent was obtained from the patients and/or the children's parents or guardians.
Control group
A total of 197 DNA samples from HIV-1-uninfected individuals were randomly selected to obtain control frequency, including 104 samples from volunteers with a median age of 25±15.2 years, consisting of 50 males and 54 females attending the Bahia Federal University and 93 samples selected from sentinel surveillance areas of Salvador, Bahia, previously established for the investigation of various infectious diseases. 21 These 93 samples were used to study, at first, both CCR5 regions. For the study of the CCR5 gene region encoding the N-terminal domain, all samples from the first population were also analyzed. However, for the analysis of the CCR5 gene region encoding the C-terminal domain, only 35 samples from the first population were included due to initial results showing no differences at genotypic and allelic frequencies of the CCR5 gene mutations detected when these two control populations were stratified and analyzed separately (data not shown). For this reason, the uninfected individuals from the two control groups were compiled and studied as the same reference group.
The local ethical committee's approval was obtained and all subjects provided written informed consent.
DNA extraction and mutation detection
Genomic DNA was extracted from peripheral blood mononuclear cells (PBMCs) using the QIAamp DNA Blood Mini Kit (QIAGEN Inc., Valencia, CA). The analyses of the N- and C-terminal regions of the CCR5 protein were performed through PCR amplification (300 bp and 344 bp, respectively) followed by sequencing. The primers constructed for these analyses were 5'TTTGCATTCATGGAGGGCA3' and 5'TGGCCAGGTTGAG CAGGTAG3' for the N-terminal region and 5'TCTCTTCTGGGCTCCCTACA3' and 5'CCAGCCCACTTGAGTCCGTG3' for the C-terminal region. All sequences obtained were analyzed with the SeqScape program (Applied Biosystems) and the observed nucleotide variations were confirmed using the BioEdit 22 and GeneDoc 23 programs.
Physicochemical and potential protein domain analyses
To investigate the possible impact of mutations, the physicochemical analysis of the missense mutations observed in the sequences was performed using Network Protein Sequence Analysis (NPSA) (
Homology modeling of CCR5
The Homology Model of Human CCR5 was generated using Modeller 9.10 software26 with the crystal structure (PDB: 2KS9, 3ODU, and 4MBS) as template chosen from the PDB BLAST hit (
Statistical methods
The allelic frequencies were estimated by direct allele counting. Conformity with Hardy–Weinberg equilibrium and the existence of linkage disequilibrium were tested using Genepop v.3.4. 28 The heterogeneity between population samples was evaluated by Fisher's exact test or by χ2 test using the BioEstat v.5.0 program. 29 The median HIV-1 viral load and TCD4 cell counting in infected patients were also calculated using the BioEstat v.5.0 program. A p-value of <0.05 was considered statistically significant.
Results
The HIV-1-infected individuals consisted of 114 women with ages ranging from 20 to 73 years (mean=37.9 years). The epidemiological and clinical data are shown in Table 1. This population was highly heterogeneous regarding the time of antiretroviral (ARV) treatment, the appearance of HIV-1 constitutional symptoms, viral load, and TCD4 cell counting. The median HIV-1 viral load was 754±40.02 copies/ml and the median TCD4 cell counting was 437±99.87 cells/μl. Eighty-eight (77.2%) patients were under ARV therapy and 20 (17.5%) were therapy naive. Six patients (5.3%) did not have available data in their medical records.
NI, not informed.
The analysis of the CCR5 N-terminal domain sequences revealed the missense mutation at position 165 of the CCR5 gene in seven (3.6%) samples from uninfected individuals (allelic frequency of 1.8%) characterized by the substitution of a thymine to an adenine nucleotide, which resulted in an amino acid change of a leucine to a glutamine (p.L55Q mutation) at position 55 of the CCR5 protein. The p.L55Q mutation was not observed in HIV-1-infected individuals (Table 2) and the physicochemical analysis revealed that this mutation magnified the flexibility and accessibility profiles (data not shown). When the mutant protein sequence was subjected to potential protein domain analysis to determine if the p.L55Q mutation had changed any signature into the N-terminal protein domain, no modification was observed. Also, this mutation did not change any posttranslational modification site and did not influence the prediction of the protein secondary structure.
The Homology Model of Human CCR5 generated the final stable p.L55Q CCR5 structure (Fig. 1). After the refinement process, validation of the CCR5 models with mutant and wild-type residues was carried out using Ramachandran plot calculations. Altogether, 91.1% and 89.8% of the residues from mutant and wild-type models, respectively, were in favored and allowed regions, demonstrating that both models have a good stereochemistry quality (Fig. 2). Overlapped structures showed that there was a small deviation to the right as well as a hydrophobic to hydrophilic property alteration (Fig. 1). The p.L55Q mutation resulted in a small modification of the electrostatic load of this region (Fig. 3).

The final model of the stable structure of mutant CCR5 (p.L55Q mutation). The HIV-1 binding site is featured. Right, focus on the p.L55Q mutation region. Color images available online at

Validation of the model using the Ramachandran plot computed with the PROCHECK program with 90.1% and 89.8% of the residues from the mutant and wild-type models, respectively, in favored and allowed regions. Color images available online at

Electrostatic load of the wild-type (L55) and mutant (Q55) models. HIV-1 binding site, L55 and Q55 alleles are featured. Color images available online at
The analysis of the CCR5 gene region encoding the C-terminal protein domain revealed three novel silent mutations present in HIV-1-infected women and uninfected individuals (c.3765C>T, c.3777A>T, and c.3831A>G), which characterized this CCR5 gene region as more divergent than the encoding region of the N-terminal protein domain. Except for the c.3831A>G mutation, the other mutations were considered as polymorphic sites in the studied populations (Table 2).
Genotypic and allelic frequencies of the analyzed sites are shown in Table 1 and did not differ significantly from the frequencies expected under Hardy–Weinberg equilibrium (p=1.00, χ2=0.0001). Linkage disequilibrium was not observed between any of the analyzed mutations when HIV-1-infected women and uninfected individuals were compared (p=1.00, χ2=0.0001).
Discussion
Viral and host factors have been associated with variations in the infection susceptibility and postinfection replication level in different HIV-1-infected individuals. Some of these host factors have already been identified as determinants in HIV-1 infection susceptibility and AIDS progression. However, it is possible that other nonidentified host factors are related to infection control, emphasizing that human genetic variation studies can contribute to a better understanding of HIV-1 infection and AIDS progression mechanisms. 13,30 –33
In this context, many studies have demonstrated an association between mutations in human genes, disease manifestation, and viral infection influence. 34 –36 Regarding HIV-1 infection, the chemokine receptor CCR5 acts as the coreceptor for R5-tropic strains and some mutations in the promoter and the encoding regions of the CCR5 gene influence virus infection and/or AIDS progression. These mutations could affect CCR5 protein expression in the cell surface leading to a high or low protein production phenotype and could also change the protein structure, which could be related to its functionality and affect its activity at the cell membrane. In addition, the CCR5 protein undergoes posttranslational modifications at specific amino acids at the amino and carboxyl domains that are important for the interaction between this coreceptor and HIV-1 gp120, leading to membrane fusion and cell infection. Therefore, CCR5 gene mutations that change N- and C-terminal amino acids could interfere in the ability of CCR5 to bind gp120 and affect HIV-1 entry. 10,37 –39 In this regard, this is the first study to analyze mutations in the CCR5 gene regions encoding the N- and C-terminal protein domains in HIV-1-infected individuals in a Brazilian population.
The p.L55Q mutation, observed in the N-terminal domain of the CCR5 protein, is characterized by a change from a nonpolar amino acid (leucine or L) to a polar amino acid (glutamine or Q), which did not cause changes in any posttranslational modification site. The physicochemical analysis revealed that the mutated allele p.L55Q may create a more flexible protein domain than the wild-type allele, which indicates that the protein may present a greater number of structure conformations. Also, the modeling results demonstrated that this mutation altered the CCR5 3D structure. Altogether, these results could indicate that since this mutation is placed next to the region that interacts with the HIV-1 gp120, comprising amino acids 2 to 18 of the N-terminal domain, this amino acid change interferes in the CCR5 N-terminal conformation, possibly affecting the interaction region and gp120 binding.
In our study, the p.L55Q mutation was observed only in uninfected individuals, suggesting that the 55 amino acid change in the CCR5 N-terminal domain could interfere with HIV-1 entry through the CCR5 coreceptor. Carrington and collaborators showed that L55Q/CCR5-Δ32 heterozygotes were found on infected and uninfected individuals and did not alter the progression to AIDS. 19 Despite this, we suggest that it would be very elucidative to study this mutation in groups of exceptional individuals who deviate from the expected response to HIV exposure, such as long-term nonprogressors (LTNP), who maintain stable TCD4 cell levels and low viral load for 10 or more years; fast progressors (FP), who cannot control viremia and develop AIDS within 3 years of infection 40 ; and elite controllers (EC), who control HIV replication under 50 copies/ml. 41,42 Only one study was performed in this line, by Young and colleagues, who did not find L55Q in a group of highly exposed, persistently seronegative (HEPS) women in Thailand. 43
The three mutations (c.3765C>T, c.3777A>T, and c.3831A>G) observed in the CCR5 gene region encoding the C-terminal protein domain do not cause any amino acid change in protein sequence. The number of mutations detected at this region of the CCR5 gene shows that it is more divergent than the encoding region of the N-terminal protein domain, suggesting that the C-terminal CCR5 domain suffers a minor positive selective pressure when compared to the N-terminal domain, probably due to its localization in the cytoplasmatic side of the cell.
The results presented here suggest that the p.L55Q mutation, observed in the CCR5 N-terminal domain of uninfected individuals, could have a protective role in HIV-1 infection. Despite this, all the available studies failed to find any actual impact of the p.L55Q mutation on HIV entry into cells. Our analysis provides additional reasons to further investigate this and other polymorphisms in the N- and C-terminal CCR5 domains that might contribute to a better evaluation of the major role of CCR5 in HIV-1 infection, as this protein becomes an attractive target for therapeutic development against virus infection.
Footnotes
Acknowledgments
This work was supported by grants from the Brazilian Ministry of Health STD/AIDS Program, CNPq, FBDC, and FIOCRUZ.
Author Disclosure Statement
No competing financial interests exist.
