Abstract
Codon usage has been identified as one of the most important factors that influence gene expression. The frequencies with which the different codons are used vary significantly between different organisms and also between the genes within the same organism. HIV has a remarkable nucleotide composition with an above average percentage of “A” nucleotide
E
The human immunodeficiency virus (HIV) is a well-known translational parasite of humans that uses the host's translation machinery to translate its own mRNA. HIV has a remarkable nucleotide composition with an above average percentage of “A” nucleotide resulting in a codon usage pattern that differs markedly from that of the human host with preferential occupancy of A at the flexible third codon position. 3 In a recent study, we reported that the tat (transcriptor of transactivation) gene of HIV-2 has a higher effective number of codons (ENCs) value as compared to HIV-1, reflecting lower level of expression of HIV-2 Tat protein, and hypothesized that lower levels of HIV-2 Tat protein, particularly in the early stages of infection, result in decreased viral load, lower viral set point, and delayed disease progression that distinguishes HIV-2 infection from that of HIV-1 infection. 4 In the present study, we expanded our previous observations by comparing the codon usage pattern of HIV-1 genes among different groups of HIV disease progressors, viz. Long-Term Nonprogressors (LTNPs), Elite Controllers (ECs), Slow Progressors (SPs), and Typical Progressors (P), to understand the influence of differential codon usage pattern of the viruses on the pathogenic manifestation in the host.
Complete genome sequences of HIV-1 strains isolated from ECs (n = 8), LTNPs (n = 58), SPs (n = 26), and Progressors/Typical Progressors (n = 38) were downloaded from the Los Alamos National Laboratory (LANL) HIV database as available as on December 2015. Base composition analysis was performed using codon W (
Interestingly, we found that the tat gene in EC, LTNP, and SP groups had a significantly higher ENC value as compared to that of typical progressors/progressors (p < .05), whereas that of all other genes was comparable (Table 1). Universally, codon usage has been suggested to be a reliable correlate of gene expression, with higher ENC values correlating with lower levels of gene expression, and vice versa. Tat being an overlapping regulatory gene, participates in a positive feedback mechanism that ensures high levels of HIV transcription following the activation of cells carrying HIV proviruses. The lower ENC value of tat gene in typical progressors reflects higher levels of tat protein expression leading to higher viral load, higher viral set point, and faster disease progression in these individuals as compared with the EC, LTNP, and SP.
The table provides the mean ENC ± SD of HIV-1 genes in EC, LTNP, SP and P. Two-way ANOVA was performed to compare EC, LTNP, and SP groups with P.
p < .05 was considered statistically significant.
ENC, effective number of codons; EC, elite controller; LTNP, long-term nonprogressor; P, typical progressor; SP, slow progressor.
In most instances, HIV infection is associated with progressive viral transcription and replication resulting in disease progression and clinical symptoms of AIDS. Several factors contribute to the control of viral transcription and replication, such as defective viral strains, robust host cell-mediated immune responses, and other host genetic factors that can affect viral replication. We hypothesize based on the provided evidence that codon bias of the tat gene may also contribute to the course of disease and extent of pathogenicity in HIV infection, along with other host and viral factors.
Footnotes
Acknowledgments
The authors thank the Department of Science and Technology, New Delhi, India, and Indian Council of Medical Research, New Delhi, India for their support.
Author Disclosure Statement
No competing financial interests exist.
