Abstract

E
Conventional phylogenetic selection analyses compare the rate of synonymous codon substitution (dS) with the rate of nonsynonymous codon substitution (dN). A ratio of dN/dS exceeding 1 indicates positive selection, as the rate of amino acid change is greater than the rate of (ostensibly) neutral evolution. Selection acting on overlapping coding sequences can invalidate such analyses, as nonsynonymous sites in one overlapping reading frame will correspond to synonymous sites in another. 1
In the HIV-1 genome, the first seven codons at the 5′ end of tat exon 1 overlap with the 3′ end of the vpr gene, while the last 26 codons overlap with rev exon 1 (HXB2 reference sequence;

Overlapping codons in the HIV-1 tat and vpr/rev genes. Boxes represent nucleotide positions and solid black lines show the boundaries between codons. Numbers within boxes represent nucleotide positions within each codon triplet. It can be seen that the nucleotide occupying the first position in vpr/rev codons is the same as the nucleotide at the third codon position in tat.
A number of approaches to studying selection specifically in overlapping coding sequences have been developed, 1 –3 though none are practical for routine analyses. Investigators undertaking phylogenetic selection analyses with conventional codon models must be aware of the bias introduced by ignoring overlapping coding sequences in virus or bacterial genomes. The scope of such work should be limited to regions of genes in which there is no overlap.
Footnotes
Author Disclosure Statement
No competing financial interests exist.
