Abstract
We previously isolated a cDNA clone from cynomolgus macaque encoding a novel CXC chemokine that we termed CXCL1L from its close similarity to CXCL1. However, the cDNA consisted of 3 exons instead of 4 exons that were typically seen in other CXC chemokines. Here, we isolated a cDNA encoding the full-length variant of CXCL1L that we termed CXCL1Lβ. CXCL1Lβ is 50 amino acids longer than the original CXCL1L, which we now term CXCL1Lα. The CXCL1Lβ mRNA is much more abundantly expressed in the cynomolgus macaque tissues than CXCL1Lα mRNA. However, CXCL1Lβ protein was poorly produced by transfected cells compared with that of CXCL1Lα. When the coding region of the fourth exon was fused to the C-terminus of CXCL1 or even to a nonsecretory protein firefly luciferase, the fused proteins were also barely produced, although the mRNAs were abundantly expressed. The polysome profiling analysis suggested that the inhibition was mainly at the translational level. Furthermore, we demonstrated that the C-terminal 5 amino acids of CXCL1Lβ were critical for the translational repression. The present study, thus, reveals a unique translational regulation controlling the production of a splicing variant of CXCL1L. Since the CXCL1L gene is functional only in the Old World monkeys, we also discuss possible reasons for the conservation of the active CXCL1L gene in these monkeys during the primate evolution.
Introduction
C
The gene duplication is a driving force for the generation of evolutionary novelty (Nei and Rooney 2005; Eirin-Lopez and others 2012). After gene duplication, while one of the duplicated genes maintains the original function, the other could be released from the functional constraint to gain a new function or to be degenerated into a nonexpressing pseudogene through nucleotide substitution or deletion. The chemokine superfamily, thus, provides an interesting model of gene duplication in evolution.
CXCL1 (also called Gro-α) is an inflammatory chemokine that mainly attracts neutrophils and is critically involved in inflammation, angiogenesis, and tumorigenesis (Dhawan and Richmond 2002; Amiri and Richmond 2003). Previously, we isolated a novel cynomolgus macaque CXC chemokine that we termed CXCL1L from its highest similarity and close genomic relationship to CXCL1 (Nomiyama and others 2007). Although rodents have only one CXCL1 gene, there are CXCL1 and its duplicated CXCL1L gene in the genomes of cynomolgus macaque (Macaca fascicularis) and rhesus macaque (Macaca mulatta) (Nomiyama and others 2007), both of which are widely used in the medical research and belong to the same genus (Macaca) of Old World monkeys. These monkeys diverged from a common ancestor about 1.5 million years ago (Blancher and others 2008; Osada and others 2010). Our recent database search has further revealed that all 7 Old World monkeys, including Macaca monkeys, have the active CXCL1L gene, whereas the gene has been inactivated in the genomes of higher primates, including humans (Nomiyama and others 2010, 2013). This may suggest a unique role of CXCL1L in the Old World monkeys.
Our original cynomolgus macaque cDNA clone of CXCL1 L corresponded to an EST in the public database (Nomiyama and others 2007). CXCL1L shares 80% amino acid sequence similarity to cynomolgus macaque CXCL1, which we also cloned in the same study (Nomiyama and others 2007). Most CXC chemokine genes are known to have 4 exons. The comparison of the CXCL1L cDNA sequence and the corresponding gene, however, suggested that the cDNA was derived from an alternative splice product lacking the fourth exon sequence.
In this study, we, therefore, isolated the full-length CXCL1L cDNA clone of cynomolgus macaque consisting of the 4 exon sequences. We now term the products of the shorter cDNA and the full-length cDNA as CXCL1Lα and CXCL1Lβ, respectively. To elucidate the function of CXCL1Lα and CXCL1Lβ, we tried to produce their proteins. In contrast to the CXCL1Lα protein, however, we failed to obtain efficient production of the CXCL1Lβ protein by silk worms or transfected cells. Our subsequent study revealed that Ex4 (we use this abbreviation in this article to indicate the coding region of the exon 4) represses the translation in spite of the abundant production of mRNA in transfected cells. Furthermore, the C-terminal 5 amino acids of Ex4 are critical for this suppression. Thus, we have revealed a unique translational regulation for the production of a splicing variant of CXCL1L. We also discuss the conservation of an active CXCL1L gene only in Old World monkeys in functional and evolutionary points of view.
Materials and Methods
Cloning and expression of CXCL1Lβ cDNA
Cynomolgus macaque CXCL1Lβ cDNA was cloned by reverse transcription-PCR (RT-PCR) using a spleen cDNA library prepared as described (Nomiyama and others 2007). The primers used were based on the rhesus macaque CXCL1L gene and were 5′-AGT TCCCCTGCTCCTCTCAC and 5′-GAAAGAACCTTAAT AGACAAGCTTTGA.
The expression plasmids for cynomolgus macaque CXCL1, CXCL1Lα, and CXCL1Lβ were constructed by using each cDNA as a template. The primers used were: 5′-TTTGGT ACCATGGCCCGCGCCGCGCTC and 5′-TTTTCTAGAG TTGGATTTGTCACAGTTCAG for CXCL1; 5′-TTTGGT ACCATGGCCTGCGCCGCACCC and 5′-TTTTCTAGA CCTGTTCAGCATCTTTTGCAT for CXCL1Lα; and 5′-TTTGGTACCATGGCCTGCGCCGCACCC and 5′-TTTT CTAGATAAGAAGCAGTTAAACACATT for CXCL1Lβ. The synthesized cDNAs were digested with KpnI and XbaI and cloned into pEF4/Myc-HisA vector containing polyhistidine tag (Thermo Fisher Scientific). For construction of CXCL1-Ex4, the Ex4 sequence was amplified with primers 5′-TTTTCTAGAGGCAACGCTAACGGACCTAG and 5′-TTTTCTAGATAAGAAGCAGTTAAACACATT by using CXCL1Lβ cDNA as a template, and the XbaI digested fragment was inserted into the XbaI site, located downstream of the CXCL1 coding sequence, of the XCL1 expression plasmid mentioned earlier.
For construction of CXCL1-STOP-Ex4, the Ex4 fragment amplified with primers 5′-GGCAACGCTAACGGACCTAG and 5′-TAAGAAGCAG TTAAACACATT was cloned into the PmeI site, located downstream the polyhistidine tag, of the CXCL1 expression vector. For construction of CXCL1-STOP-ARE, the AU-rich element in the 3′-noncoding region of the human granulocyte-macrophage colony-stimulating factor (GM-CSF) gene (Shaw and Kamen 1986) was amplified with primers 5′-CAGGCATGGCAGAAGAATGGGAATA and 5′-ATTACGGTAAAACATCTTGAATAAA by using the human genome DNA as the template, and the obtained 135 bp fragment was inserted into the PmeI site of the CXCL1 expression vector.
The C-terminal serial deletion mutants were constructed as the construction of CXCL1-Ex4. The downstream primers of the primer pair were 5′-AAATCTAGACACATTA AACACAGTCCA (ΔC5), 5′-TTTTCTAGAAGTGTGCCC AGGAGCTAATTG (ΔC11), 5′- TTTTCTAGATTGTTTTT CTTCTTTCACTTT (ΔC17), 5′- AAATCTAGACACTTTG ATTCCTCTAAGAATAGA (ΔC22), and 5′-AAATCTAG ACTTTAGGAATTGTAGCGG (ΔC31). Other C-terminal deletion mutants of short Ex-4 sequences were generated by inserting the annealed oligonucleotides, 5′-CTAGAGGCA ACGCTAACGGACCTAGAGAGAAGGAGAAAT and 5′-CTAGATTTCTCCTTCTCTCTAGGTCCGTTAGCGTTG CCT (ΔC39), 5′-CTAGAGGCAACGCTAACGGAT and 5′-CTAGATCCGTTAGCGTTGCCT (ΔC45), into the XbaI site of the CXCL1 expression vector.
Luciferase-Ex4 (Luc-Ex4) and Luc-STOP-Ex4 plasmids were also constructed. Luciferase coding region was amplified with primers 5′- TTTAAGCTTATGGAAGACGCC AAAAACATAAA and 5′- AAAGAATTCCACGGCGATC TTTCCGCCCTTCT (Luc-Ex4) or 5′-AAAGAATTCTTAC ACGGCGATCTTTCCGCCCT (Luc-STOP-Ex4) by using pGL3-Control vector (Promega) as a template. Then, the Luc fragments were digested with EcoRI and HindIII and inserted into pcDNA3.1 (+) vector (Clontech).
RT-PCR
Relative expression levels of CXCL1Lα and CXCL1β mRNAs in cynomolgus macaque tissues were determined by RT-PCR. The primer pairs include one common primer annealing to the first exon sequence and a downstream primer specific for either CXCL1Lα or CXCL1Lβ (Fig. 1). The common primer is 5′-CTGCTCGTGCTCCTGGTG, and the CXCL1Lα- and CXCL1Lβ-specific primers are 5′-GCC AGTATTTCTGACCAACG and 5′-TCTAGGTCCGTTA GCGTTGC, respectively. The amplification products are 299 bp for CXCL1Lα and 270 bp for CXCL1Lβ in length.

Identification and expression of splice variants of cynomolgus macaque CXCL1L.
DNA transfection and Western and Northern blot analyses
Transfection of human embryonic kidney HEK293 cells and Chinese hamster ovary CHO cells with the constructed plasmids was performed by using a transfection reagent LipoTrust EX Gene (Hokkaido System Science). The EGFP expression vector pEGFP (enhanced green fluorescent protein)-N1 (Clontech) was used as an internal control for Western blot analysis. Thirty-two hours after transfection, cell lysates were prepared with RIPA lysis buffer (Nakalai). Anti-EGFP, anti-His-tag, and anti-luciferase polyclonal antibodies were obtained from Medical & Biological Laboratories, and peroxidase-goat anti-rabbit IgG was obtained from ZYMED. Total RNAs were prepared from the transfected cells by using Trisol reagent (Thermo Fisher Scientific).
For Northern blot analyses, digoxigenin-labeled probes for CXCL1, luciferase, and β-actin were prepared by DIG RNA Labeling kit (Roche Diagnostics). The CXCL1 RNA probe was prepared from a PCR product that was 210 bp in length and that was amplified with 5′-GCTGC TCCTGCTCCTGGT and 5′-GATGCGGGGTTGAGAC AAG primers. The CXCL1 probe that shares 92% identity with CXCL1L was used to detect both CXCL1 and CXCL1L mRNAs. The washing step after the hybridization was performed at low stringency (0.5X SSC, 60°C). ECL Plus Western blotting detection system (GE Healthcare) or CDP-Star (Roche) was used for the detection of Western and Northern signals, respectively. An image scanner LAS-3000 (Fujifilm) was used to determine relative signal intensity, and an average was obtained by using 3 independent experiments.
Polysome profile analysis
HEK293 cells transfected with plasmids in a 10 cm dish were lysed on the dish with cold lysis buffer (5 mM Tris-HCl, pH 7.4, 1.5 mM KCl, 2.5 mM MgCl2, 1 mM dithiothreitol, 100 μg/mL cycloheximide, 1% Triton X-100, 1% deoxycholate). Polysome profile analysis was performed as described (He and Green 2013). Briefly, the lysate was homogenized with a tissue grinder 10 times followed by centrifugation at 10,000 g for 10 minutes. The supernatant was then layered onto linear sucrose gradient (10%–50% in 80 mM NaCl, 5 mM MgCl2, 20 mM Tris-HCl, pH 7.4, and 1 mM dithiothreitol), and it was centrifuged at 27,000 rpm for 6 h. After centrifugation, fractions were collected whereas the polysome profiles were recorded with a UV spectrometer. RNAs were extracted from the fractions and used for Northern blot analyses as described earlier.
Results
Isolation of cynomolgus macaque CXCL1Lβ cDNA
Previously, we isolated a cDNA clone from cynomolgus macaque spleen encoding a novel CXC chemokine that we termed CXCL1L (Nomiyama and others 2007), which was apparently generated from CXCL1 by gene duplication at the early stage of primate evolution (Nomiyama and others 2013). Since the cynomolgus macaque genome sequence has not been determined at the beginning of the present study, we compared the cDNA sequence with that of the CXCL1L gene in the genome of rhesus macaque, the species closely related to the cynomolgus macaque (Nomiyama and others 2007). Although the genes of most CXC chemokines, including the original CXCL1, have 4 exons with the fourth exon coding the C-terminal region of a few amino acids, the cynomolgus macaque CXCL1L cDNA was found to be composed of only the first 3 exons. This suggested that the cDNA encoded a splice valiant of CXCL1L.
We, therefore, tried to identify the CXCL1L exon 4 in the rhesus macaque genome sequence. We failed to identify the exon 4 by the similarity search using the exon 4 sequence of the rhesus macaque CXCL1 gene as a probe. However, by using a gene prediction software GENSCAN (Burge and Karlin 1997), we identified an exon sequence that was 154 nucleotides long encoding a long peptide of 50 amino acids 812 bp downstream from the third exon of rhesus macaque CXCL1L. Furthermore, we have also identified a similar potential fourth exon sequence of CXCL1L in the cynomolgus macaque genome sequence recently reported (Yan and others 2011; Higashino and others 2012), which is located 532 bp downstream from the third exon in the cynomolgus macaque genome (Fig. 1A).
Based on the nucleotide sequence of the potential exon 4, we designed PCR primers and isolated the full-length cDNA clone from the spleen cDNA library prepared in our previous study (Nomiyama and others 2007). The exon 4 encodes a long peptide of 50 amino acids that is now termed Ex4, which is highly conserved between cynomolgus and rhesus macaques with only one amino acid substitution (Fig. 2A). We, therefore, renamed the spliced variant encoded by the first 3 exons as CXCL1Lα and the one from the 4 exons as CXCL1Lβ. The predicted CXCL1Lα and CXCL1Lβ proteins are 104 and 154 amino acids long, respectively (Fig. 2A).

The amino acid sequences of CXCL1Lα and CXCL1Lβ of 7 Old World monkeys and the phylogenetic tree of primates.
Our previous RT-PCR data showed that the CXCL1L mRNA was expressed at a high level in the spleen and to a lesser level in the brain (Nomiyama and others 2007). To determine the expression of CXCL1Lα and CXCL1Lβ separately, we performed semi-quantitative RT-PCR by using these 2 tissue cDNAs. The primer pairs contain one common primer that anneals to the first exon sequence and the downstream primers that are specific for either CXCL1Lα or CXCL1Lβ (Fig. 1A). The annealing site of the CXCL1Lα-specific primer is located downstream of the coding region of CXCL1Lα exon 3. This site is present in the third intron of the CXCL1Lβ. The results showed that CXCL1Lβ mRNA was dominantly expressed in both tissues (Fig. 1B).
Protein expression in cultured cells
To compare functions of CXCL1, CXCL1Lα, and CXCL1Lβ, we tried to produce their proteins by silkworms that were transfected with the respective expression vectors. Unexpectedly, we could not detect CXCL1Lβ protein in the body fluid of the transfected silkworms (data not shown). We, therefore, constructed mammalian expression plasmids of CXCL1, CXCL1Lα, and CXCL1Lβ with His-tags at their C-terminal ends and compared their expression in transfected CHO cells. An EGFP expression plasmid was also transfected as an internal control. Thirty-two hours after transfection, Western blot analyses were performed by applying 1 μL from 600 μL of culture supernatant to each lane. The protein bands of CXCL1 and CXCL1Lα were strongly visible, whereas the CXCL1Lβ protein band was hardly detected (Fig. 3A). When the same experiments were repeated by using HEK293 cells, CXCL1Lβ protein was again hardly detected in contrast to those of CXCL1 and CXCL1Lα (data not shown but see Fig. 3B). However, when 1 μL from 30 μL of the total cell lysates of transfected HEK293 cells was applied on the gel, a small amount of CXCL1Lβ protein was detected (Fig. 3B). These results suggested that the CXCL1Lβ protein was present at low levels in the endoplasmic reticulum but it was hardly detected in the culture supernatant.

Western and Northern blot analyses of transfected cells.
To examine an effect of the Ex4 sequence on the protein production, a construct was generated to express CXCL1 with Ex4 fused to its C-terminal end (CXCL1-Ex4) (Fig. 3E). The results from the CXCL1-Ex4 construct were quite similar to those of CXCL1Lβ: namely, a poor production of CXCL1-Ex4 protein compared with that of CXCL1 (Fig. 3B). However, when we placed the Ex4 sequence at 6 nucleotides downstream of the stop codon of CXCL1 (CXCL1-STOP-Ex4) (Fig. 3E), the Ex4 sequence did not affect the protein abundance (Fig. 3B). These results suggested that the Ex4 sequence needed to be present within the coding region to suppress protein production.
We further examined whether Ex4 could inhibit the production of a nonsecretory protein by using firefly luciferase. Again, the presence of the Ex4 sequence fused to the C-terminal end of luciferase (Luc-Ex4) (Fig. 3E) but not after the stop codon (Luc-STOP-Ex4) (Fig. 3E) strongly suppressed the protein production in cell lysates (Fig. 3C). This strongly suggested that the inhibitory activity of the Ex4 sequence was not at the secretion step but at the translation step.
Expression of mRNA in cultured cells
We also determined mRNA expression levels by Northern blot analysis. As shown in Fig. 3B, the level of CXCL1Lβ mRNA was also low compared with that of CXCL1Lα in transfected HEK293 cells. Furthermore, the presence of the Ex4 sequence fused to the C-terminal end or after the stop codon of CXCL1 also greatly reduced the mRNA levels. This might be due to the presence of AU-rich sequences that cause instability of mRNA (Shaw and Kamen 1986; Hao and Baltimore 2009). In fact, the CXCL1Lβ mRNA has at least one canonical AUUUA sequence in its 3′ untranslated region, and the coding region of Ex4 is also AU rich (60%) compared with those of the exons 1–3 (38%).
Therefore, for comparison, we also constructed an expression vector of CXCL1 that contained an AU-rich element (ARE) of the human GM-CSF gene (Shaw and Kamen 1986) in the 3′ noncoding region (CXCL1-STOP-ARE) (Fig. 3E). Transfection of the vector showed that its mRNA expression level was as low as those of the Ex4-containing CXCL1-Ex4 and CXCL1Lβ (Fig. 3D). However, the protein expression level of the CXCL1-STOP-ARE was comparable to that of CXCL1-STOP-Ex4. It was also noted that the mRNA expression levels of Luc-Ex4 and Luc-STOP-Ex4 were not reduced compared with those of Luc (Fig. 3C). A similar phenomenon was observed with the constructs using fluorescent protein DsRed gene (data not shown). Collectively, these results suggest that, though the presence of the Ex4 sequence in the C-terminal end strongly reduces the protein production, its effect on the mRNA level is context dependent.
Protein expression normalized by mRNA expression
Since the Ex4 reduced both the protein and mRNA levels, the effects of Ex4 on the protein expression levels of various constructs were compared after normalization with the levels of mRNA expression, assuming that mRNAs of the constructs were equally expressed and not degraded. Accordingly, averages of protein and mRNA signal intensities were determined by using 3 independent experiments, 2 of which are shown in Fig. 3B and D. By setting the normalized protein level of CXCL1 as 100%, the normalized protein levels in the culture supernatants (mean of 3 experiments ± standard error) were 25.8% ± 4.5% for CXCL1Lα, 1.3% ± 0.2% for CXCL1Lβ, 8.8% ± 0.7% for CXCL1-Ex4, and 32.3% ± 2.4% for CXCL1-STOP-Ex4. The data clearly indicated that the protein expression of CXCL1Lβ and CXCL1-Ex4 was constantly low compared with CXCL1α, and CXCL1-STOP-Ex4, although CXCL1 exhibited an even higher protein expression level than CXCL1Lα and CXCL1-STOP-Ex4.
Luc-Ex4 mRNA binds less ribosomes than Luc mRNA
To see whether the decreased translation of proteins containing Ex4 was at the level of initiation or extension, the polysome profile analysis was performed. HEK293 cells transfected with Luc-Ex4 or Luc expression plasmids were homogenized, and the cell lysates were applied to 15%–50% sucrose gradient. This procedure separates mRNAs depending on the number of ribosomes binding to the mRNAs, which reflects the rate of translational initiation and elongation (He and Green 2013). After centrifugation, aliquots were collected. As shown in Fig. 4 (upper panel), the polysome profiles that represented the sum of those of various transcripts were quite similar between the cells transfected with Luc (A) or Luc-Ex4 (B).

Polysome profile analysis. HEK293 cells were transfected with Luc
To determine the actual polysome numbers associated with Luc and Luc-Ex4 mRNAs, we next performed Northern blotting by using mRNAs extracted from the aliquots. As shown in Fig. 4 (lower panel), the Luc mRNA was loaded with 1.5–6 ribosomes with the peak value of 3 ribosomes (A), whereas the Luc-Ex4 mRNA was loaded with 1.5–2.5 ribosomes with the peak value of 2 ribosomes (B). The results indicated that the number of ribosomes bound to the Luc-Ex4 mRNA was much less than that of the Luc mRNA. The results also suggested that the translation was at least initiated. Although it could not be determined from this analysis whether the ribosomes were stalled on the mRNA, the low translation level of Ex4-containing mRNA might be mostly due to the repression of translational elongation.
The C-terminal 5 amino acid sequence of Ex4 is important for the translational repression
To identify the amino acid sequence within the Ex4 that was responsible for the observed translational repression, successive C-terminal deletion mutants were made from the CXCL1-Ex4 construct (Fig. 5). We used CXCL1-Ex4 instead of the natural CXCL1Lβ, because it was easier to replace the Ex4 sequence with truncated Ex4 sequences using the CXCL1-Ex4 construct. Furthermore, the expression of CXCL1 was much higher at both protein and mRNA levels than those of CXCL1Lα or CXCL1Lβ.

Expression of successive C-terminal deletion mutants of CXCL1-Ex4 in HEK293. The Ex4 sequence fused to the cynomolgus macaque CXCL1 was deleted successively from the C-terminal end as depicted on the right. The constructs were transfected into HEK293 cells. The cell culture supernatants, cell lysates, and total RNAs were prepared and analyzed as in Fig. 3. Protein bands shifted upward in the Δ5 and Δ11 lanes of supernatant and cell lysate samples and are indicated by the arrows.
The constructs were transfected into HEK293 cells, and Western blotting analyses were performed by using cell culture supernatants and cell lysates. As shown in Fig. 5, the deletion of only 5 amino acids from the C-terminal end resulted in a tremendous increase in the CXCL1-Ex4 proteins in the cell supernatant, although it showed less effect on the protein abundance in the cell lysates. These results indicated that the C-terminal 5 amino acids (Phe-Asn-Cys-Phe-Leu) have the major impact on the level of protein secretion. Of note, the deletion of the C-terminal 5 amino acids also greatly affected the electrophoretic mobility of the protein band in the gel with an upward shift. This suggested altered tertiary structures of the C-terminally truncated proteins.
The mRNA expression levels were also gradually increased by the successive deletion of the Ex4 sequence. However, almost half (∼22 amino acids) of the Ex4 sequence needs to be removed to make the expression levels comparable to that of CXCL1, suggesting that the region responsible for the reduced mRNA expression is much broader than that of the repression of protein translation.
Discussion
CXCL1L is a gene copy that is duplicated from CXCL1 in the primate lineage after the divergence of primates and rodents (Nomiyama and others 2013). Previously, we isolated a cDNA clone of CXCL1L from cynomolgus macaque (Nomiyama and others 2007). By comparing with the available rhesus macaque genome, however, we found that the cDNA consisted of 3 exons instead of the standard 4-exon structure of CXC chemokines. In the present study, therefore, we have isolated the full-length CXCL1L cDNA consisting of 4 exons. In contrast to a few amino acids encoded by the fourth exons of other CXC chemokines, however, the fourth exon of CXCL1L encodes a peptide that is 50 amino acids long that is termed Ex4. We now call the short version of CXCL1L as CXCL1Lα and the long one as CXCL1Lβ.
However, our subsequent study has revealed that the presence of Ex4 appears to have a negative impact on the stabilization and/or translation of CXCL1Lβ mRNA. Indeed, when the Ex4 sequence was fused to the genes of cynomolgus macaque CXCL1 or firefly luciferase, the translation of these genes was also greatly repressed (Fig. 3). However, the Ex4 sequence had no such repressive activity when present after the stop codon. Thus, the Ex4 sequence inhibits translation of a protein to which Ex4 is directly fused, suggesting that the presence of Ex4 at the C-terminus of a protein is required for the translational repression. Indeed, the presence of Ex in the C-terminus of firefly luciferase greatly affected the ribosome binding to its mRNA (Fig. 4). Furthermore, the C-terminal 5 amino acid sequence of Ex4 is critical in the suppression of protein production (Fig. 5). These results suggest that although the ribosome binds to mRNA and initiates translation of mRNA, the presence of Ex4 sequence at the C-terminus induces ribosome stalling, resulting in poor protein translation (Fig. 6).

Schematic illustration of translational repression of CXCL1Lβ mRNA by Ex4. More ribosomes bind cynomolgus macaque CXCL1Lα mRNA than CXCL1Lβ mRNA. Once the Ex4 sequence of CXCL1Lβ mRNA is translated, the nascent peptide may inhibit translational initiation and/or elongation of the following ribosomes.
Arresting of polysomes is a widely used mechanism in controlling the translational rate (Richter and Coller 2015), and the examples of specific amino acid sequences in the nascent peptides to arrest translational elongation of their own mRNAs are known in species from eubacteria to mammals (Fang and others 2000; Gong and Yanofsky 2001, 2002; Nakatogawa and Ito 2002; Tenson and Ehrenberg 2002; Cruz-Vera and others 2005; Onouchi and others 2005; Vazquez-Laslop and others 2008; Ito and others 2010; Wilson and Beckmann 2011; Chiba and Ito 2012). These sequences generally contain basic amino acid residues, interact with the negatively charged ribosome tunnel that exports proteins from the ribosome, and, thus, may cause ribosome stalling (Lu and Deutsch 2008). Some of such amino acid sequences have been found in proteins that are involved in physiological control systems. In the cases of Escherichia Coli secretion monitor SecM and Bacillus Subtilis membrane insertion/folding monitor MifM, the arrest can be released when the physiological situation is restored (Chiba and others 2009; Ito and others 2010).
In the Ex4 sequence of the CXCL1Lβ, there are 2 Arg and 6 Lys residues, although they are not located in the crucial C-terminal 5 amino acid residues (Fig. 2A). It remains to be seen whether the ribosome pausing site on the Ex4 sequence corresponds to the C-terminal 5 amino acid sequence. Ribosome profiling (Ingolia and others 2009, 2011; Oh and others 2011) by next-generation sequencers may be helpful to detect the ribosome-protected mRNA fragments, thus allowing the identification of ribosome pausing sites.
Arrested nascent peptides are usually degraded by proteasome, although the mRNAs are not subjected to degradation (Ito-Harashima and others 2007; Dimitrova and others 2009; Inada 2013). Our preliminary data show, however, that the proteasome inhibitor MG132 does not substantially affect the protein level of CXCL1Lβ, suggesting that the proteasome degradation is not the major mechanism for the reduction of protein production (data not shown).
The expression levels of Luc-Ex4 and Luc-STOP-Ex4 mRNAs were not appreciably decreased compared with that of Luc mRNA (Fig. 3C). Thus, the presence of the Ex4 sequence does not appear to affect the stability of these mRNAs. In contrast, CXCL1Lβ or CXCL1-Ex4 mRNAs were less abundant than those mRNAs without the Ex4 sequence and, thus, may be degraded (Figs. 3B, D, and 5). Given that the turnover of the Luc mRNA was not appreciably enhanced by the presence of the Ex4 sequence (Fig. 3C), the effect of the Ex4 sequence on the mRNA stability may be context dependent.
Translational elongation can be arrested by RNA secondary structures (Dimitrova and others 2009; Inada 2013). The Ex4 RNA sequence could form several stem-loop structures (unpublished results). Determination of ribosome pausing sites may elucidate whether the RNA secondary structure of Ex4 is also involved in the ribosome stalling.
So far, the active CXCL1L gene has been found only in 7 Old World monkeys (family Cercopithecidae): cynomolgus macaque, rhesus macaque, Southern pig-tailed macaque (Macaca nemestrina), sooty mangabey (Cercocebus atys), drill (Mandrillus leucophaeus), olive baboon (Papio anubis), and African green monkey (Chlorocebus sabaeus) (Fig. 2A, B). On the other hand, the gene has been inactivated by the deletion of the third and fourth exons in other hominids such as human, chimpanzee, and gorilla. In Orangutan, the gene has been inactivated by nucleotide substitutions. Common marmoset (Callithrix jacchus) of New World monkeys, which diverged from Old World monkeys roughly 43.5 million years ago (Perelman and others 2011), also has a pseudogene of CXCL1L inactivated by nucleotide substitutions (our unpublished results). Thus, it appears that there might be a lineage-specific selective force for the preservation of active CXCL1L gene in Old World monkeys. Conversely, there might be a strong negative selection for the presence of active CXCL1L gene in the higher primates except for the Old World monkeys. Our genome analyses have also shown that Philippine tarsier (Tarsius syrichta), which is a separate relict lineage with an independent origin, and bushbaby (Otolemur garnettii) of Strepsirrhines seem to have no CXCL1L gene, though both of them have the CXCL1 gene.
Obviously, the CXCL1L gene was generated from CXCL1 by gene duplication during the evolution of primates. It seems likely that the gene had been active in the primates during the emergence of Old World monkeys but was inactivated in the lineages of New World monkeys and hominids. Notably, the gene also gained a new exon 4 in Old World monkeys. This might have conserved the CXCL1L gene in the Old World monkeys, with a possibly new function. The newly gained exon 4 encodes a long peptide of 50 amino acids, resulting in a quite new CXC chemokine with an exceptionally long C-terminal end compared with CXCL1 and other CXC chemokines. The newly acquired Ex4 sequence is relatively rich in polar amino acids (50% in Ex4 vs. 41% in exons 1–3) and also in charged residues (50% positively charged residues in Ex4 vs. 12% in the exons 1–3; 10% negatively charged residues in Ex4 vs. 2% in the exons 1–3). The pI of Ex4 is 9.36, whereas that of exons 1–3 is 10.11. The net pI of CXCL1Lβ is 9.90.
The function of Ex4 is at present unclear, since the Ex4 peptide has no sequence similarity to other proteins. Of note, however, the newly gained Ex4 sequence is highly conserved in Old World monkeys (Fig. 2A, B). This might suggest that CXCL1L with Ex4 has gained a new function that is important for the Old World monkeys and, thus, has been conserved in these monkey genomes even after they were diverged from hominids 20–38 million years ago (Perelman and others 2011).
Given the extent of evolutionary conservation of the active CXCL1L gene in the Old World monkeys (Fig. 2A), the CXCL1L proteins may have an essential role in these monkeys. For example, CXCL1Lα, which is the major protein product of the CXCL1L gene and lacks a few amino acids encoded by an original exon 4, might have acquired a new function in the Old World Monkeys by deleting the C-terminal amino acids.
Alternatively, CXCL1Lβ, which has extra 50 amino acids encoded by the newly gained exon 4 and is the major product at the mRNA level, may play a specific role in these monkeys in certain circumstances through coevolution of a mechanism that releases the ribosomal pausing of CXCL1Lβ mRNA in such conditions to promote its protein production. For example, since CXCL1 is one of the inflammatory chemokines, CXCL1Lβ may contribute to coping with particular infectious agents that are specific to these monkeys. The long C-terminal of CXCL1Lβ may promote the interaction with extracellular substrates such as heparin with affinities that are different from CXCL1, and, thus, may have acquired a new role in cell migration in those monkeys (Sierra and others 2004). Yet, another possibility, although it may sound quite unusual, is that the CXCL1L gene in the Old World monkeys is also in the process of inactivation like its counterparts in New World monkeys and higher primates and, in fact, already behaves like a pseudogene with its self-inflicted suppression of mRNA translation.
In conclusion, it would be of quite interest to see whether CXCL1Lβ protein can be produced by cells or tissues of Old World monkeys in particular conditions. CXCL1Lα and/or CXCL1Lβ might also have some activities that are different from original CXCL1. The CXCL1L gene with its unique translational control not known in any other genes may also provide an interesting case for the study of evolution and function of duplicated genes.
Footnotes
Acknowledgments
The authors thank B. Okuno, T. Sata, and Y. Endo of Kumamoto Univ. and J. Kusuda of the NIBIOHN for their help.
Author Disclosure Statement
No competing financial interests exist.
