Molecular Characterization of the Kamese Virus,an Unassigned Rhabdovirus,Isolated from Culex pruina in the Central African Republic

Abstract

Rhabdoviridae is one of the most diversified families of RNA viruses whose members infect a wide range of plants, animals, and arthropods. The members of this family are classified into 13 genera and >150 unassigned viruses. Here, we sequenced the complete genome of a rhabdovirus belonging to the Hart Park serogroup, the Kamese virus (KAMV), isolated in 1977 from Culex pruina in the Central African Republic. The genomic sequence showed an organization typical of rhabdoviruses with additional genes in the P-M and G-L intergenic regions, as already reported for the Hart Park serogroup. Our Kamese strain (ArB9074) had 98% and 78.8% nucleotide sequence similarity with the prototypes of the KAMV and Mossuril virus isolated in Uganda and Mozambique in two different Culex species, respectively. Moreover, the protein sequences had 98–100% amino acid similarity with the prototype of the KAMV, except for an additional gene (U3) that showed a divergence of 6%. These molecular data show that our strain of the KAMV is genetically close to the Culex annuliorus strain that was circulating in Uganda in 1967. However, this study suggests the need to improve our knowledge of the KAMV to better understand its behavior, its life cycle, and its potential reservoirs.

Introduction

Rhabdoviruses are enveloped, negative-strand RNA viruses that belong to order Mononegavirales, which also includes families Bornaviridae, Mymonaviridae, Pneumoviridae, Sunviridae, Nyamiviridae, Paramyxoviridae and Filoviridae (Adams, et al. 2016). The family Rhabdoviridae is one of the most ecologically diverse families of RNA viruses with members infecting a wide range of plants, animals, and arthropods in terrestrial, freshwater, and marine ecosystems (Kuzmin, et al. 2009, Walker, et al. 2011). These viruses can either be transmitted directly, through blood, or indirectly, through arthropods or even water (Kuzmin, et al. 2009, Dietzgen, et al. 2016). Currently, the Rhabdoviridae family comprises 13 genera (Cytorhabdovirus, Varicosavirus, Dichorhavirus, Ephemerovirus, Lyssavirus, Novirhabdovirus, Nucleorhabdovirus, Perhabdovirus, Sigmavirus, Sprivivirus, Tibrovirus, Tupavirus, Vesiculovirus) and >150 unassigned viruses (Adams, et al. 2013, Adams, et al. 2016,). However, a recent study on the genome's evolution divides the family into 17 established and newly proposed genera with 2 distinct clades of unassigned viruses (Walker, et al. 2015). Their unsegmented, single-stranded (−) RNA genome is packaged within a bullet- or rod-shaped particle comprising five structural proteins of the virion, namely the nucleoprotein (N), the phosphoprotein (P), the matrix protein (M), the glycoprotein (G), and the RNA-dependent RNA polymerase (L) (Dale and Peters 1981). The rhabdovirus genome features partially complementary untranslated leader and trailer sequences and five open reading frames (ORFs) arranged in the order 5′ N-P-M-G-L 3′. The ends of each ORF are flanked by conserved transcription initiation and transcription termination/polyadenylation sequences, which control expression of the corresponding capped and polyadenylated mRNAs (Dietzgen et al. 2012).

Kamese virus (KAMV) was first identified from female Culex annulioris mosquitoes caught in the Kamese forest, Mawokota County, Uganda, in 1967 (1970). Like many other viruses, the newly identified virus was classified as a rhabdovirus according to its bullet-shaped morphology and antigenic relationships. In addition to the prototype strain (MP6186 from Uganda in 1967), several strains had been isolated, identified as KAMV using seroneutralization assay, but were not characterized at the molecular level. The strains came from different sylvatic species of Culex and Aedes africanus mosquitoes (World Health Organization Collaborating Center for Reference and Research on Arboviruses [CRORA], Institut Pasteur de Dakar, www.pasteur.fr/recherche/banques/CRORA), indicating that the virus may be widespread in Central Africa. Although no human disease has ever been associated with an infection by this virus, antibodies have been detected in Ugandan human sera (7 out of 135) (1970). Until recently, the KAMV was an unassigned virus of the Hart Park serogroup (Francki 1991), but it has been tentatively classified in the newly proposed Hapavirus genus based on its genome size and complexity (Walker, et al. 2015). To date, our knowledge on the genetic diversity of the KAMV is very sparse, only one complete sequence and two partial sequences of the L gene of the KAMV prototype are available in GenBank (Dacheux, et al. 2010, Aznar-Lopez, et al. 2013, Walker, et al. 2015). Several research projects on arboviruses have been carried out at the Institut Pasteur de Bangui, in the CAR, in which several viruses were isolated from different species of mosquitoes or from animal reservoirs. Recently, several strains of Zika virus, Middelburg virus, or Arumowot virus were characterized using broad-spectrum molecular techniques (Berthet, et al. 2014, Tricou, et al. 2014, Berthet, et al. 2016). In this article, we describe the second complete KAMV genome obtained in CAR as part of a project aiming to characterize at the molecular level all the arboviruses that were isolated at the Institut Pasteur de Bangui.

Materials and Methods

Virus isolation

The viral strain described in this study was isolated from Culex pruina mosquitoes caught in 1977 at Lobé (18°03′E, 03°38′N) in the CAR. The mosquitoes were collected in a semi mountainous equatorial forest, identified, and grouped into pools of 30 individuals. Viruses were isolated and amplified by four serial passages in brains of suckling mice (Saluzzo, et al. 1980). The brain suspensions were then lyophilized and stored in sealed glass vials at room temperature until the beginning of our experiment. The lyophilized virus was resuspended in phosphate-buffered saline and inoculated in newborn mice aged 1 to 3 days old for virus amplification.

Nucleic acid sequencing

For molecular characterization, RNA was extracted from the brain tissue of a moribund newborn mouse using the QiaAmp Viral RNA Mini Kit (Qiagen) according to the manufacturer's instructions. Extracted total RNA was treated with Turbo DNase (Life Technologies) to remove the Mus musculus DNA genome and then retrotranscribed into cDNA with Super Script III reverse transcriptase and random hexamer primers (Life Technologies). The generated cDNA was amplified with the Phi29 enzyme, as described previously (Berthet, et al. 2008). Amplified DNA was quantified in the Quant-iT assay (Invitrogen), and a fixed amount of amplified DNA was fragmented using a Covaris M220 ultrasonicator according to the manufacturer's instructions. The 450 bp DNA fragments were used to construct a genomic library with the NEBNext^® Ultra DNA Library Prep kit for Illumina^® (New England Biolabs) according to the manufacturer's recommendations. Illumina sequencing was performed using the MiSeq instrument to give 150 bp in paired-end reads with the MiSeq sequencing kit v2 (Illumina). A total of 5,142,844 reads in paired ends were obtained. All raw reads were filtered according to quality, and the mouse genome sequence was filtered from these reads with Bowtie 2.0 software using the M. musculus Mn10 sequence as a reference. Viral reads corresponding to the KAMV genome were selected using a similarity approach with BLASTN and BLASTX search tools based on the only complete sequence of KAMV (KM204989) available in GenBank. For each selected read, only the region that matched the viral genome was considered as previously described. All reads were assembled with SPAdes software (version 3.5) to obtain the full-length viral genome in one step. The size of the obtained KAMV genome was 13,206 nucleotides with an average coverage of 13648 × . The KAMV sequence was deposited in GenBank under accession number KX497133.

Result and Discussion

The genome showed the typical rhabdovirus organization with the following order of the main genes: 5′-N-P-M-G-L-3′. As already observed in many rhabdoviruses belonging to the Hart Park serogroup, additional genes with unknown functions (UN) were found between the P-M and G-L genes (Gubala, et al. 2008, Walker, et al. 2011). Five additional complete ORFs, whose sizes ranged from 300 to 500 nt, were characterized between P-M and G-L. Actually, four ORFs with unknown function are found in the P-M intergenic region. U1, U2, and U4 are at positions 2210 to 2695, 2717 to 3217, and 3286 to 3718, respectively. U3 is found at positions 2844 to 3144 in the U2 ORF. The fifth ORF (U5) is found in positions 6441 to 6752. These intergenic ORFs have already been described for several Hart Park serogroup members such as FLAV, HPV, NGAV, and WONV (Gubala, et al. 2008, Gubala, et al. 2010, Allison, et al. 2014).

A selected set of rhabdovirus sequences belonging to the Hart Park serogroup was used to determine the phylogenetic relationships of our KAMV variant with other rhabdoviruses. The amino acids of L protein sequences were aligned in Geneious software (version 9.1.4), with manual editing to increase the quality of the alignment. The phylogenetic tree was constructed using the maximum likelihood method, and the statistical significance of the tree was estimated by bootstrapping with 1000 replicates. Phylogenetic analysis indicated that KAMV ArB9074 clustered with other KAMVs isolated from C. annulioris (Walker, et al. 2015) (Fig. 1).

FIG. 1.

Maximum likelihood phylogenetic tree of 16 rhabdovirus L protein sequences. Analysis at the amino acid level was based on sequences available in GenBank. The tree was generated using Geneious software (Geneious R9 version 9.1.4; www.geneious.com) and branch robustness was tested with 1000 bootstraps. Bootstrap values are indicated as percentages. The KAMV isolate sequenced here is shown in gray. Potential hosts (when known) are indicated for each clade. BASV, Bas Congo virus; FLAV, Flanders virus; GLOV, Gray Lodge virus; HPV, Hart Park virus; JOIV, Joinjakaka virus; KAMV, Kamese virus; LJAV, Landjia virus; LJV, La Joya virus; MANV, Manitoba virus; MOSV, Mossuril virus; MQOV, Mosqueiro virus; NGAV, Ngaingan virus; ORV, Ord River virus; PCV, Parry Creek virus; WONV, Wongabel virus.

As shown in Table 1, the full-length KAMV genome isolated in the CAR in 1977 had 98% nucleotide sequence similarity with the prototype KAMV sequence (KM204989), which was isolated in Uganda in 1967. The various conserved proteins such as M, N, P, G, and L and all unknown function proteins, except U3, showed high amino acid similarity, which ranged from 98.0% to 100%, whereas the U3 protein had a 6% divergence with the prototype KAMV sequence. The comparison between the KAMV and Mossuril virus genomes, which is the most closely related viral sequence, showed variable divergence at the amino acid and nucleic acid levels according to the gene considered. Similarity at the amino acid level ranged from 79% to 99.3%. Although the N and unknown function proteins (U1 and U2) showed high nucleotide divergence (7.4% to 12.8%), protein sequences remained conserved (96.9–99.3%) except for the G and unknown function proteins (U3 and U5) (16–22.2% of divergence). This high divergence may be because of the natural evolution of the strain within a novel species of Culex and by the interaction between external viral proteins and host proteins.

Table 1.

Percentage of Nucleotide and Amino Acid Identity Between the Strain of Kamese Virus ArB9074 Isolated in the Central African Republic and the Genome Sequences of Four Strains of the Hark Park Serogroup Available in GenBank

			N		P		M		G		L		U1		U2		U3		U4		U5
Virus	Host	Genome	Nucl.	Prot.	Nucl.	Prot.	Nucl.	Prot.	Nucl.	Prot.	Nucl.	Prot.	Nucl.	Prot.	Nucl.	Prot.	Nucl.	Prot.	Nucl.	Prot.	Nucl.	Prot.
KAMV (1967, Uganda) KM204989	Culex annuliorus	98.3	98.7	99.8	97.7	98.4	98.2	100.0	98.0	98.8	98.3	99.5	98.8	100	98.4	98.8	98.0	94.0	99.4	99.4	97.1	98.0
MOSV (1959, Mozambique) KM204993	Culex sitiens	78.8	88.6	99.3	86.4	93.8	86.8	95.9	72.1	79.0	76.4	91.2	87.2	96.9	92.6	98.8	94.8	84.0	83.2	93.1	71.9	77.8
HPV (1995, USA) KM205011	Culex tarsalis	—	69.5	72.2	53.1	37.1	60.1	58.2	58.7	52.9	67.5	73.4	52.9	38.5	58.8	50.6	—	—	—	—	—	—
FLAV (2007, USA) JX431885	Unknown	—	68.5	72.0	55.0	42.2	62.5	58.7	58.5	52.9	67.5	73.0	53.4	39.8	56.7	50.0	62.5	—	—	—	—	—

FLAV, Flanders virus; G, glycoprotein; HPV, Hart Park virus; KAMV, Kamese virus; L, RNA-dependent RNA polymerase; M, matrix; MOSV, Mossuril virus; N, nucleoprotein; nucl, nucleic acid; P, phosphoprotein; prot, protein; UN, unknown protein.

Although KAMV has only been identified in the CAR and Uganda, C. pruina and C. annulioris are sympatric, hematophagous mosquitoes that are found in forested Central African countries (Mutebi, et al. 2012). Their presence in the same environment appears to suggest that they are infected by feeding on the same vertebrate reservoir, although this presumption remains to be tested.

Conclusion

The obtention of the complete genomic sequence of this KAMV strain demonstrates clearly that it is genetically close to the strain that was found in C. annulioris in Uganda in 1967. However, our strain presented large amino acid divergence for external viral proteins (e.g., the G protein) from other close strains belonging to the Hart Park group. This study demonstrates the usefulness of associating classical virology tools, that is, the isolation of a viral strain, with high-throughput sequencing to obtain a whole sequence of a specific viral variant. To conclude, our molecular data suggest the need to improve our knowledge on KAMV diversity and to better understand KAMV behavior and life cycle.

Footnotes

Acknowledgments

We thank Heïdi Lançon and Dr. Engel-Gautier for revising the english of the article. This study was supported by the Institut Pasteur de Bangui, CAR, and the Institut Pasteur, Paris, France (Programme Transversal de Recherche CEVACAR no. 385). The CIRMF is supported by the government of Gabon, Total-Fina-Elf Gabon, and the Ministère de la Coopération Française. The funders had no role in study design, data analysis, or preparation of the article.

Authors' Contributions

H.D.S.T., E.N., M.K., and N.B. designed and planned the experiments. H.D.S.T, B.S., and N.B. performed the experiments and the bioinformatics analysis. H.D.S.T., N.B., E.N., A.G., J.C.M., and M.K. analyzed the data. H.D.S.T. and N.B. wrote the article. All authors approved the final version of the article.

Author Disclosure Statement

No competing financial interests exist.

References

Adams

, King

, Carstens

. Ratification vote on taxonomic proposals to the International Committee on Taxonomy of Viruses (2013). Arch Virol, 2013; 158:2023–2030.

Adams

, Lefkowitz

, King

, Harrach

, et al. Ratification vote on taxonomic proposals to the International Committee on Taxonomy of Viruses (2016). Arch Virol, 2016; 161:2921–2949.

Allison

, Mead

, Palacios

, Tesh

, et al. Gene duplication and phylogeography of North American members of the Hart Park serogroup of avian rhabdoviruses. Virology, 2014; 448:284–292.

Aznar-Lopez

, Vazquez-Moron

, Marston

, Juste

, et al. Detection of rhabdovirus viral RNA in oropharyngeal swabs and ectoparasites of Spanish bats. J Gen Virol, 2013; 94:69–75.

Berthet

, Nakoune

, Gessain

, Manuguerra

, et al. Complete genome characterization of the Arumowot virus (unclassified phlebovirus) isolated from Turdus libonyanus Birds in the Central African Republic. Vector Borne Zoonotic Dis, 2016; 16:139–143.

Berthet

, Nakoune

, Kamgang

, Selekon

, et al. Molecular characterization of three Zika flaviviruses obtained from sylvatic mosquitoes in the Central African Republic. Vector Borne Zoonotic Dis, 2014; 14:862–865.

Berthet

, Reinhardt

, Leclercq

, van Ooyen

, et al. Phi29 polymerase based random amplification of viral RNA as an alternative to random RT-PCR. BMC Mol Biol, 2008; 9:77.

Dacheux

, Berthet

, Dissard

, Holmes

, et al. Application of broad-spectrum resequencing microarray for genotyping rhabdoviruses. J Virol, 2010; 84:9557–9574.

Dale

, Peters

. Protein composition of the virions of five plant rhabdoviruses. Intervirology, 1981; 16:86–94.

10.

Dietzgen

, Calisher

, Kurath

, Kuzmin

, et al. Rhabdoviridae. In: King

AMQ

, Adams

M.J.

, Carstens

E.B.

, Lefkowitz

E.J.

, ed. Virus Taxonomy: Classification and Nomenclature of Viruses: Ninth Report of the International Committee on Taxonomy of Viruses. San Diego: Elsevier Academic Press, 2012:654–681.

11.

Dietzgen

, Kondo

, Goodin

, Kurath

, et al. The family Rhabdoviridae: Mono- and bipartite negative-sense RNA viruses with diverse genome organization and common evolutionary origins. Virus Res, 2016; 227:158–170.

12.

Francki

RIB

, Fauquet

, Knudson

, Brown

. Classification and nomenclature of viruses: 5th report of the International Committee on Taxonomy of Viruses [for the] Virology Division of the International Union of Microbiological Societies. New York: Springer-Verlag, 1991:450.

13.

Gubala

, Davis

, Weir

, Melville

, et al. Ngaingan virus, a macropod-associated rhabdovirus, contains a second glycoprotein gene and seven novel open reading frames. Virology, 2010; 399:98–108.

14.

Gubala

, Proll

, Barnard

, Cowled

, et al. Genomic characterisation of Wongabel virus reveals novel genes within the Rhabdoviridae. Virology, 2008; 376:13–23.

15.

Kamese (KAM) strain. MP 6186. Am J Trop Med Hyg, 1970; 19:Suppl:1145–1146.

16.

Kuzmin

, Novella

, Dietzgen

, Padhi

, et al. The rhabdoviruses: Biodiversity, phylogenetics, and evolution. Infect Genet Evol, 2009; 9:541–553.

17.

Mutebi

, Crabtree

, Kading

, Powers

, et al. Mosquitoes of western Uganda. J Med Entomol, 2012; 49:1289–1306.

18.

Saluzzo

, Gonzalez

, Herve

, Georges

. Epidemiological study of arboviruses in the Central African Republic: Demonstration of Chikungunya virus during 1978 and 1979. Bull Soc Pathol Exot Filiales, 1980; 73:390–399.

19.

Tricou

, Berthet

, Descorps-Declere

, Nakoune

, et al. Complete genome sequences of two middelburg viruses isolated from arthropods in the central african republic. Genome Announc, 2014; 2:e01078.

20.

Walker

, Dietzgen

, Joubert

, Blasdell

. Rhabdovirus accessory genes. Virus Res, 2011; 162:110–125.

21.

Walker

, Firth

, Widen

, Blasdell

, et al. Evolution of genome size and complexity in the rhabdoviridae. PLoS Pathog, 2015; 11:e1004664.