Abstract
Recent analyses suggest bacterial and/or mitochondrion-like ancestry for giant viruses (Megavirales sensu latu): amoeban mitochondrial gene arrangements resemble those of their candidate homologs in megaviral genomes. This presumed ancestral synteny decreases with genome size across megaviral families at large and within Poxviridae. In this study, analyses focus on Phycodnaviridae, a polyphyletic group of giant viruses infecting Haplophyta, Stramenopiles, and other algae, using syntenies between algal mitogene arrangements and chloroplast genomes and Rickettsia prowazekii as positive controls. Mitogene alignment qualities with Rickettsia are much higher than with viral genomes. Mitogenome synteny with some viruses is higher, for others lower than with Rickettsia, despite lower alignments qualities. In some algae, syntenies among cohosted chloroplast, virus, and mitochondrion are higher, in others lower than expected. This suggests gene order coevolution in cohosted genomes, different coregulations of organelle metabolisms for different algae, and viral mitogenome mimicry, to hijack organelle-committed cellular resources and/or escape cellular defenses/genetic immunity systems. This principle might explain high synteny between human mitochondria and the pathogenic endocellular alphaproteobacterium R. prowazekii beyond common ancestry. Results indicate that putative bacteria/mitochondrion-like genomic ancestors of Phycodnaviridae originated before or at the mitochondrion-bacteria split, and ulterior functional constraints on gene arrangements of cohosted genomes.
Introduction
Origins of viruses remain unknown, partly because viruses are conceptualized while focusing solely on a single phase of their life cycles, the virion capsule particle (Bandea, 1983). However, all dynamical aspects need to be integrated to correctly define and study life's complex phenomena, including viruses. The morphogenesis of some viruses among Poxviridae resembles that of cellular subcompartments, suggesting that some viruses might have originated from cells and/or cellular compartments (Bandea, 2009a, 2009b). Recent analyses suggest that at least some viruses, the giant viruses (Megavirales) (Colson et al., 2013) might have evolved from bacteria- and/or mitochondria-like cells (Seligmann, 2018a, 2019). Indeed, several properties of mitochondria, and especially mitogenomes, are reminiscent of viruses.
Similarities between mitochondria and viruses
Mitochondria are, like viruses, obligatory endocellular organisms (Roger et al., 2017). Mitogenomes frequently insert themselves into their host's nuclear chromosomes (numts) (Calabrese et al., 2017), similar to viruses. Mitogenome insertion frequency increases with colorectal adenocarcinoma (Srinivasainagendra et al., 2017). Insertion sites are nonrandom (Tsuji et al., 2012) and associated with nuclear chromosome double-strand breaks (Hazkani-Covo and Covo, 2008) and replication origins (Lenglez et al., 2010). In these respects, numts resemble some endogenized viruses, but mitogenome insertion mechanisms differ from those observed for Adenoviridae (Hazkani-Covo and Covo, 2008).
Mitochondrial posttranscriptional RNA processing also resembles that found in some viruses. Secondary structures formed by mitochondrial tRNAs within long primary mitochondrial RNAs punctuate mitochondrial RNA splicing (Ojala et al., 1981; Shimpi et al., 2017). Secondary structure signals (El Houmami and Seligmann, 2017) also punctuate processing of noncanonical mitochondrial RNAs (Seligmann, 2016a, 2016b). Megavirales (Mimivirus) (Raoult et al., 2004) and their virophages (La Scola et al., 2008) use similar secondary structure punctuation during posttranscriptional splicing (Byrne et al., 2009; Claverie and Abergel, 2009). In addition, mitochondrial budding-like vegetative replication (Hu, 2014) is reminiscent of bacterial endospores (Ajithkumar et al., 2001; Angert, 2005) and membrane-enveloped viruses (Garoff et al., 1998; Chazal and Gerlier, 2003; Chen and Lamb, 2008; Rossman and Lamb, 2011).
Initiation of Giant virus replication is also bacteria- and mitochondria-like: initiation signals found in bacteria and mitochondria associate with the predicted replication origin of Mimivirus (Akashi and Takemura, 2019). Hence, nuclear endogenization, transcription, and replication bear striking, although partial, resemblances with those known from mitochondria.
Gene homologies in mitochondrial and megaviral genomes
A Mimivirus protein coding gene produces a homolog of an amoeban inner mitochondrial membrane transporter that targets inner mitochondrial membranes (Zara et al., 2018). This specific case might reflect an occasional, putatively adaptive horizontal gene transfer from the amoeban mitochondrion to its endocellular parasite, Mimivirus. Giant viruses and mitochondria share further homologous genes. DNA polymerases of giant viruses and their virophages are homologs of those encoded by polintons (usually capsidless linear DNA parasites) (Kapitonov and Jurka, 2006), linear mitochondrial genomes, and bacteriophages (Yutin et al., 2013; Koonin et al., 2014; Koonin and Krupovic, 2017). The RNA polymerases of these taxonomic groups also cluster together (Kempken et al., 1992; Rohe et al., 1992).
In addition, pieces of eukaryotic nucleus- and mitogenome-encoded ribosomal RNAs were detected in various viruses belonging to Megavirales (Abrahão et al., 2018; Seligmann and Raoult, 2018; Seligmann, 2018a, 2019). Mitochondrial rRNA remnants seem most widely distributed within Megavirales (Seligmann, 2018a, 2019), suggesting that giant viruses have mitochondrion-like ancestors. Viral candidates for rRNA homologs apparently decay faster than viral homolog candidates of other mitogenes (Seligmann, 2018a), in line with rRNA absence in viruses. However, megaviral candidate homologs of mitogenes tend to conserve the mitochondrial gene order, and these synteny levels between megaviral and mitochondrial genomes increase with viral genome size reduction, when comparing megaviral families at large (within each small and large genome size groups, Seligmann, 2018a) and when comparing genomes of Poxviridae (Seligmann, 2019).
Similar gene arrangements
The increase of synteny with genome size reduction occurs independently in all three analyses (small and large megaviral genomes and Poxviridae). However, taking into account phylogenetic structure for each of these groups does not recover any association between synteny levels and genome size. This indicates that correlations observed when not accounting for phylogeny result from information embedded in the phylogenetic process associated with genome reduction. A simple way to explain this is that these correlations are not strictly adaptive, but result from interactions between phylogeny and adaptation.
Reasons for this apparent orthogenetic evolution between megaviral and mitogenome gene arrangements are unknown. However, in association with genome size reduction, gene contents of mitogenomes strongly converge with those of plastids (Maier et al., 2013; Sloan et al., 2018), and chloroplasts (Stefano et al., 2015; Allen, 2017). Future comparisons with very distantly related mitochondria as those found in the predatory flagellate Ancoracysta (Janouškovec et al., 2017) might shed some additional light on these phenomena.
These convergent gene orders are at least, in part, due to functional constraints on gene order, as functionally associated genes frequently form clusters on chromosomes (Botstein et al., 1972; Brinig et al., 2006; Dávila-López et al., 2010; Wicke et al., 2011; Junier and Rivoire, 2016), and are sometimes ordered according to assembly order of proteins in multiprotein complexes (Wells et al., 2016). Not least, supercoiling of chromosomal DNA, which creates close spatial vicinity between genes that may be, from a linear point of view, located far apart on the chromosome, produce coexpression patterns that also affect orders of distant genes (Willenbrock and Ussery, 2004; Sobetzko et al., 2012; Dorman and Dorman, 2016; Noy et al., 2016; Krogh et al., 2018; El Houdaigui et al., 2019).
Megavirales are probably polyphyletic, pooling together large viruses from different, independent origins (Colson et al., 2018; Koonin and Yutin, 2018, 2019). Recent studies suggest that giant viruses are relatively abundant in all ecosystems (Andrade et al., 2018; Mihara et al., 2018; Schulz et al., 2018), including such infecting marine invertebrates (a chaetognath-hosted giant virus has the longest known viral particles, exceeding 4 μm, approximately twice the length of Escherichia coli (Barthélémy et al., 2019)).
Here, analyses test convergences between mitochondrial gene arrangements and genomes of Phycodnaviridae. This polyphyletic group (details in Table 1 according to a phylogeny in Colson et al., 2018) is hosted by algal life forms from groups with very different phyletic origins, such as Stramenopiles, Viridiplantae, and Haptophycea. Phycodnaviridae present the advantage that mitochondrial and chloroplast genomes are available for several of their host species. This enables their inclusion as controls when comparing syntenies between mitochondria and Phycodnaviridae analyses of syntenies between mitogenomes from different species and between mitogenomes and chloroplast genomes (hosts and genomes listed in Table 1).
List of Host Taxa, Mitochondrial, Chloroplast, and Phycodnaviridae Genomes Analyzed for Synteny with Mitogenomes, with Genome Size and Taxonomic Cluster of Hosts and Viruses
C indicates the phyletic cluster of phycodnaviruses according to Colson et al. (2018).
This comparative experimental setup also enables to test whether cohosted genomes have higher syntenies than for comparable genomes with different hosts, independently of the lifeform (mitochondrion, chloroplast, or virus) to which genomes belong. This could detect cellular coregulation of cohosted organelle metabolisms, potential viral hijacking of these pathways, and viral mimicry to escape the host's cellular defenses (Alcami, 2003; Rollins et al., 2019). Moreover, independently of the topic of common origins for giant virus genomes and mitogenomes, the approach developed here sets a methodological and conceptual framework for studies of syntenies between genomes of cohosted organelles.
Previous observations on phyletic constraints producing associations between synteny and viral genome size reduction suggest that genome size effects might not be detectable in a polyphyletic group such as Phycodnaviridae.
Materials and Methods
Genomes analyzed for synteny
Previous analyses used a single mitogenome as reference to test synteny between mitogenomes and megaviral genomes, the mitogenome of Acanthamoeba castellanii (Seligmann 2018a, 2019). To test whether positive results on synteny with viruses are circumstantial, due to mitogene arrangements in that specific mitogenome, synteny analyses for Phycodnaviridae were done using additional reference mitogenomes, notably 8 mitogenomes from hosts infected by Phycodnaviridae, and the human mitogenome, a total of 10 mitogenomes. The human mitogenome is the shortest in the list and is therefore used to illustrate methods estimating synteny between mitogene order and other genomes.
Furthermore, vertebrate mitogenomes, including the human mitogenome, are highly reduced to a specific set of highly conserved genes, whose arrangement on mitogenomes is also highly conserved. Previous analyses (Seligmann, 2018a, 2019) showed that synteny between amoeban and viral genes is greater for mitogenes conserved across distant taxa, such as amoeba and vertebrata, than for lineage-specific mitogenes only present in amoeba. Hence, synteny between the human mitogenome and virus genomes is expected to be relatively high, not because of a closer phyletic relationship than with other mitogenomes, but because the human (and vertebrate) mitogenomes are reduced to few highly conserved mitogenes.
Table 1 presents the different mitogenomes, from which all genes were extracted and compared by BLASTn against a variety of chloroplast and viral genomes (9 and 24, respectively). Results from BLASTn analyses were used to calculate synteny levels.
BLASTn analyses
Synteny analyses as done here are directional, meaning that they search alignments between each mitogene extracted from a given focal mitogenome and another (mitochondrial, chloroplast, or viral) genome, using exactly the same methods as already described previously for Megavirales (Seligmann 2018a) and Poxviridae (Seligmann, 2019). These are explained here on the basis of an example, the mitogenome of Homo sapiens. Vertebrate mitogenomes have the lowest gene number and hence are most amenable as example.
The human mitogenome includes 13 protein coding genes, 2 rRNAs and a Dloop, in the following order according to its GenBank annotation: Dloop, 12s rRNA, 16s rRNA, ND1, ND2, COI, COII, AT8, AT6, COIII, ND3, ND4l, ND4, ND5, ND6, and CytB. These 16 sequences were extracted from GenBank entry NC_012920. For each of them, a separate alignment search was done by BLASTn (Altschul et al., 1997), with the following parameters: maximal Evalue, 10; word size 7; match/mismatch scores 1, −1; and gap costs: existence 0, extension, 2. If more than one alignment was detected, only the alignment with the lowest (best) Evalue was considered. Each of these 16 human mitogene sequences are analyzed by BLASTn, searching for alignments with each of the other mitogenomes, each of the viral and each of the chloroplast genomes in Table 1. Table 2 lists the best alignments obtained between each of these 16 mitogenome sequences extracted from the human mitogenomes and the genome of Acanthocystis turfacea Chlorella virus1, and with the genome of Rickettsia prowazekii. Note that these analyses are directional, from the mitogenes of a focal mitogenome, toward other genomes.
Alignments Between 16 Sequences from the Human Mitogenome and the Genomes of Acanthocystis turfacea Chlorella Virus1 and Rickettsia prowazekii
Mitochondrial sequences are listed in the order of their circular chromosomal arrangement. Last column: mitogene rank producing the highest synteny with 5′ positions of alignments with each Chlorella virus1 and Rickettsia genomes.
Synteny between ranked gene and alignment orders
Because all these genomes are circular, the starting point of genome annotations is arbitrary. Hence, each mitogene is potentially the first sequence. Therefore, the 16 mitogenes are assigned 16 alternative ranks (from 1 to 16), ranks of other sequences are adjusted accordingly. Synteny between these mitogene ranks from the focal mitogenome and other genomes are calculated using Pearson correlation coefficient r. This r estimates the similarity between the ranks of the focal mitogenes and the positions of the sequences aligning with these mitogenes on another genome. These alignments are considered candidate homologs of the focal mitogenes in the other, target genome. Their positions are also ranked. For the human mitogenome, 16 r's are calculated, one for each of the alternative mitogenome sequence ranks, considering a different mitogene as starting the annotation. The highest (most positive) r estimates synteny between mitogene order from the focal mitogenome (in this example from H. sapiens) and their candidate homologs in the other genome. In some cases, some mitogenes lack a candidate homolog. These mitogenes are excluded from correlation analyses. To compare syntenies calculated on different sample sizes (meaning different numbers of candidate homologs), Pearson correlation coefficients are z-transformed, and these z values are adjusted for sample sizes to za as explained in Table 3 and previously (Seligmann et al., 2007).
Synteny levels as presented are Pearson correlation coefficients r × 100, underlined coefficients have p < 0.05, followed by numbers of detected alignments between a genome and mitogenes from the mitogenome indicated in the column heading. Sample size affects r, this effect is neutralized by z-transformation, z = ln[(1 + r)/(1 − r)], sample size adjusted za = z − r/(2n − 5) (Seligmann et al., 2007). Synteny with mitogenome 7 as reference was not calculated. Bold numbers indicate cohosted genomes. Column numbers indicate species according to numbers in rows.
Sign tests
Sign tests use binomial distributions to estimate statistical significances of given events, assuming their expected random frequency is 0.5. For example, considering a given focal mitogenome and its cohosted chloroplast genome, the null hypothesis expects that on average, syntenies with about half of the chloroplast genomes, which are not cohosted with that focal mitogenome, are higher (more positive r) and about the other half are lower than synteny between cohosted organelles. In other words, if the fact that two organelles are cohosted has no effect on synteny, on average, synteny between cohosted organelles should be in the midst of the distribution of syntenies between organelles that are noncohosted. Hence, if, for example, the cohosted organelles have the highest synteny among the nine tested chloroplast genomes paired with that focal mitogenome, the binomial distribution produces the p value of observing the highest synteny specifically between cohosted organelles, with frequency 0.5: p = 0.002. In cases where cohosted organelles have the second highest synteny among nine tested chloroplast genomes paired with that focal mitogenome, the binomial distribution produces p = 0.002 + 0.018 = 0.0195. The binomial distribution requires for calculating p values the number of trials (here nine) and the expected frequency, here 0.5.
Results
Synteny is independent of alignment quality
Table 2 presents positions and Evalues of human mitogene candidate homologs on Rickettsia and A. turfacea Chlorella virus1 genomes. Alignments are frequently short and have relatively high Evalues. Overall, alignment quality is better with Rickettsia than with the virus, as already reported for Poxviridae (Seligmann, 2019).
As expected, synteny analyses with the alignment data from R. prowazekii (Table 2) produce r = 0.7029, one tailed p = 0.0012. This shows that the highly reduced mitogene content of vertebrate mitogenomes conserved (or recovered) the gene arrangement of homologous genes in the presumed common ancestor of mitochondria and Rickettsia. This positive result can also be considered as a positive control showing that the method used to estimate synteny is adequate. It is presented first for that very reason.
Similar analyses for A. turfacea Chlorella virus1 also produce high levels of congruence between human mitogene arrangement and ranked positions of candidate A. turfacea Chlorella virus1 homologs of human mitogenes (Fig. 1A, r = 0.7558, one tailed p = 0.00035). This is surprising because alignment qualities between human mitogenes and their candidate A. turfacea Chlorella virus1 homologs are much lower than for the same mitogenes and the Rickettsia genome. This means that although viral sequences are much more distant from their presumed homolog than bacterial homologs are from mitogenes, the presumed viral genomic arrangement of its candidate mitogene homologs fits well the human mitogene arrangement. These synteny levels are comparable to those observed between human mitogenes and the Rickettsia genome. Hence, mitogene arrangement is conserved in viruses, even if the very sequences are much less conserved. In other words, synteny presumably reflects here a common ancestral gene arrangement.

Ranked position of alignments between mitochondrial sequences and the Acanthocystis turfacea Chlorella virus1, and the ranked position of the mitochondrial sequences for
Synteny for conserved mitogenes
Above analyses show high congruence between the human mitogene order and the order of the viral candidate homologs detected by BLASTn. This is because vertebrate mitogenomes are reduced to few highly conserved mitogenes whose arrangement is also highly conserved, not because vertebrate mitogenomes would be more closely related to virus genomes than plausibly more relevant mitogenomes. Analyses below show this, using the mitogenes from the focal mitogenome of Heterosigma akashiwo.
When considering all Heterosigma mitogenes, synteny with Chlorella virus1 is low (r = 0.20, p > 0.1). However, when restricting synteny analyses to mitogenes common to Heterosigma and vertebrata, synteny is much higher (Fig. 1B, r = 0.54, p = 0.024). Results indicate that vertebrate mitogene arrangement is very similar to a common ancestor of Rickettsia, mitochondrial, and giant virus genomes. Algal and other mitogenomes apparently reshuffled more the arrangement of these mitogenes, although conserved an overall synteny with that common ancestor. This bias for greater synteny between viral and mitogenomes for mitogenes conserved across very divergent mitochondria is a major point favoring the interpretation of synteny as an indicator of common ancestry.
Synteny between mitogenomes
Estimates of synteny such as those presented above are statistically significant at p < 0.05. However, this does not indicate their biological meaning. To get an idea to what extent these synteny levels can be considered as high or low, similar synteny analyses are calculated between all pairs of mitogenomes for the 10 mitogenomes from Table 1 (Table 3). Note that for the sake of simplicity, the mitogenome of Ostreococcus is only included as target for aligning mitogenes from other species because the Ostreococcus mitogenes include gene duplications, which render synteny analyses and comparisons with other mitogene arrangements more complex.
Synteny levels between mitogene orders and mitogenomes in Table 3 show some phyletic signal. For example, synteny between Ectocarpus mitogene order and candidate homologs from Heterosigma (both Stramenopiles) is za = 0.988. This is higher than za calculated from synteny between Ectocarpus mitogene order and any of the other eight mitogenomes not originating from Stramenopiles (p = 0.0039, two-tailed sign test). Mean za within groups is greater than mean za between groups for all seven comparisons from Table 3 (no comparisons can be made for taxa represented by only one species, namely Acanthamoeba and Homo). This produces an overall p = 0.0078 (two-tailed sign test) for greater synteny within taxa than for comparisons between mitogenomes from different taxonomic groups.
Hence, synteny reflects phyletic relatedness: mitogenomes from related taxa tend to have more similar gene arrangements than more distant taxa. Nevertheless, this effect is not very strong, indicating convergences toward similar mitogene arrangements, independently of phylogenetic relatedness.
Synteny between chloroplast and mitochondrial genomes
Results in the previous section indicate orders of magnitudes for synteny, variation in synteny and association of synteny with phyletic relatedness when comparing mitogenomes from different hosts. Synteny between mitogene arrangements and chloroplast genomes from the same algal hosts are analyzed and compared to those obtained for comparisons within mitochondria from the previous section (Table 3). Mitochondria presumably evolved from ancestral Rickettsia-like alphaproteobacteria (Emelyanov, 2001), while chloroplasts evolved apparently from the relatively distant cyanobacteria (Raven and Allen, 2003).
Synteny levels between mitogene arrangements and chloroplast genomes (mean za = 0.819) are comparable to synteny levels among mitogenomes (mean za = 0.809) (Table 3). Both organelles reputedly converge toward similar gene contents, as noted above. Results indicate that they also converge toward similar gene arrangements.
Synteny between genomes of cohosted organelles
The data for synteny between mitogene arrangements and chloroplast genomes in Table 3 indicate interesting patterns in relationship to synteny between chloroplast and mitochondrial genomes sharing the same host. The organelles of Phaeocystis globosa have the highest mitogenome-chloroplast genome synteny when considering all syntenies between the mitogenes from that species and all eight chloroplast genomes analyzed in this study. This has p = 0.0039 according to a two-tailed sign test. Similarly, synteny between the mitogene arrangement in Pavlova lutheri (another Stramenopile) and the genome of chloroplasts hosted by that species is the second highest among the syntenies between that mitogenome and the eight chloroplast genomes considered here (p = 0.0352, two-tailed sign test).
For another Stramenopile, Chrysochromulina parva, synteny between its mitogene arrangement and its chloroplast genome is the lowest among the eight chloroplast genomes included in analyses (p = 0.0039, two-tailed sign test). Hence, among the seven organisms for which syntenies between cohosted organelles were calculated, three have extreme synteny levels compared with other mitogene-chloroplast genome comparisons. Two have higher than expected syntenies and one has a lower than expected synteny between mito- and chloroplast genomes. There is no clear pattern in the remaining species.
This suggests the possibility that in some species, functional constraints increase synteny between genomes of cohosted organelles, and in other species, constraints decrease synteny. It is possible that in the former species, mitochondrial and chloroplast transcriptions are coregulated, perhaps by the same or similar nucleus-encoded transcription factor(s). This might result in similar functional constraints on gene expression that explain similar gene arrangements.
In the species where synteny is lower than expected, nuclear regulations might be organized to dissociate between the metabolisms of the two organelles (e.g., diurnal/nocturnal activities for chloroplast/mitochondrion) (Liang et al. 2015). This would have produced lower than expected synteny between mitogene arrangement and chloroplast genomes. Other, more complex rationales might cause the observed synteny patterns. Note that synteny analyses between mitochondria and chloroplasts were designed as controls for synteny between mitochondria and Phycodnaviridae. Hence, results in relationship to synteny patterns between mitogene arrangement and chloroplast genomes should be considered as preliminary, but indicate that they could associate with different types of coregulations of symbiont organelle metabolisms. This principle might also apply to syntenies between mitochondrial and viral genomes, beyond and/or in addition to phyletic interpretations of common ancestries.
Synteny between mitogene arrangements and Phycodnaviridae
Table 4 presents syntenies calculated between mitogene arrangements from nine species and 23 phycodnavirus genomes. Comparisons between mean syntenies after za-transformations for different taxonomic groups (Stramenopiles, Haptophycea, Viridiplantae, and the remaining various taxa) do not indicate statistically significant trends associated with broad taxonomic affiliations. On average, synteny between mitogene arrangements of Micromonas (Viridiplantae) and genomes of Phycodnaviridae infesting Viridiplantae are relatively high (mean za = 0.80), however, there is no statistically significant difference at p < 0.05 (one-tailed tests) with syntenies obtained for that same mitogenome with Phycodnaviridae from other groups (Haptophycea, mean za = 0.62; Stramenopiles, mean za = 0.54; various, mean za = 0.65). No clear phyletic effects on mitogenome-phycodnaviral syntenies could be detected for any mitogenome.
Synteny Estimated as in Table 1 Between Mitogene Arrangements in 9 Mitogenomes and Genomes of 24 Phycodnaviridae
Underlined: p < 0.05; bold: virus and mitogenome cohosted.
Syntenies between cohosted mitochondrial and viral genomes are greater than expected compared with syntenies with other phycodnavirus genomes for three species, C. parva, Micromonas sp., and P. globosa (p = 0.0000002, p = 0.000043, and p = 0.026, respectively [two-tailed sign tests]). These results, considering above comments on chloroplast and mitochondrion gene expression coregulations, suggest that in some hosts, high syntenies between mitogenomes and the viral genomes reflect coregulation of mitochondrial and viral metabolisms. In other words, viruses might mimic mitochondria, thereby facilitating the hijack of the host's resources. In P. globosa, gene arrangements in all three host genomes (virus, mitochondrion, and chloroplast) seem locked on the mitogene arrangement.
Synteny between mitogene arrangements and Phycodnaviridae versus Rickettsia genomes
Syntenies between mitogene arrangements and Phycodnaviridae, compared with syntenies between mitogene arrangements and the genome of R. prowazekii, show different patterns, according to which mitogenome is used as reference. Patterns are all statistically significant according to two-tailed sign tests, independently of reference mitogenome. For 6 among 9 reference mitogenomes used here, syntenies with Rickettsia's genome are higher than expected by chance, compared to the 23 virus genomes examined. This is the case for two haptophycean mitogenomes (Emiliania huxleyi, p = 0.000772; P. globosa, p = 0.032), two stramenopiles (Ectocarpus siliculosus, p = 0.00000006; H. akashiwo, p = 0.0000029), Micromonas sp. (Viridiplantae, p = 0.000772), and the human mitogenome (p = 0.0000015). In other words, mitogene arrangements in these six species are more similar to the Rickettsia genome than to Phycodnavirus genomes.
For the three remaining species, the opposite is observed: syntenies between mitogene arrangement and the Rickettsia genome are lower than expected, compared with the 23 examined viral genomes. This is the case in two haptophyceans, C. parva (p = 0.000772) and P. lutheri (p = 0.0113), and for the mitogene arrangement of A. castellanii (p = 0.0033). Mitogene arrangements of these three species resemble less that of candidate homologs of these mitogenes in their presumed alphaproteobacterial ancestor than that of various Phycodnaviridae.
Synteny between mitogene arrangements and Phycodnaviridae versus chloroplast genomes
Seven among nine species whose mitogene arrangement is used as reference have chloroplasts. Syntenies between mitogene arrangement and the cohosted chloroplast genome are higher than for viral genomes in five among these seven cases (two-tailed sign tests): E. huxleyi, P. lutheri, and P. globosa (Haptophycea, p = 0.0319, p = 0.00000006, p = 0.00000006, respectively) and for the stramenopiles E. siliculosus and H. akashiwo (p = 0.0113 and p = 0.0013, respectively). The opposite tendency is observed for Micromonas sp. (p > 0.05) and for C. parva (Haptophycea, p = 0.000018). Overall, results suggest higher syntenies between mitogene arrangement and cohosted chloroplast than between mitogene arrangement and viral genomes, with the exception of C. parva.
Synteny with mitogene arrangement as a function of genome size
Previous analyses comparing Megavirales at large, and within Poxviridae, found that syntenies between mitogene arrangements and their viral candidate homologs increase with viral size reduction. This result was not obtained when considering phyletic relatedness among viral species. No association between synteny and genome size was observed for Phycodnaviridae, for none of the 10 mitogenomes used here as reference for synteny calculations with phycodnaviral genomes.
However, the distribution of genome sizes of Phycodnaviridae as in Table 1 is discontinuous (bimodal), one mode with small genomes with <195,000 nucleotides, and another mode with genomes with more than 270,000 nucleotides. When analyzing separately these two groups, synteny might decrease with genome size for small genomes (r = −0.51, one tailed p = 0.068), and increases for large phycodnaviral genomes (r = 0.631, two tailed p = 0.021) when using the P. lutheri mitogene order as reference mitogenome. These analyses are inconclusive, and suggest combinations of confounding phyletic and functional effects that prevent a clearer picture. Putatively, associations between synteny and genome size are not as clear as in other groups. This might be because Phycodnaviridae are polyphyletic and because Phycodnaviridae are cohosted with both mitochondria and chloroplasts. This complex situation enables detecting functional constraints on gene arrangements of cohosted genomes.
General Discussion
Previous analyses of synteny between amoeban mitogene arrangement and candidate megaviral homologs suggested common ancestry for these groups. Discussions considered the possibility that megaviral genomes originated from mitogenomes while increasing size (Seligmann, 2018a), or result from parallel evolution from similar common Rickettsia-like ancestors (Seligmann, 2019). Results presented here do not confirm nor refute the directionality of synteny evolution in relationship to viral genome size. They do confirm high syntenies between mitogene arrangements and viral genomes, comparable in extent to syntenies with other mitogenomes, chloroplast genomes, and the genome of R. prowazekii. DNA supercoiling could be an explanation for gene order conservation/convergence. However, it is difficult to understand how such functional/spatial-structural constraints could conserve or produce syntenies between 16,000 and 50,000 nucleotide-long mitogenomes and candidate homologs of mitogenes distributed across viral and bacterial genomes that are typically 10–100 times larger. In this respect, ancestral synteny seems a more parsimonious explanation at this point, conserved by ulterior functional constraints such as DNA supercoiling.
Analyses here show that beyond common ancestry, functional factors affect synteny. These are putatively due to coregulations of gene expressions of cohosted genomes, notably DNA supercoiling. Such constraints might increase, or decrease syntenies, depending on hosts. In Phaeocystis, viral, mitochondrial, and chloroplast genomes seem locked on the mitogene arrangement. In Micromonas sp., syntenies of mitogene arrangement with candidate homologs from cohosted phycodnavirus and chloroplast are lower than expected, suggesting avoidance of synchronous gene expressions. In this species, higher than expected synteny between mitogene arrangement and homologs in R. prowazekii stresses common ancestry.
Synteny levels seem independent of alignment qualities. This suggests that syntenies reflect very ancient common origins of genomes: divergence between specific genes present in both genomes only produces alignments too short for considering these genes as homologs. This is reminiscent of very ancient evolutionary events in relationship to cellular origins of viruses through structural homologies among proteins (Nasir and Caetano-Anollés, 2015), RNAs (Seligmann and Raoult, 2016, 2018), mean amino acid positions in primary protein structures (Seligmann, 2018b) and genetic code evolution (Seligmann, 2015, 2018c). In that sense, synteny is an interesting tool to complete classical analyses for detecting phyletic relatedness.
Independently of the specific topic of common ancestry for mitochondria and giant viruses, the presented analyses are a useful conceptual and methodological framework to study syntenies among cohosted genomes.
Synteny between human mitogene arrangement and R. prowazekii is particularly high, putatively indicating that beyond common ancestry, high synteny might promote endocellular parasitism, perhaps downplaying genetic immunity systems and/or enabling them to hijack cellular resources usually targeted at mitochondria. The latter hypothesis connecting synteny between mitochondrial gene arrangement and pathogen genomes should be investigated independently of viruses and of hypotheses on common ancestry and in relationship to pathogenicity and genomic immunity mechanisms.
Conclusions
Synteny exists between algal mitogene order and candidate homologs in genomes of Phycodnaviridae.
Synteny between mitogenes and Phycodnaviridae is comparable to that between mitogenes and chloroplast, and mitogenes and Rickettsia.
Alignment qualities between mitogenes and chloroplasts, and mitogenes and Rickettsia, are generally much higher than between mitogenes and viral genomes.
In some algae, synteny among cohosted viral, chloroplast, and mitogenomes is better than between genomes of organelles and viruses with different hosts; in some (fewer) algae, syntenies between cohosted genomes are lower than expected.
The latter point indicates coregulation of expression of cohosted genomes, and the possibility that synteny between mitogenes and viral genomes results, in part, from functional constraints, putatively via DNA supercoiling regulating transcription.
Common ancestry remains the most parsimonious explanation and main cause for mitogene–viral genome syntenies, because viral genomes are 10–100 times larger than mitogenomes.
Independently of the issue of common ancestors for mitochondria and giant viruses, analyses here provide a conceptual and methodological framework for studies of syntenies among cohosted genomes.
Footnotes
Acknowledgment
Thanks to Claudiu Bandea for constructive comments.
Disclosure Statement
No competing financial interests exist.
Funding Information
No funding was received for this article.
