Abstract
I propose a new model, called the “RNA memory” model, for the possible role of RNAs in the maintenance and establishment of cell identity. This is cytoplasmic memory obtained by the transmission of mother noncoding (nc) RNAs to daughter cells. These RNAs are able to activate transcription via sequence homology in daughter cells. Regulation of RNA memory is strictly linked to the regulation of ncRNAs with repressive features, such as the RNAs involved in RNA interference (RNAi). Misregulation of this system could lead to misidentity, and thus it could be involved in cancer transformation, progression of viral or genetic diseases, and progression of senescence.
Introduction
Different models have been developed to explain either the establishment or the maintenance of cell identity, but actual models focus on explaining how an on/off gene status is inherited and not the exact transcriptional profile. In these models, cell identity can be maintained or established mainly by Polycomb and trithorax group proteins, histone variants, DNA methylation, and epigenetic modifications. 3
In symmetric cell division, after mitosis, daughter cells are able to activate the same set of genes of the mother cell at the same transcriptional level for each gene, starting from the high condensed and overall silenced state of metaphase chromosomes, thus leading to a cell lineage. We define the maintenance of cell identity as the continued restoring of chromatin and transcriptional profiles through generations. 4 –6 In asymmetric cell division or during commitment, daughter cells can acquire new functions, and their chromatin and transcriptional profile can be different from the maternal one. 4 –6
Even if actual models can explain several events related to the control of cell identity, some phenomena still remain unexplained or poorly explained. An “RNA memory” model could partially fill the gap between actual models and unexplained or partially explained experimental observations.
RNA Memory Model
The RNA memory theory is a son of Occam's razor. The simplest way for daughter cells to restore the transcriptional profile of a mother cell in a cell line is for the daughter cells to have a direct method of reading the transcriptional profile of the mother cell. During its lifetime, the mother cell produces messenger (m) RNAs that represent the key point of its cell identity. The quantity and quality of mRNAs produced are the sum of all the cascades of regulation borne by the cell. These cascades are driven by intracellular, intercellular, humoral, or environmental stimuli and are integrated in the developmental design. The loss of this information after cell division would represent a severe lack of efficiency. For example, enzymatic induction after exposure to an environmental toxin must be retained after cell division, and it would be gradually lost only if the exposure is not reiterated. 7 –9
In the RNA memory model, there is cytoplasmic memory for cell identity via trans-acting RNA diffusible elements (from now on called memRNAs). These RNAs are able to activate transcription via sequence homology in daughter cells either in a gene- or locus-specific manner. In the cytoplasm of the mother cell, memRNAs are functional RNA remnants that mirror the mother cell's own transcriptional profile. After mitosis, the transcriptional profile can be restored through the reading of memRNAs, which reflect the transcriptional profile of the mother cell. This could be considered a sort of transcriptional inertia.
Development programs could take advantage of this effect. The regulated expression of novel memRNAs during development could change the cell fate in the next generation. In this scenario, daughter cells would be different from mother cells. Asymmetric distribution of memRNAs could easily lead to a different cell fate in daughter cells. In this scenario, daughter cells are different one from the other.
It seems possible that memRNAs could lead to a euchromatic state of chromatin moreso than directly activated transcription. Moreover, it seems highly probable that memRNAs are ncRNAs. memRNAs could arise either from introns, mRNAs untranslated regions (UTRs), or noncoding transcripts from promoters and enhancers.
Regulation of RNA memory is strictly linked to the regulation of ncRNAs with repressive features, such as RNAs involved in RNA interference (RNAi). Double-stranded (ds) RNA-driven degradation could be a system used by the cell to modulate RNA memory. RNAi could destroy memRNAs in a gene-specific manner, thus leading to a lack of activation of the target gene, and thus leading to the silencing of that gene. This mechanism is compatible with the observation that the RNAi pathway is required for the establishment of a silenced state but not for its maintenance. 3,10 Several scenarios could be hypothesized, for example, alternative splicing could be an event of regulation for the production of short- or long-lasting memRNAs.
memRNAs could be important regulators in cell fate, and they could be involved in several biological events. Their misregulation could lead to events such as cancer transformation or senescence. Chronic activation of an oncogene via RNA memory could contribute to cancerogenesis; the only evidence for this would be a change in the epigenetic pattern of that gene. 11 –13 From one generation to the other, the abundance of memRNAs for that gene would induce its own transcription greatly, thus leading the cell to enter the cell cycle, and so on.
In senescence, from one generation to the next, transcriptional noise could increase due to a stochastic floating of the RNA memory system, and this could lead to aging-related diseases. 14 –18 Possible applications both in research and in therapies could be significant. These topics will be discussed in detail in this paper.
RNA and Chromatin Structure
Much evidence suggests that RNA-directed processes help to orchestrate chromatin architecture and epigenetic memory. 19,20 RNA has the capacity for a high degree of sequence and locus specificity and could be an important source of information during development and adaptation. The vast majority of genomes of all metazoans are transcribed in a developmentally regulated manner, mainly into complex patterns of ncRNAs. 21,22 It is known that RNA is an integral component of chromatin 23 and that many of the proteins involved in chromatin modifications as well as transcription factors have the capacity to bind RNA or complexes containing RNA or RNA-binding proteins. 21,24,25 There is evidence that many ncRNAs can act locally to regulate the epigenetic state of the nearby chromatin, often recruiting either repressing or activating chromatin remodeling complexes. 26 –35 These data suggest that RNA can be involved in chromatin remodeling and epigenetic modification in a sequence-specific manner, a key feature of RNA memory model.
RNA and Transcription
RNAs are able to physically contact components of the transcriptional machinery. This is known for the TATA box–binding protein (TBP) 36 and for the carboxy-terminal domain (CTD) of the largest subunit of RNA polymerase II. Moreover, the CTD can also bind RNA-binding proteins, 37 –39 such as TFIIH, which interacts with U1 small nuclear (sn) RNA, greatly enhancing its kinase and helicase activities. 40,41 These examples show that RNAs have the intrinsic ability to contact components of the transcriptional machinery.
In 1999 it was discovered that some RNA sequences are able to trans-activate transcription, 42 and that these RNAs seem to bear a consensus sequence. 36 Interestingly, a study using longer molecules identified RNAs with a high degree of complexity able to activate transcription even more strongly than in previous experiments. These RNAs do not have a clear consensus sequence. 39,43 These data suggest that RNAs have the ability to trans-activate transcription, a key feature of RNA memory model.
It is also known that in some cases RNAs are able to activate transcription in a highly regulated spatial-temporal way, for example as in the case of trans-activation responsive region (TAR) RNA for the TAT protein in the human immunodeficiency virus type 1 (HIV-1). 44 –46 Other examples are Evf-2 ncRNA, which cooperates with the Dlx-2 protein in a targeted and homeodomain-specific manner, probably the first of a large family to be identified, 32 or the steroid receptor RNA activator (SRA) with the SRC-1 protein, which also seems to confer specificity for steroid receptors. 47 –49 Another example is the short noncoding RNA duplex NRSE, which is able to bind to the neuron-restrictive silencing factor NRSF/REST and able to convert it to a transcriptional activator. Interestingly, NRSE has a sequence homology to DNA sites bound by NRSF and does not disrupt the NRSF–DNA complex. 50 These data suggest that RNAs have in some cases the ability to trans-activate transcription in a regulated way, also by sequence homology. This is another key feature of RNA memory model.
RNA Duplex
Recently, it has been discovered that RNA duplexes with sequence homology with the promoter can promote transcription from that promoter using the same cellular machinery needed for RNAi. This phenomenon is strictly linked to genome-wide antisense transcription. 51 –53 It is interesting to note that there is a great deal of evidence that enhancers and other regulatory sequences are transcribed in the cell in which they are active, 54 –58 and their transcription is often needed to activate the transcription on their target gene. 59 These data suggest a possible mechanism of action of RNA memory. memRNAs could form RNA duplexes with sequence homology to target promoters or other regulatory sequences, thus driving the transcription of the target gene.
Moreover, there is the possibility that the regulation of RNA memory is strictly linked to the regulation of ncRNAs with repressive features, such as RNAs involved in RNAi. dsRNA-driven degradation could be a system used by the cell to modulate RNA memory. RNAi could destroy memRNAs in a gene-specific manner, thus leading to a lack of activation of the target gene and the silencing of that gene. This is compatible with the observation that the RNAi pathway is required for the establishment of a silenced state but not for its maintenance. 3,10
RNA and Polycomb-Mediated Silencing
Polycomb group response elements (PREs) mediate the mitotic inheritance of gene expression programs and thus maintain determined cell fates. By default, PREs silence associated genes via the targeting of Polycomb group (PcG) complexes. Upon an activating signal, however, PREs recruit counteracting trithorax group (trxG) proteins, which in turn maintain target genes in a transcriptionally active state. 3
In Drosophila, experimental data suggest that the switch from the silenced to the activated state of a PRE requires noncoding transcription. Continuous transcription through the PRE prevents the establishment of PcG-mediated silencing. The maintenance of epigenetic activation requires transcription through the PRE not only during embryogenesis but also at late larval stages, suggesting that transcription through endogenous PREs is required continuously as an antisilencing mechanism to prevent the access of repressive PcG complexes to the chromatin. All PREs tested were found to be transcribed in the same tissue as the mRNA of the corresponding target gene, suggesting that antisilencing by transcription is a fundamental aspect of the cellular memory system. 60 It seems that an intergenic ncRNA transcribed from the Polycomb group/trithorax response element (PRE/TRE) of the Ubx gene is retained at the TRE through DNA–RNA interactions and plays an important role in providing an RNA scaffold that is recognized by the epigenetic positive regulator Ash1. 2,26
These data suggest a possible mechanism of action for RNA memory. Transcripts from regulatory sequences could target their specific regulatory sequences, recruiting specific nuclear factors, and thus drive the transcription of the target gene. So, through the inheritance of the transcriptional state of regulatory sequences, the transcriptional state of the target gene could be also inherited. Modulation of transcription of the regulatory sequence could change the cell fate of daughter cells. Transcripts from regulatory sequences with these features would be memRNAs.
Introns and UTRs
MicroRNAs (miRNAs) are 20-nucleotide-long to 24-nucleotide-long ncRNAs acting as posttranscriptional regulators of gene expression in animals and plants. Both human and mouse intronic miRNAs tend to be present in large introns with 5′-biased position distribution. 61 This correlates with the previous observation that most long intronic transcripts are expressed within first introns of the host genes. 62 A few large-scale studies have shown that certain sets of intronic ncRNAs have the same tissue expression pattern as the corresponding protein-coding genes, but this is not a universal rule. 62 –65
It has been reported that long intronic sequences could coordinate waves of gene expression important for particular cellular processes that are functionally related to the protein-coding transcript of the same locus. Note that non-degraded spliced introns can be exported selectively to the cytoplasm 66–67 and be involved in global gene expression regulation. 66,68 These data suggest a possible role of intronic transcript or intron fragments in RNA memory. If sense or antisense, short or long intronic transcripts or fragments could drive the expression of their homolog gene, and these RNA fragments can survive from one generation to the next, these RNAs would be memRNAs.
Moreover there is evidence that many 3′ UTR of mRNAs can act in trans as ncRNAs, 69 –80 so it is possible that other fragments of mRNAs, if they could drive the expression of their homolog gene and survive from one generation to the next, could act as memRNAs.
The Nuclear Membrane
In the vast majority of eukaryotes, the nuclear envelope is disrupted during mitosis. In some rare cases this does not happen, so it is not compulsory, but it is probably an event of regulation. 81 Is it possible that nuclear membrane disruption permits the merging of cytoplasmic and nuclear factors, allowing memRNAs to reach the nucleus? Moreover it is known that RNAs can change conformation and activity in response to interacting proteins or metabolites, so the disruption of nuclear membrane could lead to an ulterior step of regulation. 39,82 –85
β-Globin Locus and Transinduction
In mouse erythroid cell lines, cell identity heritage is a clonal phenomenon. 86 In fact, at least for tested genes, stochastic variations in the transcriptional profile are inherited by daughter cells, without genetic mutations. Tested genes were α- and β-globin genes, which are differentiation markers and have no known nuclear effect. 86 This means that a mechanism should be present in daughter cells to mimic the transcriptional profile of the mother cell. RNA memory could be this system. For example, if at generation 0 there is a random inactivation of the β-globin gene, there will be a lack of memRNAs for that transcript in mother cell cytoplasm. Daughter cells at generation 0 + 1 could not find any β-globin gene memRNAs, thus the inactivation will be inherited. This effect can extend on through generations, producing a clonal phenomenon. Moreover, because the memRNA activation could be related to sequence homology between memRNAs and the noncoding sequence of the gene, the activation/inactivation could spread through cell generations in an allele-specific mode. Also, asymmetric distribution or variation in the half-life of memRNAs could lead in the next generation to a variation of the transcriptional profile, giving a possible mechanicistic explanation both for the clonal inheritance and the stochastic variation of the transcriptional profile. Moreover, analysis of a multicopy transgenic locus for human β-globin gene in mice showed an all-or-none effect in the activation of all the genes in the locus. Each β-globin gene does not act stochastically in an independent way. 86 Trans-acting memRNAs with sequence homology for all of the copies of human β-globin gene in the locus could give an easy explanation to this phenomenon.
In human cultured cells, a transient transfection of a β-globin gene induces transcription of the locus control region (LCR) and intergenic region at the chromosomal β-globin locus in nonerythroid cell lines. However, the β-globin genes remain transcriptionally silent. This effect is dependent on transcription, but not on protein expression, and therefore is RNA mediated. This phenomenon was called transinduction. In one experiment, transinduction was mediated by the transfection of a γ gene of the β-globin locus not able to produce a detectable full-length γ-globin mRNA. This suggests that the effect is not mediated by a functional mRNA, but instead by ncRNAs. 55 A possible explanation is that the transfection is able to produce memRNAs for the euchromatinization of the β-globin locus, but the specific globin genes cannot be expressed due to the lack of the right erythroid transcriptional factors. If this phenomenon is mediated by memRNAs, here there is a clear suggestion that RNA memory can drive a euchromatic state of chromatin more than directly activated transcription.
Moreover, in humans, a regulated intergenic transcription is needed for chromatin remodeling at the β-globin locus to set the right temporal development of the erythroid cell line. 87 This could be a regulated modulation of RNA memory at the β-globin locus to drive the correct expression pattern through development.
Drosophila melanogaster PARP Gene
In Drosophila melanogaster, there is only one PARP gene that encodes two alternative splicing forms: PARP-I, an enzymatically active form, and PARP-e, an enzymatically inactive form. An insertion mutation near an upstream promoter disrupts all PARP expression. The expression of a transgene for PARP-e can restore the presence and the activity of PARP-I. PARP-e expression induces a more physiological pattern in PARP-I expression, leading to fewer deleterious effects than when PARP-I is misexpressed globally. 88 Even if authors suggest a different explanation, this phenomenon could be explained by a trans-activation via sequence homology. In my model, PARP-e transgene could have produced memRNAs responsible for the activation of genomic PARP. If this would be true, it implies that alternative splicing could also be a step for the regulation of RNA memory.
Transvection
Many Drosophila loci show a phenomenon called transvection, wherby a wild-type promoter linked to a mutant coding sequence is able to trans-activate a wild-type coding sequence linked to a nonfunctional promoter in the homolog gene. 89 Usually transvection needs a homologous position on chromosomes of the two interacting genes, but it has been reported some cases in which transvetion occurs over large distances, even between different chromosomes. 90 –93 Moreover, many transvecting promoters are transcribed into ncRNAs, and mutants of epigenetic modifiers can modify transvection. 89,94 –96
Taken together, these observations could suggest an involvement of a diffusible trans-activating signal over great distances, able to modulate locally the epigenetic markers, such RNAs or, in my theory, memRNAs.
Pseudogenes
Pseudogenes bear sequence similarities to a specific protein-coding gene, but they are unable to produce functional proteins. The human genome is estimated to contain up to 20,000 pseudogenes. Pseudogenes may represent a means for additional levels of gene regulation.
Recent studies have suggested the importance of pseudogene transcripts in gene regulation. For example, a mouse not expressing the Makorin1-p1 pseudogene shows abnormal expression of the functional protein-coding Makorin1 gene located elsewhere in the genome. 2,97 –99 Again, this phenomenon could be explained by a trans-activation via sequence homology. In my model, the Makorin1 pseudogene could produce memRNAs responsible for the activation of the functional protein-coding Makorin1.
Celll Reprogramming
The possibility of reprogramming somatic cells to pluripotency through nuclear transfer or cell fusion strongly suggests the presence of trans-acting agents able to radically reprogram the chromatin. These cytoplasmic factors are able to totally erase the epigenetic program of a differentiate cell and convert it into a stem cell–like chromatin pattern. 5 Even if the efficiency of reprogramming is extremely low, these factors are not only able to physiologically drive the first steps of development, but they also have the intrinsic ability to “remember” how the nucleus should behave. I think that some of these factors could be RNAs that can act in a memRNA-like fashion. Even during direct in vivo reprogramming of somatic cells into embryonic stem-like cells, the production of proteic factors is usually obtained by the use of viral vectors. This method raises the production of both proteins and RNAs of the transfected genes in host cells. 5 Is it possible that the maintenance of production of stem cell-specific factors through the generations could be mediated also by memRNAs?
Conclusion
In recent years, it has been discovered that ncRNAs could have a broad range of functions, absolutely unexpected until few years ago. RNAi is now a well-known, evolutionarily conserved phenomenon with several implications in many aspects of chromatin dynamics and transcriptional regulation, almost the totality of them with repressive features. 100,101 This paper describes a theory regarding the role of ncRNAs in the activation of transcription and their implications in the control of cell identity. If the theory is confirmed, we would have a mechanism and a tool opposite and complementary to RNAi. This model is called RNA memory. In this theory, the mother cell produces together with mRNAs a class of ncRNAs called memRNAs. memRNAs reflect the transcriptional profile of the mother cell and are retained until the next cell division. In this way, daughter cells are able to read the memRNAs and mimic the transcriptional profile of the mother cell. This mechanism permits retention of transcriptional information due to local functional perturbation, such as enzymatic induction. In this theory, the abundance of memRNAs can be read by the nucleus, using an unknown mechanism, and drive the euchromatinization of specific genes or loci due to sequence homology. I have presented some experimental data in support of such a theory. If this theory is confirmed, it is highly probably that RNAi and RNA memory could work in a very entangled network, especially in the regulation of the developmental program.
Two major objections could be raised against this theory. Why is epigenetic inheritance not sufficient to explain the observed phenomena? The answer is that epigenetic phenomena act in cis and cannot explain the observed trans-acting events. Many of the events discussed seem to use sequence homology to target the right gene or locus, and an RNA-mediated mechanism seems more probable than an epigenetic one. Moreover, many histone covalent modifications seem to be transient and are able to set the local environment to perform specific functions. Histone modifications are and must be very dynamic, but it is uncertain how many of them can spread in time in the absence of a reiterated epigenetic writing. 102,103 The model I propose is that RNA memory targets the gene or the locus that has to be activated and that chromatin remodelers and epigenetic modifiers are the physical effectors.
The second major objection could be the following: Why has this mechanism not yet been discovered? I think that full-length RNAs and/or transcription from regulatory sequences is needed to trigger RNA memory. The enormous use of complementary (c) DNAs in the last decades could have produced experimental drift, which that could have obscured this phenomenon. Moreover, until recently, the analysis of cellular components involved in transcriptional activation has focused on proteic factors.
If this theory of memRNA is confirmed, the implications in the development and cure of several diseases will be enormous. Overproduction of oncogenes could be driven by an alteration of an RNA memory system, leading to a chronic activation of these genes, thus leading to carcinogenic transformation. Interference with RNA memory could revert this activation and could be used in therapy. Viral infection could also take advantage of RNA memory. Production of memRNAs for viral products could contribute to the increase in transcription from viral genes and could be a key point of the regulation of virulence. In senescence, from one generation to the next, transcriptional noise could increase due to a stochastical floating of RNA memory system, and this could lead to aging-related diseases. As a tool, memRNA will be of great use, permitting the activation of specific transcriptional patterns just driving the production of memRNAs, and it will have possible applications in therapy whenever a disease could result from the underproduction of specific gene product, as, for example, in the widespread Gilbert syndrome. 104
Of course, this theory is not a substitute for consolidated theories regarding the regulation of transcription and the control of cell identity. However, if this theory is confirmed, it will be just another piece in the amazing network of cellular life.
Footnotes
Acknowledgments
I want to thank Giordano Bruno and Nikola Tesla for their inspiring visions, and my former laboratories for their help. I want also thank Swonild Ilenia Genovese for her aid revising the manuscript.
Author Disclosure Statement
No competing financial interests exist.
