Abstract
BACKGROUND:
Stroke is one of the leading causes of disability and mortality worldwide.
OBJECTIVE:
To identify the regulatory network of microRNAs (miRNAs) and mRNAs to clarify molecular mechanisms in stroke.
METHODS:
Four miRNA datasets and two mRNA datasets of stroke were downloaded from the GEO database. R-Studio was utilized to analyze differentially expressed miRNAs (DEmiRNAs) and mRNAs (DEmRNAs) in the blood of stroke and control patients. FunRich software was utilized to conduct GO and biological pathway analysis on DEmiRNAs, and to search for transcription factors (TFs) of DEmiRNAs. Subsequently, we used miRDB, miRTarBase, and TargetScan to identify DEmiRNAs target genes and intersected with DEmRNAs to find common target genes. The miRNA-mRNA regulatory network of common target genes was constructed by using the Cytoscape. The biological and functional roles of target genes in the regulatory network were predicted using GO and KEGG pathway analyses.
RESULTS:
464 DEmiRNAs and 329 DEmRNAs were screened. The top ten TFs (SP1, SP4, EGR1, TCF3, NKX6-1, ZFP161, RREB1, MEF2A, NFIC, POU2F1) were visualized. 16747 target genes of DEmiRNAs were predicted. Target genes were intersected with DEmRNAs, 107 common target genes and 162 DEmiRNAs regulating these common genes were obtained, and then a regulatory network was constructed. Target genes of the regulatory network were primarily enriched in VEGF signaling pathway, lipid and atherosclerosis, T cell receptor signaling pathway.
CONCLUSION:
This study found that VEGF signaling pathway, lipid and atherosclerosis, T cell receptor signaling pathway are implicated in the biological process of stroke by constructing the regulatory network of miRNAs-mRNAs, which may have guide significance for the pathogenesis and treatment of stroke.
Introduction
Stroke is one of the main causes of disability and death in the globe and exhibits high morbidity and recurrence rates. Stroke was rated second among the top ten main causes of disability-adjusted life years (DALYs) in 2019 for both people aged 50–74 and those aged 75 [1, 2]. For every 10-year increase in age, the annual risk of stroke doubles [3]. Stroke exhibits high mortality and physical disability, and usually requires long-term rehabilitation after the acute phase, bringing a heavy burden on patients, families, and society as a whole [4]. It is critical to investigate stroke mechanisms, find novel therapeutic interventions for stroke, provide more effective treatment for patients, and reduce the burden on patients.
MiRNAs are a ubiquitous class of small non-coding single-stranded RNAs composed of 21–23 nucleotides that play important roles in health and disease [5]. MiRNAs modulate gene expression at the post-transcriptional level by modulating target messenger RNAs (mRNAs), leading to changes in levels of target proteins. Single or multiple miRNAs can bind to the 3’ non-translated region of a target mRNA, causing its degradation or inhibiting translation/transcription [6]. Thousands of downstream target genes may be regulated by a single miRNA, which could have an impact on whole gene network and protein synthesis [7]. There is evidence that miRNAs play crucial roles in neuropathological processes resulting from stroke. Therefore, miRNA levels after stroke can provide insight into the underlying pathogenesis of stroke [8].
In this study, DEmiRNAs and DEmRNAs of stroke were screened and analyzed by analyzing multiple miRNA datasets and multiple mRNA datasets of stroke for the first time, so as to construct a regulatory network of miRNAs-mRNAs, increase the understanding of the pathological mechanisms of stroke, and bring new perspective on clinical diagnosis and treatment of stroke.
Materials and methods
Data preparation and processing
Data preparation and processing are shown in Fig. 1. The GEO database (
We used the surrogate variable analysis (SVA, version 3.14) package to remove batch effects of all samples. Probe identifications were transformed as gene symbols and the expression values were log2-transformed. Subsequently, the four miRNA datasets of stroke and two mRNA datasets of stroke were merged respectively to collect common genes to form new gene expression profiles. The “normalizeBetweenArrays” function of limma package in R-Studio (Version 4.1.0) were utilized to normalize data. If a gene has two or more expression levels, the average expression value of gene was utilized.
Data collection
Data collection
The process of data preparation and processing. TFs: transcription factors; GO: Gene Ontology; KEGG: Kyoto Encyclopedia of Genes and Genomes.
Principal component analysis (PCA) graph of miRNA and mRNA in stroke. (a) and (b) respectively represent the sample distribution of stroke miRNA before and after batch effect removal. (c) and (d) respectively represent the sample distribution of stroke mRNA before and after batch effect removal.
Heatmap and volcano map of differentially expressed miRNAs (DEmiRNAs) and mRNAs (DEmRNAs) in stroke. (a) and (b) respectively represent the Heatmap and volcano map of DEmiRNAs. (c) and (d) respectively represent the Heatmap and volcano map of DEmRNAs. Red for upregulated genes and green for downregulated genes.
GO and biological pathway analysis results of DEmiRNAs. (a), (b) and (c) are the results of biological process (BB), cellular component (CC) and molecular function (MF) in GO analysis, respectively. (d) is the results of biological pathway.
TFs prediction of DEmiRNAs.
DEmiRNAs-DEmRNAs regulatory network of stroke. Triangles represent miRNAs, and ovals represent mRNAs. The upregulated genes are shown in red, and the downregulated genes are shown in green.
GO and KEGG enrichment analysis results of DEmRNAs in DEmiRNAs-DEmRNAs network.
The limma program of R-Studio was utilized to screen for differentially expressed miRNAs (DEmiRNAs) and mRNAs (DEmRNAs) across stroke and control groups, respectively. Sets the screening threshold of DEmiRNAs to adjust
Gene Ontology (GO) and biological pathway analysis and transcription factors prediction of DEmiRNAs
FunRich software was used to conduct GO and biological pathway analysis on DEmiRNAs. GO describes the biological functions of target genes in terms of biological processes (BP), cellular components (CC), and molecular function (MF). Based on FunRich software, the transcription factors (TFs) were searched. The condition of statistical significance was adjusted
Prediction of target genes for DEmiRNAs and building regulatory network
The DEmiRNAs were predicted target genes using three databases of TargetScan (
Functional enrichment of target genes in regulatory network
After target genes were extracted from regulatory network, the org.Hs.eg.db and clusterProfiler package in R-Studio were utilized to carry on GO and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses for target genes of ceRNA network. GO describes the biological functions of target genes in terms of BP, CC, and MF, while KEGG can explain the signaling pathways in which the differentially expressed target genes have been involved. The significant enrichment threshold was adjusted to
Results
Differentially expressed miRNAs and mRNAs
The flowchart for this study is illustrated in Fig. 1. The results after batch effect elimination are presented using the principal component analysis (PCA) graph (Fig. 2). In present study, results showed that 464 DEmiRNAs and 329 DEmRNAs were obtained between the stroke and the control group, including 265 upregulated and 199 downregulated DEmiRNAs, and 195 upregulated and 134 downregulated DEmRNAs. DEmiRNAs and DEmRNAs are displayed as volcano map and heatmap (Fig. 3), with red for upregulated genes and green for downregulated genes.
GO and biological pathway analysis and transcription factors prediction of DEmiRNAs
In order to better understand DEmiRNAs, we use FunRich software for GO and biological pathway analyses. GO enrichment results were nucleotide, nucleoside and nucleic acid metabolism, apoptosis, regulation of nucleobase, regulation of cell growth, cell communication, signal transduction, transport, regulation of gene expression, fatty acid metabolism, regulation of cell cycle, epigenetic, regulation of translation (BP), cytoplasm, nucleus, golgi aparatus, lysosome, early endosome, endosome, actin cytoskeleton, axon, membrane, cytoplasmic vesicle (CC), and receptor signaling complex scaffold activity, ubiquitin-specific protease activity, GTPase activity, transcription factor activity, transcription regulator activity, protein serine/threonine kinase activity, receptor binding, guanyl-nucleotide exchange factor activity, protein serine/threonine phosphatase activity, cytoskeletal protein binding (MF). Biological pathway enrichment results were proteoglycan syndecan-mediated signaling events, beta1 integrin cell surface interactions, glypican pathway, erbB receptor signaling network, TRAIL signaling pathway, integrin family cell surface interactions, VEGF and VEGFR signaling network, sphingosine 1-phosphate (S1P) pathway, PDGF receptor signaling network, plasma membrane estrogen receptor signaling (Fig. 4). TFs can bind to mRNA and perform post-transcriptional regulatory functions, both of which are crucial components of regulatory networks in organisms. The FunRich software was utilized to predict the upstream TFs of DEmiRNAs. The top ten TFs (SP1, SP4, EGR1, TCF3, NKX6-1, ZFP161, RREB1, MEF2A, NFIC, POU2F1) were visualized (Fig. 5).
Target genes prediction for DEmiRNAs and DEmiRNAs-DEmRNAs network construction
Three databases (TargetScan, miRTarBase and miRDB) were utilized to predict the target genes of DEmiRNAs, 16747 target genes were identified, and then the target genes were intersected with DEmRNAs, and 401 DEmiRNAs-DEmRNAs interactions pairs were obtained(Supplementary Table 1). Subsequently, based on the obtained DEmiRNAs-DEmRNAs interaction pairs, Cytoscape was used to construct a DEmiRNAs-DEmRNAs regulatory network for stroke (Fig. 6). The DEmiRNAs-DEmRNAs network contained 162 DEmiRNAs and 107 DEmRNAs.
GO and KEGG enrichment analysis
The functions of the DEmRNAs in DEmiRNAs-DEmRNAs network were investigated by GO and KEGG functional enrichment analyses. With regard to BP, the DEmRNAs were primarily involved in neutrophil mediated immunity, neutrophil activation, regulation of MAP kinase activity, neutrophil degranulation, positive regulation of membrane permeability, neutrophil activation involved in immune response, response to acid chemical, positive regulation of mitochondrial membrane permeability, positive regulation of T cell differentiation in thymus, secretory granule localization. The main CC involving the DEmRNAs in secretory granule membrane, membrane raft, lytic vacuole membrane, lysosomal membrane, azurophil granule membrane, vacuolar membrane, membrane region, membrane microdomain, primary lysosome, azurophil granule. In terms of MF, DEmRNAs were primarily enriched in phospholipid binding, phosphatidylserine binding. In terms of KEGG, the DEmRNAs were mostly enriched in kaposi sarcoma-associated herpesvirus infection, VEGF signaling pathway, T cell receptor signaling pathway, lipid and atherosclerosis, PD-L1 expression and PD-1 checkpoint pathway in cancer. The GO and KEGG results are displayed as bar graphs and bubble diagrams respectively (Fig. 7).
Discussion
Stroke has high morbidity, mortality, and disability rate, which brings heavy burden to economy and society, and is a serious public health problem in China [9, 10]. Despite efforts to manage stroke and reduce risk factors, stroke cases have increased in recent years [11]. There is a growing interest in understanding the pathogenesis of stroke and finding new biomarkers for the early diagnosis and treatment of stroke. Therefore, we identified potential stroke-related genes and constructed a DEmiRNAs-DEmRNAs regulatory network to further understand the pathogenesis of stroke and give novel insights for the prevention, diagnosis and therapy of stroke.
In present study, the differentially expressed genes of stroke were screened by analyzing multiple miRNA datasets and multiple mRNA datasets of stroke for the first time, 464 DEmiRNAs and 329 DEmRNAs were obtained in the stroke group and the control group, including 265 upregulated and 199 downregulated DEmiRNAs, and 195 upregulated and 134 downregulated DEmRNAs. The top ten TFs (SP1, SP4, EGR1, TCF3, NKX6-1, ZFP161, RREB1, MEF2A, NFIC, POU2F1) were visualized. We obtained 401 DEmiRNAs-DEmRNAs interactions pairs through databases prediction, constructed a regulatory network of stroke containing 162 DEmiRNAs and 107 DEmRNAs, and analyzed the functional enrichment of DEmRNAs in the regulatory network. It was found that DEmRNAs of regulatory network are mainly concentrated in VEGF signaling pathway, lipid and atherosclerosis, T cell receptor signaling pathway.
Studies have shown that miRNAs expression can be regulated by TFs [12]. Therefore, we identified upstream TFs that may modulate DEmiRNAs. SP1, SP4 and EGR1 were the most significant transcription factors. SP1, along with SP2, SP3, and SP4, is the specificity protein (SP) family member and functions as a key transcription factor and multifunctional oxidative stress response protein [13]. After transient middle cerebral artery occlusion, 38 transcription factors were discovered to be differentially expressed in mice, among which SP1, SPi1, and Stat3 being the most substantially expressed [14]. It is speculated that SP1 is one of the principal genes involved in stroke [15]. SP1 protects neurons, endothelial cells and glial cells by promoting the transcription of a variety of antioxidant proteins and a variety of antioxidant enzymes, affecting ion transporters on the cytoplasmic side [16]. Early growth response 1 (EGR1) is involved in ischemia-reperfusion injury. EGR1 knockdown effectively reduces infarct size and apoptosis, and improves neurological function [17]. MEF2A, a nuclear element subtype found in myogenic cells, is upregulated in the hippocampus and cerebellum [18]. MEF2A transcriptional activity inhibition can promote neuronal apoptosis. Studies have found that knockdown of MEF2A can aggravate neurological damage and impair cognitive function in rats [19]. These results support the role of selected DEmiRNAs in the pathogenesis of stroke.
Based on functional annotation, VEGF signaling pathway, lipid and atherosclerosis and T cell receptor signaling pathway are significantly enriched pathway in stroke. Angiogenesis can increase collateral circulation, alleviate brain damage, and play a key role in neurological function recovery after stroke [20]. The vascular structures regeneration provides critical “support” for neurogenesis and synapse formation, promotes neural networks recovery, and plays a vital role in enhancing stroke patients’ quality of life [21, 22]. Angiogenesis is regulated by vascular endothelial growth factor (VEGF). VEGF is a pivotal cytokine associated with endothelial cell differentiation, proliferation, and migration. VEGF activates downstream signaling pathways and conveys angiogenic signals when it binds to its receptor [23]. Atherosclerosis (AS) is the pathological basis of several diseases, such as stroke, coronary heart disease and peripheral vascular diseases [24]. Cerebral AS is distinguished by increasing lipid accumulation, fibrous hyperplasia, and inflammatory cell infiltration, which may result in poor clinical consequences such as transient ischemic attack, stroke, and even death [25, 26]. The mechanism of stroke induced by dyslipidemia includes inducing AS, endothelial dysfunction, thrombosis and blood-brain barrier damage, reducing cerebral blood flow (CBF) and promoting neuronal apoptosis [27, 28, 29]. Hyperlipidemia can induce hypertension and cardiovascular disease, and can also interfere with CBF by reducing the synthesis and release of VEGF, thereby increasing the risk of stroke [30, 31].
Inflammation and immunological response play key roles in the pathogenesis of stroke. After stroke, damaged nerve cells induce activation of glial cells, infiltration of peripheral immune cells, and release of inflammatory mediators [32]. T cell receptors are specific receptors on the surface of T cells, which link T cells to antigen-presenting cells and play a crucial role in T cell immune response. Stroke rapidly activates the peripheral immune system, facilitating T or B lymphocytes, neutrophils and monocytes into the damaged region [33]. The study showed that mice lacking T and B cells had smaller infarcts than normal mice [34]. Selective inhibition of T cell receptors reduces infarct size and promotes long-term functional recovery [35]. Therefore, VEGF signaling pathway, lipid and atherosclerosis and T cell receptor signaling pathway are crucial in stroke. However, further experiments are needed to verify our study.
Conclusion
This study found that VEGF signaling pathway, lipid and atherosclerosis, T cell receptor signaling pathway are implicated in the biological process of stroke by constructing the regulatory network of miRNAs-mRNAs, which may have guide significance for the pathogenesis and treatment of stroke.
Footnotes
Acknowledgments
This work was supported by the National Natural Science Foundation of China (Grant number 82260243) and the Health Appropriate Technology Promotion Project of Guangxi (Grant number S2021101).
Conflict of interest
The authors declare that they have no conflict of interest.
Supplementary data
The supplementary files are available to download from https://dx-doi-org.web.bisu.edu.cn/10.3233/THC-231357.
