Abstract
BACKGROUND:
Morin, (3,5,7,2′,4′-pentahydroxyflavone), is a polyphenolic compound belonging to bio-flavonoids and is predominantly isolated from the family Moraceae. Previous studies demonstrated the health benefits of morin using human and animal models. Despite its importance as a bioactive compound, the genetic architecture of the morin biosynthetic pathway is still unclear.
OBJECTIVE:
To understand the genetic architecture of the morin biosynthetic pathway, the following components were analyzed: (1) cis-responsive element (CRE)-mediated regulation, (2) microRNAs (miRNA)-mediated post-transcriptional silencing, and (3) tissue-specific in silico gene expression.
METHODS:
To understand the genetic architecture of morin biosynthetic pathway, in silico survey was carried out using different web servers (MorusDB, MEME suite, NCBI database, PlantCARE, and psRNATarget) and collected mRNA, protein sequences, and expressed microarray data. TBtools was employed for depicting protein and promoter motifs and the heatmap preparation of tissue-specific expression of genes involved in the morin biosynthesis.
RESULTS:
The current data mining study highlighted the morin biosynthetic pathway associated genes, namely, phenylalanine ammonia-lyase (MnPAL), chalcone synthase A (MnCSA), chalcone-flavonone isomerase (MnCFI), and flavonoid 3′,5′-hydroxylase (MnFH) are transcriptionally regulated by different growth, development, and stress-responsive CREs. Differential expression profiles shown MnPAL (L484_024373) and MnCFI (L484_011241) genes were upregulated across selected tissues. Moreover, miRNA-mediated post-transcriptional silencing was identified.
CONCLUSIONS:
This study will improve our understanding of morin biosynthesis, and it can improve production via metabolic engineering.
Introduction
Recent advances in nutritional genomics have focused on the importance of gene–nutrient interaction to prescribe natural ingredients and/or their active components to their patients and maximize health benefits. Interestingly, the current trends of medicinal and pharmaceutical research primarily focused on natural products specifically obtained from plants. Morin, 3,5,7,2′,4′-pentahydroxyflavone, is a natural flavonol for many types of plants, although it is predominantly isolated from plants belonging to the Moraceae family and it is recognized as a broad spectrum bioactive compound comprising a wide range of pharmacological properties with extremely low cytotoxicity [1, 2]. Recent findings suggest that treatment with morin can protect against different neurotoxicity induced by acrylamide [3], ifosfamide [4] and effectively reduce oxidative stress, inflammation, and apoptosis during the heart and brain damage in the animal system [5]. There is evidence put forward that morin bears several pharmacological effects specific influences the activity of numerous enzymes to produce antibacterial, antioxidant, antidiabetic, anti-inflammatory, antitumoral, antihypertensive, hypouricemic, and neuroprotective [5]. Hence, as a pharmacological compound morin offers potential application and advantages for medicinal purposes. Recent studies suggest that morin is involved in modulating MAPK, NF-κB, JAKs/STRTs, Keap1/Nrf2, and mTOR signal transduction cascades [6–11]. The chemical structure of morin shows that the 2′ hydroxyl group (-OH) of the B ring forms a hydrogen bond with the1′ oxygen atom of the C ring. Because of this conformational change, the transfer of electrons (e–) from the B ring to the double bond of the C ring makes morin a potential natural radical scavenger [2, 12]. Hence, we established the impact of morin as a bioactive compound.
Despite the importance of morin, understanding molecular regulation and genetic architecture of metabolic pathways are still not specifically emphasized for the Morus spp. The major constraints to understanding the complex metabolic pathways are (a) compartmentalization of metabolites in cellular organelles [13], (b) interaction of metabolites and proteins, (c) tissue-specific accumulation of the bio-active compound, and (d) influence of the environment on multilevel gene regulation [14, 15]. Recently, trends in bioinformatics provide an excellent opportunity to examine and understand the genetic architecture of complex metabolic pathways that will provide an optimum platform for wet lab-related research [14, 15].
Adequate understanding is a prerequisite that will be helpful to improve the productivity of natural morin by metabolic engineering in the future. In this study, we attempted to understand (1) the number of genes involved in the morin biosynthesis pathway, (2) cis responsive elements (CREs)-mediated regulation, (3) miRNA-mediated post-transcriptional silencing, (4) conserved motifs in protein, and (5) in silico gene expression in different tissues. This study may help to understand the morin biosynthesis pathways as well as their genetic architecture.
Materials and methods
Identification of genes involved in morin biosynthesis in M. notabilis
The biosynthetic pathway and genes involved in morin (pentahydroxyflavone) were recognized by Brugliera et al. [16]. Accordingly, we identified the orthologs of M. notabilis C.K. Schneid using the MorusDB (https://morus.swu.edu.cn/morusdb) [17] and retrieved mRNA and protein sequences.
Identification of motifs of isoforms
The retrieved protein sequences of the identified isoforms were subjected to the prediction of the conserved motifs using the MEME suite (https://meme-suite.org/meme) [18] web server.
In silico gene expression analysis
To understand the differential expression pattern of genes involved in morin biosynthesis, we downloaded the expression profiles of five tissues such as root, bark, winter bud, male flower, and the leaf of M. notabilis from MorusDB [17]. TBtools [19] was used to prepare a heatmap of the expression data.
Promoter structure analysis and annotation of CREs
To understand the transcriptional regulation of identified potential genes involved in morin biosynthesis, we retrieved 500bp upstream promoter sequences from the NCBI database (https://www.ncbi.nlm.nih.gov) [20]. Sequences were then subjected to analysis in the PlantCARE database (https://bioinformatics.psb.ugent.be/webtools/plantcare/html) [21] for identifying the consensus motif present in the promoter region. TBtools [19] was then used to analyze promoter and CREs of selected genes.
Prediction of miRNA targets
To understand the impact of miRNA-mediated post-transcriptional silencing, putative miRNAs, which target the transcript of genes involved in morin biosynthesis in M. notabilis, were identified using the psRNATarget (https://www.zhaolab.org/psRNATarget) [22] server with the expectation value of 3.5.
Results
Different isoforms involved in morin metabolism
The gene ortholog isoforms involved in the morin biosynthesis of M. notabilis were identified; Fig. 1Ashows the morin biosynthetic pathway. Four orthologs namely, phenylalanine ammonia-lyase (MnPAL), chalcone synthase A (MnCSA), chalcone-flavonone isomerase (MnCFI), and flavonoid 3′,5′-hydroxylase (MnFH) were identified in M. notabilis, which are involved in morin biosynthesis. Investigation on copy number analysis deciphering except MnPAL (L484_024373), there are multiple isoforms of MnCSA (04 isoforms: L484_027525, L484_017929, L484_017930, and L484_003228), MnCFI (04 isoforms: L484_011241, L484_017264, L484_017265, and L484_020757), and MnFH (02 isoforms: L484_028073, L484_027395) present in the M. notabilis genome.

Identification of a gene involved in morin biosynthesis, tissue expression pattern, and encoded protein structure. (A) Biosynthetic pathway of morin (3,5,7,2′,4′-pentahydroxyflavone); Involvement of enzymes like phenylalanine ammonia-lyase (PAL), chalcone synthase A (CSA), chalcone-flavonone isomerase (CFI), and flavonoid 3′,5′-hydroxylase (FH) and detailed steps were highlighted. (B) Tissue-specific (root, branch bark, male flower, leaf, and winter bud) expression of different isoforms (* mark given for major isoform, identified based on expression potentiality). (C) Structural diversity of proteins involved in morin biosynthesis (the identified conserved motifs were supplemented in Table 1S).
MnPAL, MnCSA, MnCFI, and MnFH belong to the different classes of proteins and comprised a varied length of diverse conserved motifs. MEMEsuite web server analysis suggests that MnPAL (L484_024373) comprises six conserved motifs (Fig. 1B). Both N- and C-terminal domains of all classes of proteins are devoid of any conserved motifs. The identified motifs were situated in the middle position of the protein. All four isoforms (L484_027525, L484_017929, L484_017930, and L484_003228) of MnCSA are of the same length and comprised nine motifs across the peptide. The isoforms of MnCFI (L484_011241, L484_017264, L484_017265, and L484_020757) were varied in length and comprised two conserved motifs. Similarly, the two isoforms of MnFH, L484_028073, and L484_027395 are unequal in length and comprised three conserved motifs.
Tissue-specific expression of different isoforms
Tissue-specific expression data demonstrate that MnPAL was differentially expressed across all selected tissues such as root, bark, winter bud, male flower, and leaf (Fig. 1C). MnPAL is reported to be upregulated in all tissues compared to other genes/isoforms. L484_011241 and L484_017264 isoforms of MnCFI were upregulated in all tissues compared to the other two isoforms. None of the isoforms of MnCSA and MnFH demonstrated upregulation in selected tissues.
Developmental- and stress-associated CREs involved in transcriptional regulation
A number of CREs associated with development (14), stress (21), and response to hormones (13) were identified (Fig. 2A), which are spanning across the promoter (500 bp) in selected major genes such as MnPAL (L484_024373), MnCSA (L484_017929), MnCFI (L484_011241), and MnFH (L484_028073). A number of identified motifs with respect to the consensus sequence, start and end site, and functional annotation was retrieved from the PlantCARE database and described in Table 1. Figure 2B shows the proportional distribution of development (light-responsive, meristem expression, and xylem-specific induction), stress (anaerobic induction, drought-inducibility, SA responsive, and wound responsive), and response to hormones (ABA, ethylene, MeJA, and SA). Figure 2C shows the potential sites of CREs across promoter regions.

(A) Bar diagram represents the number of identified CREs associated with development (14), stress (21), and response to hormones (13). (B) Pie chart represents proportional distribution (in percentage) of development (light-responsive, meristem expression, xylem specific induction), stress (anaerobic induction, drought- inducibility, SA responsiveness, and wound responsiveness), and response to hormones (ABA responsiveness, ethylene responsiveness, MeJA-responsiveness, and SA responsiveness) associated CREs. (C) Dot plot represents the presents of different of CREs across the promoter regions. (D) Distribution of potential sites of CREs across the promoter regions. Different colour boxes represent functional annotation of identified CREs.
Detail of identified motifs in terms of consensus sequence, start and end site and functional annotation retrieved from PlantCARE database
A total of 13 putative miRNAs are identified (cre-miR906-5p, gma-miR1516a-3p, mtr-miR2673a, mtr-miR2673b, ddi-miR-7131c-5p, sly-miR168a-3p, aqc-miR477d, mtr-miR5207, mtr-miR5556-5p, vvi-miR477b-5p, aly-miR838-3p, bra-miR9563b-3p, and sly-miR5302b-3p); they may be involved in the post-transcriptional silencing of morin biosynthesis pathway by inhibiting translation or by cleavage mechanism (Table 2). The transcripts of MnCFI (4), MnCSA (2), MnPAL (4), and MnFH (3) were targeted by more than two numbers of miRNAs. The analysis demonstrated that the transcript region near the 3′region of MnCFI and MnCSA, whereas the transcript of MnPAL and MnFH5′regionare more prompt target sites for the identified miRNAs. miRNA cre-miR906-5p and gma-miR1516a-3p were reported to be involved in translational inhibition, thus all remaining miRNAs were inhibited targets by cleavage mechanism.
List of miRNAs involved in post-transcriptional silencing of transcripts involved in morin biosynthesis of M. notabilis
List of miRNAs involved in post-transcriptional silencing of transcripts involved in morin biosynthesis of M. notabilis
Flavonoids exist in abundance in plants and play an important role in protecting against stresses, modulating developmental processes, and helping in seed dispersion and pollination by coloring flowers and fruits [23, 24]. Morin is a natural bio-flavonol and is predominantly synthesized in the Moraceae family [2]. Genes involved in flavonoid biosynthesis such as phenylalanine ammonia-lyase (PAL), chalcone synthase A (CSA), chalcone-flavanone isomerase (CFI), and flavonoid 3′,5′-hydroxylase (FH) are extensively examined to understand the evolutionary relationship, as well as expression patterns under multiple abiotic and biotic stresses in different crops [23, 26]. The central importance to understand the genetic architecture and how morin biosynthesis is modulated by a multi-level gene regulation system (CREs and/or miRNA) has not yet been studied for the Morus spp. Hence, this study provides a platform as to how morin biosynthesis is controlled in M. notabilis from a molecular point of view.
The copy number variation of haploid M. notabilis genome demonstrates that except MnPAL (L484_024373), other genes (MnCSA, MnCFI, and MnFH) comprise multiple isoforms. Copy number of a gene family is essential for understanding gene family expansion and diversification, a key evolutionary drive [27], and for demonstrating the genetic dosage in a given species or genotype [28, 29]. Hence, a single copy of MnPAL offers an excellent platform for understanding complex regulation and subcellular organization of flavonoids biosynthesis in M. notabilis. Thus, for understanding the biosynthesis of flavonoids, the natural haploid species M. notabilis could be an experimental model.
The expression pattern of genes involved in morin biosynthesis differentially expressed across all selected tissues (Fig. 1). Specifically, MnPAL is reported to be upregulated in all tissues compared to other genes/isoforms, although the upregulation of gene MnCFI is minimum. Mulberry is a perennial tree species with huge adaptability and ploidy variation. A recent transcriptional study elucidated a considerable amount of variation in transcriptional abundance between haploid M. notabilis and diploid M. indica [31], moreover higher amount of up-and down-regulated genes were identified in haploid M. notabilis [31]. Therefore, understanding the expression potentiality of genes involved in morin biosynthesis as well as their isoforms in haploid M. notabilis provides a platform for future metabolic engineering.
The influence of the environment at the gene expression/regulation level and characterization of CREs were recommended by several researchers in both model and non-model systems [32–34]. However, the characterization of promoter structure and distribution of CREs across the promoter of genes involved in morin biosynthesis was not done previously. In the present study, a number of CREs spanning across the promoter region (500 bp) in selected four major genes indicating that morin biosynthetic processes transcriptionally regulated development- (14), stress- (21) and hormone- (13) responsive TFs. CREs associated with development (light-responsive, meristem expression, and xylem-specific induction), stress (anaerobic induction, drought-inducibility, SA responsiveness, and wound responsiveness), and response to hormones (ABA responsiveness, ethylene responsiveness, MeJA-responsiveness, and SA responsiveness) are indicating that all genes are controlled by intra- and extra-cellular factors (Fig. 2). Therefore, CRE-mediated transcriptional regulation implies considerable abiotic factors modulate the morin biosynthesis pathway, though further lab-based research is essential to demonstrate deeper understanding through the development of overexpressed, mutant, and promoter lines.
Furthermore, our result shows that all four genes are subjected to post-transcriptional regulation by many miRNAs either through translational inhibitions or through cleavage mechanisms. In this study, a total of six miRNAs were identified, which are involved in post-transcriptional gene silencing in the morin biosynthesis pathway. Although lab research in the future will require confirming the gene regulation through miRNAs, limited studies have focused on understanding miRNA-mediated gene regulation on mulberry species to date [35].
In the last few years, many in silico approaches were implemented to characterize relevant genes involved in plant cell signalling [36], development [37], gene regulation [38], mitigation of biotic [28], abiotic [39] stresses, and metabolic engineering [40] in the Morus spp. Hence, the present in silico research outcome will enrich the current understanding of the genetic architecture of morin biosynthesis (Fig. 3).

Complex multi-level gene regulation of morin biosynthesis and signal transduction cascades in the animal cell. At the cellular level TFs, CREs, and miRNA mediated the regulation of genes (MnPAL, MnCHA, MnCHI, and MnFH) and translated proteins involved in morin biosynthesis processes. Simultaneously, morin-mediated signal transduction processes and management of metabolic disorders in the animal cell. Plant cell organelles are denoted in box.
In the broader sense, genetic architecture of a metabolic pathway implies complex genetic communications and their association with environmental factors. Understanding of genetic architecture of any metabolic pathway reveals information regarding the genes involved in the pathway/cascade as well as transcriptional and post-transcriptional regulation in responses to a specific environment. In the recent past, the genetic architecture of tissue-specific glucosinolate accumulation in Brassica napus [41], genetic architecture of Prunus mume floral traits [42], genetic architecture of erucic acid and tocopherol isoform variation in B. napus seeds [43] (Havlickova et al., 2018), genetic architecture of amylose biosynthesis Zea mays [44] were investigated. Moreover, the results of the current data mining study support that TFs, CREs, and miRNA played a role in the genetic architecture of morin biosynthetic pathway by regulating the expression pattern in responses to various extra and intracellular stimuli. Likewise, genetic architecture of the Populus lignin production pathway mediated by miRNA and TF are also characteristics [45]. Hence, the present in silico approach offers a fundamental platform to understand complex genetic interaction and their association with the environmental factors involved in morin biosynthesis processes, which can be take-up for further metabolic engineering to enhance morin productivity.
With the help of bioinformatics tools, we attempted to identify genes involved in morin biosynthesis, CREs-mediated regulation, miRNA-mediated post-transcriptional silencing, and in silico expression in different tissues. Moreover, this study suggests that the natural haploid species M. notabilis could be an experimental model for understanding morin biosynthesis pathways. The present bioinformatics study will improve our understanding of the genetic architecture of morin biosynthesis, although experimental evidence will be required to understand the impact of intra- and extra-cellular factors on morin biosynthesis. In regards to improving the production of morin following experimental designees should be taken into account: (1) development of overexpressed and mutant lines of genes involved in morin biosynthesis for comparative assessment; (2) creation of transgenic promoter lines; and (3) integrated omic-study under a set of environment. Such an integrated study will provide a deeper understanding of metabolic engraining to improve the production of morin.
