Abstract
Cancer research calls for new approaches that account for the regulatory complexities of biology. We present, in this study, the differential transcriptional regulome (DIFFREG) approach for the identification and prioritization of key transcriptional regulators and apply it to the case of renal cell carcinoma (RCC) biology. Of note, RCC has a poor prognosis and the biomarker and drug discovery studies to date have tended to focus on gene expression independent from mutations and/or post-translational modifications. DIFFREG focuses on the differential regulation between transcription factors (TFs) and their target genes rather than differential gene expression and integrates transcriptome profiling with the human transcriptional regulatory network to analyze differential gene regulation between healthy and RCC cases. In this study, RNA-seq tissue samples (n = 1020) from the Cancer Genome Atlas (TCGA), including healthy and tumor subjects, were integrated with a comprehensive human TF-gene interactome dataset (1122603 interactions between 1289 TFs and 25177 genes). Comparative analysis of DIFFREG profiles, consisting of perturbed TF-gene interactions, from three common subtypes (clear cell RCC, papillary RCC and chromophobe RCC) revealed subtype-specific alterations, supporting the hypothesis that these signatures in the transcriptional regulome profiles may be considered potential biomarkers that may play an important role in elucidating the molecular mechanisms of RCC development and translating knowledge about the genetic basis of RCC into the clinic. In addition, these indicators may help oncologists make the best decisions for diagnosis and prognosis management.
Introduction
Renal cell carcinoma (RCC) is one of the 10 most common cancers in adults and encompasses >90% of kidney cancers worldwide (Caliskan et al., 2020). It originates from the renal epithelium and is more common in males than in females, and most patients are at an older age. Primarily, RCC is mainly composed of clear cell RCC (ccRCC, also known as KIRC), papillary RCC (pRCC, also known as KIRP), chromophobe RCC (chRCC, also known as KICH), and so on, according to their morphology and immunohistochemical features (Caliskan et al., 2021).
ccRCC is the most common variant, corresponds to 75–80% of all RCCs, and originates from the proximal convoluted tubules of the kidney. The rest are papillary (10–15%), chromophobe (5%), and rare kidney cancers. Although there is improvement of the state-of-the-art treatment technologies, the overall prognosis is still poor in RCCs, because most RCCs are incidentally found at imaging investigations, and about 30% of patients present with metastatic disease at the time of diagnosis (Hsieh et al., 2018). Furthermore, RCC subtypes usually do not respond to conventional radiotherapy and chemotherapy because they have drug resistance. Therefore, focusing on elucidating pathogenesis and biological mechanisms of each subtype of RCC may be promising for early diagnosis, new treatment strategies, and prognosis, and, in turn, might be life-saving.
Gene expression programs are shaped through the interplay of transcription factors (TFs), noncoding RNAs, and their target genes (Cao et al., 2015). Gene regulatory network (GRN) studies reveal complex life events in terms of a descriptive model of such connections introduced by transcriptional and post-transcriptional regulatory datasets, and systems biology approaches integrate heterologous information to understand the function of networks occurring within cells (Emmert-Streib et al., 2014).
In previous years, with the development of high-throughput screening techniques that started with microarray technology and now shifting to next-generation sequencing technologies, a lot of data on gene expressions and their regulations were accumulated and are available to the research community. Various methods and strategies have been developed to create GRNs with a vast amount of information accumulation (Del Vecchio et al., 2017). These regulatory data are used in medicine and molecular biology applications such as identification of disease mechanisms and pathways, the discovery of new drugs and drug repositioning, reducing side-effects of treatments, studying the expression patterns of genes with no evident function, identification of efficacious biomarkers, and development of diagnostic and prognostic screens with high accuracy.
In this study, we investigated the RNA-seq data (n = 1020 samples) form the Cancer Genome Atlas (TCGA) and used the differential regulome methodology that integrates transcriptome data with the human transcriptional regulatory (TF-target gene) network to analyze and compare the differential gene regulations among healthy and tumor groups. Three common subtypes (ccRCC, pRCC, and chRCC) of RCC were investigated and compared in terms of differential regulome profiles. These analyses address the question of which TF-gene interactions are perturbed between the disease and normal conditions and which TFs are likely to change their targets. Differential regulome methodology focuses on the differential regulation between TFs and their target genes rather than differential expression (Fig. 1).

Representative illustration of the basis of the DIFFREG methodology. The figure depicts the alteration of TF-gene interactions between the disease and normal phenotype. DIFFREG focuses on the differential regulation between TFs and their target genes rather than differential expression. DIFFREG, differential transcriptional regulome; TF, transcription factor.
Materials and Methods
Data extraction: TF-target gene interactions
To generate a highly reliable TF-target gene interaction dataset, we used several databases that generate data based on experimental values or computational origin, including Regnetwork (Liu et al., 2015), HTRIdb (Bovolenta et al., 2012), IRegulon (Verfaillie et al., 2014), Pazar (Portales-Casamar et al., 2008), TRRUST (Han et al., 2015), and TFacts (Essaghir et al., 2010). The TF-gene interactome set consisted of 1122603 interactions among 1289 TF and 25177 genes.
Regulatory interactions between TFs, microRNAs (miRNAs), and target genes from 25 selected databases for human and mouse are collected, integrated, and reorganized by the Regnetwork database (Liu et al., 2015). Specifically, the human regulatory network contains 23,079 nodes and 369,277 edges, consisting of 1456 TFs, 1904 miRNAs, and 19,719 target genes in the Regnetwork database. The HTRIdb is an open-access database that keeps TF-target gene information with 284 TFs that regulate 18302 genes, totaling 51,871 TF-target gene interactions (Bovolenta et al., 2012).
The Pazar database includes 1433 annotated publications, 1284 regulated genes, 6869 regulatory sequences, and 708 TFs (Portales-Casamar et al., 2008). TRRUST database includes a significantly increased size of the database consisting of 8444 regulatory interactions for 800 TFs in humans (Han et al., 2015). TFacts concludes data of TFs associated with 2720 target genes and 6401 experimentally validated regulations (Essaghir et al., 2010). iRegulon relies on the analysis of regulatory sequences around each gene in the gene set to detect enriched TF motifs or ChIP-seq peaks, using databases of nearly 10.000 TF motifs and 1000 ChIP-seq data sets or “tracks.” Next, it associates enriched motifs and tracks with candidate TFs and determines the optimal subset of direct target genes (Verfaillie et al., 2014).
Identification of differentially expressed genes
RNA-seq tissue samples (n = 1020) from TCGA, including healthy and tumor lineages, were analyzed with the implementation of differential transcriptional regulome (DIFFREG) methodology. RNA-seq count expression values were used for the identification of differentially expressed genes (DEGs), while RNA-seq FPKM-normalized transcriptome datasets were used for the DIFFREG methodology. Among those, 610 samples (72 control and 538 tumors) for ccRCC, 321 samples (32 control and 289 tumors) for pRCC, and 89 samples (24 control and 65 tumors) for chRCC were examined and compared concerning the differential regulome profiles.
Each dataset was statistically analyzed to identify DEGs independently. DEGs were identified from RNA-seq count expression values using the DESeq algorithm (Anders and Huber, 2010), which implements a negative binomial distribution-based method in which mean and variance are linked by local regression to detect DEGs. Analyses were performed in R/Bioconductor, and statistical significance was determined using log2 (fold change) >1 and adjusted p-value criteria (p < 0.05). Benjamini-Hochberg's method was used as a control for the false discovery rate.
Prediction of differential regulome
The DIFFREG method uses three datasets as input: (1) List of DEGs for the corresponding disease, (2) a transcriptome dataset of tissues, including healthy and control samples, and (3) regulatory interaction data (TF-target gene associations). The algorithm first creates a new TF-target gene list containing only DEGs in the target gene column and then performs all subsequent calculations using this new list. DIFFREG calculates the correlation between the expression levels of TFs and target genes in both healthy and diseased states.
It creates groups by taking each TF and its target genes after the correlation calculations. It then performs a significance test (Student's t-test or Mann–Whitney U test) and calculates the p-value over the correlation values for each group between healthy and diseased states. TFs of groups with a p-value <0.05 were identified as key TFs in the disease condition. These gene modules are determined as highlighted groups because the regulation of these target genes changes under different conditions (control vs. diseased). Consequently, DIFFREG estimates the extent to which each TF-gene interaction is perturbed between disease and control cases (Fig. 2). All steps of the algorithm were implemented in the R language.

Workflow of the DIFFREG methodology. The algorithm first creates a new TF-target gene list containing only DEGs in the target gene column and then performs all subsequent calculations using this new list. DIFFREG calculates the correlation between the expression levels of TFs and target genes in both healthy and diseased states. It creates groups by taking each TF and its target genes after the correlation calculations. It then performs a significance test (Student's t-test or Mann–Whitney U test) and calculates the p-value over the correlation values for each group between healthy and diseased states. TFs of groups with a p-value <0.05 were identified as key TFs in the disease condition. DEGs, differentially expressed genes.
Pathway enrichment analyses
To in-depth analyze the highlighted TF-target gene modules at the functional level, Kyoto Encyclopedia of Gene and Genome (KEGG) pathway analyses were performed using Database for Annotation, Visualization, and Integrated Discovery (DAVID), which is a web-based online bioinformatics tool that aims to provide a comprehensive pool of functional annotation tools for the researchers to figure out the molecular mechanisms associated with extensive gene lists. p < 0.05 was considered statistically significant (Dennis et al., 2003).
Results
Defining the differential regulome
It is important to distinguish between healthy and afflicted tissues to understand the biological mechanism behind illnesses. The ability to simultaneously consider thousands of genes, thanks to gene expression data, has changed the research of diseases. Therefore, it is not unexpected that a number of recent articles explicitly take gene expression data into account in relation to biomolecular networks.
Although there are mutations or post-translational modifications, which can modify the function of TFs without changing their expression level, the majority of studies on biomarkers and therapeutic targets concentrate on the expression level of genes. This could lead to the omission of TFs that are important in illness. Because of these limitations, it is necessary to shift the focus of disease research from differential gene expression to differential networking. To this end, we applied, in this study, the DIFFREG approach to identify and prioritize the key transcriptional regulators that have a strong impact on the expression of a specific group of genes for each RCC subtype (Fig. 1). The DIFFREG strategy looks for significant TFs, including those that are not DEG, but are extremely important in the onset and development of disease.
Prediction of differential regulome
To gain a deeper understanding of molecular mechanisms that distinguish between subtypes, it is necessary to identify the regulatory patterns that are different for each phenotype. To this end, we designed a method that predicts key TFs and differential regulome networks between healthy and disease conditions. One thousand twenty RNA-seq tissue samples were analyzed using a DIFFREG method that combines transcriptome data with human regulatory data to analyze the different regulations between healthy and diseased phenotypes. Six hundred ten samples (72 control and 538 tumors) for ccRCC, 321 samples (32 control and 289 tumors) for pRCC, and 89 samples (24 control and 65 tumors) for chRCC were analyzed and compared in terms of differential regulome profile. DIFFREG estimates the extent to which each TF-gene interaction is perturbed between disease and control cases (Fig. 2).
Three datasets are used as input for the differential regulome method: (1) Regulatory interaction data, (2) a transcriptome dataset of tissues, (3) and a list of DEGs for the associated disease. Before performing any further calculation, the method first generates a new TF-target gene list with only DEGs in the target gene column. In both healthy and diseased conditions, DIFFREG determines the correlation between the expression levels of TFs and their target genes.
Following the correlation calculations for both healthy and diseased samples, groups are created for each TF by taking that TF and its target genes. Following that, a significance test is run (Student's t-test or Mann–Whitney U test), and the p-value is determined for each group's correlation values between the healthy and diseased states. p-Values of <0.05 were used to identify the important TFs in the disease condition. Because the regulation of these target genes varies depending on the situation, these gene modules were chosen as highlighted groupings. DIFFREG thus calculates the degree of each TF-gene interaction perturbation between disease and control cases (Fig. 2).
Key TFs
DIFFREG algorithm provided the list of TFs whose correlation with its targets was significantly different between tumor and control and their target genes for each subtype (Table 1 and Supplementary File S1). For the ccRCC profile, we identified 820 differential interactions between 48 major TFs and 113 target genes (Fig. 3). The differential regulatory pattern of pRCC showed 795 altered interactions between 31 TFs and 140 target genes (Fig. 3). In chRCC, 404 altered interactions were observed between 37 TFs and 67 target genes (Fig. 3). The major TFs and their target genes were also detected for each subtype considering their cancer specificity (Fig. 3).

Key TFs and their target genes were demonstrated for each subtype, also considering their cancer specificity.
Key Transcription Factors Identified by the Differential Transcriptional Regulome Algorithm for Each Subtype
ccRCC, clear cell renal cell carcinoma; chRCC, chromophobe renal cell carcinoma; pRCC, papillary renal cell carcinoma; TFs, transcription factors.
Since their target genes were enriched in two prominent ccRCC-related pathways—focal adhesion and steroid hormone synthesis—MYB and MYBL2 TFs were highlighted as significant TFs. Another important TF that differentially regulates the CYP7A1 gene in the context of ccRCC has been identified as the androgen receptor (AR). PDX1, MNX1, and PAX4 are TFs that enriched in maturity-onset diabetes of the young pathway and conspicuous TFs whose interaction with target genes was changed in the disease state for chRCC (Fig. 3E). GATA2, GATA4, and SHOX2 were shown to be the common TFs targeting all three genes (HTR4, GNGT1, and KCNN2) in the serotonergic synaptic pathway, which was found as disrupted in chRCC (Fig. 3F).
The TFs E2F4 and interferon regulatory factor 8 (IRF8) were discovered to be shared essential TFs for the ccRCC and pRCC subtypes. The DIFFREG method identified ARNTL, ARX, MNX1, POU5F1 (OCT4), and ZNF503 as common TFs for ccRCC and chRCC. Using the DIFFREG method, it was discovered that SOX4 is a mutually important TF for both pRCC and chRCC.
Key pathways
The KEGG pathways of target genes were analyzed using DAVID to find out functional annotations associated with target genes (Fig. 4). The DAVID online software analyzed the potential pathways of target genes with p-values <0.05. The target genes were mainly involved in focal adhesion (p = 3.2 × 10−2) and steroid hormone synthesis (p = 3.6 × 10−2) in the ccRCC. Functional enrichment analysis of target proteins in the chRCC regulatory network indicated several biological pathways mainly associated with insulin secretion (p = 1.8 × 10−3), maturity-onset diabetes of the young (p = 7.2 × 10−3), circadian entrainment (p = 3.1 × 10−2), and serotonergic synapse (p = 4.1 × 10−2). Proximal tubule bicarbonate reclamation (p = 2.1 × 10−5) and aldosterone-regulated sodium reabsorption (p = 3.2 × 10−2) were the most affected pathways in regard to regulatory network in pRCC cases.

The associated KEGG pathways of the target genes. The target genes were mainly involved in focal adhesion and steroid hormone synthesis in the ccRCC profile. Functional enrichment analysis of target proteins in the chRCC regulatory network indicated several biological pathways mainly associated with insulin secretion, maturity-onset diabetes of the young, circadian entrainment, and serotonergic synapse pathways. Proximal tubule bicarbonate reclamation and aldosterone-regulated sodium reabsorption pathways were the most affected pathways in regard to regulatory network in pRCC cases. KEGG, Kyoto Encyclopedia of Gene and Genome.
Discussion
Cancer biology is a culmination of multiple and interacting pathways and molecules that are under complex regulation. New approaches that consider not only particular molecular targets and pathways but also their differential regulation would offer new strides toward a deeper understanding of cancer biology, diagnosis, prognosis, and clinical management. A regulatory lens on human biology in health and disease can be achieved by harnessing the high-throughput omics technologies and systems biology (Aydin et al., 2021; Emmert-Streib et al., 2014; Hsieh et al., 2018; Kori et al., 2021).
However, most studies on biomarkers and drug targets focus on the expression level of genes, independent of mutations or post-translational modifications, which can alter the function of TFs without changing their expression level. This may result in overlooking TFs that play critical roles in disease. Because of these limitations, we applied, in this study, the DIFFREG approach to identify and prioritize the key transcriptional regulators that have a strong impact on the expression of a specific group of genes for each RCC subtype.
Key TFs and their target genes were demonstrated for each RCC subtype, also considering their subtype specificity in Figure 3. The target genes were used in the enrichment analysis to identify the disrupted signaling pathways for each subtype. The tumor specificity of the enriched pathways varied by subtype (Fig. 4). Functional enrichment analysis of proteins in the ccRCC regulome network indicated several biological pathways mainly related to focal adhesion and steroid hormone synthesis (Fig. 4). Five genes, including TNC, VASP, PDGFRB, ACTN4, and PPP1CB, were found to be implicated in the focal adhesion pathway.
Regulation of these five target genes by MYB and MYBL2 TFs was found to be differential in ccRCC patients compared with healthy samples (Fig. 3B). The MYB gene family was significantly associated with poor prognosis, proliferation, invasion, and aggressiveness of various tumors, including ccRCC (Cicirò and Sala, 2021; Sala and Ciciro, 2021). In light of the above, the molecular mechanism underlying the promoting role of the MYB gene family in tumor aggressiveness, metastasis, and proliferation could be explained by the differential regulation of genes (TNC, VASP, PDGFRB, ACTN4, and PPP1CB) involved in the focal adhesion pathway by members of the MYB gene family in ccRCC patients.
Steroid hormone synthesis was found to be another important enriched metabolic pathway in ccRCC. Steroid hormones and their receptors play an important role in normal renal biology. Three genes (UGT2B11, CYP7A1, and HSD17B8) enriched in the steroid hormone synthesis pathway were found to be dysregulated in the differential regulatory network of ccRCC. MYB and MYBL2 were determined to be key TFs that also function as differential regulators of these genes in this pathway. The AR was found to be another prominent key TF that differentially regulates the CYP7A1 gene in the case of ccRCC. Our previous study, in which we have applied a reporter features algorithm (Caliskan et al., 2020), also has revealed AR as a significant TF in ccRCC cases.
Some studies associate AR with RCC types, but they indicate that the function of AR in RCC needs to be further investigated to establish a proper correlation between AR and RCC progression (Bennett et al., 2014). The CYP7A1 protein catalyzes the first reaction of cholesterol degradation in the liver, in which cholesterol is converted to bile acids (Chiang and Ferrell, 2020). We found that AR and CYP7A1 have an activated interaction in a disease situation, leading to the downregulation of CYP7A1. This situation may lead to the accumulation of cholesterol. ccRCC is characterized by the accumulation of cholesterol, cholesterol esters, and other neutral lipids (Drabkin and Gemmill, 2012). All in all, we may conclude that the accumulation of cholesterol during the progression of ccRCC occurs through the downregulation of the CYP7A1 gene by AR (Fig. 3C).
Analysis of the functional enrichment of proteins in the chRCC regulome network indicated several biological pathways mainly associated with insulin secretion, maturity-onset diabetes of the young, circadian entrainment, and serotonergic synapse (Fig. 4). In our previous study, the insulin secretion pathway was enriched for chRCC (Caliskan et al., 2020), as well and our new findings support previous results. It has been reported that diabetes and RCCs have been listed among the risk factors for hyperglycemia (Caliskan et al., 2020). Another notable study indicated that type 2 diabetes was found to be associated with a significantly higher risk of RCC in women, independent of obesity, hypertension, and smoking. The association was strongest for non-ccRCC (Graff et al., 2018). However, they did not specify the type of non-ccRCC, and, based on our current and previous studies, we claim that chRCC differentiates from other subtypes due to its strong association with diabetes.
PDX1, MNX1, and PAX4 are TFs enriched in maturity-onset diabetes of the young pathway and conspicuous TFs whose interaction with target genes were changed in the disease state (Fig. 3E). These results suggest that genes related to diabetes are involved in the activity of the regulome network in chRCC patients. Another significantly enriched pathway in chRCC is the circadian entrainment pathway. Genes involved in the circadian clock system play an important role in biological processes such as DNA damage and repair, cell proliferation, and metastasis, thus influencing tumorigenesis and progression (Zhou et al., 2020). In addition, there is growing evidence of the importance of circadian clock genes in the diagnosis, treatment, and prognosis of cancer. Another prominent pathway is a serotonergic synapse in chRCC patients. Several studies have shown the growth-promoting effect of serotonin on various types of cancer and carcinoids (Balakrishna et al., 2021).
The common TFs targeting all three genes (HTR4, GNGT1, and KCNN2) in the serotonergic synaptic pathway were determined to be GATA2, GATA4, and SHOX2 (Fig. 3F). The TF GATA2 has been associated with poor prognosis of RCC subtypes (Peters et al., 2015), but to our knowledge, the mechanism of action has not been elucidated. Based on our findings, it can be suggested that GATA2 promotes tumor progression through alterations in the regulation of the serotonergic synaptic pathway. Moreover, in the findings of reporter features algorithms, GATA2 was also identified as a major TF in chRCC. The association with GATA4 and SHOX2 TFs and RCC has not been reported in the literature, and to our knowledge, it was reported in this study for the first time.
Pathway enrichment analysis indicated that the proximal tubule bicarbonate reclamation and aldosterone-regulated sodium reabsorption were the most influenced pathways in terms of regulatory networks in pRCC cases (Fig. 4). Among the altered pathways of the pRCC regulome network, one of the prominent pathways is proximal tubule bicarbonate reclamation, which is strongly associated with proximal tubule structure and has been previously associated with pRCC (Huang et al., 2017). Another striking result of the pRCC regulome network enrichment is the disruption of the aldosterone-regulated sodium reabsorption pathway.
Aldosterone stimulates sodium transporters in the aldosterone-sensitive distal nephron, resulting in increased sodium reabsorption, a process that also permits potassium secretion (Alexander and Bockenhauer, 2016) in healthy functioning kidneys (Solocinski and Gumz, 2015). Specific pathologies, which prevent bicarbonate reclamation and lead to metabolic acidosis, are referred to as renal tubular acidosis. It is known that these forms of renal tubular acidosis are typically accompanied by hypokalemia. Consequently, it can be concluded that these two pathways are highly connected and deterioration of the aldosterone-regulated sodium reabsorption pathway may lead to renal tubular acidosis by resulting in disruption of the bicarbonate reclamation pathway in pRCC patients.
E2F4 and IRF8 TFs were found as mutual key TFs for both ccRCC and pRCC. Our previous study, which we have applied a reporter features algorithm, also has found E2F4 as a significant TF in ccRCC. E2F4 is known as a negative regulator of VHL in the human placenta (Alahari et al., 2018). It is known that loss of the VHL gene has been strongly associated with the formation and proliferation of ccRCC through its downstream targets. A study focused on the prognostic capability of the E2F family has shown that E2F4 can be used as a prognostic marker for survival of ccRCC patients (Zhang et al., 2021), but E2F4 has not been associated with the pRCC subtype specifically.
Considering the proliferative and prognostic role of E2F4 in various tumor types and our findings, we argue that it may play an important role in the formation and proliferation of pRCC by negatively regulating its downstream targets. IRF8 has been proposed as a predictor of progression and patient survival in RCC (Muhitch et al., 2019), but it has not been associated with the pRCC subtype specifically. Considering our findings, it can be concluded that IRF8 has been implicated in both ccRCC and pRCC formation and progression.
ARNTL, ARX, MNX1, POU5F1, and ZNF503 were determined as a mutual key TF for both ccRCC and chRCC through the DIFFREG algorithm. The stem cell factor POU5F1 has been correlated with advanced tumor stage and poorer overall survival in RCC patients (Siebenthall et al., 2019). The ARX gene is a homeobox-containing gene expressed during development and functions in the nervous system and brain development. However, intriguing studies of the role of ARX orthologs in diverse models have suggested that it is critical for important developmental processes such as proliferation and cell differentiation (Friocourt et al., 2006) but has not been correlated with RCC. This study is the first time that ARX and ZNF503 are proposed as potential biomarkers for both ccRCC and chRCC.
SOX4 was found as a mutual significant TF for both pRCC and chRCC through the DIFFREG approach. The study focusing on the role of SOX4 in RCC indicated that specific knockdown of SOX4 in renal cancer cell lines significantly suppressed the migration and invasion of cancer cells; specific overexpression of SOX4 in renal epithelial cell line markedly promoted the migration and invasion of the cell line (Ruan et al., 2017).
Our previous study, in which we have applied reporter features algorithm, (Caliskan et al., 2020) also has revealed significant TFs specific to RCC subtypes. AR, E2F4, ESR1, and NR1I2 are common highlighted TFs that both DIFFREG and reporter feature algorithm gave a result for ccRCC cases. ESR1 is an estrogen receptor that is known as an oncogene and has been strongly associated with RCC.
There is significant evidence that these receptors promote the development and progression of a variety of cancers, with new research suggesting that the human kidney could be one of them. This is supported by clinical evidence, such as a large sex difference in RCC incidence, with men having double the rate of women. Furthermore, the study focusing on hormone therapy of RCC in hamsters led to the concept that some kidney cancers are hormone dependent (El-Deek et al., 2018). E2F4 and YBX1 became prominent in both DIFFREG and reporter feature algorithms in pRCC samples. The relationship of E2F4, YBX1, and pRCC has been discussed in detail in the previous paragraphs. Only GATA2 was determined to be mutual for chRCC in both the DIFFREG and reporter feature algorithms.
Conclusion
Understanding the dynamics of cellular systems and disease molecular pathways requires the identification of differential regulators. Most biomarker or drug delivery molecule research focuses on gene expression levels, ignoring mutations or post-translational modifications, which can change TF function without changing their expression levels. Considering these arguments, we developed the DIFFREG algorithm, which allows us to identify important TFs that may not be DEG. The DIFFREG algorithm was applied to three subtypes of RCC in this study; however, the algorithm is applicable to transcriptome data of any illness or to other organisms for which TF-target gene data are available. These analyses address the question of which TF-gene interactions are perturbed between the disease and normal conditions and which regulators tend to alter their targets.
The differential regulome methodology focuses on the differential regulation between TFs and their target genes rather than differential expression. We focused on functional enrichment analyses and highlighted signaling pathways when evaluating the results of this methodology. Focal adhesion and steroid hormone synthesis were highlighted in ccRCC cases, while proximal tubule bicarbonate reclamation and aldosterone-regulated sodium reabsorption were emphasized in pRCC patients. Insulin secretion, maturity-onset diabetes of the young, circadian entrainment, and serotonergic synapse were found as most influenced ones in chRCC disease.
The discovery of these subtype-specific biomarkers may play an important role in elucidating the molecular mechanisms of RCC development and translating the knowledge of the genetic basis of RCC into the clinic. Furthermore, these indicators may aid oncologists in making the best diagnosis and prognosis management decisions.
Footnotes
Authors' Contributions
A.C.: Writing—original draft; conceptualization; software; methodology; and visualization. K.Y.A.: Conceptualization; writing—review, and editing; methodology; and supervision. All authors read and approved the final version of the article.
Author Disclosure Statement
The authors declare they have no conflicting financial interests.
Funding Information
No specific funding was received for this work.
Abbreviations Used
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
