Abstract
Aims:
Diabetic nephropathy (DN) has become a serious public health problem. Genetic factors are involved in the pathogenesis of DN, but the exact mode of inheritance is still unknown. Genome-wide linkage scans (GWLS) have produced inconclusive or inconsistent results. In an effort to test consistency and provide more conclusive results, we applied a heterogeneity-based genome search meta-analysis (HEGESMA) to GWLS regarding DN.
Materials and Methods:
We combined results from eight GWLS in the primary analysis and nine GWLS for a conditional analysis about DN for both diabetes types, as well as in each type of diabetes and ethnicity in subgroup analyses.
Results:
HEGESMA identified cytogenetic locations that rank highly on average in terms of linkage statistics across multiple genome scans, taking into consideration the magnitude of heterogeneity of the results between scans. Main analyses: Our meta-analysis identified 13 cytogenetic locations (bins) with statistical significance (Prank ≤ 0.05), 11 of which were significant in both weighted and unweighted analyses located on chromosomes 1q, 3q, 4p, 5q, 7q, 15q, 16p, 17q, 19q, and 22p. In addition, four novel regions (5q11.2-5q14.3, 5q23.2-5q34, 17q24.3-17q25.3, and 22q12.3-22q13.3) were identified. Seven bins on chromosomes 4p, 5q, 7q, 15q, 22p, and 22q were common between both types of diabetes and in all subgroup analyses, in addition 5q14.3-5q23.2 was significant across all analyses. Conditional analyses: meta-analysis identified nine different cytogenetic locations, among which 7p22.3-7p15.3 was significant only in type 2 diabetes mellitus conditional analysis. Ethnicity subgroup analyses identified 11 different cytogenetic locations, 5 out of which are novel findings. However, none of the chromosomal regions reached genome-wide statistical significance (Prank < 0.00042).
Discussion:
This meta-analysis provides evidence for linkage for nine novel cytogenetic regions that should be further investigated for genes that confer susceptibility to DN.
Introduction
Diabetic nephropathy (DN) is a serious microvascular complication of either type 1 (T1D) or type 2 diabetes (T2D) characterized by progressive clinical course with loss of kidney function. It constitutes the most frequent cause of end-stage renal disease (ESRD) requiring chronic renal replacement therapy (Saran et al., 2016). Main risk factors of DN are poor glycemic control and long duration of diabetes mellitus (Dronavalli et al., 2008). However, patients with good glycemic control may still develop nephropathy, implying the contribution of genetic background, as well as epigenetic mechanisms (Rich, 2006; Kato and Natarajan, 2014). This fact and a proven significant familial clustering of DN (Seaquist et al., 1989; Borch-Johnsen et al., 1992; Quinn et al., 1996) clearly implicate that genetically defined predisposition is involved in the pathogenesis of nephropathy in both DM types. However, the genetic component has not yet been deciphered (Zintzaras et al., 2007a, 2009; Stefanidis et al., 2009, 2014, 2018; Tziastoudi et al., 2017).
Genetic association and family-based studies have shown that a significant number of variants are associated with DN. However, both categories of studies have produced inconclusive results so far (Mooyaart et al., 2011; Palmer and Freedman, 2012). Genome-wide linkage scan (GWLS) is an approach for identifying chromosomal regions that may be linked to complex diseases, such as DN. GWLS in DN have shown that genetic components play a role in the disease, but the exact inheritance pattern remains to be established. The genome searches on DN have produced inconclusive inferences; as linkage signals tend to be rather low in each study, the number of families is relatively small and the individual genome scans have identified evidence for linkage in different chromosomal regions.
Genome-wide scan meta-analysis (GSMA) is a method for synthesizing the results of genome searches to identify loci linked to DN (Wise et al., 1999; Zintzaras and Ioannidis, 2005a, 2005b). Zintzaras and Ioannidis (2005a, 2005b) developed an extension of the GSMA, the heterogeneity-based genome search meta-analysis (HEGESMA), which also evaluates the heterogeneity between different genome scans. HEGESMA is the most established method for meta-analysis of genome scans and it has been applied to genome scans of several complex diseases (Zintzaras and Ioannidis, 2005a, 2005b, 2008; Zintzaras and Kitsios, 2006; Trikalinos et al., 2006; Zintzaras et al., 2006; Zintzaras et al., 2007b; Bouzigon et al., 2010; Rao et al., 2011). In the present article, we performed a HEGESMA to provide conclusive evidence on regions linked to DN as a discreet trait (Zintzaras and Ioannidis, 2005a, 2005b).
Materials and Methods
Eligible whole genome scans
All GWLS for DN published before March 2018 were considered in the meta-analysis. We searched the PubMed database for English articles referred to humans. As a search criterion, we used the combination of the following terms: “genome search,” “genome scan,” “genome screen,” “logarithm of odds (LOD) score,” “linkage,” “nonparametric linkage (NPL) score,” “genome-wide” or “genome-wide linkage analysis,” and “diabetic nephropathy.”
The meta-analysis included GWLS fulfilling the following criteria: (1) subjects were humans (2) with DN or ESRD due to diabetes; (3) the required linkage score data were available or were extractable from published graphs; and (4) there was no sample overlapping.
All potentially eligible articles were read to assess their appropriateness for meta-analysis and references of eligible articles were also perused. In studies with overlapping cases, the largest study or the most recent one, including extractable data, was included. Scans restricted to specific candidate regions or chromosomes were excluded.
Definition of outcome
DN was defined on the basis of a long-standing diabetes mellitus, either T1D or T2D, with macroalbuminuria (urinary albumin excretion rate ≥200 μg/min or ≥300 mg/24 h, equivalent to an overt glomerular proteinuria) (Eknoyan et al., 2003) and/or chronic renal insufficiency (stage 3-5 chronic kidney disease) in the absence of nondiabetic renal disease (KDIGO, 2013). Cases were defined similarly in the included studies. Studies in which diabetic probands were selected regardless of the degree of renal function or albuminuria were rejected. Excluded from the meta-analysis were also GWLS with quantitative surrogate markers for DN as estimated glomerular filtration rate (eGFR), albuminuria, and serum cystatin.
Data extraction
For each study, the following information was extracted: first author, year of publication, ethnicity of study population, type of diabetes, number of families, selection criteria, number of markers and intermarker interval, linkage statistic, type of statistical analysis, metric and software of linkage analysis, and finally, the locations with chromosomal regions corresponding to maximum LOD score, as proposed by Lander and Kruglyak (1995).
Genome scan data, if not available, were derived from published figures by digitizing graphs of linkage scores per distance in each chromosome using Engauge Digitizer (version 2.12, Mark Mitchell, 2002). Regarding included scans, the corresponding investigators were asked to provide whole genome results to avoid a special form of publication bias when results are presented only for regions that show some evidence for linkage.
Genome scan meta-analysis and heterogeneity testing
GSMA starts by splitting the chromosomes into genetic regions (bins) of approximately equal length; usually, each bin has a width of 30 cM giving 120 bins for the whole genome. The 120th bin refers to the highest rank. For nomenclature purposes, bin c.n is the nth bin on chromosome c. For each genome scan, the most significant result within the bin is recorded. Then, for each scan, the bins are ranked according to their significance and the ranks for each bin, weighted or not by a specific factor, are summed across scans. The significance of the average rank of each bin is assessed empirically against the distribution of average ranks.
When a bin has a high summed rank, this is considered evidence for linkage. Equal test statistics for several bins within a study were assigned as tied ranks and negative linkage scores were ranked as 0.
Heterogeneity between studies for each bin was assessed using Q statistic, which is defined as the sum of the squared deviations of each study's bin rank from the mean of the ranks. In GSMA, low between-study heterogeneity indicates consistency of study results for the same bin and the presence of low heterogeneity for a specific bin with high ranks can be interpreted as further supportive evidence for the significance of this bin. The statistical significance of the average rank and the Q metric were assessed using a Monte Carlo method (Zintzaras and Ioannidis, 2005a, 2005b). In this method, in a run, the ranks of each study are randomly permuted and the simulated average rank and Q metric are calculated. The procedure is repeated for 50,000 runs and a null distribution for the average rank, and for the Q metric, is constructed. The significance level (Prank) of the average rank of bins against the null distribution of average ranks is the percentage of simulated average ranks greater or equal than the observed. The statistical significance level (PQ) for low heterogeneity is the percentage of simulated metrics less or equal than the observed. Moreover, a Monte Carlo test has been performed, generating null distributions separately for each bin, considering only the simulated distributions of the Q metric (Qadjusted) for bins with the neighboring simulated average rank (±2) as the bin being considered each time (Zintzaras and Ioannidis, 2005a).
In this study, GSMA (Prank and Porder) and heterogeneity testing were performed unweighted and weighted (Zintzaras and Ioannidis, 2005a, 2005b).
Software and analytic details
Main analysis considered only the genome scans providing results of all bins, while conditional analysis considered one more genome scan, which had evaluated all chromosomes, but results were available only for 22 bins. We also conducted subgroup analyses by type of diabetes and ethnicity. The different ethnicities were classified as European Americans/Europeans/Whites/Caucasians, African Americans, and American Indians.
We performed both unweighted and weighted analyses. In weighted analyses, we weighted bin ranks by √(number of families × number of markers) and then, the weights were scaled to sum up to 1. Weighting ensures that smaller, lower powered studies will have a smaller influence on the overall results. As the optimal weighting for GSMA is unclear, we also performed unweighted analyses. Inferences were performed at the a = 0.05 level. Thus, we considered a p-value ≤0.05 of suggestive significance, and a p-value ≤0.00042 of genome-wide significance (using Bonferroni corrections for 120 comparisons), after adjustment for multiple comparisons. Graph digitizing was performed with Engauge Digitizer (version 2.12, Mark Mitchell, 2002). Genetic distances were obtained using chromosomal maps from the Marshfield Center for Medical Genetics (http://research.marshfieldclinic.org/genetics/GeneticResearch/compMaps.asp). The cytogenetic locations of bins were determined using the Map Viewer and the identification of candidate genes harbored in these regions using the HuGE Navigator and the term “diabetic nephropathies” (last database update on February 2018). The evaluation of the significance of average ranks and the significance of heterogeneity was performed using HEGESMA software (http://biomath.med.uth.gr) (Zintzaras and Ioannidis, 2005a, 2005b).
Results
Literature search of PubMed retrieved 228 articles, out of which 20 were linkage scans for DN or ESRD due to diabetes and 5 articles were retrieved from the references (Fig. 1). Finally, only six articles (nine studies) met the inclusion criteria (Imperatore et al., 1998; Bowden et al., 2004; Osterholm et al., 2007; Rogus et al., 2008; Igo et al., 2011; Wessman et al., 2011). Among the aforementioned scans, nine were partial linkage scans (Yu et al., 1996; Bowden et al., 1997; Freedman et al., 1997; Moczulski et al., 1998; Yu et al., 1999; Freedman et al., 2002; Vardarli et al., 2002; Iyengar, 2003; McDonough et al., 2009), whereas 10 articles referred to surrogate markers for DN as eGFR, albuminuria, and serum cystatin (Covic et al., 2001; Freedman et al., 2005; Krolewski et al., 2006; Placha et al., 2006; Chen et al., 2007; Iyengar et al., 2007; Puppala et al., 2007; Freedman et al., 2008; Schelling et al., 2008; Thameem et al., 2013). The study of Bowden et al. (2004), which presented raw data only for 22 bins, was considered only in conditional analyses. A more detailed justification for the exclusion of the remaining eleven GWLS is presented in Table 1.

Flowchart of retrieved studies for meta-analysis and studies excluded, with justification of reasons. MALD, mapping by admixture linkage disequilibrium; RCT, randomized controlled trial; TDT, transmission disequilibrium testing.
Genome-Wide Linkage Scans on Diabetic Nephropathy Retrieved from PubMed
Both included and excluded genome searches from meta-analysis and the reason of exclusion are shown (listed from earlier to most recent).
eGFR, estimated GFR; ESRD, end-stage renal disease; GFR, glomerular filtration rate; T2D, type 2 diabetes; T2DM, TD mellitus; UAER, urinary albumin excretion rate.
Details on the analyzed studies are shown in Table 2. All included studies used similar diagnosis criteria for DN. They were conducted in populations with different ethnicities and different type of diabetes. According to Igo et al. (2011), the probands were considered to have T2D, as 90-95% of probands were believed to have T2D. The linkage data of Igo et al. (2011) refer to four different studies. Upon request for linkage data, only Igo et al. (2011) provided the raw data of the four ethnic groups. Regarding the remaining studies, the presented graphs showing linkage scores per distance for all chromosomes were digitized for the calculation of linkage statistic. The unpublished data set was requested from Bowden et al. (2004), but it was not available. All studies analyzed only the autosomes.
Characteristics of the Genome-Wide Linkage Studies Included in the Meta-Analysis
ACR, albumin:creatinine ratio; ASP, affected sib pairs; DSP, discordant sib pairs; LOD, logarithm of odds; NPL, nonparametric linkage; NR, not reported.
All studies, except that of Igo et al. (2011), implemented the LOD score as a metric of linkage statistics. The study of Igo et al. (2011) regarding Mexican Americans was the most weighted study (w = 0.24), whereas the least weighted study was that of Imperatore et al. (1998) (w = 0.03). The regions with suggestive linkage identified from each scan are shown in Table 2. This meta-analysis provides the results of analysis with both DM types (Fig. 2) and subgroup analyses regarding the type of diabetes with and without the inclusion of the study of Bowden et al. (2004), as well as ethnicity subgroup analyses (Tables 3 and 4).

Unweighted (open circles) and weighted (filled circles) average ranks from eight diabetic nephropathy genome scans with 120 bins regarding the analysis with both types of diabetes. Bins with significant Prank in unweighted or weighted analysis are above the line at p ≤ 0.05.
Diabetic Nephropathy Genome Scan Meta-Analysis Results—Main and Conditional—in Unweighted and (in Parentheses) Weighted Analyses
AA, African Americans; AI, American Indians; EA, European Americans; MA, Mexican Americans; na, non-applicable.
Diabetic Nephropathy Genome Scan Ethnicity Subgroup Meta-Analysis Results in Unweighted and (in Parentheses) Weighted Analyses
Main analysis
In all analyses regarding the eight studies, 13 different bins were found significant (Prank ≤ 0.05) with both DM types or subgroup analyses, weighted or unweighted analyses. More specifically, in analysis with both DM types, six cytogenetic locations (4p14-4q13.3, 5q14.3-5q23.2, 5q23.2-5q34, 15p13-15q11.2, 22p13-22q12.3, and 22q12.3-22q13.33) were significant in weighted or unweighted analyses (Prank ≤ 0.05), out of which bins 4p14-4q13.3, 5q14.3-5q23.2, and 15p13-15q11.2 were significant in both weighed and unweighted analyses (Prank ≤ 0.05). In subgroup analysis regarding T1DM type, nine cytogenetic locations (1q43-1q44, 3q21.2-3q25.32, 5q11.2-5q14.3, 5q14.3-5q23.2, 16p12.3-16q12.2, 17q24.3-17q25.3, 19q13.33-19q13.43, 22p13-22q12.3, and 22q12.3-22q13.3) were significant in weighted or unweighted analyses (Prank ≤ 0.05), out of which bins on chromosomes 1q, 3q, 16p, 17q, 19q, and 22p were significant in both weighed and unweighted analyses (Prank ≤ 0.05). In subgroup analysis regarding type 2 diabetes mellitus (T2DM), five cytogenetic locations (4p14-4q13.3, 5q14.3-5q23.2, 5q23.2-5q34, 7q22.3-7q34, and 15p13-15q11.2) were significant in both weighed and unweighted analyses (Prank ≤ 0.05). Cytogenetic location 7q22.3-7q34 is identified significant only in T2DM analysis. However, no bin reached the significance threshold after adjustment for multiple comparisons in either analysis with both DM types or subgroup analyses (Prank > 0.00042).
Regarding ordered analyses, in analyses with both DM types, no bin was significant, whereas in T1DM subgroup analysis, only 22p13-22q12.3 was significant (Porder < 0.05). In T2DM subgroup analysis, no bin was significant (Porder < 0.05) (Table 3).
In low heterogeneity testing, chromosomal regions 4p14-4q13.3 and 5q14.3-5q23.2 were statistically significant with left-sided PQ/Ha/B < 0.05 (Table 3). In analyses with both DM types, 16p12.3-16q12.2 was significant with PHa < 0.05 in T1DM subgroup analyses and 5q14.3-5q23.2 and 5q23.2-5q34 in T2DM subgroup analyses with PQ/Ha/B < 0.05, respectively. Chromosomal region 5q14.3-5q23.2 in analysis with both DM types and regions 5q14.3-5q23.2 and 5q23.2-5q34 were significant in both weighted and unweighted analyses regarding all metrics of heterogeneity (PQ/Ha/B < 0.05).
Regarding bins repeatedly identified as significant across analyses with both DM types and subgroup analyses, only chromosomal location 5q14.3-5q23.2 was common across all analyses. However, between analysis with both DM types and T1DM subgroup analyses, chromosomal locations 5q14.3-5q23.2, 22p13-22q12.3, and 22q12.3-22q13.3 were common.
Conditional analysis
Meta-analysis of nine genome-wide scans identified nine different cytogenetic locations (4p14-4q13.3, 5q14.3-5q23.2, 5q23.2-5q34, 7p22.3-7p15.3, 7q22.3-7q34, 15p13-15q11.2, 16p12.3-16q12.2, 22p13-22q12.3, and 22q12.3-22q13.33). Two cytogenetic locations, 7p22.3-7p15.3 in T2DM analysis and 16p12.3-16q12.2 in both DM type analyses, were significant only in relevant conditional analyses.
Regarding ordered analyses, only cytogenetic location 5q14.3-5q23.2 in analysis with both DM types showed statistical significance (Porder < 0.05). Similarly, in low heterogeneity testing, this region was statistically significant in both weighted and unweighted analyses (PQ < 0.05).
The genome scan meta-analysis and heterogeneity testing in DN revealed the significance of four novel locations, 5q11.2-5q14.3, 5q23.2-5q34, 17q24.3-17q25.3, and 22q12.3-22q13.3, and replicated the significance of the remaining cytogenetic locations, whereas conditional analysis showed one more cytogenetic region 7p22.3-7p15.3, indicative of linkage with DN.
Regarding the three ethnicity subgroup analyses, in total, 11 new cytogenetic locations were identified significant (Prank ≤ 0.05). More specifically, in Caucasians, 13 cytogenetic locations were identified significant, 3 out of which (1q23.2-1q31.1, 11q13.3-11q22.1, and 17p12-17q21.2) were only in Caucasians. In African Americans, eight cytogenetic locations are identified significant, five out of which (1p35.2-1p32.1, 4q35.1-4q35.2, 7p13-7q11.23, 12p13.2-12q12, and 13q22.1-13q33.1) were only in African Americans. Last, but not least, in American Indians, all regions (1p36.32-1p36.21, 15q14-15q22.32, and 20p13-20p12.2) identified are significant only in American Indians.
Regarding ordered analyses based on ethnicity, only cytogenetic locations 16p12.3-16q12.2 and 17q24.3-17q25.3 in Caucasians showed statistical significance (Porder < 0.05). Similarly, in low heterogeneity testing, only regions 19q13.33-19q13.43 and 22q12.3-22q13.33 in Caucasians, as well as 16p12.3-16q12.2 in African Americans were statistical significant (PQ/Ha/B < 0.05).
The subgroup analyses based on ethnicity revealed some additional novel cytogenetic locations (1p36.32-1p36.21, 1p35.2-1p32.1, 11q13.3-11q22.1, 13q22.1-13q33.1, and 17p12-17q21.2).
Discussion
The meta-analysis of 8 genome scans identified a total of 13 chromosomal regions with evidence for linkage from Prank. Remarkably, 11 chromosomal regions were significant in both weighted and unweighted analyses, analysis with both DM types, or subgroup analysis based on type of diabetes. However, no bin revealed significance at genome-wide level (Prank ≤ 0.00042). For this reason, we focused on bins that reached statistical significance (Prank < 0.05) and were novel findings, or were common among different analyses. For purposes of completeness, meta-analysis was also conducted after the inclusion of the study of Bowden et al. (2004), as it was clearly eligible, but it presented results only for 22 bins. Furthermore, it has been shown that when results are not available for all bins, there is no increase in false-positive results (Forabosco et al., 2007). The conditional analyses were post hoc analyses that should be viewed as exploratory. Novel findings of this study constitute the chromosomal regions that were identified as significant by the meta-analysis, but not by the individual genome scans, as cytogenetic locations 5q11.2-5q14.3, 5q23.2-5q34, 17q24.3-17q25.3, and 22q12.3-22q13.3. Only chromosomal region 22p13-22q12.33 indicated evidence for linkage from Porder statistic.
Heterogeneity testing revealed low unadjusted heterogeneity at chromosomal regions 4p14-4q13.3 and 5q14.3-5q23.2 in analysis with both DM types, as well as regions 16p12.3-16q12.2 in T1DM and 5q14.3-5q23.2 and 5q23.2-5q34 in T2DM analysis, respectively. All of the aforementioned bins were significant in both weighted and unweighted analysis, except bin on chromosome 16p.
Chromosomal region 5q14.3-5q23.2 was the common finding between analysis with both DM types and subgroup analysis and constitutes the most significant region for DN in analysis with both DM types. However, among included studies, only Rogus et al. (2008) had shown evidence of linkage within this chromosomal region, whereas another genome-wide linkage study, which was excluded from the meta-analysis due to improper inclusion criteria, had detected evidence of linkage to serum creatinine in 5q23 (Chen et al., 2007). Chromosome 5q, as well as 7q and 22q, account for the majority of genetic variance of urinary albumin excretion in both diabetic and nondiabetic individuals (Krolewski et al., 2006). PCSK1 is the only gene located in this chromosomal region that has been examined in genetic association studies regarding DN (Jacobsen et al., 2002; De Cosmo et al., 2002).
The most significant region in T2DM-DN is the cytogenetic location 5q23.2-5q34, which constitutes a novel finding. Locations adjacent to this region have shown strong or suggestive evidence of linkage with DN. Rogus et al. (2008) detected evidence for linkage with DN in 5q at 118 cM. Also, Krolewski et al. (2006) found strong evidence for linkage to urinary albumin excretion on chromosome 5 at 69 cM (LOD = 3.4) both in diabetic and nondiabetic individuals. IL4, GPX3, SMAD5, IL12B, IL13, GALNT10, and MFAP3 are located in this region and have been previously examined in genetic association studies (Arababadi, 2010; Vieira et al., 2011; McKnight et al., 2009; Grzegorzewska et al., 2014; Ng et al., 2012).
Another novel cytogenetic location on chromosome 5, 5q11.2-5q14.3, showed marginal significance in subgroup analysis of T1DM (Prank = 0.05). ITGA2 is the only gene located in this region, which has been examined in genetic association studies (Tsai et al., 2001). ITGA2 encodes the alpha subunit of one major platelet membrane receptor for collagen and related proteins exposed to subendothelial and subsequently, it is assumed to be implicated in the accumulation of extracellular matrix (ECM), one of the hallmarks of DN.
Cytogenetic location 17q24.3-17q25.3 is one more novel region detected from meta-analysis. Among the excluded GWLS, only Chen et al. (2007) identified suggestive linkage at 17q24, with CrCl and GFR with LOD score 1.36 and 1.04, respectively. FN3K is the only gene already examined in genetic association studies in this region (Tanhauserova et al., 2014). Advanced glycation end products (AGEs) have great impact on the development and progression of DN, and FN3K is assumed to play an important role in prevention of further glycation processes (Delpierrre et al., 2004; Ott et al., 2014).
Across the cytogenetic region 22q12.3-22q13.33, none of the included studies identified evidence of linkage with DN. Among candidate genes in this region are APOL1, APOL2, APOL3, MYH9, HMOX1, and MIOX. Although APOL1 explains a substantial proportion of nondiabetic end-stage kidney disease in African Americans, less is known about its contribution in DN (Freedman et al., 2010). Polymorphisms in MYH9 also exhibit strong association with nondiabetic kidney disease, but Genovese et al. (2010) found that the statistical association of APOL1 was stronger compared with MYH9 (Freedman et al., 2010).
The most important regions for T1DM-DN are on chromosomes 3q, 19q, and 22p in weighted or unweighted analyses. Chromosome 3q has been identified as a major locus for DN susceptibility genes from many different studies, either linkage scans or association studies. This meta-analysis replicates the significance of chromosomal region 3q21.2-3q25.32, as individual linkage scans had identified evidence of linkage in different locations across the whole region (Imperatore et al., 1998; Osterholm et al., 2007; Rogus et al., 2008; Moczulski et al., 1998; Placha et al., 2006; Krolewski et al., 2006). However, two other linkage scans identified evidence of linkage at neighboring regions (Bowden et al., 2004; Chen et al., 2007). Among candidate genes in this region are MRAS, NPHP3, ACAD11, TRPC1, and AGTR1, which is one of the most studied genes.
This meta-analysis is the first to search for chromosomal regions harboring genes responsible for DN, considering DN as a discreet trait. Therefore, linkage studies specifically for proteinuria or other quantitative surrogate markers of renal function were excluded. The most relevant meta-analysis to our study was the one by Rao et al. (2011), who combined data of GWLS on renal function traits with the difference that almost half of the studies were conducted in general population, whereas our study was conducted only in diseased population.
GWLS about quantitative kidney phenotypes were not included in this GSMA for many reasons. First, although proteinuria and impaired kidney function are major phenotypes of DN that aggregate in families of diabetics and nondiabetics as heritable traits, they are not genetically correlated even in nondiabetics (Placha et al., 2005). This could bias the results. Second, the consideration of DN as a discreet trait provides more consistent inclusion criteria for the meta-analysis, reducing the heterogeneity between studies, as GFR and albumin:creatinine ratio (ACR) (urinary albumin excretion rate) present high variability in their calculation and measurement. The eGFR values depend on the estimating equations and many factors, whereas in single ACR measurements, the results are not reliable. Both traits may not reveal the initial stages of DN, as GFR is high in the hyperfiltration stage, whereas microalbuminuria can be reversible (Glassok et al., 2017). Antihypertensive treatment could also minimize the variability of both phenotypes, undermining the genetic influences. It has been postulated that the development of DN consists of two distinct processes, one associated with glomerula and proteinuria and another with tubules, interstitium, and kidney function (Placha et al., 2005). Therefore, this integrating approach of this meta-analysis could reveal genes and biological processes common to both processes. More specifically, the novel and significant cytogenetic locations identified from this meta-analysis include genes involved in insulin resistance, immune system, transforming growth factor beta 1 and bone morphogenetic protein signaling pathways, oxidative stress, protein glycosylation and AGEs, EC accumulation, and lipid metabolism, as well as cytokinesis and angiogenesis.
The identified locations derived from analyses, including both DM types, could harbor susceptibility genes either of DN or diabetes (T1D or T2D). Similarly, statistically significant locations in subgroup analyses could reflect effects of either susceptibility genes of DN per se or specific genetic effects of the type of diabetes under investigation. However, any common genetic region between T1D or T2D subgroup analyses is more likely to harbor susceptibility genes of DN, not affected by the type of DM. These genes could constitute the common genetic architecture of all types of DN and the future therapeutic targets. More specifically, genes located in region 5q23.2-5q34 and in chromosome 22, as well as genes in linkage disequilibrium with them, could constitute the genetic determinants of DN regardless of the type of diabetes. It is noteworthy to mention that cytogenetic region 7p22.3-7p15.3, identified significant only in conditional analysis of T2DM, could be related with genetic loci that are specific for African Americans with T2DM, as the population sample in the study of Bowden et al. (2004) was from this ethnic group.
Similarly, ethnicity subgroup analyses could reveal susceptibility loci specific for different ethnicities. More specifically, cytogenetic locations 1q23.2-1q31.1, 11q13.3-11q22.1, and 17p12-17q21.2 are identified significant only in Caucasians and locations 1p35.2-1p32.1, 4q35.1-4q35.2, 7p13-7q11.23, 12p13.2-12q12, and 13q22.1-13q33.1 only in African Americans, while locations 1p36.32-1p36.21, 15q14-15q22.32 and 20p13-20p12.2 only in American Indians. Among the aforementioned locations, novel findings constitute the locations 11q13.3-11q22.1 and 17p12-17q21.2 in Caucasians, locations 1p35.2-1p32.1 and 13q22.1-13q33.1 in African Americans, as well as location 1p36.32-1p36.21 in American Indians. Cytogenetic locations 5q14.3-5q23.2 and 22p13-22q12.3 are identified in all analyses regarding diabetes type and ethnicity, except in American Indian subgroup analysis. Last, but not least, nine regions identified in T1DM analysis and three regions identified in T2DM analysis are also identified in Caucasians, suggesting that Caucasians share more common regions with T1DM rather than T2DM genetic architecture.
Although HEGESMA is based on bins with width of 30 cM, an analysis with bin width <30 cM could provide some more information. HEGESMA currently considers only autosomes. Therefore, no conclusion about possible linkage on X and Y chromosome can be derived. Nevertheless, the fact the HEGESMA calculates the heterogeneity between the included genome scans makes the results more robust.
Conclusions
In conclusion, given the modest results from GWASs on DN, the growing interest in rare variants, and a resurgence of family-based studies, this genome scan meta-analysis and heterogeneity testing in DN replicate the significance of several cytogenetic locations and especially emphasize the significance of locations 5q14.3-5q23.2 and 22p13-22q12.3, which are identified in all analyses regarding diabetes type and ethnicity, except in American Indian subgroup analysis. In addition, this meta-analysis revealed significance of nine novel locations, 1p35.2-1p32.1, 1p36.32-1p36.21, 5q11.2-5q14.3, 5q23.2-5q34, 11q13.3-11q22.1, 13q22.1-13q33.1, 17p12-17q21.2, 17q24.3-17q25.3, and 22q12.3-22q13.3. The conditional analysis revealed one more region, 7p22.3-7p15.3, indicative of linkage with DN. Although the meta-analysis provides some of the strongest evidence of linkage compared to the results of the individual genome scans, especially for cytogenetic locations statistically significant in more than one analyses, we should interpret the results with caution, as it is likely that some important regions have been missed, while some others may result from type I errors, given the aforementioned limitations of the method. However, this genome scan meta-analysis provides some evidence of linkage for several regions to prioritize follow-up studies in candidate locations.
Footnotes
Author Disclosure Statement
No competing financial interests exist.
