Abstract
Background:
Genome-wide and candidate gene association studies have previously revealed links between a predisposition to acute lymphoblastic leukemia (ALL) and genetic polymorphisms in the following genes: IKZF1 (7p12.2; ID: 10320), DDC (7p12.2; ID: 1644), CDKN2A (9p21.3; ID: 1029), CEBPE (14q11.2; ID: 1053), and LMO1 (11p15; ID: 4004). In this study, we aimed to conduct an investigation into the possible association between polymorphisms in these genes and ALL within a sample of Yemeni children of Arab-Asian descent.
Methods:
Seven single-nucleotide polymorphisms (SNPs) in IKZF1, three SNPs in DDC, two SNPs in CDKN2A, two SNPs in CEBPE, and three SNPs in LMO1 were genotyped in 289 Yemeni children (136 cases and 153 controls), using the nanofluidic Dynamic Array (Fluidigm 192.24 Dynamic Array). Logistic regression analyses were used to estimate ALL risk, and the strength of association was expressed as odds ratios with 95% confidence intervals.
Results:
We found that the IKZF1 SNP rs10235796 C allele (p = 0.002), the IKZF1 rs6964969 A>G polymorphism (p = 0.048, GG vs. AA), the CDKN2A rs3731246 G>C polymorphism (p = 0.047, GC+CC vs. GG), and the CDKN2A SNP rs3731246 C allele (p = 0.007) were significantly associated with ALL in Yemenis of Arab-Asian descent. In addition, a borderline association was found between IKZF1 rs4132601 T>G variant and ALL risk. No associations were found between the IKZF1 SNPs (rs11978267; rs7789635), DDC SNPs (rs3779084; rs880028; rs7809758), CDKN2A SNP (rs3731217), the CEBPE SNPs (rs2239633; rs12434881) and LMO1 SNPs (rs442264; rs3794012; rs4237770) with ALL in Yemeni children.
Conclusion:
The IKZF1 SNPs, rs10235796 and rs6964969, and the CDKN2A SNP rs3731246 (previously unreported) could serve as risk markers for ALL susceptibility in Yemeni children.
Introduction
A
The initiation of ALL has been hypothesized to occur in utero or during early infancy (Kaatsch, 2010). However, the exact etiology and development of childhood ALL remain poorly understood. Nevertheless, it is widely recognized that genetic variations, infections, and environmental factors play important roles in the development of ALL (Belson et al., 2007; Guo et al., 2014; Ma et al., 2014).
Genetic variations in both Ikaros family zinc-finger (IKZF1) and dopa decarboxylase aromatic
To better understand the differences in susceptibility to ALL and how genetic and environmental factors interact together to increase or decrease ALL risk, it is crucial to study the impact of genetic polymorphisms among different ethnicities.
Currently, there are limited genetic studies examining ALL susceptibility in the Middle East population. Replicating studies conducted in other populations is imperative to confirm a possible universal impact of these polymorphisms with regard to ALL, in addition to discovering possible genetic differences among various populations to bettering treatment and ALL diagnosis. At present, no study examining the effects of IKZF1, DDC, CDKN2A, CEBPE, and LMO1 on ALL exist in Yemen. As such, the primary aim of this case-control study is to evaluate the influence of polymorphisms in these five genes on childhood ALL susceptibility in the Yemeni population of Arab-Asian ancestry.
Materials and Methods
Ethics statement
We received approval for the current case-control study from the Medical Ethics Committee of the University Malaya Medical Center (RF. No.: 989.39). We also obtained permission from Al-Thawra and Al-Kuwait University Hospital Boards (Sana'a, Yemen) before conducting the study. Written informed consent was signed by parents or guardians before blood sample collection.
Study population and sample processing
A total of 289 children, 2-15 years of age, of whom 136 cases had been diagnosed with ALL and 153 healthy controls in Sana'a, Yemen, were enrolled in this study. Genomic DNA was isolated from 4 mL peripheral blood using the FlexiGene DNA Kit (Qiagen, Valencia, CA) according to the manufacturer's protocol. Extracted DNA samples were stored in 200-μL tubes and immediately kept at −80°C before analysis.
Hematological analysis
Complete blood counts were analyzed using Mythic™ 22 AL fully automatic 22 parameters hematology analyzer (Orphee SA).
Genetic analysis
SNPs for our study were selected based on previous candidate gene association and genome-wide association studies (GWAS) (Papaemmanuil et al., 2009; Treviño et al., 2009; Prasad et al., 2010; Sherborne et al., 2010; Beuten et al., 2011; Pastorczak et al., 2011; Lautner-Csorba et al., 2012; Chokkalingam et al., 2013). The SNPs are as follows: IKZF1 (rs4132601, rs6964969, rs11978267, rs7789635, rs10235796, rs6964823, and rs6944602), DDC (rs3779084, rs880028, and rs7809758), CDKN2A (rs3731246 and rs3731217), CEBPE (rs2239633 and rs12434881), and LMO1 (rs442264, rs3794012, and rs4237770). IKZF1, DDC, CDKN2A, CEBPE, and LMO1 SNPs were genotyped using Fluidigm 192.24 Dynamic Array (Fluidigm Corporation, San Francisco, CA) according to the manufacturer's protocol. The Fluidigm primer design is listed in Supplementary Table S1 (Supplementary Data are available online at www.liebertpub.com/gtmb). The call rates for IKZF1 SNPs rs4132601, rs11978267, and rs10235796 were 100% in both ALL and control groups. The call rates for rs6964969 and rs7789635 were >90% in the ALL group and >86% in the control group. The call rates for DDC SNPs were >97% in both groups. The call rates for CDKN2A SNP rs3731246 and rs3731217 were >96% and >99% in both groups, respectively. The call rates for CEBPE and LMO1 SNPs were >99% in both groups. Quality control was checked using 40% blind duplicate samples and the concordance rate was 100%. No-template controls were included in each run.
Statistical analysis
Hardy-Weinberg equilibrium (HWE) test was performed to assess the departure of genotype distribution by using the DeFinetti software (http://ihg.gsf.de/cgi-bin/hw/hwa1.pl) from the Institute of Human Genetics (www.helmholtz-muenchen.de/ihg). Statistical analysis was performed using SPSS version 20.0 (LEAD Technologies, Inc.). Continuous data were presented as geometric means and comparisons between groups were analyzed using independent samples t-test. Logistic regression analysis was used for IKZF1, DDC, CDKN2A, CEBPE, and LMO1 SNPs controlled for age and gender as covariates. The associations between genotypes and ALL risk are presented as odds ratios (ORs) with 95% confidence intervals. Linkage disequilibrium (LD), haplotypes, and diplotype analysis of IKZF1 SNPs were evaluated using SNP & Variation Suite v8.x software (Golden Helix, Bozeman, MT). A significant difference was considered when p-value <0.05.
Results
The current study is comprised of 136 children diagnosed with ALL and 153 healthy children. The distribution of demographic and hematological variables among ALL cases and controls are listed in Table 1.
Results presented as geometric means (95% confidence interval of mean), ALL versus normal control group, evaluated by independent samples t-test.
ALL, acute lymphoblastic leukemia; HCT, hematocrit; HGB, hemoglobin; RBCs, red blood cells; WBCs, white blood cells.
IKZF1 SNPs (rs4132601, rs6964969, rs11978267, rs7789635, and rs10235796), DDC SNPs (rs3779084, rs880028, and rs7809758), CDKN2A SNPs (rs3731246 and rs3731217), CEBPE SNPs (rs2239633 and rs12434881), and LMO1 SNPs (rs442264, rs3794012, and rs4237770) conformed to HWE in the control group. However, IKZF1 SNPs rs6964823 and rs6944602 showed departure from HWE (p-value = 0.04, 0.007, respectively), hence these two SNPs were excluded from further analysis. The logistic regression model adjusted for age and gender revealed that IKZF1 rs10235796 C allele was a risk factor for ALL (OR = 2.50; p = 0.002) in comparison to the T allele. The results also showed that IKZF1 rs6964969 A>G variant was significantly associated with ALL (OR = 2.14; p = 0.048, GG vs. AA) (Table 2).
Controlled for age and gender. The outliers (studentized residual is greater than 2.0 or less than −2.0) were excluded. Bold italicized values are significant.
CI, confidence interval; OR, odds ratio; SNPs, single-nucleotide polymorphisms.
Our results revealed borderline association between ALL risk and IKZF1 SNP rs4132601 T>G (OR = 2.07; p = 0.050, GG vs. TT). On the other hand, we found no associations between IKZF1 rs11978267 and rs7789635 genotypes with ALL (Table 2). Similarly, there were no associations between DDC SNPs (rs3779084, rs880028, and rs7809758) and ALL risk (Table 2). The findings indicated that CDKN2A rs3731246 G>C polymorphism was associated with ALL risk (OR = 2.01; p = 0.047, GC+CC vs. GG). Specifically, the rs3731246 C allele demonstrated increased risk for ALL (OR = 1.55; p = 0.007) compared to the G allele. However, CDKN2A SNP rs3731217 was not associated with ALL. In addition, we found no association between CEBPE SNPs (rs2239633 and rs12434881) and LMO1 SNPs (rs442264, rs3794012, and rs4237770) with childhood ALL (Table 2).
Upon performing haplotype analysis, two-SNP haplotypes and diplotype blocks were determined to exhibit significant LD. The constructed block consisted of IKZF1 SNPs rs11978267 and rs6964969 (Fig. 1). Two haplotypes and three diplotypes with frequency >4% in the total sample were analyzed for their association with increased risk of ALL (Table 3). The most common haplotype (AA) was composed of the protective alleles of rs11978267 and rs6964969. This haplotype revealed protection from ALL (OR = 0.43, p = 0.015), whereas the risk allele-haplotype (GG) showed borderline association with ALL (OR = 1.97, p = 0.05). Likewise, the diplotype (GG-GG) had borderline association with ALL risk (OR = 1.97, p = 0.05) (Table 3).

Pairwise linkage disequilibrium among IKZF1 SNPs in Yemeni children. SNPs, single-nucleotide polymorphisms.
Controlled for age and gender. The outliers (studentized residual is greater than 2.0 or less than −2.0) were excluded. Bold italicized values are significant.
Discussion
ALL is a heterogeneous disease influenced by both genetic and non-genetic factors. In the present study, we assessed whether polymorphisms within IKZF1, DDC, CDKN2A, CEBPE, and LMO1 genes have any influence on susceptibility to ALL in Yemeni children of Arab-Asian ancestry. We found that IKZF1 SNPs rs10235796 and rs6964969 were associated with increased risk of ALL, whereas SNP rs4132601 merely has a marginal association with ALL. The association between rs6964969 in IKZF1 and ALL has been reported in the Hungarian population (Lautner-Csorba et al., 2012). Of interest, we found a strong association between CDKN2A SNP rs3731246 with increased risk of ALL, a novel result that has never been reported prior. Our results also revealed that IKZF1 SNPs rs7789635 and rs11978267, DDC SNPs, CDKN2A SNP rs3731217, CEBPE SNPs, and LMO1 SNPs have no effect on the susceptibility to childhood ALL. Our findings were inconsistent with previous studies that showed association between IKZF1 rs11978267 and ALL (Treviño et al., 2009; Lautner-Csorba et al., 2012; Orsi et al., 2012; Linabery et al., 2013; Xu et al., 2013). Noteworthy, Bahari et al. (2016) found no association between rs11978267 and childhood ALL in the Iranian population.
The IKZF1 gene, located at 7p12, encodes the Ikaros zinc-finger transcription factors, which are important in normal lymphoid maturation and tumor suppression (Sellars et al., 2011; Kastner et al., 2013; Yoshida and Georgopoulos, 2014). Mutations of Ikaros are common in B cell ALL (Nakase et al., 2000) and poor prognosis is associated with IKZF1 alterations (Feng and Tang, 2013). The activity of hematopoietic stem cells were reduced in Ikaros-null mice (Ng et al., 2009) and deletion of IKZF1 leads to the development of a high-risk form of childhood ALL (Mullighan et al., 2008, 2009; Den Boer et al., 2009). It has been suggested that mutations in IKZF1 affects expression of this gene and disruption of normal Ikaros function has been implicated in the leukemogenesis of ALL in infants (Papaemmanuil et al., 2009).
IKZF1 rs4132601 SNP has been shown to be significantly associated with susceptibility to childhood ALL in both the Caucasian and Hispanic populations (Papaemmanuil et al., 2009; Prasad et al., 2010; Pastorczak et al., 2011; Lautner-Csorba et al., 2012; Orsi et al., 2012; Hsu et al., 2015). However, we found borderline association between rs4132601 and ALL risk in Yemeni children. Contentious findings have been reported by study groups in Taiwan, China, and Korea that show no association between rs4132601 and ALL risk in the Asian population (Han et al., 2010; Wang et al., 2013; Lin et al., 2014). On the other hand, Vijayakrishnan and Houlston (2010) have found that rs4132601 SNP was associated with increased risk of ALL in the Thai population (Vijayakrishnan and Houlston, 2010). Also, the findings in the Tunisian and Iranian population supported the association between this SNP and ALL risk (Bahari et al., 2016; Gharbi et al., 2016). Two recent meta-analysis reports have suggested an association between IKZF1 rs4132601 and increased ALL risk among Caucasians, but not among Asians (Dai et al., 2014; Li et al., 2015). These inconsistencies might be attributed to differences in genetic and environmental factors, as well as variations in LD.
IKZF1 SNP rs4132601 is located in exon 8 and may affect messenger RNA (mRNA) stability (Kastner et al., 2013). Papaemmanuil et al. (2009) found that IKZF1 rs4132601 increases the risk of B cell ALL (Papaemmanuil et al., 2009). They found that the risk genotype of this SNP was associated with reduced expression of IKZF1 in lymphocytes. Upon evaluating the impact of the IKZF1 haplotypes on ALL susceptibility, haplotype GG, containing risk alleles of both IKZF1 SNPs rs11978267 and rs6964969, was found to be marginally associated with ALL risk. Conversely, haplotype AA, containing nonrisk alleles, showed a protection against ALL risk.
DDC is a prime candidate gene to study given its contiguous involvement with leukocytogenesis. DDC gene expression in human peripheral white blood cells leads to
Based on its biology, CDKN2A is a strong candidate ALL-susceptibility gene. Abnormalities in CDKN2A/CDKN2B tumor suppressor genes is present in up to 40% of B-precursor ALL and plays a role in the deregulation of cell cycle in leukemia (Mullighan et al., 2007). CDKN2A-CDKN2B genes encode three tumor suppressor proteins (p16INK4A, p14ARF, and p15INK4B). p16INK4A and p15INK4B are inhibitors of cyclin-dependent kinases, which are important regulators of cell cycle through the retinoblastoma protein (Rb)-E2F pathway (Krug et al., 2002). We investigated the association of CDKN2A SNPs with ALL and found that rs3731246 is the only SNP associated with ALL risk. To the best of our knowledge, this is the first study to demonstrate the contribution of rs3731246 to increase risk of ALL in Yemeni children of Arab-Asian descent. Our data also revealed no association between rs3731217 and ALL, consistent with results from earlier studies that have failed to identify an association between this SNP and ALL (Vijayakrishnan et al., 2010; Pastorczak et al., 2011). A study examining a Tunisian population of Arab-African ancestry similarly found no association between SNP rs3731217 and ALL risk (Gharbi et al., 2016). A former GWAS in the European population revealed no significant association between CDKN2A variations and childhood ALL (Papaemmanuil et al., 2009). However, another GWAS using larger sample size in the European population showed significant association (Sherborne et al., 2010).
CEBPE is a tumor suppressor gene that inhibits myeloid leukemogenesis and may contribute to B cell ALL development (Papaemmanuil et al., 2009). We did not identify any significant associations between CEBPE SNPs rs2239633 and rs12434881 with ALL. Our findings are consistent with the results from the Asian (Vijayakrishnan et al., 2010; Wang et al., 2013) and Polish studies (Pastorczak et al., 2011), which exhibited an absence of association between rs2239633 and childhood ALL. Healy et al. (2010) showed that rs2239633 was not associated with childhood ALL in French Canadians of European descent (Healy et al., 2010). Contradictorily, a Korean study showed moderate association of this SNP with the risk of ALL (Han et al., 2010). Also, two CEBPE SNPs that were not included in this study, rs4982731 and rs10143875, have been found to be associated with increased risk for ALL in the Hispanic population (Hsu et al., 2015). Former studies in Caucasians have shown association between CEBPE variations and childhood ALL risk (Papaemmanuil et al., 2009; Prasad et al., 2010; Orsi et al., 2012). Xu et al. (2013) showed strong association between CEBPE SNPs and ALL risk in a European-American cohort, with dissimilar results in the non-European population (Xu et al., 2013). Wang et al. (2013) found differences in locus of rs2239633 in LD blocks between Chinese and European populations (Wang et al., 2013). Interestingly, SNP rs2239632 located in the promoter region of CEBPE has been hypothesized to increase susceptibility to leukemia as a result of its risk allele increasing the expression of the final gene product (Ryoo et al., 2013).
CEBPE and CDKN2A variants were found to be more restricted to European population (Xu et al., 2012, 2013; Perez-Andreu et al., 2013). The disparities might be explained by the presence of population-specific disease variants and different LD between tag SNPs and causal SNPs. No significant associations were detected between LMO1 SNPs and childhood ALL in this study. Our results contradict those obtained from a previous study in a Caucasian population which showed that LMO1 SNP rs442264 was strongly associated with ALL risk (Beuten et al., 2011). In the same study, LMO1 SNPs rs11041815, rs11041830, and rs11041831 were also found to be associated with ALL; a 14-SNP haplotype block within LMO1 gene was shown to increase susceptibility to ALL (Beuten et al., 2011). High expression of this gene is seen in several tissues, such as bone marrow (Liu et al., 2008), and plays a role in protein-protein interaction (Rabbitts, 1994). Abnormal LIM protein expression contributes to T cell leukemogenesis (Boehm et al., 1991; Izraeli, 2004) and about 45% of T-ALL has aberrant LMO1 expression (Aifantis et al., 2008). The role of LMO1 variants in ALL pathogenesis has yet to be thoroughly investigated.
Conclusion
We have performed the first study investigating the relationship between IKZF1, DDC, CDKN2A, CEBPE, LMO1 gene polymorphisms, and childhood ALL risk in the Yemeni population. We have found that IKZF1 SNPs rs10235796 and rs6964969 and CDKN2A SNP rs3731246 were significantly associated with increased risk of childhood ALL. Replication studies in the Arab population with larger sample sizes are necessary to validate the extent of our findings.
Footnotes
Acknowledgments
The authors would like to thank all children, parents, and medical and nursing staff for their participation in this study. The current study was supported by grants from the University of Malaya (HIR/420001-E000046) and IPPP (PG039/2013A).
Authors' Contributions
Designed study: B.A., S.M., R.S.A., M.F.M.R., and S.M.N. Samples collected: B.A., M.A., R.H.A., and S.D.S. Conducted experiments and statistical analyses: B.A., R.S.A., S.D.S., M.A., and R.H.A. Prepared article: B.A., S.M., S.M.N., R.S.A., and M.F.M.R.
Author Disclosure Statement
No competing financial interests exist.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
