Abstract
Background:
Previous genome-wide microsatellite screening in Graves' disease (GD) has suggested several regions of linkage to disease. Although replication has been inconsistent, some regions such as chromosome 5q31–33 have been associated with several Oriental GD patient cohorts. Recently, two studies have reported association of single-nucleotide polymorphism (SNP) rs31480 in interleukin 3 (IL-3) and the rs1368408 and SNP75 (−623∼−622 AG/-T) SNPs in secretoglobulin family 3a member 2 (SCGB3A2) with GD and suggested that this may account for linkage to the 5q31–33 region in Oriental GD datasets. We sought to confirm this association in a large Caucasian U.K. GD cohort.
Methods:
The rs31480 SNP was shown to tag all known common variations in IL-3 and the rs1368408 SNP was shown to tag all common variations in SCGB3A2. The SCGB3A2 SNP75 was found to be rare in the U.K. Caucasian population and, therefore, was not screened. We genotyped rs31480 and rs1368408 and performed a case–control association study in 2504 GD cases and 2688 controls from the U.K.
Results:
Association between the SCGB3A2 rs1368408 SNP and GD was detected (p = 0.007, odds ratio = 1.18, 95% confidence intervals = 1.05–1.33). No association between the IL-3 rs31804 SNP and U.K. Caucasian GD patients was observed.
Conclusions:
These data suggest that chromosome 5q31–q33 contains a susceptibility locus for Caucasian GD patients as well as Oriental GD patients. Although association was detected between SCGB3A2 and U.K. Caucasian GD subjects, the size of effect was smaller than that seen in the Oriental population (odds ratio = 1.28–1.73). Fine mapping within this region will be required to determine the exact location of the etiological variants present within this region for both Caucasian and Oriental GD patients.
Introduction
Chromosome 5q31–q33, which encodes a large variety of cytokines, inflammatory mediators, and other immune modulators, represents a region of linkage, which has generated much interest. Linkage of microsatellite marker D5S436 located within chromosome 5q31–33 was first detected in a Japanese GD genome-wide screen (5). Linkage was later replicated in a Han Chinese GD dataset along with additional more strongly associated microsatellite markers including D5S2090 (6) and in a second Japanese GD cohort to an extended 5q12–q33 region (4). Although no Caucasian GD genome-wide microsatellite screen has reported linkage to 5q31–33 (1 –3,9), some evidence for linkage to a separate region of chromosome 5, 5q11.2–5q14.3, has been previously detected (9). Several genes within 5q31–33 have been proposed to explain the association of this region with GD, including beta-2-adrenergic receptor (ADRB2) (10), interleukin 12b (IL-12b) (11,12), interferon regulatory factor 1 (13), IL-13 (13 –15), and IL-4 (13), although replication of these loci has proved difficult. More recently, two independent studies attempted to narrow down association within this region and have identified two distinct GD susceptibility loci approximately 15 Mb apart, which could potentially explain linkage to GD in this region.
The first study used 51 Tag single-nucleotide polymorphisms (SNPs) to screen 19 genes, chosen because of their potential as candidate loci for GD within the 5q31–33 region, in 428 Han Chinese GD and 690 control subjects (16). The rs40401 SNP (and the rs31480 SNP, which was in complete linkage disequilibrium [LD]) located within IL-3 (p = 0.002, odds ratio [OR] = 1.51) showed the strongest association, even after the addition of a further seven Tag SNPs within the surrounding haplotype block (16). Association of rs40401 was further replicated in an independent dataset composed of 468 Han Chinese GD and 471 control subjects (p = 0.0001, OR = 1.76) (16). A second study investigated a total of 179 SNPs within a 3-Mb region containing 25 genes surrounding microsatellite marker D5S2090 in 384 Han Chinese GD and 382 controls (17). Four SNPs were associated with GD, with rs1368408 being the most associated one, located in secretoglobulin family 3a member 2 (SCGB3A2) (17). Further screening of 122 SNPs in a 1-Mb region surrounding rs1368408 in 541 Han Chinese GD and 478 controls revealed 20 SNPs associated with GD, with logistic regression revealing the SCGB3A2 rs1368408 (p = 4.11 × 10−8, OR = 1.73) and SNP75 (-623∼-622 AG/-T) (p = 1.37 × 10−8, OR = 2.17) as the most important SNPs in this region. Association of rs1368408 (p = 1.43 × 10−6, OR = 1.28) and SNP75 (p = 7.62 × 10−5, OR = 1.32) was replicated in another much larger dataset of 2811 Han Chinese GD and 2807 controls, with rs1368408 (p = 0.001, OR = 1.47) also showing association in a further independent dataset of 545 Han Chinese GD and 603 control subjects from Shanghai, although they failed to detect association of SNP75 (p = 0.1348). Haplotypes containing rs1368408 and rs6882292 (another SNP in SCGB3A2) or rs1368408 and SNP75 were also shown to be associated with GD (p = 0.0007 and p = 0.019, respectively), with functional analysis revealing that presence of these haplotypes correlated with lower SCGB3A2 promoter efficiency, leading to lower SCGB3A2 expression (17). The aim of our study was, therefore, to determine if IL-3 and SCGB3A2 also play a role in GD susceptibility in the U.K. Caucasian population.
Materials and Methods
Subjects
A dataset consisting of 2504 unrelated white Caucasian patients with GD from the U.K. National Autoimmune Thyroid Disease Collection was screened. Patients were recruited from specialist clinics in Birmingham, Bournemouth, Cambridge, Cardiff, Exeter, Leeds, Sheffield, and Newcastle in the United Kingdom. Clinical definition of GD was as previously described (18). GD was defined by the presence of biochemical hyperthyroidism with (a) a diffuse goiter on a radionuclide or sonographic scan, (b) Graves' ophthalmopathy (no signs or symptoms; only signs, no symptoms; signs only; proptosis; eye muscle involvement; corneal involvement; sight visual acuity reduction [NOSPECS] classification score ≥2) (19), (c) positive autoantibodies to the thyroid stimulating hormone receptor (TSHR), (d) diffuse goiter on physical examination and positive antibodies to thyroglobulin or thyroid peroxidase, or (e) confirmation of a lymphocytic infiltrate in thyroid histology. A total of 2688 geographically matched white Caucasian control subjects were also obtained from the 1958 British Birth Cohort as previously stated (20). All patients gave informed written consent and the project was approved by the local ethics committee.
Tag SNP selection and genotyping
To screen the IL-3 promotor rs31480 SNP (a proxy for rs40401) and any other common variation within IL-3, genotyping data were downloaded from the International Haplotype Mapping Project website (
Clinical phenotype correlations
Association analysis of SNP genotype with specific clinical phenotype was performed as described previously (21).
Statistical and haplotype analyses
We had greater than 80% power to exclude an OR = 1.15 for rs31480 and an OR = 1.17 for rs1368408 at α = 0.05. Analysis of case–control data was performed using the χ 2 test within the MINTAB statistical package (MINITAB Release 15, © 1972–2003, Minitab, State College, PA) and p < 0.05 was considered significant. OR with 95% confidence intervals were calculated by the method of Woolf with Haldane's modification for small numbers, where appropriate (22). Power and Hardy–Weinberg equilibrium calculations were performed using formula written within Excel (Microsoft® Office Excel, © 2003; Microsoft, Redmond, WA) based on established statistical methodology (23,24).
Results
Genotyping results
Both SNPs were in Hardy–Weinberg equilibrium in both the cases and controls. The SCGB3A2 rs1368408 SNP was shown to be associated within our dataset (p = 0.007, OR = 1.18, 95% confidence intervals = 1.05–1.33; Table 1). No association of the IL-3 rs31480 SNP was detected with GD (p = 0.295).
All SNPs are reported 5′–3′.
SNP, single-nucleotide polymorphism; OR, odds ratio; 95% CI, 95% confidence interval.
Clinical phenotype associations
Tests of correlation were performed with both the SCGB3A2 rs1368408 and IL-3 rs31480 SNPs with GD subphenotype, including severity of ophthalmopathy as determined by the NOSPECS classification (NOSPECS <2 vs. NOSPECS ≥2), age of onset of GD (<31 years vs. ≥31 years), biochemical severity as indicated by serum-free T4 concentration at time of diagnosis (serum-free T4 concentrations of <40 or ≥40 pmol/L), the presence or absence of palpable goiter, and autoantibody-positive subjects compared with autoantibody-negative subjects (for TSHR, thyroglobulin, or thyroid peroxidase antibodies). For rs1368408, a weak allelic association was detected with lower levels of free T4 (p = 0.037), although association with individual genotypes was not detected (p = 0.085). No further associations were observed between the rs1368408 and rs31480 SNPs and any other of the subphenotypes investigated (data available upon request).
Discussion
Although linkage to chromosome 5q31–33 had been found in the Oriental population, no Caucasian GD microsatellite genome scan has ever reported linkage to 5q31–33 (1 –3,9). Weak support for a role for chromosome 5q31–q33 in the development of GD in Caucasian patients has come from the Wellcome Trust Case Control Consortium (WTCCC). The WTCCC performed a 15,000 nsSNP genome screen in which an SNP within ADRB2, rs1042714 (p = 0.04), which is located approximately 948 kb away from SCGB3A2, and a further SNP in RAD50, RAD50-327 (p = 0.03), which is approximately 516 kb away from IL-3, were weakly associated with GD (20). In this study we have replicated the association of SCGB3A2 rs1368408, located within chromosome 5q31–33, which has been previously shown to be associated with GD in a Japanese GD cohort.
Although SCGB3A2 may not seem like an obvious candidate for GD, insights about its role within the immune response suggest a possible explanation for its association with GD. SCGB3A2 encodes a secretary protein that binds to the macrophage scavenger receptor with collagenous structure (MARCO). MARCO is a scavenger receptor present on a subpopulation of macrophages and is believed to be involved in binding lipopolysaccharides and gram-positive and gram-negative bacteria (25). Knockout mouse studies showed that MARCO-deficient mice demonstrated an impaired ability to clear bacteria from their lungs compared with wild-type mice, suggesting a key role for MARCO in host defense against bacterial infections (26). Although MARCO has been shown to be expressed in the spleen, lymph nodes, and thymus (17,27), low levels of MARCO and SCGB3A2 have been more recently detected within the thyroid (17). In lung tissue it has been proposed that SCGB3A2 expressed on the surface of epithelial cells of the bronchioles interacts with MARCO to enable functional interaction between macrophages and lung epithelial tissue to aid the removal of bacteria and other pathogens from the lung (27). It is appealing to hypothesize that, in thyroid, MARCO and SCGB3A2 could interact in a similar manner. Interestingly, the presence of the rs1368408 allele A in a haplotype containing the SCGB3A2 SNP75 allele T or rs6882292 allele A was shown to reduce SCGB3A2 mRNA levels in human thyroid tissue (17). As bacteria are proposed to be one environmental trigger for GD (7,28), any disruption or variations within this interaction could alter how bacteria and other pathogens are recognized by the immune system and the response triggered, which could possibly cross react to cause thyroid autoimmunity. SCGB3A2 is also believed to be a target of homeodomain transcription factor T/EBP (TTF-1), which regulates expression of thyroid-specific genes, including thyroglobulin, thyroid peroxidase, and the TSHR (29 –32). Disruption within this control system could also contribute to the onset of GD, although further work is required to determine the exact role by which SCGB3A2 contributes to GD (17).
We failed to replicate association of the IL-3 rs31480 SNP within our Caucasian GD cohort. This can be due to several potential explanations. The original associations in the Oriental populations could have been a false positive or the size of effect overinflated because of a first time effect (33). This is unlikely as the original study replicated association of IL-3 with GD in two small independent Oriental GD datasets. Our dataset had >80% power to exclude an OR ≥1.15 for rs31480, and although we cannot rule out a minor effect for this SNP in our dataset, we can confidently exclude the same level of association in the Caucasian GD population as was seen in the Oriental GD population (OR = 1.51–1.76) (16). GD is a complex disease caused by both environmental and genetic factors, with well-established differences in presentation between patients including, for example, variation in degrees of thyroid eye disease, autoantibody titers, and appearance of goiter. Polymorphism within IL-3 could be linked to a specific clinical subphenotype or environmental trigger that is more prevalent in GD onset and progression in Oriental patients compared with Caucasian patients. However, our study revealed no major association with disease subphenotype.
It is of note that the magnitude of the association of SCGB3A2 rs1368408 seen in our GD population (OR = 1.18) was lower than that seen in the Oriental cohorts (OR = 1.28–1.73) (17). This could be explained by the well-reported, first time effect phenomenon (33). Equally, the differences observed between the Caucasian and Oriental populations could represent ethnic or geographical differences in genetic architecture. Levels of SNP diversity, allele frequencies, and LD patterns are known to vary significantly between Oriental and Caucasian populations. This suggests that although an etiological variant could be localized to a given set of SNPs in one population, diversity within the other population combined with differences in LD patterns could mean that the location of the etiological variant within the other population could be linked to a different set of SNPs located either elsewhere in the same gene or within the surrounding region. For example, association of the TSHR with GD in U.K. Caucasians has been refined to a probable 40-kb region within intron 1, whereas within the Japanese GD population the largest association is located within intron 7 (34,35).
Our study tagged all known common variations present within SCGB3A2 and IL-3 according to Caucasian Hapmap data (Build 36). However, within the Oriental populations many more SNPs were detected within these genes, suggesting that Hapmap data for this region at the time of project inception may have been limited. We may have missed other common variants within this region, which could be part of another haplotype not tagged by rs1368408.
Conclusion
Although association of SCGB3A2 rs1368408 has been found with GD in U.K. Caucasians, further SNP coverage surrounding SCGB3A2 is essential to localize the etiological variant within this region. This next step is, however, at present hampered by the lack of known SNP data within this region for Caucasians, making it difficult to determine the boundaries of LD. Data from the soon to be published 1000-genome project are likely to provide much more detailed information on the number and location of SNPs and the haplotype structure for Caucasians (
Footnotes
Acknowledgments
The authors would like to thank patients, nurses, and doctors for recruiting into the Autoimmune Thyroid Disease National Collection, and the Wellcome Trust for funding. The authors acknowledge use of DNA from the British 1958 Birth Cohort Collection, funded by the Medical Research Council grant G0000934 and the Wellcome Trust grant 068545/Z/02.
Disclosure Statement
None of the authors have any conflicts of interest or any competing financial interest to declare.
