Abstract
Background:
Nonsyndromic cleft lip with or without cleft palate (NSCL/P) is a complex congenital disease affected by genetic and environmental factors however, the specific pathogenic alleles and regulatory mechanisms remain unclear in many cases. Here, we aimed to study the association between eight potentially functional single nucleotide polymorphisms (SNPs) of the BRCA2 and MGMT genes and NSCL/P in a Chinese population through a case-control study.
Materials and Methods:
To investigate the relationship between potentially functional SNPs of the BRCA2 and MGMT genes and NSCL/P, we selected 200 affected patients and 200 unrelated normal controls in a Chinese population. The BRCA2 gene SNPs (rs11571836, rs144848, rs7334543, rs15869, rs766173 and rs206118) and MGMT gene SNPs (rs12917 and rs7896488) were genotyped using the SNaPshot technique and the resulting data were subjected to statistical and bioinformatic analyses.
Results:
Our study identified for the first time that alleles of the BRCA2 are associated with NSCL/P in a Chinese population and that the s11571836 G allele was protective against NSCL/P. Under four genetic models, rs11571836 had a significant correlation with NSCL/P. Preliminary bioinformatic analyses revealed four potential miRNA matching sites (miR-1244, miR-1323, miR-562, and miR-633) associated with the rs11571836 which is located in the 3′ untranslated region of BRCA2.
Conclusions:
These results support the role of polymorphisms of BRCA2 gene in affecting susceptibility to NSCL/P and its progression, but further research is necessary to elucidate the mechanism by which the BRCA2 gene polymorphisms affect the penetrance of NSCL/P.
Introduction
Cleft lip and/or palate (CL/P) is one of the most common congenital malformations of the maxillofacial region, with a prevalence of ∼1 in 700 live births. Asian and American Indians have the highest incidence rate, while African Americans have the lowest (Dixon et al., 2011). Patients with this condition may have negative effects in a variety of areas, including feeding, speaking, hearing, and social integration.
Therefore, some patients may suffer lifelong adverse effects from the deformity. CL/P is classified as syndromic cleft lip and palate and nonsyndromic cleft lip with or without cleft palate (NSCL/P). As an example, Van der Woude syndrome, which is mostly brought on by mutations in the interferon regulatory factor 6 and grainyhead like transcription factor 3 genes, has a well-studied etiology and pathology (Kondo et al., 2002; Peyrard-Janvid et al., 2014). In contrast, the understanding of the etiology and pathogenesis of NSCL/P, which accounts for ∼70% of CL/P, is very limited.
It is generally accepted that NSCL/P is a genetic disease with a complex etiology caused by genetic and environmental factors. Numerous research has identified a number of genes linked to the prevalence of NSCL/P, although the findings are often unsatisfactory and poorly reproducible. More importantly, previous studies have focused on statistical differences, while neglecting the functional analysis of genetic loci and biological mechanisms.
The process of facial embryonic development involves multiple factors and signaling pathways that together participate in regulating various physiological processes such as cell proliferation, migration, differentiation, and apoptosis. Furthermore, if any of these regulatory components' functions is hampered, it may have an impact on the fusion association of the facial synapse and lead to the emergence of NSCL/P (Ji et al., 2020). Previous studies have found genetic overlap between cancer and NSCL/P and a possible common etiological background (Dunkhase et al., 2016; Wang et al., 2016).
DNA damage repair genes are critical for embryonic development and their abnormalities dramatically increase the risk of cancer. Many animal experiments have demonstrated that mutations in DNA damage repair genes lead to various embryonic developmental abnormalities, including stillbirths, proliferation defects, genetic instability, and developmental malformations of organs, especially craniofacial tissues (Deng and Wang, 2003). The network of genes controlling cellular defense against DNA damage plays a role in the etiology of NSCL/P.
Dysregulation of co-expressed gene networks mainly associated with DNA double strand breaks (DSBs) repair and cell cycle control was found in dental pulp stem cells of NSCL/P patients (Kobayashi et al., 2013). Some adverse environmental factors may lead to oxidative DNA damage, including DSBs, which may increase the risk of developing craniofacial cleft (Feltes et al., 2013). A crucial role for DNA damage response in the formation of the palate has been established by animal research. Furthermore, the breast cancer susceptibility gene (BRCA)1/2-p53-dependent pathway is key in regulating normal palatogenesis in mice (Yamaguchi et al., 2021).
Therefore, DNA damage repair-related genes are closely related to the pathogenesis of NSCL/P, among which, BRCA2 and Methylation of O6-methylguanine-DNA methyltransferase (MGMT) genes are key genes. The BRCA2 gene is required for craniofacial bone development (Kitami et al., 2018) and was found to be weakly associated with NSCL/P in non-Hispanic whites (NHW) and Hispanics (Rodriguez et al., 2018). MGMT is a DNA methyltransferase that repairs damaged DNA by transferring methyl groups at the guanine O6 site to cysteine residues, avoiding gene mutations, cell death, and tumorigenesis caused by alkylating agents (Yu et al., 2020).
10q26.3 distal deletion was associated with cleft palate and the region of deletion contained the MGMT (Plaisancié et al., 2014). Besides, rs7922405, located in the intron of MGMT gene, was associated with NSCL/P (Chen et al., 2018). However, the specific association and mechanism between BRCA2, MGMT gene, and NSCL/P in the Chinese population are still not clear. Therefore, we selected eight functional single nucleotide polymorphisms (SNPs) in BRCA2 and MGMT genes in this case-control study and performed preliminary bioinformatics analysis of the positive results.
Materials and Methods
Subjects
Two hundred NSCL/P patients, 71 males and 129 females, 3 months to 18 years of age, were collected as a case group from Beijing Stomatological Hospital affiliated to Capital Medical University. Among the case group, there were 59 cases of cleft lip only and 141 of cleft lip with cleft palate. The control group was recruited in the Beijing area and consisted of 200 cases, who had no congenital malformations and genetic family history. Among them, there were 80 males and 120 females, ranging from 2 to 25 years old. Besides, the subjects in the case and control groups were unrelated.
Variant selection and genotyping
This study was performed in line with the principles of the Declaration of Helsinki. Approval was granted by the Ethics Committee of Beijing Stomatological Hospital affiliated to Capital Medical University (No. CMUSH-IRB-KJ-PJ-2020-01). Written informed consent for genetic analysis was obtained from all subjects or their guardians. In the case group, 2 mL of venous blood was drawn, anticoagulated with EDTA, and stored at −80°C. In the control group, 2 mL of saliva was collected by DN-39 saliva collector (Aidlab, Beijing, China) and stored at room temperature. DNA was extracted from venous blood or saliva using a TIANamp blood DNA kit (Tiangen, Beijing, China) or a DN-39 saliva extraction kit (Aidlab). Spectrophotometry and agarose gel electrophoresis were used to determine the quantity and quality of extracted DNA.
The databases selected for functional variants were the HapMap (http://hapmap.ncbi.nlm.nih.gov) and NCBI dbSNP (www.ncbi.nlm.nih.gov/snp) databases. The functional variants for BRCA2 and MGMT genes in Chinese Han population were queried, and the screening criteria were as follows: (1) variants with a minor allele frequency of 0.05 (5%) or greater in Chinese Han population and (2) variants were selected at positions such as frameshift variants, missense variants, 5′ untranslated region (UTR), and 3’UTR. Eight functional variants were finally selected for this study (Table 1).
Information About Eight Candidate Single Nucleotide Polymorphisms
MAF, minor allele frequency; SNPs, single nucleotide polymorphisms; UTR, untranslated region.
Primers were designed and synthesized using Primer5 software based on gene sequence information. In this study, genotyping was performed by SNaPshot technique according to the manufacturer's protocols (Applied Biosystems, Warrington, United Kingdom). An ABI 3730XL DNA analyzer was used for sequencing and the DNA sequences were read using GeneMapper version 3.2 software (Applied Biosystems). The variant site corresponding to the extension product was identified based on the position of the peak. Then, the type of base added could be known according to the color of the peak, so as to determine the genotype of the sample. After the results were obtained, the positive results were evaluated for preliminary function by bioinformatics softwares.
Data analysis
Haploview version 4.2 program (www.broad.mit.edu/mpg/haploview/index.php) was applied to perform the Hardy-Weinberg Equilibrium (HWE) test. The chi-square (χ2) test was used to calculate whether the frequencies of each allele were statistically different between cases-controls by Plink version 1.07 software. Correlation between five genotypic test models (codominant, dominant, recessive, overdominant, and log additive) and NSCL/P was analyzed by unconditional logistic regression using the SNPStats online program. The optimal genetic model was determined according to the Akaike information criterion (AIC), and the genetic model with the lowest AIC value was the optimal genetic model for the variant. Bonferroni correction was applied to all results. The significance level in all tests was 0.05. Quanto version 1.2.4 software was used to evaluate the statistical power.
The potential function of positive variants was analyzed using bioinformatics tools, including SNPinfo (https://snpinfo.niehs.nih.gov) (Xu and Taylor, 2009), Fathmm (http://fathmm.biocompute.org.uk) (Shihab et al., 2013), PolyPhen2 (http://genetics.bwh.harvard.edu/pph2) (Adzhubei et al., 2010), and MutationTaster2021 (Steinhaus et al., 2021).
Results
A total of 200 NSCL/P cases and 200 controls were recruited to study the relationship between the functional SNPs of BRCA2, MGMT gene, and NSCL/P in a Chinese population. All SNPs included in this study were genotyped in both the case and control groups and belonged to HWE in the control group. The results of the χ2 test showed that the frequency of BRCA2 rs11571836 G allele was statistically significant between the case and control groups (odds ratio [OR] = 0.6321, confidence interval [95% CI] = 0.4657-0.8579, adjusted p = 0.025). There was no positive result for the remaining SNPs (Table 2).
Comparison of Allele Frequencies of Eight Single Nucleotide Polymorphisms Between Case Group and Control Group
p Value was calculated by the χ2 test.
p Value was calculated by the χ2 test after Bonferroni correction. Significant p values are shown in bold.
95% CI, 95% confidence interval; Freq, allele frequency; OR, odds ratio.
Unconditional logistic regression analysis showed that rs11571836 was significantly associated with NSCL/P under codominant, dominant, overdominant, and log additive (codominant model: OR [G/A vs. A/A] = 2.78, 95% CI [G/A vs. A/A] = 1.79-4.33, OR [G/G vs. A/A] = 1.4, 95% CI [G/G vs. A/A] = 0.74-2.66, adjusted p < 0.0001; dominant mode: G/A-G/G vs. A/A, OR = 2.35, 95% CI = 1.56-3.54, adjusted p < 0.0001; overdominant model: G/A vs. A/A-G/G, OR = 2.60, 95% CI = 1.70-3.96, adjusted p < 0.0001; and log-additive model: OR = 1.53, 95% CI = 1.14-2.06, adjusted p = 0.035). The overdominant model was the optimal genetic model of rs11571836 (AIC = 521.1) (Table 3) with a power value of 96.39%.
Association Between Eight Single Nucleotide Polymorphisms and Nonsyndromic Cleft Lip With or Without Cleft Palate Under Different Genetic Models
Significant p values are shown in bold. Nominally significant p values are shown in italics.
p Value was calculated by the nonconditional logistic regression.
p Value was calculated by the nonconditional logistic regression after Bonferroni correction.
AIC, Akaike information criterion.
The rs144848 was correlated with NSCL/P under codominant, recessive, and overdominant models (codominant model: OR [G/T vs. T/T] = 1.51, 95% CI [G/T vs. T/T] = 0.99-2.31, OR [G/G vs. T/T] = 0.55, 95% CI [G/G vs. T/T] = 0.26-1.16, p = 0.017; recessive model: G/G vs. T/T-G/T, OR = 0.46, 95% CI = 0.22-0.95, p = 0.032; and overdominant model: G/T vs. T/T-G/G, OR = 1.64, 95% CI = 1.09-2.48, p = 0.018). However, after the Bonferroni correction, the p-value was >0.05. The codominant model was the optimal genetic model for rs144848 (AIC = 536.8) with a power value of 96.9%. Unconditional logistic regression results of the remaining SNPs were negative (Table 3).
In this study, the BRCA2 rs11571836 was found to be associated with NSCL/P, which is located in the 3′UTR. SNPinfo online software predicted potential miRNA binding sites and identified hsa-miR-1244, hsa-miR-1323, hsa-miR-562, and hsa-miR-633 as possible miRNAs bound by the BRCA2 gene (Table 4). Before Bonferroni correction, a missense mutation rs144848 was associated with NSCL/P. In addition, the potential biological effects of this missense mutation on protein function were predicted using Fathmm, PolyPhen2, and MutationTaster2021 software. The result of Fathmm was tolerated, while the result of PolyPhen2 and MutationTaster2021 was benign.
Potential Matching Sites for rs11571836 Using SNPinfo Software
Discussion
Our study revealed for the first time that BRCA2 was associated with NSCL/P. The rs11571836 G allele was protective against NSCL/P pathogenesis in a Chinese population (OR <1 and U95 < 1). Rs11571836 had a significant correlation with NSCL/P under four genetic models, with the overdominant model being the optimal genetic model. Rs144848 was associated with NSCL/P under three genetic models, and the codominant model was the optimal genetic model. However, rs144848 was not statistically different after the Bonferroni correction. Bioinformatics analysis identified hsa-miR-1244, hsa-miR-1323, hsa-miR-562, and hsa-miR-633 were potential binding miRNAs for rs11571836. Moreover, the missense mutation rs144848 may be a possible harmless mutation on the BRCA2.
With the development of genetics, there is an increasing number of studies on the association of common and complex genetic congenital diseases such as NSCL/P. In particular, the advent of genome-wide association studies has identified dozens of loci associated with the risk of NSCL/P (Avasthi et al., 2021; Beaty et al., 2010; Gowans et al., 2022; Mukhopadhyay et al., 2021), most of which are located in intronic or intergenic regions, but the clinical insights gained from these results are limited. In contrast, functional variants are the most likely to be associated with disease.
Association studies based on functional SNPs are relatively efficient for better understanding their function in complex diseases because they study a group of SNPs that are most likely to cause diseases (Carlton et al., 2006). Therefore, eight functional SNPs of BRCA2 and MGMT genes were selected for our study and the results were subjected to preliminary bioinformatics analysis to lay the foundation for further functional studies.
There are 27 exons in the BRCA2 gene, which is found on chromosome 13q12 (Lancaster et al., 1996). The BRCA2 gene is involved in maintaining genomic stability, especially in the homologous recombination pathway involved in DNA double strand repair. It encodes for the production of proteins that play an important role in regulating the replication of human cells, normal cell growth, and repair of DNA damage to genetic material (D'Alessandro et al., 2018; Roy et al., 2011).
The BRCA2 gene is mainly studied as an oncogene with its mutations associated with a range of malignancies, including breast, ovarian, prostate, and pancreatic cancers (King et al., 2003; Lee et al., 2021). The BRCA2 gene was specifically damaged in epithelial and/or ectodermal mesenchymal cells throughout the formation of the mouse craniofacial structure, resulting in abnormalities in the frontal nasal bone and maxilla (Kitami et al., 2018). BRCA2 played a key role in palatal cleft formation in mouse neural crest-derived ectodermal mesenchymal cells and was widely expressed in palatal epithelial and mesenchymal cells. Neural crest mice lacking BRCA2 in cells exhibit severe cleft palate. The neural crest-specific BRCA1 and BRCA2 double deletion animals showed a more severe cleft palate phenotype, indicating they functioned synergistically in palatogenesis in mice (Yamaguchi et al., 2021).
In NHW and Hispanics, BRCA2 rs206115 was modestly associated with NSCL/P (p = 0.01) and the other BRCA2 SNPs (rs144848 and rs9534342) showed borderline associations with NSCL/P (p = 0.05), but none was statistically significant after Bonferroni correction (Rodriguez et al., 2018). Chinese people made up our research sample, and the findings revealed a connection between rs144848 and NSCL/P. Codominant model was the best genetic model of rs144848, in which the risk of G/T is 1.51 times that of T/T, and G/G is the protective genotype compared with T/T.
However, rs144848 was also not statistically significant after Bonferroni correction. The key finding of this study was to determine the association between the BRCA2 rs11571836 and NSCL/P. The rs11571836 G allele frequency was statistically significant between the case and control groups and the G allele was protective against the incidence of NSCL/P. The overdominant model was the optimal genetic model for rs11571836, in which the risk of NSCL/P of G/A was 2.6 times higher compared with A/A-G/G. Our findings are also the same as those mentioned above, suggesting that cancer and NSCL/P may share a common genetic background (Dunkhase et al., 2016; Wang et al., 2016).
The rs11571836 is located in the 3′UTR of the BRCA2 gene. RNA stability, mRNA translation, and localization are among the activities that are regulated by the 3′UTR, which is situated downstream of the coding sequence (Steri et al., 2018). SNP at 3′UTR of gene can regulate the expression level of gene in both temporal and spatial dimensions, and control the translation process of corresponding protein (Jia et al., 2013; Kuersten and Goodwin, 2003). The 3′UTR is where most of the miRNA and mRNA binding sites are found.
Moreover, miRNAs can pair completely with a segment of target sequence on the 3′UTR of target genes to form a complex that induces gene silencing, inhibits translation, or promotes mRNA degradation, resulting in a decrease in the expression of target genes (Bartel, 2009). We predicted that hsa-miR-1244, hsa-miR-1323, hsa-miR-562, and hsa-miR-633 might be in a targeting relationship with the 3′UTR of BRCA2 gene. Therefore, the BRCA2 gene 3′UTR polymorphic locus rs11571836 may affect the progression of NSCL/P by modifying the targeted binding of corresponding miRNAs to mRNAs and thus regulating the mRNA expression level of BRCA2 gene.
The rs144848 is a missense mutation in BRCA2 gene. The amino acid type and sequence of the polypeptide chain are altered by the missense mutation, which occurs when the codon encoding one amino acid turns into the codon encoding another amino acid following base replacement. Missense mutations affect protein stability, hydrogen bond, kinetics, and activity, may or may not cause disease, and may affect susceptibility to disease and drug therapy (Gong and Blundell, 2010; Stefl et al., 2013).
Around 1.26% of human SNPs occur in protein-coding regions, of which 0.64% are nonsynonymous SNPs (including missense and nonsense mutations) (Gong and Blundell, 2010). Nonsynonymous SNPs are generally prevalent and seem to have no apparent functional significance, according to studies of human nonsynonymous SNPs (Ng et al., 2008). To assess the possible biological effects of mutations on protein function, we employed three types of software; however, all the outcomes were benign.
Although the efficacy of these techniques to predict variant pathogenicity is growing, grouping pathogenic from benign variants according to score does not further our knowledge of the underlying molecular processes in important disorders. At the same time, although a large number of disease-related missense variants can be found in publicly accessible databases, most of them are still of uncertain significance (Iqbal et al., 2020; Landrum et al., 2018). Therefore, it is of great interest to investigate the effect of missense mutations on NSCL/P by functional assay experiments.
A DNA repair protein, which is an effective direct DNA damage repair enzyme, is encoded by the MGMT gene, which has five exons and is located at 10q26. It participates in the cell's resistance to the toxicity of alkylating agents and the induced gene mutation, and prevents mismatch errors during DNA replication (Sharma et al., 2009). Glioblastoma has been the primary focus of research on the MGMT gene (Butler et al., 2020). In the association study between MGMT and NSCL/P, it was found that rs7922405 was associated with NSCL/P in Chinese people (Chen et al., 2018). However, rs7922405 is an intron variant; our study did not include the study of rs7922405, an intron SNP. Furthermore, no association between candidate functional SNPs of MGMT and NSCL/P was found in our study.
Conclusions
Our study identified for the first time the association of rs11571836 and rs144848 in the BRCA2 gene with NSCL/P in a Chinese population, but no statistical difference in rs144848 after Bonferroni correction was found. The results were also subjected to preliminary bioinformatics analysis. A bigger sample size might strengthen the validity of our findings since the number of cases and controls in our research was relatively limited. In addition, more cellular and animal studies are required for this work to investigate the potential role of the BRCA2 gene in the pathogenesis of NSCL/P.
Footnotes
Acknowledgments
We thank the study participants for their support of the study.
Authors' Contributions
S.G.: conceptualization (equal); funding acquisition (lead); investigation (lead); formal analysis (lead); writing—original draft (lead); and writing—review and editing (lead). Z.Z.: investigation (supporting); data curation (equal); and validation (equal). T.G.: investigation (supporting); writing—original draft (supporting); and resources (supporting). Y.X.: investigation (supporting); validation (equal); and resources (supporting). X.Y.: formal analysis (supporting) and data curation (equal). Y.W.: formal analysis (supporting) and data curation (supporting). R.C.: conceptualization (equal); resources (lead); and project administration (lead).
Author Disclosure Statement
No competing financial interests exist.
Funding Information
This study was supported by grants from Discipline Construction Fund of Beijing Stomatological Hospital Affiliated to Capital Medical University (Grant No. 19-09-09).
