Abstract
Abstract
SLC22A2 facilitates the transport of endogenous and exogenous cationic compounds. Many pharmacologically significant compounds are transported by SLC22A2, including the antidiabetic drug metformin, anticancer agent cisplatin, and antiretroviral lamivudine. Genetic polymorphisms in SLC22A2 can modify the pharmacokinetic profiles of such important medicines and could therefore prove useful as precision medicine biomarkers. Since the frequency of SLC22A2 polymorphisms varies among different ethnic populations, we evaluated these in South African Bantu speakers, a majority group in the South African population, who exhibit unique genetic diversity, and we subsequently functionally characterized promoter polymorphisms. We identified 11 polymorphisms within the promoter and 9 single-nucleotide polymorphisms (SNPs) within the coding region of SLC22A2. While some polymorphisms appeared with minor allele frequencies similar to other African and non-African populations, some differed considerably; this was especially notable for three missense polymorphisms. In addition, we functionally characterized two promoter polymorphisms; rs138765638, a three base-pair deletion that bioinformatics analysis suggested could alter c-Ets-1/2, Elk1, and/or STAT4 binding, and rs59695691, an SNP that could abolish TFII-I binding. Significantly higher luciferase reporter gene expression was found for rs138765638 (increase of 37%; p = 0.001) and significantly lower expression for rs59695691 (decrease of 25%; p = 0.038), in comparison to the wild-type control. These observations highlight the importance of identifying and functionally characterizing genetic variation in genes of pharmacological significance. Finally, our data for SLC22A2 attest to the importance of considering genetic variation in different populations for drug safety, response, and global pharmacogenomics, through, for example, projects such as the Human Heredity and Health in Africa initiative.
Introduction
G
SLC22A2 expression not only occurs mainly at the basolateral membrane of renal proximal tubule cells in the kidney but is also found in the placenta, skin, spleen, lung, inner ear, small intestine, thymus, and brain (Koepsell et al., 2007). SLC22A2 facilitates the transport of endogenous and exogenous cationic compounds (Table 1), with one major physiological function being the removal of these from the circulation into renal epithelial cells of the kidney, where they are finally excreted into the urine (Motohashi et al., 2002). As such, SLC22A2 plays a significant role in the pharmacokinetic profiles of cationic drugs.
NSAID, nonsteroidal anti-inflammatory drug.
Genetic polymorphisms in the SLC22A2 coding region have been identified and characterized in various ethnic populations and these have shown that indeed some coding region polymorphisms result in missense mutations that can significantly alter transporter function of SLC22A2 substrates (Fukushima-Uesaka et al., 2004; Leabman et al., 2002; Wang et al., 2007). In comparison, noncoding and promoter polymorphisms have been less well characterized and their biological effects are more difficult to ascertain (Ogasawara et al., 2008), but could still have a significant impact on SLC22A2 function.
While the genetic diversity among African populations is greater when compared to other populations, there are relatively few studies regarding genetic variation in drug transporter genes in South Africans, despite the fact that populations within South Africa have unique genetic profiles and allele frequencies, which can differ considerably from other African and non-African populations (Gurdasani et al., 2014; Ramsay, 2012; Tishkoff et al., 2009; Warnich et al., 2011). A major factor for these differences is the relatively high proportion of Khoisan admixture in these populations (Barbieri et al., 2014; Busby et al., 2016; Pickrell et al., 2012, 2014). Determining genetic polymorphisms with an aim to understanding their effect on transporter efficacy in the South African Bantu-speaking population is significant in the context of precision/personalized treatment.
The profile of drugs that SLC22A2 transports is very important for the South African population as these include the most commonly prescribed antidiabetic drug, metformin; the antimalarial, quinine; and the antiviral, lamivudine (Table 1). Therefore, identifying and characterizing genetic variation could have a significant impact on understanding patient response to these drugs.
In this study, we identified genetic polymorphisms within the SLC22A2 promoter and coding regions in the majority Bantu-speaking South African population and subsequently functionally characterized two promoter polymorphisms.
Materials and Methods
Research ethics statement
Permission to carry out this study was granted from the Human Research Ethics Committee (Medical) of University of the Witwatersrand, Johannesburg, #M10745 and #M120640. A written informed consent was obtained from all subjects.
Identification of genetic polymorphisms in the SLC22A2 gene
Genetic polymorphisms were identified within SLC22A2 (GRCh38: ENSG00000112499) promoter region (6:160260216–6:160258477) and coding regions (6:160216762–6:160259016), using publically available databases from various studies examining genetic variation within Bantu-speaking individuals. These included the whole-genome sequencing study of 40 Bantu-speaking individuals (personal correspondence with N. Carstens, 2016; unpublished data) and 100 Zulu individuals (Gurdasani et al., 2014).
We performed Sanger sequencing for the SLC22A2 promoter on 10 Bantu-speaking individuals to isolate and subsequently characterize two promoter polymorphisms that we determined could be of functional significance. We considered the recent study by Jacobs et al. (2015) that evaluated specifically the single-nucleotide polymorphisms (SNPs) in SLC22A2 in 96 Xhosa individuals. We compared minor allele frequency (MAF) values for known polymorphisms from the South African data to the overall MAF obtained from the 1000 Genomes Project (The 1000 Genomes Project Consortium, 2015). We compared these values to other African populations and to the overall MAF values excluding the African population, which were calculated to determine allele frequencies within the non-African populations (i.e., European, American, South Asian, and East Asian). These comparisons allowed us to determine how similar or dissimilar the South African Bantu-speaking population is to other non-African populations and to other African populations, since South Africa is not represented in these groups (Table 2).
HC, high coverage; LC, low coverage; n, number of subjects; WGS, whole-genome sequencing.
Prediction of effect of polymorphism on expression and function
We screened the polymorphisms identified in the coding region using the Variant Effect Predictor tool (McLaren et al., 2016). Potential deleterious effects were predicted using the Sorting Intolerant From Tolerant (SIFT) and Polymorphism Phenotyping (Polyphen) programs through the Ensembl Genome Browser (Adzhubei et al., 2010; Ng and Henikoff, 2003). Modifications to potential transcription factor binding sites within the SLC22A2 promoter were predicted using ALGGEN PROMO version 3.0.2 (Farré et al., 2003; Messeguer et al., 2002), through which the wild-type or mutant alleles with 15 base-pair flanking sequences were screened for whether the polymorphisms could modify putative transcription factor binding.
Functional characterization of the SLC22A2 promoter polymorphisms
We performed Sanger sequencing on 10 Bantu-speaking individuals to isolate and functionally characterize the promoter polymorphisms rs59695691 and rs138765638, which bioinformatics analysis predicted to affect transcription factor binding. Ethics approval for this study was obtained from the Human Research Ethics Committee (Medical) of the University of the Witwatersrand, Johannesburg #M10745 and #M120640.
First, DNA was extracted from blood using a standard salting-out method, and then, quality was assessed using gel electrophoresis and quantified using a NanoDrop 1000 spectrophotometer. Next, we designed primers to amplify across a 1718 bp region of the SLC22A2 promoter (−1421 to +277 relative to the translation start site). Polymerase chain reaction (PCR) products were sequenced three times in both the forward orientation and reverse orientation to confirm the presence of the polymorphisms (Inqaba Biotechnical Industries).
For cloning into the pGL4.10 luciferase reporter vector (Promega), these primers were redesigned to incorporate NheI and EcoRV restriction enzyme sites to allow for directional cloning into pGL4.10: forward primer, OCT2FNhe 5′-ATAA
Recombinant vectors were identified through colony PCR with primers designed to amplify across the vector multiple cloning site pGL4.10FWD 5′-CTAGCAAAATAGGCTGTCCC-3′ and pGL4.10REV 5′-TTCATGGCTTTGTGCAGCT-3′ using Kapa Taq ReadyMix (Kapa Biosystems) with the following cycling conditions: 95°C for 2 min; 25 cycles of 95°C for 30 sec, 57°C for 30 sec, 72°C for 2 min; 72°C for 2 min, and a 4°C hold. A standard alkaline lysis plasmid extraction protocol was used to isolate the vector from transformed cells that were cultured for 16 h in LB broth supplemented with 100 μg/mL ampicillin. The three different SLC22A2 promoter sequences (wild type, rs59695691, and rs138765638) were confirmed by sequencing and alignment with the reference sequence before analysis by luciferase assay. The wild-type promoter sequence used as the control is the one that appears in Ensembl for SLC22A2 (ENSG00000112499; GRCh38; 6:160260216—6:160258477).
Next, the luciferase assay was performed to determine expression levels of luciferase from the wild type and SNP containing SLC22A2 promoter vectors. The MRC-5 human fetal lung fibroblast cell line was maintained in a humidified atmosphere of 5% CO2 and 95% air at 37°C and cultured in Dulbecco's modified Eagle's medium supplemented with 10% fetal bovine serum with 1% penicillin–streptomycin. Cells were seeded into 96-well culture plates at a density of 12,000 cells per well. After 24 h, when the cells had reached ∼80% confluency, cells were transfected with 200 ng of vector using the TurboFect transfection reagent (Thermo Fisher Scientific).
An empty (promoterless) pGL4.10 vector was used as a negative control. After 24 h, the plates were removed from the 37°C incubator, left to stand at room temperature for 15 min, and then 100 μL of the Steady-Glo® luciferase assay substrate (Promega) added to each well. After 5 min at room temperature, each well was mixed by pipetting and transferred into a white 96-well plate.
The expression level of luciferase, in the form of light, was measured using a GloMax 96 microplate luminometer (Promega). The expression of luciferase was determined for each vector and these were compared to the wild-type SLC22A2 promoter. Statistical significance was analyzed on n = 4 using a paired t-test, and p < 0.05 considered to be statistically significant.
Results
Genetic polymorphisms within the SLC22A2 promoter region
Overall, a total of 11 genetic polymorphisms were identified within the SLC22A2 promoter region in the Bantu-speaking population (Table 3). We then screened each polymorphism, with a 15 bp flanking sequence on either side, using the ALGGEN PROMO program, which identifies potential alterations in transcription factor binding sites. In this way, we determined whether each promoter polymorphism may have the potential to alter gene expression levels. We identified two polymorphisms that may influence the binding of transcription factors to the DNA.
Bantu-speaking individuals-40 whole genome (personal communication with N. Carstens, 2016; unpublished data).
Zulu individuals (Gurdasani et al., 2014).
African populations from the 1000 Genomes Project Phase 3.
All individuals from the 1000 Genomes Project Phase 3: African, American, East Asian, European, and the South Asian.
Indicates position relative to the translation start site.
—, either there is no information available or the variation could not be identified; MAF, minor allele frequency; n, number of individuals; SA, South African; SNP, single-nucleotide polymorphism; WT, wild type.
First, the SNP rs59695691, which is found at −195 relative to the translation start site, lies in the binding site for TFII-I, an integral member of the basal transcription machinery (Roy, 2001). Second, rs138765638, a three base-pair deletion, was predicted to modify transcription factor binding of c-Ets-1/2, Elk1, and/or STAT4. We then functionally analyzed these polymorphism promoters by cloning these promoter regions into luciferase reporter vectors. We observed a significant decrease of 25% in luciferase expression from the rs59695691-containing promoter pGL4.10 vector, and a significant increase of ∼37% in luciferase expression for rs138765638, in comparison to the wild-type control (Fig. 1).

Functional characterization of the effects of rs59695691 and rs138765638 on luciferase reporter gene expression. The rs59695691-containing vector, which carries the T>C base change at −195 (−195-SNP), showed a decrease in luciferase reporter expression (relative light units), while the rs138765638-conatining vector with the CTT deletion showed an increase in expression, in comparison to the wild-type control (WT), n = 4 ± SD. SD, standard deviation; SNP, single-nucleotide polymorphism.
Genetic polymorphisms within the SLC22A2 coding regions
A total of nine SNPs were identified within the SLC22A2 coding region in the South African Bantu-speaking population (Table 4). Five SNPs were found to be synonymous with the rs772717144 (Val94Val) and rs58264151 (Gly370Gly) variants identified only in the South African Bantu-speaking populations, although at low frequency, and were not seen in any of the 1000 Genomes Project populations. The polymorphism rs624249 (Thr130Thr) was found at a lower frequency compared to the African and non-African populations, which exhibit similar MAF values, while rs316003 (Val502Val) showed a notably higher allele frequency in the African population in comparison to the South African populations in this study and to the non-African population. Four SNPs were identified as nonsynonymous, missense mutations, these being rs316019 (Ser270Ala), rs8177516 (Arg400Cys), rs8177517 (Lys432Gln), and rs139045661 (Ile552Asn). The MAF of Ser270Ala was found to be similar to all other populations. Interestingly, Arg400Cys, Lys432Gln, and Ile552Asn all appear in the South African populations in our study, and seem to be prevalent in African over non-African populations in general.
Bantu-speaking individuals (personal communication with N. Carstens, 2016; unpublished data).
Zulu individuals (Gurdasani et al., 2014).
African populations from the 1000 Genomes Project Phase 3.
All individuals from the 1000 Genomes Project Phase 3: African, American, East Asian, European, and the South Asian.
These missense SNPs have been functionally characterized by Leabman et al. (2002).
—, there is no information available; n, number of individuals; WT, wild type.
Discussion
Genetic polymorphisms within SLC22A2 have been shown to change the expression level or activity of the protein and these have been documented to modify the pharmacokinetics of SLC22A2 substrates (Ieiri et al., 2006). Many pharmacologically significant drugs utilize SLC22A2 for transport and these include the some of the most commonly prescribed antidiabetic, antiviral, and anticancer drugs, including quinine for malaria and lamivudine for HIV, which are of considerable importance for the South African population (Blumberg, 2015; Kimura et al., 2005; Lazarus et al., 2013). Due to the unique level of genetic diversity within the African continent, along with advancement in technologies that allow for personalized medicine, there has been an increased drive to identify and characterize genetic variation in African populations (Ramsay, 2012).
Recent large-scale genome-sequencing projects are providing a more comprehensive view of African genomic diversity; the African Genome Variation Project has generated 300 whole-genome sequences from Eastern and Southern African populations (Gurdasani et al., 2014), and more such data are expected from ongoing projects such as the Southern African Human Genome Programme, the Human Heredity and Health in Africa (H3Africa) Project, and the TrypanoGEN study. We focused on the identification of genetic polymorphisms in the SLC22A2 promoter and coding regions in the South African Bantu-speaking population, as these are yet to be identified and characterized, especially since this gene seems to exhibit considerable population- specific genetic variation that could impact on protein function and hence patient drug response.
Polymorphisms in the promoter region of a gene can have a significant impact on gene expression levels through various mechanisms, including the modification of transcription factor binding sites. We identified and subsequently functionally characterized key promoter polymorphisms. We found that 6 of the 11 polymorphisms identified had frequencies that ranged from 1.4% up to 9.5% in the South African Bantu-speaking populations in this study, while in non-African populations these appeared at less than 1%. In comparison to the African population, some of these were similar but some were dissimilar.
We also found that SNP rs113384645 was observed only in the South African population and not in any of the 1000 Genomes Project populations. Of the remaining four polymorphisms that displayed an MAF of greater than 0.01 in the non-African population, the polymorphisms rs138765638 and rs3127573 displayed considerably lower MAF in non-Africans, in comparison to the African and South African Bantu-speaking populations, whereas the polymorphisms rs66512417 and rs316023 were notably lower MAFs in the African and South African populations, in comparison to the non-African populations.
We then considered the potential functional effect of two of the promoter polymorphisms, rs59695691 and rs138765638. We focused on these since bioinformatics analysis suggested that the transcription factor binding sites could be modified by these two polymorphisms. The SNP rs59695691 lies in the binding site for TFII-I, member of the basal transcription machinery that assembles around the core promoter. TFII-I also acts as an activator that binds to upstream regulatory regions and thereby facilitates communication between these upstream transcriptional regulatory sites and the core promoter (Roy, 2001). ALGGEN PROMO predicted that the TFII-I site 5′-GGACA/
We observed these polymorphisms in the South African populations with an MAF 0.085 for the Zulu-only study and 0.025 in the South African Bantu study, while Jacobs et al. (2015) identified this SNP with an MAF of 0.263 in the Xhosa population (n = 96). In other African and non-African populations, rs59695691 was observed at 0.023 and <0.01, respectively. As such, we decided to functionally characterize this SNP using the luciferase reporter assay, considering the fundamental role TFII-I has in transcription. As anticipated, luciferase reporter gene expression decreased by a significant 25% from this SNP-containing promoter region.
We also analyzed rs138765638, a 3-bp deletion, since ALGGEN PROMO predicted that the binding of the transcription factors c-Ets-1/2, Elk1, and/or STAT4 could be affected. While there is relatively little characterization of the transcription factors that regulate SLC22A2 expression, STAT4 and SLC22A2 have overlapping expression patterns, such as in parts of the brain. We found that this deletion polymorphism, which was identified with an MAF of 0.21 in the South African Zulu population, 0.138 in Africans, and at 0.084 in non-Africans, showed a significant 37% increase in luciferase reporter gene expression.
While the physiological effects of promoter polymorphisms can be difficult to ascertain, they have previously been shown to affect SLC22A2 gene expression, and as such may affect individuals carrying these gene variants. Ogasawara et al. (2008) identified a functionally significant genetic polymorphism in the SLC22A2 promoter in 63 Japanese nephrectomized patients. An AAG deletion at position −578 to −576 (−578_-576delAAG) was shown to significantly reduce the activity of the SLC22A2 promoter and heterozygotes of this deletion were shown to have lower SLC22A2 mRNA levels.
With regard to coding region polymorphisms, we identified a number of synonymous mutations with high MAF values, as well as being considerably different to other African and non-African populations. While it is difficult to determine the role of such synonymous genetic variations, their effect on SLC22A2 cannot be discounted as they may alter protein expression and function through pretranscriptional, posttranscriptional, and posttranslational modifications, thereby influencing patient drug response (Sauna and Kimchi-Sarfaty, 2011).
We also identified four missense mutations in the coding region of SLC22A2. The MAF of rs316019 (Ser270Ala) was found to be similar to all other populations, and since it is found in up to 14% of the population, it has been the subject of a number of functional characterization studies (Leabman et al., 2002; Tzvetkov et al., 2009; Zolk, 2009). However, whether this amino acid change alters the function of SLC22A2 seems to depend on the specific cationic compound tested. For example, the renal clearance of metformin, which is transported by SLC22A2, seems to also be contradictory in various studies, with some reporting higher and others lower renal clearance, although study design and population ethnicity may play a significant role in these conflicting results (Chen et al., 2009; Song et al., 2008).
In addition, while mutations tend to be seen as negative traits, this Ser270Ala variant has also recently been associated with protective roles against cisplatin-induced nephro- and oto- toxicity, which are serious side effects of cisplatin chemotherapy (Filipski et al., 2009; Lanvers-Kaminsky et al., 2015). For rs8177516 (Arg400Cys) and rs8177517 (Lys432Gln), which do not appear in non-African populations, we found these at between 1.3% and 4.5% in the African and South African populations. Previous functional characterization of these SNPs has shown that they deleteriously affect transporter function (Leabman et al., 2002). Finally, for rs139045661 (Ile552Asn), which also has a prevalence of less than 1%, there has not been any functional characterization as yet, although SIFT and PolyPhen programs predict this amino acid change to be tolerated.
In the context of precision medicine, our results could have a considerable impact on patient treatment in South Africa as well as for global science of rational therapeutics. Our data for SLC22A2 raise the issues of considering genetic variation in different populations for the purposes of assessing drug safety and for drug response biomarkers, which further highlights the importance of projects such as the H3Africa initiative.
The role that pharmacogenetics will play in future global health must consider many factors, not least, the best course of clinical implementation, especially in resource-poor settings (Ozdemir et al., 2015). In addition, social and political science research on the ways in which precision medicine is currently emerging in resource-limited settings, shaping the local communities, and the dependency of knowledge-based innovations on the context, power, and funding warrant further attention and interdisciplinary scholarship (Birch and Tyfield, 2013; De Vries, 2004; Petersen, 2013). With the continuing rise of precision medicine, we envisage that studies such as ours will enable patients in developing countries to benefit from specifically targeted treatment regimens.
Conclusions and Expert Outlook
Our study highlights differences and similarities between the South African Bantu-speaking population and other African and non-African populations. This was evident with the promoter polymorphisms that we functionally characterized and the missense mutations that appear at frequencies that are considerably different to other populations. In the age of precision medicine and considering the important role of SLC22A2 in transporting drugs of considerable clinical significance, our finding warrants further investigation into the possible in vivo effects, especially during drug therapy where these polymorphisms may have additional confounding effects (Jung et al., 2008).
Furthermore, while many studies look at the pharmacokinetics of exogenous compounds, the effects of altered transporter function on endogenous substrates must also be taken into account, where, for example, decreased clearance of the neurotransmitters, norepinephrine or serotonin via SLCA22A2, may lead to changes in mood-related behavior (Bacq et al., 2012).
In all, our research reinforces the fact that the genetic diversity in South Africans is distinct from other African populations and contributes to understanding genetic variation in SLC22A2 in South African populations, which has important consequences for rational drug prescription and healthcare provided in South Africa as well as globally.
Footnotes
Acknowledgments
This study was funded by the University of the Witwatersrand, Johannesburg, and the National Research Foundation of South Africa with a grant to DMD (Grant No. 90710). NCW was supported by an NRF Masters Scholarship. A.C. was supported by the National Human Genome Research Institute of the National Institutes of Health under Award Number U54HG006938 (H3Africa Consortium). Any opinion, findings and conclusions, or recommendations expressed in this publication are those of the authors and do not necessarily reflect those of the authors' affiliated institutions or the funders.
Author Disclosure Statement
The authors declare that no conflicting financial interests exist.
