Abstract
Abstract
Mannose binding lectin (MBL) is a pathogen pattern recognition protein involved in antimicrobial activities. Variation in MBL2 gene has been extensively implicated in differential outcomes of infectious diseases in studies conducted outside Africa, but virtually very little is known on the role of this candidate gene in the African continent. We investigated human genetic variations in MBL2 in a Zimbabwean pediatric population and their putative associations with HIV infection in perinatally exposed children. One hundred and four children aged 7 to 9 years comprising 68 perinatally exposed to HIV (32 who were born infected and 36 who were uninfected) and 36 unexposed controls were recruited. DNA samples were genotyped for MBL2 polymorphisms using PCR-RFLP and sequencing. HIV infected children had markedly variable and significantly lower mean height (p=0.03) and weight (p=0.005) when compared to the uninfected children. Using all samples, frequencies for MBL2 genetic variants for the Zimbabwean population were calculated. Twelve single nucleotide polymorphisms were observed and minor alleles occurred with the following frequencies: −550C>G (G: 0.02), −435G>A (A: 0.08), −428A>C (C: 0.39), −394A>G (A: 0.39), −328AGAGAA ins/del (AGAGAA ins: 0.44), −245G>A (A: 0.05), −221C>G (C: 0.12), −111A>T (T: 0.10), −70C>T (C: 0.46), +4C>T (C: 0.45), novel −595G>A (A: 0.02), and 170G>A (0.24). We found that the MBL2 +4T variant displayed a trend for association with reduced risk of HIV transmission from mother-to-child but the remaining vast majority of the genetic markers did not show a significant association. We conclude (1) the MBL2 gene is highly polymorphic in the Zimbabwean population, and (2) MBL2 genetic variation does not appear to play a major role in influencing the risk of mother-to-child HIV transmission in our study sample. These observations contest the hitherto significant role of this candidate gene for HIV transmission from mother-to-child in non-African populations and thus, further speak to the limits of extrapolating genomic association studies directly to the African populations from studies conducted elsewhere. It is hoped that more OMICS research in a diverse set of African countries can shed further light on the putative role (or the lack thereof ) of this candidate gene in HIV transmission in the continent, a major global health burden in Africa.
Introduction
M
Genetic variation in the promoter and exon 1 of the MBL2 gene affects expression and oligomerization of MBL, leading to differential susceptibility to HIV infection and disease progression (Mangano et al., 2008; Singh et al., 2008). Distribution of MBL2 allelic variants differs among populations, for example, of the exon 1 SNPs implicated in HIV infection and disease progression, African populations predominantly carry the C allele, whilst Caucasian populations mainly present with the B allele (Lipscombe et al., 1992; Madsen et al., 1998). However, the distribution of MBL2 variants and their possible role in HIV infection and neurocognitive function is not well described among African populations. Identifying genetic variants that influence HIV transmission and disease progression may help predict disease course, guide therapy, and provide potential therapeutic targets (Singh and Spector, 2009). This study therefore describes the variations in the MBL2 gene and its possible role in risk of HIV infection in children born to HIV-infected mothers. We further discuss the extent to which MBL2 genetic variation might play a role (or not) in HIV transmission from mother-to-child in the African continent.
Methods
Study participants
Participants were recruited from the Better Health for the African Mother and Child (BHAMC) cohort, a longitudinal study of mother–child pairs followed up since 2002 at three Harare peri-urban clinics in Epworth, St Mary′s Chitungwiza, and Seke North. One hundred and four (N=104) unrelated children aged 7 to 9 years were of bantu African origin (Kurewa et al., 2010). Participants included 32 children perinatally-infected with HIV (EI) and 72 healthy HIV-uninfected children comprising of 36 exposed to HIV in utero but not infected (EU), and 36 unexposed and uninfected (UEUI) were recruited as controls. All HIV-infected mothers of children included in the study were administered a single dose of 200 mg nevirapine at delivery of the children to prevent mother-to-child transmission of HIV. A 5 mL blood sample was collected from each child for CD4+ T-cell count and genotyping purposes. The demographic characteristics of the recruited children were captured from their medical records. Written informed consent was obtained from each child's parent or legal guardian before the samples were collected. The study received ethics approval from the Medical Research Council of Zimbabwe and the University of Cape Town, Faculty of Health Sciences Research ethics committee.
Genotyping for MBL2 genetic variants
Genomic DNA was extracted from blood samples using the Nucleospin® Blood L kit (Macherey-Nagel, Germany) according to the manufacturer's instructions. Primers for amplification of genomic DNA regions were designed using Primer 3 online package http://frodo.wi.mit.edu/, NCBI Primer Blast and OligoAnalyzer SciTool (Integrated DNA Technologies®; http://eu.idtdna.com) bioinformatics tools. To investigate genetic variation in the promoter and exon 1 of MBL2 gene, a 1187 bp fragment was amplified by polymerase chain reaction (PCR) (Fig. 1), and subsequently sequenced. The PCR contained 10 picomoles of each of the sense (5′-AGGCTGCTGAGGTTTCTTAGG-3′) and antisense (5′-ATGCCAGAGAATGAGAGCTGA-3′) primers, 200 μM dNTPs (Bioline, UK), 1X GoTaq Flexi Green buffer (Promega, USA), 1.5 mM MgCl2 (Fermentas, Canada), 1U Taq polymerase (Fermentas), and 50 ng genomic DNA in a total volume of 25 μL. The cycling conditions were as follows; initial denaturation at 94°C for 5 min, followed by 35 cycles of denaturation at 94°C, primer annealing at 64°C, and primer extension at 72°C for 30 sec at each step. The reaction was completed by an extension step at 72°C for 5 min. The PCR products were further purified using alkaline phosphatase (FastAP™) (Fermentas) and exonuclease I, and then sequenced including capillary electrophoresis on ABI® PRISM 3130 Genetic Analyser (Life Technologies, Grand Island, NY). Alignment of sequences with a reference sequence was performed using the Lasergene® 10 software (DNASTAR®, Madison, WI).

Schematic representation of the MBL2 gene. The top panel shows the full gene structure whilst the bottom panel zooms in on the region that was investigated in this study. The arrows indicate the positions of the SNPs that were genotyped and the genotyping methods employed.
Statistical analysis
Genotype and allele frequencies in the HIV exposed infected (EI), exposed uninfected (EU), and unexposed (UE) children cases and controls were calculated using Stata 11.2 (StataCorp LP, College Station, TX) and/or SHEsis online version (Yong and Lin, 2005). All the samples were combined to calculate the frequencies of the MBL2 variants in the Zimbabwean population. Conformation to Hardy Weinberg Equilibrium (HWE) was determined. Chi-square and Fisher's exact tests were also used to evaluate the effects of MBL2 genotypes and alleles on HIV and disease status. p<0.05 was considered statistically significant. Pairwise Linkage Disequilibrium (LD) analyses between MBL2 SNPs was carried out using SHEsis (Li et al., 2009). Lewontin's D′ value and r2 were used to quantify the level of LD. Haplotypes were inferred using the expected maximisation algorithm in SHEsis.
Results
Demographic characteristics
HIV infected (EI) children presented with significantly lower mean height (cm) of 117.11±6.2 SD (p=0.03) and lower mean weight (kg) of 20.31±1.99 SD (p=0.005) when compared to uninfected children (exposed uninfected+unexposed and uninfected) with 120.37±6 SD and 22.35±3.01 SD, respectively, as shown in Table 1. This indicated possible slowed growth due to HIV/AIDS-related challenges.
EI-HIV, exposed and infected; EU, exposed but uninfected; N/A, non-applicable; UEUI, unexposed and uninfected.
MBL2 genetic polymorphism distribution and HIV infection
All 104 participants were used to calculate frequencies for MBL2 genetic variants to impute the likely frequencies in the Zimbabwean population in general. 12 single nucleotide polymorphisms (SNPs) were observed and the minor alleles occurred with the following frequencies; −595G>A (A: 0.02), −550C>G (G: 0.02), −435G>A (A: 0.08), −428A>C (C: 0.39), −394A>G (A: 0.39), −328AGAGAA ins/del (AGAGAA ins: 0.44), −245G>A (A: 0.05), −221C>G (C: 0.12), −111A>T (T: 0.10), −70C>T (C: 0.46), +4C>T (C: 0.45), novel −595G>A, and 170G>A (0.24). Two previously reported SNPs, 154C>T (rs5030737) and 161G>A (rs1800451), were monomorphic in the Zimbabwean population. However, none of the SNPs investigated was associated with differential susceptibility to HIV infection, although the MBL2 +4T variant appeared to show a trend towards association with reduced risk. Table 2 shows the genotype and minor allele frequencies of the SNPs detected. All SNPs conformed to the Hardy-Weinberg Equilibrium.
F, failed samples; MA (Freq), minor allele frequency; N, total number of samples genotyped; N.t, nucleotide; wt, wild type allele; mt, mutant allele.
SNPs were further genotyped in the samples remaining after sequencing; *wt=refers to the starting allele as indicated in the nucleotide base substitution column, and **mt=to the second allele. This designation has nothing to do with functional significance.
We report a novel G>A variation at position −595 upstream of the exon 1 start site (MBL2 −595G>A) which occurred with a frequency of 2% for the −595A allele (Fig. 1). No homozygous −595A/A genotype was observed. Functional analysis of −595G>A variants done using a predictive bioinformatics software called TFSearch (Heinemeyer et al., 1998) showed that the MBL2 −595G>A position does not carry any transcription factor binding site regardless of the presence of G or A allele.
In addition to the MBL2 −595G>A, distribution of genotypes of four SNPs previously implicated in HIV infection and disease progression in the literature (−550C>G, 221C>G, +4C>T, and 170G>A) were also compared between HIV EI and EU children (Table 3). None of the MBL2 genotypes showed a significant difference in distribution between HIV EI and EU children. Pairwise LD calculation for the five MBL2 SNPs; −595G>A, −550C>G, −221C>G,+4C>T, and 170G>A in the HIV uninfected (EU+UEUI) children showed even distribution of strong and weak pairwise LD (Fig. 2). Strong LD limited the number of MBL2 haplotypes inferred from the SNPs −550C>G, −221C>G, +4C>T, and 170G>A. The mutant alleles of MBL2 exon 1 nonsynonymous SNPs 170G>A (G57E), 154C>T (R52C), and 161G>A (G54D) are alternatively termed “C,” “D,” and “B,” respectively, whilst the promoter and 5′-UTR variants −221C/G and +4C>T are referred to as; −221X/Y and +4P/Q, respectively (Lipscombe et al., 1992; Madsen et al., 1998; Sumiya et al., 1991). This alternative nomenclature is commonly used to describe MBL2 haplotypes (Fig. 3). Seven haplotypes with respect to the −550C>G, −221C>G, +4C>T, and 170G>A SNPs were observed, three being reported for the first time (HYQA, HYPC, and HXPA) (Fig. 3). None of the haplotypes' distribution was significantly different between the HIV groups.

Linkage disequilibrium plot of MBL2 SNPs. The plot shows pairwise LD of MBL2 variants in the promoter and exon1. The numbers in the boxes are percentage D′ values indicating strength of LD.

Frequency of MBL2 haplotypes in the Zimbabwean population and how they compare to other populations (Madsen et al., 1998).
EI-HIV, exposed infected; EU-HIV, exposed uninfected; N/A, non-applicable; UEUI- HIV, unexposed; OR (95%CI), odds ratio (95% confidence interval).
Discussion
Deficiency in MBL protein, which is mainly due to reduced gene expression and poor oligomerization, has been linked with increased susceptibility to HIV (Boniotto et al., 2000; Garred et al., 2006). We describe MBL2 genetic variation among Zimbabweans and discuss their possible role in differential susceptibility to HIV infection in utero.
The frequency of MBL2 +4C (P) variant (45%) observed in our study population is comparable to what has been reported among other African populations (38%–50%) but higher than the frequency observed in Caucasian (17%) and Asian populations (12%) (Mangano et al., 2008; Ou et al., 2011; Thye et al., 2011). The difference in the frequency of the +4C allelic variant in different populations may be a pointer to possible molecular causes of the observed geographical differences in HIV prevalence. However, this is somewhat contradicted by the lower frequency of the +4C variant among Asians (12%) compared to Caucasians (17%), yet HIV prevalence is much higher in the former group (Ou et al., 2011; Singh et al., 2008). This could be due to the fact that HIV susceptibility is influenced by a multitude of factors of which MBL2 variation could be a constituent player. Moreover, presence of the +4C allelic variant has been associated with mild downregulation of MBL2 gene expression and is often overshadowed in the presence of the MBL2 −221C allele (Israëls et al., 2012). MBL2 +4C and −221C are in strong LD, thus, presence of +4C allele is unlikely to cause a detrimental reduction in antimicrobial activity of MBL protein. MBL2 −221C allele has been reported to significantly reduce serum MBL. However, frequency of the −221C allele (12%) in Zimbabweans fell in the same range with what has been reported among other African populations (12%–17%), Caucasians (17%), and Asians (15%) (Ou et al., 2011; Singh et al., 2008; Thye et al., 2011), casting doubt on its importance in the interindividual differences in susceptibility to HIV infection and distribution.
The frequency of MBL2 170A allele (C) observed in the Zimbabwean population (24%) is comparable what has been reported in other African populations (20%–30%) (Lipscombe et al., 1992; Thye et al., 2011), but higher than in both Asian (Ou et al., 2011) and Caucasian populations (Lipscombe et al., 1992). Neither D nor B alleles were observed in the Zimbabweans, yet the D allele occurs in 15% of Caucasian and 25% Asian populations whilst the B is rare in most populations (<5%) (Chen et al., 2009; Lipscombe et al., 1992). The opposing frequency distribution between MBL2 C and B variants between Caucasians and Africans is unlikely to explain the differences in HIV prevalence because the effects of the B and C alleles on MBL oligomerization is thought to be almost similar (Israëls et al., 2012). The resulting amino acid changes disrupt the α-helical structure of the MBL polypeptide chain, thus interfering with the formation of functional oligomers (Eisen and Minchinton, 2003). This reduces the antiviral activity of MBL and may increase susceptibility to a wide range of diseases, nonetheless, on a similar scale for both B and C alleles. We therefore speculate that MBL2 genetic variation may not solely explain differences in HIV distribution in world populations. This is supported by the findings on BST-2 (tetherin) differential genetic variation in Southern African populations, which is thought to affect or mirror the HIV-1 infection prevalence (Skelton et al., personal communication).
Our observations are contradicted by earlier reports where MBL deficiency variants have been identified as strong risk factors for HIV infection when compared to MBL sufficiency variants (Boniotto et al., 2003; Kuhn et al., 2006). Others have reported accelerated disease progression in individuals carrying MBL deficiency variants (Mangano et al., 2008; Singh et al., 2008). However, there are several explanations for the failure to observe any association between MBL2 variation and HIV infection. Distribution of MBL2 genetic variants is likely to have been influenced by other micro-organisms more ancient than HIV. For example, the MBL2 170A (C) allele, which results in MBL deficiency, may have been selected for in African populations because it protects the host against MBL-mediated Mycobacterium africanum (M. africanum) infection (Søborg et al., 2007; Thye et al., 2011). MBL-mediated opsonization is used by some intracellular organisms such as M. africanum to enter host cells. Moreover, the lectin (MBL) pathway of the complement system can also be activated by two other pathways (Endo et al., 2006); hence reduction in MBL-induced complement activation may not be detrimental to life (Mangano et al., 2008), thus may have a negligible effect on HIV outcomes.
The distribution of MBL2 haplotypes, like that of the individual SNPs differs among world populations. Frequencies of MBL2 haplotypes in the Zimbabwean population studied were comparable to those reported among Kenyans but differed from Caucasians and Argentinians (Fig. 3) (Madsen et al., 1998). The HYA containing haplotypes, which have been linked with high secretion of MBL (Jensen et al., 2005), were present at very low frequencies in both Zimbabwean and Kenyan populations whilst they are present at high frequencies in Caucasians and Argentinians (Madsen et al., 1998). The differences in haplotypes may account for the higher prevalence of infectious diseases such as HIV, hepatitis, and TB among Africans when compared to other populations.
Our study makes a contribution towards understanding host genetic variation with regard to HIV infection among Africans, particularly, Zimbabweans. However, these observations must be interpreted cautiously as the study was carried out on a limited sample size whose participants may also have suffered “survivor bias”, since only those children who were alive at the time the cross-sectional study was undertaken (7–9 years after birth). The study population was limited to individuals alive after close to a decade of follow-up; information on the genetic make-up of those children who died during follow-up period was not available.
Conclusions
We demonstrate high variability in the promoter and exon 1 of the MBL2 gene among Zimbabwean children, with differences in SNP and haplotype distribution when compared to other populations. Discovery of a novel −595G>A polymorphism suggests that there may be more undetected gene variants in African populations as most have not been studied at genome level. The novel SNPs may hold the key to our understanding of contribution of most of the so-called susceptibility genes. Our observations collectively contest the hitherto significant role of this candidate gene for HIV transmission mother-to-child in non-African populations and thus further speak to the limits of extrapolating genomic association studies directly to the African populations from studies conducted elsewhere. It is hoped that more OMICS research in a diverse set of African countries can shed further light on the putative role (or the lack thereof ) of this candidate gene in HIV transmission in the continent, a major global health burden in Africa.
Footnotes
Acknowledgments
We would like to thank Letten Foundation, Norway; University of Cape Town; National Research Foundation (NRF) of South Africa, Medical Research Council of South Africa and, the University of Cape Town for funding both the student and research project costs. We are also indebted to the National Institute of Health Research and University of Zimbabwe for facilitating some of the research work. None of this would have been possible without the voluntary participation of members of the BHMAC cohort for which we are thankful. We also want to thank the late Dr. EN Kurewa for facilitating this study.
Author Disclosure Statement
The authors declare that there are no conflicting financial interests.
