Abstract
Introduction
A major obstacle to the development of successful therapeutic strategies is the lack of a mechanistic understanding of how Mtb maintains a persistent, nonreplicating state in human tissues for decades, unresponsive to antimycobacterial drugs, to then abruptly resume growth and cause disease (24). This requires a detailed understanding of the metabolic flexibility of the bacilli that survive in the varied environments within the human host, ranging from high oxygen tension in the lung alveolus to hypoxic conditions within the tuberculous granuloma (6, 10). An important challenge is to understand how in vivo gases [e.g., oxygen (O2), nitric oxide (NO), carbon monoxide (CO), carbon dioxide (CO2), hydrogen (H2), and hydrogen sulfide (H2S)] and nutrients (e.g., host fatty acids), several of which are thought to play a crucial role in persistence, affect Mtb physiology and consequently redox homeostasis (Fig. 1). Furthermore, frontline anti-Mtb drugs such as isoniazid (34), ethionamide (7), and PA-824 (32) require bioreductive activation to exert antimycobacterial effects, which magnifies the importance of understanding how Mtb maintains redox balance during treatment. It is this lack of drug efficacy against dormant Mtb that represents a fundamental challenge to investigators worldwide. A better understanding of how environmental host factors affect Mtb redox homeostasis (Fig. 1) may shed light on the biology of Mtb persistence and lead to the development of new and improved antimycobacterial drugs.

Exciting progress has been made in understanding how members of the WhiB family sense and respond to host gases NO and O2 through their iron–sulfur (Fe-S) clusters, and how they may modulate Mtb virulence. Here we review the current knowledge of the Mtb WhiB-like family of proteins, focusing on the best characterized member, WhiB3.
Phylogenetic Distribution of the Wbl Proteins
Comprehensive phylogenetic analysis showed that Mtb WhiB1, WhiB2, WhiB3, WhiB4, and WhiB7 are evolutionary closer to Corynebacterium and Nocardia WhiB-like (Wbl) proteins (Fig. 2A). Furthermore, using Interpro and Pfam databases, protein bioinformatic analyses have identified more than 1600 Wbl homologues (database ID: IPR003482/PF02467), which are exclusive to actinobacteria and the double-stranded DNA (dsDNA) containing siphoviridae family of phages. Notably, WhiB5 and WhiB6 are restricted to mycobacteria (Figs. 2B and 2C) with WhiB5 only present in virulent strains (Fig. 2C). All Wbl proteins with the exception of WhiB5 contain four invariant Cys residues arranged in a Cys-X14-22-Cys-X2-Cys-X5-Cys motif (Fig. 3A), which is typically associated with metal co-coordinating DNA-binding proteins. Secondary structure prediction of Wbl proteins indicates an acidic amphiphatic α-helical region near the N-terminus and a basic α-helical region at its C-terminus with a central β-sheet region (43).


Most Wbl proteins (∼90%) contain a “WhiB” domain (including all Wbl homologues in phages, and Mtb WhiB1, WhiB2, WhiB4, WhiB5, and WhiB6), whereas ∼7% possess an additional AT-hook like domain at their C-terminus (e.g., Mtb WhiB3 and WhiB7). The AT-hook like domain is a DNA-binding motif with a preference for A/T-rich regions (4) (Fig. 3B). Approximately 2% of Wbl members contain an additional RNA polymerase (RNAP) sigma70r4 binding region and a clearly distinguished DNA-binding 'helix-turn-helix' (HTH) motif that binds to the −35 promoter region. A small percentage of Wbl members contain a RNAP sigma70r4t2 signature (Fig. 3B). Although a systematic functional analysis of Wbl protein domains is lacking, the diversity in protein architecture may be responsible for the varied functions including redox homeostasis(40), cAMP regulation (1), oxidative stress (28), cell division (23), sporulation (16), quorum sensing (5), drug resistance (33), and virulence (44).
Mtb harbors seven Wbl proteins, WhiB1 (Rv3219), WhiB2 (Rv3260c), WhiB3 (Rv3416), WhiB4 (Rv3681c), WhiB5 (Rv0022c), WhiB6 (Rv3862c), and WhiB7 (Rv3197A) (Fig. 3C). Only moderate aa sequence similarity exists between members of the Mtb Wbl family (<70%). BLAST analysis of Mtb Wbl proteins against A CLAssification of Mobile genetic Elements (ACLAME), a database of mobile genetic elements such as plasmids, phages, and viruses, identified several Wbl homologues that are present on plasmids. For example, WhiB1 and WhiB4 share significant similarity (E-value=≤ e−15) with gene products from Streptomyces plasmids. Also, WhiB2 and WhiB3 show similarity with Rhodococcus plasmid-encoded gene products (Table 1). Thus, it is unlikely that Mtb Wbl members evolved through gene duplication events. Rather, it is tempting to hypothesize that the Mtb Wbl proteins have been independently acquired from mycobacterial ancestors via lateral gene transfer from mobile elements such as plasmids or phages. The selective distribution of Mtb Wbl proteins in different mycobacterial species (e.g., the absence of WhiB5 in nonpathogenic species) coupled with considerable sequence heterogeneity, suggest a discrete functional role for each Mtb Wbl protein.
Mtb Wbl proteins were used in a BLAST analysis against the ACLAME database that contains mobile genetic elements such as plasmids and phages. Only hits with e−10 were considered as significant. The presence of Wbl proteins on plasmids and phages suggest that Mtb wbl genes were laterally acquired. WhiB5 and WhiB7 did not yield any significant hits.
Genetic Regulation of Mtb wbl Genes
To better understand the genetic regulation of Mtb wbl genes, Geiman et al. (22) performed an elegant real-time PCR expression study that examined the differential expression of all seven Mtb wbl genes following exposure to a variety of environmental stresses, including detergent exposure, oxidative stress, acid exposure, nutrient starvation, ethanol, antibiotic, heat, and low iron stress. WhiB3 expression was highly increased upon acid stress, but weakly upon exposure to oxidants. WhiB7 was highly upregulated under low iron conditions, whereas whiB7 and whiB2 expression was shown to be significantly increased upon exposure to antibiotics (22). The whiB7 finding is consistent with a previous study implicating members of the WhiB family in antimicrobial stress (33). Overall, whiB6 appeared to be responsive to the widest variety of stress conditions (22).
Exploiting an Mtb transposon library screen, whiB1 and whiB6 were both implicated in the switchover from a slow to fast growth rate (8). In another genome-wide expression study, whiB6 expression was consistently upregulated under prolonged hypoxia, a condition associated with dormant infection, whereas whiB4 and whiB2 were moderately upregulated (35, 36). Last, whiB2 expression was shown to be significantly increased upon re-aeration following hypoxia, suggesting a possible role for whiB2 in the activation of Mtb from the latent state (39).
When comparing the microarray expression profiles of several clinical Mtb strains within macrophages (27), it became clear that the wbl expression profiles are highly variable. Figure 4 depicts a summary of the relative wbl expression profiles obtained from the literature. Noticeably, whiB6 appears to be prominently upregulated under several conditions, consistent with the report by Geiman et al., (22). WhiB4 expression was upregulated under hypoxia, but downregulated during starvation and within macrophages. Surprisingly, starvation in the presence of oxygen leads to increased whiB4 expression (9), suggesting a possible role in Mtb reactivation. Mtb whiB3 appeared to be upregulated under a range of conditions.

Metabolic Regulation of Mtb WhiB3
Metabolism
Mtb WhiB3 senses fluctuations in the intracellular redox environment associated with O2 depletion, including the metabolic switchover to the favored in vivo carbon source, fatty acids (40). This has important consequences for understanding how Mtb persists within the host, as it is widely believed that fatty acids serve as a main source of carbon and energy during long-term infection
Several lines of evidence implicate WhiB3 as a metabolic regulator. First, WhiB3 is essential for maintaining bacterial cell shape and size (40, 41). Second, WhiB3 controls the anabolism of complex virulence lipids such as polyacyltrehaloses (PAT), diacyltrehaloses (DAT), sulfolipids (SL-1), and phthiocerol dimycocerosates (PDIM), which requires large quantities of reducing equivalents (NAD[P]H). Third, WhiB3 was shown to maintain redox homeostasis during infection of macrophages (40). Fourth, the MtbΔwhiB3 mutant showed substantially decreased production of the methyl-branched polar lipids PAT, DAT, and SL-1, but increased production of PDIM, and to a lesser extent triacylglycerol (TAG) (40). Mtb ΔwhiB3 grows on much higher concentrations of propionate compared to wild-type (wt) Mtb, and accumulates PDIM and TAG during intra-macrophage growth, suggesting that the increased resistance to propionate toxicity is because this intermediate is channeled into PDIM via the methyl-malonyl CoA (MMCoA) pathway and into TAG (40). These metabolic analyses implicate WhiB3 in core intermediary metabolism and are reminiscent of studies executed on FNR and ArcA, which regulate key enzymes in the tricarboxylic acid cycle (TCA) cycle under carbon source starvation conditions (31, 38).
Protein networks and Wbl function
Originally, Mtb WhiB3 was discovered through a yeast two-hybrid screen using SigA as bait (44). Recently, an E. coli two-hybrid system was exploited and showed that 113 proteins interacted with WhiB3 and 52 with WhiB7 (average number of interacting neighbors ∼6). As is typically the case with microbial protein linkage maps, many interactions are difficult to explain in the context of what is biologically known about WhiB3 (or WhiB7) function, and await future verification. Also, the frequency of false positives and negatives, often due to “sticky” interacting proteins (a common observation in yeast two-hybrid screens) is unknown. Nonetheless, the potentially large number of proteins interacting with WhiB3 and WhiB7 is difficult to ignore and may point to biological properties that are distinct among Mtb Wbl proteins. Lastly, it is uncertain to what extent post-translational modification (e.g., assembly of the WhiB3 and WhiB7 Fe–S clusters) of Mtb proteins expressed in E. coli influences protein–protein interaction. Thus, although an admirable resource, these data await further verification and should be interpreted with caution in order to generate accurate knowledge on the biological function of Mtb Wbl proteins.
A provocative finding by one group was that several Wbl proteins in their apo- forms demonstrated nonspecific protein disulfide reductase activity (2, 3, 20, 21). The authors proposed that the Fe–S cluster blocks the four active Cys residues until endogenous oxidants destroys the Fe–S cluster, which then leads to the formation of an intramolecular disulfide bond. Subsequent reduction of the disulfide bond (through a yet unknown mechanism) stimulates disulfide reductase activity. This debate was fueled further when independent groups were unable to detect such activity with WhiB1 or the S. coelicolor orthologue of WhiB3 (13, 42), and also when it was reported that Mtb WhiB1 interacts with GlgB (an α-1,4-glucan branching enzyme) to reduce the intramolecular GlgB disulfide bond (20). Although these enzymatic studies are in contrast with increasing evidence suggesting that the Wbl proteins are regulatory DNA-binding proteins (37, 40, 42), or transcription factors as was recently demonstrated for Mtb WhiB1 (42), it cannot be discounted that WhiB1 may function as a highly selective reductase.
In sum, in the past few years novel genome-wide technologies (46) and comprehensive biochemical analyses (3) have been employed to characterize the Mtb Wbl proteins. However, the methodology and exact experimental conditions should be critically evaluated in order to yield accurate information on the biological functions of Wbl proteins.
The WhiB3 Fe–S Cluster
To date, biochemical evidence suggests that all Wbl proteins contain Fe–S clusters, and it is implicitly assumed that this prosthetic group plays a crucial role in Wbl protein function. However, evidence suggests that the Fe–S clusters from different Mtb Wbl proteins vary in their behavior towards O2 exposure (3) (Fig. 5). Furthermore, the fact that Fe–S cluster proteins are stable under anaerobic conditions and unstable under aerobic conditions directly implicates the Fe–S clusters in Wbl function. Host factors may also react with Fe–S clusters. For example, Fe–S cluster enzymes in the TCA cycle are known to be inhibited by O2 •− and by high pO2 (26). In addition, NO can react with the Fe–S clusters of TCA cycle enzymes, leading to the formation of a protein-bound dinitrosyl-iron complex (DNIC), which can alter enzymatic activity (17). Also, in elegant biochemical studies on Streptomyces WhiD (the homologue of Mtb WhiB3) and Mtb WhiB1, protein stability in the presence of H2O2 and O2 •− (13) and NO (14) was examined. The reaction of S. coelicolor WhiD and Mtb WhiB1 with NO was particularly interesting as each reacted 104-fold faster with NO than O2 in a multiphasic reaction containing eight NO molecules per [4Fe–4S] cluster (14). Using electron paramagnetic resonance spectroscopy, Singh et al. (41) demonstrated that O2 exposure can cause cluster degradation, and that Mtb WhiB3 can function as a sensor of the two dormancy gases O2 and NO.

What protects the WhiB3 Fe–S cluster against oxidation and eventually complete loss of the cluster during aerobic growth? One possibility is that a mycobacterial redox buffer in the cytoplasm protects cellular Fe–S clusters. Using methyl-mycothiol, it was shown that the extent of cluster loss was substantially less compared to untreated Streptomyces WhiD samples (13). Methyl-mycothiol also reduced the sensitivity of native WhiD against low pH. Lastly, Cys and thioredoxin were also shown to be protective against cluster loss due to O2 exposure (13). This provides theoretical evidence for how mycobacteria may maintain cytoplasmic Fe–S cluster integrity during aerobic growth.
An important physicochemical factor that may influence Fe–S cluster assembly (and thus WhiB3 function) is the acid dissociation constant (pK a) of residues adjacent to the conserved Cys residues and factors that may modify their properties. For example, WhiB3 Cys residues are flanked by multiple arginine residues (Cys23-Arg24-Cys53-Arg54-Arg55-Cys56-Cys62-Arg63). At neutral pH, Arg residues located close to Cys residues can significantly lower the pK a of Cys, resulting in Cys thiolates, which are highly susceptible to oxidation. Not surprisingly, Mtb whiB3 was highly upregulated when exposed to acidic stress (22). In sum, the inherent biochemical properties of the 4Fe–4S cluster allow WhiB3 to function as a sensitive redox switch that responds to a range of environmental signals implicated in virulence.
Mtb WhiB3 DNA-Binding and Transcriptional Regulation
Since the identification of a Wbl protein in Streptomyces spp. in 1992, it has been hypothesized that Wbl proteins bind DNA due of the presence of putative DNA-binding motifs (16). Initial evidence became stronger when it was demonstrated that Mtb WhiB3 physically interacts with the principle sigma factor, SigA (RpoV) (44). However, it took nearly two decades to provide experimental proof that a Wbl protein, WhiB3, binds DNA (40). The evidence came from characterizing the DNA-binding properties of apo- and holo- forms of WhiB3 to the upstream sequence of lipid biosynthesis genes pks2 and pks3 (40). Important findings were: (i) the redox state of reconstituted holo-WhiB3 did not affect DNA binding, (ii) oxidized apo-WhiB3 bound DNA much stronger than either holo-WhiB3 or reduced apo-WhiB3, and (iii) the oxidized apo-WhiB3 form demonstrated the strongest DNA binding. It was further demonstrated that WhiB3 regulates Mtb pks2 and pks3 genes in a redox-dependent manner, which provides further proof of a role for WhiB3 in the redox-dependent regulation of virulence factors (40). In another study, an E. coli bacterial one-hybrid system was used to screen for Mtb regulatory proteins that bind to promoter DNA (25). Mtb WhiB3 was shown to bind to the promoter regions of several in vivo induced genes. Although an important resource, the use of E. coli as surrogate host to examine Mtb DNA-binding proteins has the same drawbacks as the E. coli protein–protein interaction technology (see Protein Networks and Wbl Function) and awaits independent verification.
Since the WhiB3 4Fe–4S cluster reacts with NO to form a DNIC complex, any structural or redox-mediated changes associated with complex formation could potentially affect DNA binding and/or transcriptional regulation of WhiB3 target genes (Fig. 6). Notably, while the redox sensing capacity of WhiB3 is based on the presence of 4Fe–4S cluster and Cys residues, there is no clearly defined DNA-binding domain. Nonetheless, the C-terminus of WhiB3 contains a weak AT-hook motif TMGRTRGIRRTA, which is prevalent in eukaryotic nuclear proteins that nonspecifically interact with DNA (4). AT-hook motifs are known to interact with the minor groove of AT-rich DNA sequences, which in Mtb are typically restricted to the promoter regions. Similar to other AT-hook-containing proteins (4), the AT-hook like domain may enable WhiB3 to nonspecifically bind, or “coat” DNA as a protection against free radical-mediated damage.

Collectively, these WhiB3 studies suggested that the Wbl proteins interact strongly with DNA in their oxidized apo-forms and weakly in the holo-forms. It was subsequently shown that Mtb apo-WhiB1 (42), and Mtb apo-WhiB2 and its mycobacteriophage homologue apo-WhiBTM4 bind DNA strongly, as opposed to the holo-form of these proteins that bind DNA weakly or not at all (37). Interestingly, Mtb WhiB2 and WhiBTM4 bind to a conserved promoter sequence upstream of whiB2 to regulate its transcription (37).
Recent studies have shown that oxidized Mtb apo-WhiB1 as well as nitrosylated WhiB1 bind DNA, and that DNase I footprinting of apo-WhiB1 bound to whiB1 promoter DNA revealed a 37-bp protected region (42). Importantly, transcriptional analyses demonstrated that apo-WhiB1 repressed whiB1 transcription in vitro, and provided evidence for the first time that a Wbl member functions as a transcription factor (42).
In sum, it is clear that the oxidation states of holo- or apo-Wbl proteins alter their DNA binding and likely also transcriptional activity. Physiological modifications to either WhiB3 Cys residues (e.g., oxidized, reduced) or the Fe–S cluster (e.g., the apo- or holo-states, the Fe–S cluster oxidation state, nitrosylation, etc) represent a flexible switch that allows the microbial cell to rapidly respond to a wide range of environmental conditions (Fig. 6). Since virtually all DNA-binding proteins influence transcription to some extent, this suggests that the Wbl proteins are transcription regulators with DNA binding capability.
Role of WhiB3 in Mtb Disease
The first in vivo genetic complementation studies identified the principle sigma factor SigA (RpoV) as the first mycobacterial virulence factor (12). A caveat of this finding is that sigA is an essential gene, thereby making it exceedingly difficult to study. In this regard, WhiB3 was identified as a putative regulatory protein that interacts with SigA (44). This was determined by using a yeast two-hybrid screen to examine the independent interactions of wt SigA or the mutated allele of SigA (Arg515-His) with WhiB3. Noticeably, the WhiB3:SigA interaction was strong, whereas the SigA(Arg515-His):WhiB3 interaction was significantly decreased. This finding suggests that the SigA:WhiB3 interaction is necessary for full virulence and that the Arg515-His mutation contributes to the attenuation observed in the avirulent M. bovis strain. It was hypothesized that the disruption of this interaction by the SigA Arg515-His mutation caused alterations in expression of virulence genes under WhiB3 control (44).
Further studies demonstrated that deletion of whiB3 in Mtb and M. bovis caused no phenotype in vitro, but led to the loss of virulence in mouse and guinea pig models of TB (44). The authors proposed that virulence lipids were responsible for triggering the severe pathology. Indeed, subsequent lipid analyses by the same authors (40) have shown that the lack of WhiB3 was strongly associated with decreased production of the virulence lipids SL-1, PAT, and DAT, albeit with increased synthesis of PDIM (see section on Metabolic Regulation of Mtb WhiB3). The fact that the levels of some lipids are upregulated and others downregulated makes it difficult to accurately predict the exact contribution of each lipid species towards virulence. The identification of gene products under WhiB3 control that are responsible for the differences in host lung pathology is eagerly awaited.
Mtb WhiB3 Modulates Oxido-Reductive Stress
In vitro metabolic labeling studies performed under defined thiol-specific oxidizing (diamide) or reducing (dithiothreitol) conditions demonstrated that Mtb WhiB3 regulates the production of the lipids PAT, PDIM, and TAG in a redox-dependent manner. Furthermore, MtbΔwhiB3 is able to survive in the presence of toxic concentrations of propionate in vitro and within macrophages (40). A related finding was that reductive stress (dithiothreitol) exposure upregulated TAG anabolism. Earlier studies of Mtb showed that TAG production is highly upregulated under hypoxic conditions and exposure to NO and CO (29, 30). Oxygen, CO, and NO are modulatory ligands of the heme sensor kinases DosT and DosS, which modulate DosR, the regulator of the 47-member Dos dormancy regulon, which includes the tgs1 gene that controls TAG biosynthesis (45). It was recently suggested that TAG is the prime carbon source utilized upon reactivation from a nonreplicating persistent state (15). Thus, these findings provide a link between the intracellular WhiB3 pathway and the two-component DosR/S/T signaling pathway.
During intracellular growth, Mtb utilizes host fatty acids as a primary energy source via β-oxidation, wherein a large quantity of toxic propionate is generated. WhiB3 appears to play a central role in dissipating propionate toxicity and regulates the metabolic switchover between the in vitro carbon source (e.g., glucose) and the in vivo carbon source (e.g., acetate) (40, 41). Furthermore, β-oxidation is accompanied with the generation of high concentrations of reducing equivalents such as NADH and NADPH (Fig. 7). Extraordinarily high levels of NAD(P)H, but not NAD(P)+ were identified in Mtb isolated from infected mouse lungs, demonstrating that Mtb experiences reductive stress in vivo (11). In macrophages, MtbΔwhiB3 was shown to experience more severe reductive stress upon infection compared to Mtb. These observations point to a major role for WhiB3 in dissipating excessive reductive stress (40) [for a review on reductive stress see (18)]. While the mechanism of how WhiB3 maintains redox balance is largely unknown, a likely explanation is that excess reducing equivalents are assimilated via lipid anabolism (Fig. 7).

Conclusion and Future Directions
Understanding the mechanisms of how Mtb is able to persist in host tissue for decades without causing disease, and the development of new therapeutic strategies against latent TB depends on an advanced understanding of the role of redox homeostasis in Mtb pathogenesis. Novel paradigms are urgently needed, and the Wbl protein family represents an unexplored avenue of research. This poorly characterized group of Fe–S cluster proteins influences a diverse range of biological events including redox homeostasis (40), cAMP regulation (1), oxidative stress (28), cell division (23), drug resistance (33), and virulence (44), and makes them ideal candidates for further study.
Notably, the existence of more than 1600 Wbl homologues across numerous bacterial species and the fact that the ds DNA siphoviridae bacteriophage family contains Wbl homologues may shed light on the mechanism of horizontal and vertical gene transfer of wbl gene products. Subsequent to the original WhiB3 virulence study (44), a rapid surge in Wbl studies followed, particularly in the area of biochemistry (3, 13, 14, 41, 42), which has established a broad mechanistic framework for understanding the mode of action of Wbl members. Combining these mechanistic findings with in vivo Mtb virulence studies should lead to a better understanding of the biological function of these proteins. Demonstrating that Mtb WhiB3 binds DNA (40) was an important finding and has paved the way for studying how other Wbl proteins may bind DNA. Although technically challenging, obtaining the crystal structure of any Wbl protein would significantly contribute to the field. The identification of Mtb WhiB3 as a metabolic redox sensor (41) is particularly interesting as the concept of reductive stress emerged from these findings (18).
In sum, the Wbl family represents an important group of Mtb proteins of which WhiB3 represents the prevailing paradigm. Employing genome-wide technologies geared towards identifying new pathways and metabolites should lead to increased understanding of the biological function of Wbl proteins.
Footnotes
Acknowledgments
We thank Joel N. Glasgow for critical reading of this manuscript. Research in the laboratory is supported in whole or in part by the National Institutes of Health Grants AI058131, AI076389 (to AJCS). This work is also supported by the University of Alabama at Birmingham (UAB) Center for AIDS Research, and UAB Center for Free Radical Biology (AJCS). AJCS is a Burroughs Welcome Investigator in the Pathogenesis of Infectious Diseases.
Author Disclosure Statement
No competing financial interests exist.
