Abstract
Central oxidoreductase enzymes (eg, dehydrogenases, reductases) in microbial metabolism often have preferential binding specificity for one of the two major currency metabolites NAD(H) and NADP(H). These enzyme specificities result in a division of the metabolic functionality of the currency metabolites: enzymes reducing NAD+ to NADH drive oxidative phosphorylation, and enzymes reducing NADP+ to NADPH drive anabolic reactions. In this work, we introduce the computational method OptSwap, which predicts bioprocessing strain designs by identifying optimal modifications of the cofactor binding specificities of oxidoreductase enzyme and complementary reaction knockouts. Using the Escherichia coli genome-scale metabolic model iJO1366, OptSwap predicted eight growth-coupled production designs with significantly greater product yields or substrate-specific productivities than designs predicted with gene knockouts alone. These designs were identified for the production of L-alanine, succinate, acetate, and D-lactate under modeled conditions. Simulations predicted that production of L-alanine and D-lactate can be strongly coupled to growth by knocking out three reactions and swapping the cofactor specificity of one oxidoreductase reaction, while growth coupling was not predicted with four or fewer reaction knockouts under identical conditions. A succinate production design and an acetate production design were predicted to have higher maximum growth rates and higher substrate-specific productivities than designs predicted solely with reaction knockouts. The OptSwap formulation can be readily extended to additional organisms, and the constraints enforcing oxidoreductase specificity swaps can be extended to target other specificity sets of interest.
Introduction
In microorganisms, anabolism and catabolism are precisely controlled by regulation of flux through metabolic enzymes. This regulation of anabolic and catabolic activity is an important element of the adaptive system that enables a microorganism to find the optimal phenotype for growth and reproduction in its environment. The currency metabolites NAD(H) and NADP(H), which carry and transfer reducing equivalents, play unique roles in metabolism. The primary role of the reduced respiratory cofactor NADH is to transfer electrons to oxygen via the electron transport chain, generating the proton gradient that is used for oxidative phosphorylation of adenosine diphosphate (ADP) to adenosine triphosphate (ATP). 1 –3 Concurrently, the reduced cofactor NADPH donates electrons to anabolic reactions and drives biosynthetic pathways in the cell. 2,3 Despite the chemical similarity between NAD(H) and NADP(H), many central dehydrogenase and reductase enzymes in the cell preferentially catalyze reduction or oxidation of a specific carrier. 4 –7 Thus, the functional separation of these electron carriers and the specificity of enzymes to one electron carrier allow the system to direct resources to energy production or anabolism on a whole-cell scale.
Cellular control over the direction of reducing equivalents to NADH or NADPH is facilitated both by the specificity and activity of dehydrogenase reactions and by the activity of transhydrogenase enzymes, which transfer reducing equivalents between the two cofactors. Two transhydrogenases are encoded in the genome and expressed in Escherichia coli. The soluble transhydrogenase encoded by the gene sthA catalyzes the reaction [NAD+]+[NADPH]→[NADP+]+[NADH]. The membrane-bound transhydrogenase encoded by pntAB couples reduction of NADP+ with inward proton translocation, catalyzing the reaction [NADP+]+[NADH]+[H+ out]↔[NADPH]+[NAD+]+[H+ in]. It has been reported that 35–45% of the NADPH required for biosynthesis is generated by the transhydrogenase encoded by pntAB during aerobic batch growth on glucose. 2 The large contribution of this transhydrogenase enzyme to NADPH production reflects the importance of transhydrogenases for balancing cofactor generation to meet the needs of the cellular environment. 2,8 Therefore, any strategy to modify cofactor production should consider the role that transhydrogenases play in maintaining homeostasis.
Oxidoreductase specificity for NAD(H) or NADP(H) is a central parameter in determining the direction of cellular resources. Thus, strategies have been developed to modulate the cofactor specificity of dehydrogenase enzymes to increase production of desirable cellular products. Protein engineering has been used to switch the carrier specificity of dehydrogenase enzymes by modifying the amino acid residues of the nucleotide-phosphate binding site. In E. coli, the enzymes dihydrolipoadmide dehydrogenase (a component of the pyruvate dehydrogenase complex and the 2-oxoglutarate dehydrogenase complex) and isopropylmalate dehydrogenase were engineered to prefer NADP(H) over NAD(H) as cofactor, and the enzyme isocitrate dehydrogenase was engineered to prefer NAD(H) over NADP(H). 4,6,9,10 Furthermore, the cofactor specificity of isocitrate dehydrogenase was reversed by protein engineering, and a competition study was performed in which the modified strain outcompeted wild-type when grown on glucose. 11 Despite these efforts, dehydrogenase enzymes modified by protein engineering have not been exploited for metabolic engineering purposes.
An alternative strategy to engineer cofactor specificity is to replace native dehydrogenase reactions in central carbon metabolism with heterologous dehydrogenases that have specificity for the opposite cofactor. These non-native enzymes may be more efficient than enzymes produced using protein engineering because the heterologous enzymes have benefited from evolutionary optimization. To this end, the NAD(H)-dependent glyceraldehyde-3-phosphate dehydrogenase in E. coli was replaced with the NADP(H)-dependent glyceraldehyde-3-phosphate from Clostridium acetobutylicum to increase lycopene yield. 12 In Saccharomyces cerevisiae, the native NAD(H)-dependent glyceraldehyde-3-phosphate dehydrogenase was replaced with the NADP(H)-dependent enzyme from Kluyveromyces lactis to increase fermentation of D-xylose to ethanol. 13 Thus, researchers have exploited homologous swaps to boost production phenotypes.
The scientific interest shown in modifying electron carrier availability and the yield improvements seen in these modified strains suggest that a computational approach to model oxidoreductase specificity modifications could guide future experimental work and ultimately affect metabolic engineering. Determining which oxidoreductase specificity modifications are likely to have the greatest impact on the system and how oxidoreductase specificity changes can be paired with reaction knockouts most effectively can be answered with an in silico modeling procedure.
As a tool for investigating metabolic networks in silico, constraint-based reconstruction and analysis (COBRA) methods have been shown to enable accurate prediction of bacterial behavior under many conditions. 14,15 COBRA methods utilize genome-scale models (GEMS), which are built by pairing the reconstructed metabolic network of an organism with governing constraints based on physico-chemical conservations (ie, known reaction stoichiometries), spatial limitations, and environmental parameters. Governing constraints are formulated as a set of linear inequalities that enclose the solution space available to metabolic fluxes. 16 Solution spaces can be examined by optimizing for objectives using flux balance analysis (FBA) and other linear programming methods. 17 The optimal solutions predicted by FBA can match in vivo behavior for cases in which the metabolic network of the cell is optimized for the same objective (eg, growth). A powerful approach to achieve in vivo optimality is adaptive laboratory evolution (ALE). ALE optimizes the genotype of the organism, and the result is often a match between the observed growth phenotypes and the model predictions. 18 –21 Growth-coupled strains—strains in which growth is directly coupled to the production of a given molecule—have drawn attention because these strains are predicted to produce high yields of the target molecule after ALE selecting for cell growth. 22 Computational algorithms that identify growth-coupled designs have proliferated and include OptKnock, RobustKnock, and OptGene. 17,23 –25 Thus, in silico COBRA methods coupled with ALE constitute a strategy for rational engineering of high-yield production strains. A COBRA method for optimizing cofactor specificity has not been reported. Previous investigations have explored the importance of cofactor balancing for microbial production strains, but de novo strain design has not been explored using non-biased cofactor optimization. 8,13,26
In this work, we present OptSwap, an in silico, constraint-based modeling technique for generating strategies to optimize the production of native cellular compounds by modifying the electron carrier specificity of oxidoreductase reactions in the metabolic network. A mixed-integer linear programming (MILP) method is presented that optimizes growth-coupled product yield by pairing oxidoreductase specificity swaps with reaction knockouts. Utilizing OptSwap, we predict novel, growth-coupled designs for the production of valuable compounds by E. coli, and we compare these designs to solutions that are predicted utilizing reaction knockouts alone.
Materials and Methods
Modeling and Computational Tools
The iJO1366 metabolic construction of E. coli K-12 MG1655 was used for all simulations in this work. 27 As described previously, formate-hydrogen lyase (FHL) and the oxidative stress reactions catalase (CAT), cytosolic superoxide dismutase (SPODM), and periplasmic superoxide dismutase (SPODMpp) were constrained to zero. 27 The pyruvate:ferredoxin oxidoreductase (POR5) reaction was made irreversible, as supported by biochemical data. 28
FBA, parsimonious flux balance analysis (pFBA), flux variability analysis (FVA), and RobustKnock were implemented in MATLAB (The MathWorks Inc., Natick, MA) as described in the literature. 24,29 –31 All simulations were performed using MATLAB and the COBRA Toolbox software packages with TOMLAB/CPLEX (Tomlab Optimization Inc., San Diego, CA) and Gurobi (Gurobi Optimization, Inc., Houston, TX) LP/MILP solvers. 32
Substrate uptake rates for the solitary carbon substrates in each simulation were constrained to a maximum uptake rate of 20 mmol/gDW/h. For aerobic simulations, the oxygen uptake rate was set to a maximum of 20 mmol/gDW/h. These values were chosen based on experimental observations of aerobic and anaerobic growth of E. coli. 33,34
Model Reduction and Selection of Reaction Set for Knockouts
The reaction set available for knockout was restricted based on a previously reported method for model reduction and target reaction selection. 22 After setting the bounds for the primary carbon source and oxygen exchange, we performed the following procedure, as previously reported. 22 First, to generate a “reduced model,” reactions that could not be utilized under the conditions of the simulation were removed, and upper and lower bounds of all reactions were set to maximum and minimum obtainable values, as determined by FVA. Non-gene associated reactions, spontaneous reactions, transport reactions, reactions in peripheral metabolic pathways, and reactions acting on high-carbon containing molecules were then removed from the knockout reaction set. Finally, sets of coupled reactions were identified by examining the null space of the stoichiometric matrix, and only one reaction in each correlated set was considered for knockout. 35
With glucose as the substrate, the reactions available for knockout during optimization numbered 217 anaerobically and 243 aerobically. With D-xylose as the substrate, the reactions available for knockout numbered 216 anaerobically and 238 aerobically (Supplementary Table S1
; Supplementary Data are available online at
Selection of Reaction Set for Cofactor Specificity Swaps
The set of oxidoreductase reactions available for modification in the OptSwap procedure was determined by finding the oxidoreductase reactions in the high-flux backbone. 36 In the metabolic model iJO1366, all reactions that utilize NAD(H) or NADP(H) as a cofactor were located. After pFBA optimization for flux, the reactions were sorted by flux magnitude through the biomass objective function under conditions of aerobic and anaerobic growth on glucose and D-xylose minimal media. The reactions with highest flux under each set of conditions were selected. D-Lactate dehydrogenase, malic enzymes, and L-1,2-propanediol oxidoreductase were added to the set based on interest in the literature. 37 –39 Malate dehydrogenase was removed from the pool of oxidoreductase enzymes that can be swapped with OptSwap because the NADPH–specific malate dehydrogenase allowed non-physiological loops to form in the flux solutions. Under anaerobic conditions, NADH:oxidoreductase I was removed from the pool of reactions during model reduction because the reaction cannot carry flux during simulations of anaerobic growth. Thus, 22 oxidoreductase enzymes were chosen under aerobic conditions and 21 oxidoreductase enzymes under anaerobic conditions (Table 1). 40 –43
Oxidoreductase Reactions Targeted for Analysis with OptSwap a
Reactions were chosen by identifying oxidoreductase reactions with highest predicted flux in the genome-scale model during aerobic and anaerobic growth on glucose and D-xylose minimal media. Relative flux values are reported from pFBA simulation optimizing flux through the biomass objective function under these conditions. The enzymes L-1,2-propanediol oxidoreductase, D-lactate dehydrogenase, and malic enzymes were also included in the analysis because interest has been shown in these enzymes in the literature. Malate dehydrogenase was removed from the reaction pool because thermodynamically infeasible loops were created in the presence of the swapped enzyme and, under aerobic conditions, NADH:ubiquinone oxidoreductase I was removed from the model during the model reduction procedure.
MILP Formulation
OptSwap is a bi-level MILP problem based on RobustKnock with new constraints that enforce swaps of the cofactor specificity of oxidoreductase reactions (Fig. 1).
24
The following procedure was used to incorporate these constraints into the RobustKnock problem. First, for each oxidoreductase enzyme in the pool of the reactions that can be “swapped,” a reaction with opposite specificity—either NAD(H) or NADP(H)—was added to the model. New Boolean decision variables were utilized. The variable sd
represents the on/off state of the native oxidoreductase reactions, and td
represents the on/off state of the swapped reactions, where a value of 0 means the reaction is off and a value of 1 means the reaction is on. D is the set of oxidoreductase reaction pairs (native and swapped):

The OptSwap formulation for optimizing cofactor specificity of major metabolic enzymes [NAD(H) vs. NADP(H)] coupled to reaction knockouts. Constraints are added to the MILP problem to enforce swapping the cofactor specificity of reactions catalyzed by oxidoreductase enzymes
These variables are present in addition to the RobustKnock Boolean variable ye
, which represents the on/off state of all reactions that can be knocked out in the model. E is the set of reactions that can be knocked out.
Second, a constraint was added to the outer problem that requires either the native or the swapped reaction to be knocked out for each oxidoreductase reaction pair in D. This is the constraint that forces an oxidoreductase swap:
Third, a constraint was added to the outer problem to limit the number of swaps to be less than or equal to the parameter L:
The knockout limitation constraint from RobustKnock ensures that the number of reaction knockouts is less than or equal to the parameter K:
Fourth, a constraint was added to the outer problem to limit the number of interventions (oxidoreductase swaps and reaction knockouts) to be less than or equal to the parameter X:
These three constraints on the number of knockouts and swaps can be included or excluded from the problem according to the desired simulation scenario. If the oxidoreductase set D is empty, then OptSwap reduces to the RobustKnock problem.
Fifth, a set of constraints was added to limit flux to zero for any native or swapped oxidoreductase reaction whose corresponding decision variable is equal to zero. The function x maps the set of oxidoreductase reaction pairs, D, to the corresponding fluxes and bounds for native oxidoreductase reactions in the model, and the function y maps the set of oxidoreductase reaction pairs to the corresponding fluxes and bounds for the non-native, swapped oxidoreductase reactions. Thus, when sd
=0, the flux vx(d)
is constrained to zero:
These are present in addition to the knockout constraint from RobustKnock. The function z maps the set of reactions that can be knocked out, E, to the corresponding fluxes:
As an illustration of the variables s and t, consider the case in which sd =0. Then, by Equation 4, td =1. Equation 9 reduces to LBy(d)<=vy(d)<=UBy(d) , so flux through the swapped oxidoreductase reaction is constrained only by the lower and upper bounds—the reaction is “on.” For the same case, Equation 8 reduces to 0<=vx(d)<=0, so flux through the native oxidoreductase reaction is constrained to zero—the reaction is “off.”
Thus, the final formulation of the OptSwap problem can be stated as follows: J is the set of all reaction fluxes, and I is the set of all metabolites in the model:
max min v chemical
s.t.
To solve this bi-level max-min problem, we implemented the techniques that were used to simplify and solve RobustKnock. 24 The function dual_embed from the RobustKnock implementation was used to generate the dual of the inner problem and linearize the bilinear terms (Equations 8 –10). By the strong duality theory of linear programming, the outer problem and the dual of the inner problem could be integrated. 24 Finally, the minimization problem in the resulting min-max problem was converted to a maximization problem using the dual_embed function. After this second conversion and integration, the OptSwap problem is a max-max optimization that can be solved with standard MILP solvers (Supplementary Table S2).
A slight variation of the OptSwap MILP was also investigated. Equation 4 can be converted to an inequality so that oxidoreductase reactions can be swapped (sd=0, td=1) or knocked out (sd=td=0):
However, this more general formulation proved to be more computationally intensive and presented numerical challenges that we could not resolve with the TOMLAB/CPLEX solver. Results with this formulation were limited to run for 12 hours, and many simulations did not solve to optimality (Supplementary Table S4).
Results and Discussion
The in silico metabolic optimization procedure OptSwap was used to predict production strains that couple cellular growth to production of targeted products using a combination of oxidoreductase specificity swaps and reaction knockouts. To optimize for maximal growth-coupled yield, a MILP problem was developed based on the RobustKnock problem. 24 RobustKnock is a bi-level optimization problem similar to OptKnock, except that OptKnock maximizes production of a target molecule subject to maximizing biomass production, whereas the RobustKnock problem maximizes the minimum production of a target product subject to maximizing growth rate. This procedure ensures that non-unique solutions are not produced—a concern that was first addressed by tilting the objective function prior to the appearance of RobustKnock. 22 OptSwap contains additional constraints so that both reaction knockouts and oxidoreductase swaps can be utilized to identify a growth-coupled design.
OptSwap was demonstrated using the iJO1366 genome-scale metabolic model of E. coli. 27 The pool of oxidoreductase enzymes that could be swapped in the analysis was determined by identifying central, high-flux-carrying reactions. Where reaction knockouts were considered for use with OptSwap, a reaction selection workflow was utilized, as previously reported. 22 All swap designs were based on the transhydrogenase-deficient ΔpntAB ΔsthA genotype to ensure that unconstrained transhydrogenase activity in the model did not counteract the effect of swapping oxidoreductase specificity.
Growth-coupled designs predicted by OptSwap with both oxidoreductase specificity swaps and reaction knockouts were compared with solutions from RobustKnock in which only reaction knockouts were considered. The designs were categorized by the number of interventions; an intervention is a reaction knockout or a dehydrogenase swap. Thirteen high-value products were investigated based on previous screening criteria for industrially significant compounds produced natively in E. coli. 22 For each product, simulations were performed under conditions of glucose and D-xylose minimal media for both aerobic and anaerobic growth (Supplementary Table S3).
For the conditions considered in these simulations, OptSwap produced 10 designs in which coupling oxidoreductase specificity swaps with reaction knockouts resulted in superior predicted production rate compared to just reaction knockouts (Table 2). The eight designs in which OptSwap predicted significantly stronger growth coupling or higher substrate-specific productivity (yield×growth rate) were investigated in more detail (Fig. 2). OptSwap predicted a strongly growth-coupled design for L-alanine production that secretes no coproducts and generates L-alanine with high yield under anaerobic conditions on glucose minimal media. The design requires only three knockouts and one swap of the native enzyme (Fig. 3; Supplementary Fig. S6); it utilizes an NAD(H)-specific glutamate dehydrogenase in place of the native NADP(H)-specific reaction (ghdA). NAD(H)-specific glutamate dehydrogenase transfers electrons from NADH to glutamate. Subsequently, L-alanine transaminase (alaA or alaC) produces L-alanine and 2-oxoglutarate from glutamate and pyruvate. Knockouts of D-lactate dehydrogenase (ldhA) and ethanol dehydrogenase (adhP or adhE) prevent D-lactate and ethanol production. With the new NADH-specific glutamate dehydrogenase pathway in place, L-alanine secretion is energetically favorable to succinate secretion. Knockout of acetyl-CoA acetyltransferase (atoB) is also predicted to be necessary for growth coupling (Fig. 4).

Calculated production envelopes for OptSwap designs predicted to have significantly higher optimal production rate or substrate-specific productivity than designs with just reaction knockouts. The wild-type (solid grey line) and the 0pntAB 0sthA genotype (dashed black line) are shown. All designs were found by first knocking out transhydrogenases (0pntAB 0sthA). The optimized phenotypes for OptSwap (solid red line) and RobustKnock (dashed orange line) are compared. Each phenotype required four interventions (one intervention being either a reaction knockout or a oxidoreductase swap). The star (*) indicates that the same design was identified in both cases. The orange X indicates that no growth-coupled design was identified by RobustKnock under these conditions. OptSwap predicts growth-coupled designs for producing L-alanine under all four conditions and D-lactate aerobically on D-xylose substrate; growth coupling is not predicted for these products with just four or fewer reaction knockouts under identical conditions. The OptSwap designs for succinate and acetate are predicted to have higher yield than designs with just reaction knockouts. The designs for succinate production and acetate production on glucose substrate are also predicted to have higher substrate-specific productivities than the designs predicted by RobustKnock.

Network diagrams showing the shift in reduced cofactor usage between wild-type and the L-alanine production design. Simulated wild-type flux distribution during anaerobic fermentation in E. coli. The fluxes shown are unique solutions calculated using pFBA, optimizing flux through the biomass objective function

The predicted effect of combinatorial interventions on the design for anaerobic production of L-alanine on glucose minimal media. Glutamate dehydrogenase must be swapped for high yields of L-alanine to be produced at high growth rates; dashed blue envelope shows production with the native glutamate dehydrogenase. With fewer than three reaction knockouts and the swapped glutamate dehydrogenase, high yield of L-alanine is possible, but L-alanine production is not growth-coupled. The final set of three reactions knockouts and one dehydrogenase swap causes a strong coupling between L-alanine production and cell growth. LDH_D: D-lactate dehydrogenase; ACACT1r: acetyl-CoA acetyltransferase; ALCD2x: ethanol dehydrogenase; GLUDy: glutamate dehydrogenase.
Calculated Maximum Optimal Production Rate for Designs Selected by OptSwap and Designs Selected by RobustKnock a
Results are arranged by number of interventions, in which one intervention is either one knockout or one oxidoreductase swap. For OptSwap, any combination of swaps and knockouts was allowed. The cases in which OptSwap predicted growth-coupled designs with higher maximum optimal production rate are in bold, and the cases in which the RobustKnock designs are predicted to have higher maximum optimal production rate are in italics. Ten OptSwap designs were identified with higher maximum optimal production rate than designs with just reaction knockouts.
For anaerobic production of L-alanine from D-xylose and for aerobic production of L-alanine from glucose and D-xylose, slightly different designs were predicted. Phenotypes for aerobic production of L-alanine are predicted to have lower yield and higher maximum growth rate than the anaerobic designs (Fig. 2). In all four L-alanine production designs, swapping the cofactor specificity of glutamate dehydrogenase results in predicted phenotypes with strong growth coupling, while growth coupling is not predicted in any designs with four or fewer reaction knockouts. However, previously reported simulations suggest that growth-coupling of L-alanine production may be possible with five reaction knockouts in E. coli. 22
OptSwap predicted that anaerobic production of acetate can be increased with four interventions (Fig. 2). The acetate designs rely on replacing the native NAD(H)-dependent ethanol dehydrogenase (adhP or adhE) with a NADP(H)-dependent ethanol dehydrogenase. On both glucose and D-xylose substrates, maximum optimal acetate production is predicted to be higher with the OptSwap design than with just four or fewer reaction knockouts. Furthermore, simulation predicted that acetate production on glucose substrate with the OptSwap design can be obtained at higher growth rates, and thus with higher substrate-specific productivity.
Simulations predicted that succinate can be produced aerobically with glucose as the substrate using three knockouts and one dehydrogenase swap. The OptSwap design for succinate production utilizes a NAD(H)-dependent glutamate dehydrogenase in place of the native enzyme, coupled to three reaction knockouts. The result is a small increase in the predicted succinate production rate at maximum growth rate, but a large increase in the predicted substrate-specific productivity, so this design is when substrate-specific productivity is desirable.
The OptSwap design for aerobic production of D-lactate on D-xylose substrate is predicted to be strongly growth-coupled, whereas RobustKnock does not predict any growth-coupling for D-lactate with four or fewer interventions under identical conditions. The OptSwap design swaps the native NAD(H)-dependent ethanol dehydrogenase (adhP or adhE) with a NADP(H)-dependent ethanol dehydrogenase. Three knockouts were predicted: acetate kinase (tdcD or ackA or purT), ATP synthase (atpA–H), and pyruvate formate lyase (pflA and pflB). ATP synthase is not predicted to be essential by the metabolic model for growth on either of the substrates considered in these simulations (glucose amd D-xylose). Experimental evidence indicates that ATP synthase knockout strains grow slowly on glucose in vivo. 44
A number of cases exist in which RobustKnock predicted a design with greater optimal yield than the OptSwap solution (Table 2). In these cases, the RobustKnock solution contains a knocked out oxidoreductase reaction. In the formulation of OptSwap presented here, oxidoreductase reactions cannot be knocked out—Equation 4 enforces swaps and disallows knockouts. An alternative OptSwap formulation was considered in which oxidoreductase reaction knockouts are allowed. The equality constraint (Equation 4) is replaced with an inequality (Equation 12) that allows swaps or knockouts. However, the solution space of the more-general MILP problem caused challenges for the MILP solver used for this work. Numerical imprecision caused non-real solutions, and solution times increased to more than 12 hours. However, when the new problems solved to optimality (for a number of substrate/product combinations tested), they identified the same solutions found with RobustKnock (Supplementary Table S4).
Conclusions
This study presents a computational method to predict optimal cofactor specificity of oxidoreductase reactions in the genome-scale metabolic model. The designs identified by OptSwap are non-intuitive solutions to produce desirable cellular products—solutions that are not possible with just reaction knockouts. Growth-coupled designs for L-alanine and D-lactate were identified by OptSwap using four interventions, in which reaction knockouts could not be used to identify growth-coupled production phenotypes under identical conditions. The anaerobic production designs for L-alanine utilize a NAD(H)-specific glutamate dehydrogenase coupled to reaction knockouts that limit the pathways available for anaerobic fermentation in order to produce very high yields of L-alanine at maximum growth rate. For succinate and acetate, yield improvements were smaller, but large increases in substrate-specific productivity were predicted with OptSwap when compared to designs with just reaction knockouts.
The unique anaerobic L-alanine production design reported here is simpler than a previously reported design requiring seven knockouts and one gene insertion. 37 In that study, the native D-lactate dehydrogenase of E. coli was replaced with alanine dehydrogenase from Geobacillus stearothermophilus. To ensure growth coupling of L-alanine, the authors also disabled fermentation pathways for ethanol, acetate, formate, D-lactate, L-lactate, and succinate. By utilizing the predictive power of constraint-based modeling, we predicted a design with high yield and fewer genetic manipulations.
The OptSwap in silico design strategies can be broadly applied to any product in the genome-scale model. A previous analysis demonstrated that growth coupling in genome-scale models is most easily achieved for a class of products related to the native fermentation pathways in E. coli. 22 However, techniques like OptSwap may expand the scope of products that can be growth coupled by exploring a broader set of solutions. The knockout and swap designs predicted by OptSwap are candidates for experimental validation and can be built with current technologies. More complex designs than those presented here have been successfully implemented in E. coli, including the strain for production of L-alanine. 37
The main limitation of OptSwap is that it is more computationally demanding than previous methods, including RobustKnock and OptKnock. The greater complexity of the solution space means that finding an optimal solution is more difficult. MILP problems are nondeterministic polynomial time (NP)-complete, and solution time depends on the structure of the problem and the solving method. The solution times increased with the parameters K, L, and X (Supplementary Table S5), and adopting a more general problem increases solution times further. While the simpler formulation of OptSwap reported here is computationally tractable, the more general problem allowing oxidoreductase knockouts could not be solved in many cases within 12 hours. CPLEX is a sophisticated MILP solver, and tweaking solver parameters did yield improvements in performance. OptSwap can be solved by choosing the less general formulation utilized for these results and restricting interventions (L or X) to be four or less. Improvements in MILP solvers and computational power will reduce these challenges in the future, and even more complex MILP problems will be solvable.
OptSwap adds a new level of complexity to the COBRA methods for identification of useful production phenotypes in microorganisms. 17 The constraints placed on the presence/absence of oxidoreductase reactions with specificity for different cofactors allow direct investigation and manipulation of cofactor pools that are central to directing cellular resources in microorganisms. Changing cofactor specificity alters the metabolic solution space. While reaction knockouts always decrease the size of the solution space, changing cofactor specificity can have more complex effects and may even increase the solution space in the direction of an objective function.
OptSwap can be readily implemented in metabolic models of other organisms. For example, the experimental findings with a dehydrogenase swap in S. cerevisiae could be further investigated with the genome-scale metabolic model of that organism. 13 , 45 The constraints that enforce dehydrogenase specificity swaps can also be extended to target other metabolic specificity sets (eg, nucleoside triphosphates) and can be incorporated into other in silico design strategies.
Footnotes
Acknowledgments
We would like to thank Daniel Zielinski, Ali Ebrahim, and Bernhard Palsson for their assistance and insight on the project. We would like to acknowledge the Novo Nordisk Foundation for providing funding for the project.
Author Disclosure Statement
No competing financial interests exist.
