Understanding the p K a of Redox Cysteines: The Key Role of Hydrogen Bonding

Abstract

Many cellular functions involve cysteine chemistry via thiol–disulfide exchange pathways. The nucleophilic cysteines of the enzymes involved are activated as thiolate. A thiolate is much more reactive than a neutral thiol. Therefore, determining and understanding the pK _as of functional cysteines are important aspects of biochemistry and molecular biology with direct implications for redox signaling. Here, we describe the experimental and theoretical methods to determine cysteine pK _a values, and we examine the factors that control these pK _as. Drawing largely on experience gained with the thioredoxin superfamily, we examine the roles of solvation, charge–charge, helix macrodipole, and hydrogen bonding interactions as pK _a-modulating factors. The contributions of these factors in influencing cysteine pK _as and the associated chemistry, including the relevance for the reaction kinetics and thermodynamics, are discussed. This analysis highlights the critical role of direct hydrogen bonding to the cysteine sulfur as a key factor modulating the equilibrium between thiol S–H and thiolate S⁻. This role is easily understood intuitively and provides a framework for biochemical functional insights. Antioxid. Redox Signal. 18, 94–127.

I. Introduction

II. pKa Determination Methods

A. Experimental approaches

B. Computational methods

C. Future perspective for pKa calculations applied to cysteines

III. Factors That Control the pKa Values of Cysteine Thiols in Proteins

A. Limited role of charged side chains and long-range electrostatics

B. The strong influence of direct hydrogen bonds on the pKa of cysteines

C. Reinterpretation of the helical effect on the pKas of cysteines

D. How general are the mechanisms modulating the pKa of cysteines?

IV. Functional Properties Influenced by the Cysteine pKas

V. Conclusions

I. Introduction

C ysteine residues are one of the least-abundant amino acids, but are actively involved in many ways in protein functions (109, 110). Consistent with their functional role and ability to react chemically, cysteines are frequently found conserved. They are critical for the activity in oxidases, reductases, disulfide isomerases, and peroxidases (32) (Fig. 1). These enzymes play an important role in the redox homeostasis of cells. They are involved in the thiol–disulfide exchange reactions during oxidative protein folding, and in antioxidant defense mechanisms of the cell. Cysteine thiols are also essential in cell-cycle-regulating enzymes, like phosphatases and cysteine proteases. Thus, numerous enzymes depend on redox-active cysteines, the pK _a of which is a determining factor for their reactivity and nucleophilicity (183). The pK _a of a cysteine represents the balance of the equilibrium thiol S–H ⇄ thiolate S⁻. Since thiolates are much more reactive than neutral thiols, they are critical to the function of many cysteines. It is therefore important to characterize the pK _as of cysteines, not only experimentally but also computationally. These tasks remain challenging; however, much progress has been achieved in these areas and in trying to understand the factors that modulate the cysteine pK _as.

FIG. 1.

Cysteines in thiol–disulfide exchange reactions catalyzed by Trx-fold enzymes. Cysteines present in a thiolate form at physiological pH are more sensitive to reactive oxygen species (ROS). Exposure to hydrogen peroxide (H₂O₂) leads to the oxidation of the thiol group into the reversible sulfenic acid, whereas further exposure leads to irreversible cysteine oxidation states: sulfinic acid (-SO₂H) and sulfonic acid (-SO₃H). These higher oxidation states are considered as irreversible, since no general sulfinic or sulfonic acid reductase enzymes have been identified yet (144). Human sulfiredoxin is the only known exception (186). Sulfenic acids are protected from irreversible oxidation by different mechanisms: disulfide formation with another cysteine, and mixed disulfide formation with low-molecular-weight thiols (LMW thiols; e.g., S-glutathionylation). Disulfide bond formation does not always proceed via a sulfenic acid intermediate, but can also result from the oxidation of two cysteine residues by oxidative protein-folding catalysts (e.g., disulfide-binding protein A [DsbA]). Protection with the cysteine side chain by reaction with a backbone amide nitrogen to form a sulfonamide is not shown. The cysteines of the enzymes with a thioredoxin (Trx)-fold are essential to catalyze the disulfide bond formation and reduction. (To see this illustration in color, the reader is referred to the web version of this article at www.liebertpub.com/ars.)

The intrinsic pK _a value for the free cysteine thiol–thiolate equilibrium in an aqueous solution is close to 8.6 (26, 39, 82, 89, 169). In folded proteins, this pK _a can be shifted by the influence of the three-dimensional protein structure (13). Polar groups (charged or neutral) in the vicinity of a cysteine, and/or a different solvation environment compared to an aqueous solution, can influence the pK _a of a cysteine thiol. Under physiological conditions (pH ∼7), a thiol with a pK _a value below 7 will exist mostly as a more reactive thiolate, critical for catalysis (27). Note that physiological conditions are not always at pH ∼7, since different cellular organelles have different pH values. Lowered pK _a values of catalytic cysteines influence the reaction kinetics and thermodynamics, and strongly influence the catalytic efficiency of an enzyme during thiol–disulfide exchange reactions. The effect of pK _a lowering on reaction rate enhancement will in general be most significant when the pK _a values are close to the solution pH (78). Perturbed pK _a values also influence protein stability (189).

With accurate cysteine thiol pK _a values, one gains insights into catalytic mechanisms and into the factors influencing the pK _a values. It has long been known that the pK _a values of catalytic cysteines in thiol–disulfide oxidoreductases of the thioredoxin (Trx) superfamily can adopt a wide range of values in proteins with a similar structural fold (19). Therefore, this superfamily of enzymes provides a paradigm to study the factors influencing and modulating the pK _a of thiol groups (35, 43, 45, 49, 56, 61, 77, 117, 129, 132, 142, 179). A thorough compilation of experimentally measured pK _a values for these enzymes (and related model systems) should be a helpful resource (Table 1).

Table 1.

Experimentally Measured pK _as for Thiol Groups in Proteins of the Thioredoxin Superfamilly

System	Residue	Reported pK_a value ^a	Experimental method	References ^b	Comments
Trx wild type
Escherichia coli Trx wild-type (-C32-G33-P34-C35-)	Cys32	6.7	pH dependence of enzyme reaction with iodoacetic acid and iodoacetamide	(82)	The pK _a of Cys35 appeared to be above (but close to) 9
E. coli Trx wild-type (-C32-G33-P34-C35-)	Cys32	7.1	Raman spectroscopy	(101)	Raman spectroscopy used to probe cysteine titrations, redox state, and conformations
E. coli Trx wild-type (-C32-G33-P34-C35-)	Cys35	7.9	Raman spectroscopy	(101)	Raman spectroscopy used to probe cysteine titrations, redox state, and conformations
E. coli Trx wild-type (-C32-G33-P34-C35-)	Cys32 and Cys35	9–10	pH dependence of equilibrium reaction with glutathione and direct UV absorbance	(166)	The pK _as inferred in this study have been controversial, and a reinterpretation of these results is now generally accepted (38).
E. coli Trx wild-type (-C32-G33-P34-C35-)	Cys32	7.4	pH dependence of NMR chemical shifts	(77)	Consistent with pK _a derived from UV measurements
E. coli Trx wild-type (-C32-G33-P34-C35-)	Cys35	9.5	pH dependence of NMR chemical shifts	(77)	Consistent with pK _a derived from UV measurements
E. coli Trx wild-type (-C32-G33-P34-C35-)	Cys32	7.1	UV absorption at 240 nm during pH titration	(38)	Consistent with pK _a derived from NMR measurements
E. coli Trx wild-type (-C32-G33-P34-C35-)	Cys35	9.9	UV absorption at 240 nm during pH titration	(38)	Consistent with pK _a derived from NMR measurements
E. coli Trx wild-type (-C32-G33-P34-C35-)	Cys32	7.13	pH dependence of reaction with iodoacetamide	(117)	Validated further the method relying on the pH-dependent reaction with iodoacetamide, to estimate thiol pK _as
E. coli Trx wild-type (-C32-P33-Y34-C35-)	Cys32	7.0	UV absorption at 240 nm during pH titration	(139)	To compare to the I75T mutant in the same study
E. coli Trx2 wild-type (-C64-G65-P66-C67-)	Cys64	5.1	UV absorption at 240 nm during pH titration	(40)	The pK _a of the active-site nucleophilic cysteine of Trx2 is lower than in Trx1, consistent with Trx2 being more oxidizing than Trx1.
Trx mutants
E. coli Trx mutant D26A	Cys32	8.0	pH dependence of NMR chemical shifts	(38)	D26 is conserved, buried near the Trx active site, and thought to be important for Trx function
E. coli Trx mutant D26A	Cys32	7.8	UV absorption at 240 nm during pH titration	(38)	D26 is conserved, buried near the Trx active site, and thought to be important for Trx function
E. coli Trx mutant K57M	Cys32	8.0	pH dependence of NMR chemical shifts	(38)	K57 forms a buried salt bridge with D26 in the vicinity of the active site
E. coli Trx mutant K57M	Cys32	8.1	UV absorption at 240 nm during pH titration	(38)	K57 forms a buried salt bridge with D26 in the vicinity of the active site
E. coli Trx mutant D26A/K57M	Cys32	8.1	pH dependence of NMR chemical shifts	(38)	K57 forms a buried salt bridge with D26 in the vicinity of the active site
E. coli Trx mutant D26A/K57M	Cys32	8.0	UV absorption at 240 nm during pH titration	(38)	K57 forms a buried salt bridge with D26 in the vicinity of the active site
E. coli Trx mutant (-C32-A33-T34-C35-)	Cys32	6.21	pH dependence of reaction with iodoacetamide	(117)	A Trx reductase-like -C-X-X-C- sequence in Trx
E. coli Trx mutant (-C₃₂-G₃₃-H₃₄-C₃₅-)	Cys32	6.34	pH dependence of reaction with iodoacetamide	(117)	A PDI-like -C-X-X-C- sequence in Trx
E. coli Trx mutant (-C₃₂-P₃₃-H₃₄-C₃₅-)	Cys32	6.12	pH dependence of reaction with iodoacetamide	(117)	A DsbA-like -C-X-X-C- sequence in Trx
E. coli Trx mutant (-C₃₂-P₃₃-Y₃₄-C₃₅-)	Cys32	5.86	pH dependence of reaction with iodoacetamide	(117)	A Grx-like -C-X-X-C- sequence in Trx
E. coli Trx2 mutant (-C₆₄-G₆₅-P₆₆-S₆₇-)	Cys64	7.1	UV absorption at 240 nm during pH titration	(40)	Mutating Cys67 to Ser increased the pK _a of the active-site nucleophilic cysteine by 2
E. coli Trx2 quadruple mutant C5S/C8S/C25S/C28S	Cys64	7.1	UV absorption at 240 nm during pH titration	(40)	Mutating the four cysteines in the N-terminal domain prevented binding of zinc to these cysteines, and increased the pK _a of the active-site nucleophilic cysteine by two
E. coli Trx mutant I75T	Cys32	5.5	UV absorption at 240 nm during pH titration	(139)	The I75T mutation is in the cis-Pro loop in the vicinity of the active site
DsbA wild type
E. coli DsbA wild-type (-C₃₀-P₃₁-H₃₂-C₃₃-)	Cys30	3.5	UV absorption at 240 nm during pH titration	(121)	An early biochemical study of DsbA and its reactivity
E. coli DsbA wild-type (-C₃₀-P₃₁-H₃₂-C₃₃-)	Cys30	3.42	UV absorption at 240 nm during pH titration	(56)	A major model system, extensively studied experimentally and theoretically, including with mutants
E. coli DsbA wild-type (-C₃₀-P₃₁-H₃₂-C₃₃-)	Cys30	3.28	UV absorption at 240 nm during pH titration	(69)	Used DsbA to investigate the -C-X-X-C- motif of other oxidoreductases, and its influence on pK _a and redox properties
E. coli DsbA wild-type (-C₃₀-P₃₁-H₃₂-C₃₃-)	Cys30	3.34 and 3.29	UV absorption at 240 nm during pH titration, and pH-dependent alkylation by iodoacetamide	(117)	Found that the pK _a measured by reaction with iodoacetamide (pK _a=3.29) agreed well with that measured by UV absorption at 240 nm (pK _a=3.34)
E. coli DsbA wild-type (-C₃₀-P₃₁-H₃₂-C₃₃-)	Cys30	3.3	UV absorption at 240 nm during pH titration	(139)	To compare to the V150T mutant in the same study
Vibrio cholerae DsbA wild-type (-C₄₉-P₅₀-H₅₁-C₅₂-)	Cys49	5.1	Kinetics of oxidation of a substrate peptide monitored by tryptophan fluorescence	(146)	The pK _a was attributed to the active-site reactive cysteine, and inferred indirectly from kinetic measurements, which may account for the surprisingly high pK _a reported for Cys49
DsbA mutants
E. coli DsbA mutant (-C₃₀-S₃₁-V₃₂-C₃₃-)	Cys30	4.23	UV absorption at 240 nm during pH titration	(56)	Random mutagenesis of the -C-X-X-C- active-site sequence of DsbA to test effect on redox potential and pK _a of Cys30
E. coli DsbA mutant (-C₃₀-S₃₁-F₃₂-C₃₃-)	Cys30	4.34	UV absorption at 240 nm during pH titration	(56)	Random mutagenesis of the -C-X-X-C- active-site sequence of DsbA to test effect on redox potential and pK _a of Cys30
E. coli DsbA mutant (-C₃₀-P₃₁-L₃₂-C₃₃-₎	Cys30	4.42	UV absorption at 240 nm during pH titration	(56)	Random mutagenesis of the -C-X-X-C- active-site sequence of DsbA to test effect on redox potential and pK _a of Cys30
E. coli DsbA mutant (-C₃₀-S₃₁-T₃₂-C₃₃-)	Cys30	4.45	UV absorption at 240 nm during pH titration	(56)	Random mutagenesis of the -C-X-X-C- active-site sequence of DsbA to test effect on redox potential and pK _a of Cys30
E. coli DsbA mutant (-C₃₀-Q₃₁-L₃₂-C₃₃-)	Cys30	4.59	UV absorption at 240 nm during pH titration	(56)	Random mutagenesis of the -C-X-X-C- active-site sequence of DsbA to test effect on redox potential and pK _a of Cys30
E. coli DsbA mutant (-C30-T31-R32-C33-)	Cys30	4.76	UV absorption at 240 nm during pH titration	(56)	Random mutagenesis of the -C-X-X-C- active-site sequence of DsbA to test effect on redox potential and pK _a of Cys30
E. coli DsbA mutant (-C₃₀-L₃₁-T₃₂-C₃₃-)	Cys30	4.86	UV absorption at 240 nm during pH titration	(56)	Random mutagenesis of the -C-X-X-C- active-site sequence of DsbA to test effect on redox potential and pK _a of Cys30
E. coli DsbA mutant (-C₃₀-P₃₁-P₃₂-C₃₃-)	Cys30	6.73	UV absorption at 240 nm during pH titration	(56)	Random mutagenesis of the -C-X-X-C- active-site sequence of DsbA to test effect on redox potential and pK _a of Cys30
E. coli DsbA mutant (-C₃₀-P₃₁-G₃₂-C₃₃-)	Cys30	4.85	UV absorption at 240 nm during pH titration	(69)	His32Gly mutation introduced to test the influence of electrostatics associated with His32
E. coli DsbA mutant (-C₃₀-G₃₁-H₃₂-C₃₃-)	Cys30	3.71	UV absorption at 240 nm during pH titration	(69)	A PDI-like -C-X-X-C- sequence in DsbA
E. coli DsbA mutant (-C₃₀-A₃₁-T₃₂-C₃₃-)	Cys30	4.34	UV absorption at 240 nm during pH titration	(69)	A Trx reductase-like -C-X-X-C- sequence in DsbA
E. coli DsbA mutant (-C₃₀-P₃₁-Y₃₂-C₃₃-)	Cys30	3.75	UV absorption at 240 nm during pH titration	(69)	A Grx-like -C-X-X-C- sequence in DsbA
E. coli DsbA mutant (-C₃₀-G₃₁-P₃₂-C₃₃-)	Cys30	6.21	UV absorption at 240 nm during pH titration	(69)	A Trx-like -C-X-X-C- sequence in DsbA
E. coli DsbA mutant E37Q	Cys30	3.69	UV absorption at 240 nm during pH titration	(66)	The mutation of E37 in the vicinity of the active site did not result in a significant change of the pK _a of catalytic Cys30
E. coli DsbA mutant E38Q	Cys30	3.52	UV absorption at 240 nm during pH titration	(66)	The mutation of E38 in the vicinity of the active site did not result in a significant change of the pK _a of catalytic Cys30
E. coli DsbA mutant E37Q/E38Q	Cys30	3.84	UV absorption at 240 nm during pH titration	(66)	Mutation of both E37 and E38 in the vicinity of the DsbA active site
E. coli DsbA mutant ΔE38V39L40	Cys30	3.92	UV absorption at 240 nm during pH titration	(66)	Deletion of tripeptide E38V39L40, to mimic the corresponding helix of Trx and Grx
E. coli DsbA mutant ΔE38V39L40/H41P	Cys30	3.95	UV absorption at 240 nm during pH titration	(66)	Deletion of tripeptide E38V39L40 and H41P mutation, to mimic even more closely the corresponding helix of Trx
E. coli DsbA mutant E24Q	Cys30	3.52	UV absorption at 240 nm during pH titration	(74)	This mutation in the vicinity of the active site lowered the pK _a of catalytic Cys30 by 0.03 relative to the wild type
E. coli DsbA mutant K58M	Cys30	3.19	UV absorption at 240 nm during pH titration	(74)	This mutation in the vicinity of the active site lowered the pK _a of catalytic Cys30 by 0.36 relative to the wild type
E. coli DsbA mutant E37Q	Cys30	3.69	UV absorption at 240 nm during pH titration	(74)	This mutation in the vicinity of the active site increased the pK _a of catalytic Cys30 by 0.03 relative to the wild type
E. coli DsbA mutant E24Q/K58M	Cys30	4.46	UV absorption at 240 nm during pH titration	(74)	This mutant in the vicinity of the active site increased the pK _a of catalytic Cys30 by 0.91 relative to the wild type
E. coli DsbA mutant E24Q/E37Q	Cys30	3.81	UV absorption at 240 nm during pH titration	(74)	This mutant in the vicinity of the active site increased the pK _a of catalytic Cys30 by 0.26 relative to the wild type
E. coli DsbA mutant E37Q/K58M	Cys30	3.50	UV absorption at 240 nm during pH titration	(74)	This mutant in the vicinity of the active site decreased the pK _a of catalytic Cys30 by 0.05 relative to the wild type
E. coli DsbA mutant E24Q/E37Q/K58M	Cys30	3.94	UV absorption at 240 nm during pH titration	(74)	This mutant in the vicinity of the active site increased the pK _a of catalytic Cys30 by 0.39 relative to the wild type
E. coli DsbA mutant E24Q/E37Q/E38Q/K58M	Cys30	3.89	UV absorption at 240 nm during pH titration	(74)	This mutant in the vicinity of the active site increased the pK _a of catalytic Cys30 by 0.34 relative to the wild type
E. coli DsbA mutant V150T	Cys30	3.5	UV absorption at 240 nm during pH titration	(139)	The V150T mutantion is in the cis-Pro loop in the vicinity of the active site
Glutaredoxins wild type
Human Grx1 wild type	ND	3.5	pH dependence of iodoacetamide enzyme inactivation	(116)	The pK _a of 3.5 is most likely that of the nucleophilic Cys22 in the active site. The glutaredoxin is called thioltransferase.
Human Grx1 C7S/C78S/C82S	Cys22	3.6	pH dependence of iodoacetamide enzyme inactivation	(75)	This construct may be considered representative of the wild type, since the 3 mutated cysteines are far from the active site.
Yeast Grx wild type	Cys26	<4	pH dependence of iodoacetamide enzyme inactivation	(48)	The glutaredoxin is called thioltransferase.
E. coli Grx1 wild type	Cys11	<5	Experimental protocol not reported	(9)	Mentioned as unpublished work
E. coli Grx3 (C65Y mutant)	Cys11	4.1	UV absorption at 240 nm during pH titration	(45)	This construct is a representative of the wild type, since the C65Y mutation is far from the -C-X-X-C- motif. Cys11 titration occurs concurrently with protein unfolding at low pH.
Pig Grx wild type	Cys22	2.5	pH dependence of reaction with iodoacetic acid	(47)	The estimated pK _a value of 2.5 was subsequently revised to be 3.8 in (190).
Pig Grx	Cys22	3.8	pH dependence of reaction with iodoacetamide	(190)	A theoretical study has rationalized much of this experimental biochemical work.
Grxs mutants
Human Grx1 C7S/C25S/C78S/C82S	Cys22	4.2	pH dependence of iodoacetamide enzyme inactivation	(75)	Referred to as single-cysteine construct in the study: SC-Grx. The C25S mutation is in the active site.
Human Grx1 C7S/C25S/C78S/C82S K19L	Cys22	4.6	pH dependence of iodoacetamide enzyme inactivation	(75)	Referred to as single-cysteine construct in the study: SC-Grx. Tests the effect of K19 on the pK _a of catalytic Cys22
Human Grx1 C7S/C25S/C78S/C82S K19Q	Cys22	5.0	pH dependence of iodoacetamide enzyme inactivation	(75)	Referred to as single-cysteine construct in the study: SC-Grx. Tests the effect of K19 on the pK _a of catalytic Cys22
Human Grx1 C7S/C78S/C82S K19L	Cys22	3.7	pH dependence of iodoacetamide enzyme inactivation	(75)	Referred to as triple-mutant construct in the study: TM-Grx. Tests the effect of K19 on the pK _a of catalytic Cys22
Human Grx1 C7S/C78S/C82S K19Q	Cys22	3.7	pH dependence of iodoacetamide enzyme inactivation	(75)	Referred to as triple-mutant construct in the study: TM-Grx. Tests the effect of K19 on the pK _a of catalytic Cys22
Pig Grx mutant K27Q	Cys22	4.3	pH dependence of reaction with iodoacetamide	(190)	A theoretical study has rationalized much of the experimental biochemical work
Pig Grx mutant C78S/C82S	Cys22	4.4	pH dependence of reaction with iodoacetamide	(190)	A theoretical study has rationalized much of the experimental biochemical work
Pig Grx mutant C25S	Cys22	4.9	pH dependence of reaction with iodoacetamide	(190)	A theoretical study has rationalized much of the experimental biochemical work
Pig Grx mutant C25A	Cys22	5.9	pH dependence of reaction with iodoacetamide	(190)	A theoretical study has rationalized much of the experimental biochemical work
Pig Grx mutant R26V	Cys22	ND	pH dependence of reaction with iodoacetamide	(190)	The pK _a of Cys22 could not be measured with the R26V mutant, due to lack of enzymatic activity.
Pig Grx mutant R26V/K27Q	Cys22	ND	pH dependence of reaction with iodoacetamide	(190)	The pK _a of Cys22 could not be measured with the R26V/K27Q mutant, due to lack of enzymatic activity.
E. coli Grx3 (C14A/C65Y mutant)	Cys11	5	UV absorption at 240 nm during pH titration	(45)	To test effect of active-site C14A mutation on pK _a of nucleophilic Cys11
E. coli Grx3 (K8A/C65Y mutant)	Cys11	4.2	UV absorption at 240 nm during pH titration	(45)	To test effect of active-site K8A mutation on pK _a of nucleophilic Cys11
PDI wild type
Bovine PDI	Cys35 and Cys379	6.7	pH dependence of enzyme inactivation with iodoacetamide	(64)	The pK _a of 6.7 was tentatively assigned to Cys1 in the two -Cys₁-Gly-His-Cys₂- motifs of PDI. The true values of those pK _as are now thought to be lower (83).
Bovine PDI	ND	5.6	Kinetics of oxidation of a substrate peptide monitored by tryptophan fluorescence	(146)	The pK _a was attributed to an active-site- reactive cysteine, and inferred indirectly from kinetic measurements, which may account for this comparatively surprisingly high pK _a.
PDI mutants
Human PDI C56S	Cys53	4.81	pH dependence of the rate of reaction with Ellman's reagent	(83)	Study of the -Cys₅₃-Gly₅₄-His₅₅-Cys₅₆- motif in a catalytic domain active site, to investigate PDI reaction mechanisms
Human PDI C56S/R120Q	Cys53	4.84	pH dependence of the rate of reaction with Ellman's reagent	(83)	Apparently limited role of Arg120 on the pK _a of Cys53 in -Cys₅₃-Gly₅₄-His₅₅-Cys₅₆-
Human PDI C53M	Cys56	8.60	pH dependence of the rate of reaction with Ellman's reagent	(83)	To investigate the pK _a of Cys56 in -Cys₅₃-Gly₅₄-His₅₅-Cys₅₆-, with C53M mimicking a transient mixed disulfide
Human PDI C53M/R120Q	Cys56	9.14	pH dependence of the rate of reaction with Ellman's reagent	(83)	Interpreted as evidence that R120 lowers the pK _a of Cys56 in -Cys₅₃-Gly₅₄-His₅₅-Cys₅₆-, with possible mechanistic implications
Human PDI C53M/R120D	Cys56	9.22	pH dependence of the rate of reaction with Ellman's reagent	(83)	Interpreted as evidence that R120 lowers the pK _a of Cys56 in -Cys₅₃-Gly₅₄-His₅₅-Cys₅₆-, with possible mechanistic implications
ResA wild type
Bacillus subtilis ResA wild-type (-C₇₄-E₇₅-P₇₆-C₇₇-)	Cys74	8.8	pH dependence of reaction rate with alkylating agent badan, monitored by fluorescence	(100)	Apparently unusually high pK _a for an N-terminal cysteine in a -C-X-X-C- motif. X-ray structure of reduced ResA is PDB entry 1SU9
B. subtilis ResA wild-type (-C₇₄-E₇₅-P₇₆-C₇₇-)	Cys77	8.2	pH dependence of reaction rate with alkylating agent badan, monitored by fluorescence	(100)	The pK _a of this C-terminal cysteine appeared higher than the pK _a for the N-terminal cysteine, which is atypical
ResA mutants
B. subtilis ResA (-C₇₄-E₇₅-P₇₆-A₇₇-)	Cys74	8.48	pH dependence of reaction rate with alkylating agent badan, monitored by fluorescence	(100)	Apparently unusually high pK _a for an N-terminal cysteine in a -C-X-X-C- motif. X-ray structure is PDB entry 2H19
B. subtilis ResA (-A₇₄-E₇₅-P₇₆-C₇₇-)	Cys77	8.36	pH dependence of reaction rate with alkylating agent badan, monitored by fluorescence	(100)	X-ray structure is PDB entry 2H1A
B. subtilis ResA (-A₇₄-E₇₅-P₇₆-C₇₇-)	Cys77	8.3	pH dependence of reaction rate with iodoacetate	(100)	X-ray structure is PDB entry 2H1A
B. subtilis ResA (-C₇₄-E₇₅-P₇₆-C₇₇-) E80Q	Cys77	7.4	pH dependence of reaction rate with alkylating agent badan, monitored by fluorescence	(100)	X-ray structure is PDB entry 2H1B
B. subtilis ResA (-C₇₄-P₇₅-P₇₆-C₇₇-)	Cys74	7.0	pH dependence of reaction rate with alkylating agent badan, monitored by fluorescence	(99)	Tests the influence of E75 on the pK _as of Cys74 and Cys77
B. subtilis ResA (-C₇₄-P₇₅-P₇₆-C₇₇-)	Cys77	6.6	pH dependence of reaction rate with alkylating agent badan, monitored by fluorescence	(99)	Tests the influence of E75 on the pK _as of Cys74 and Cys77
B. subtilis ResA (-C₇₄-E₇₅-H₇₆-C₇₇-)	Cys74	7.4	pH dependence of reaction rate with alkylating agent badan, monitored by fluorescence	(99)	H76 in this ResA mutant mimicks H32 in DsbA. X-ray structure is PDB entry 3C73
B. subtilis ResA (-C₇₄-E₇₅-H₇₆-C₇₇-)	Cys77	7.5	pH dependence of reaction rate with alkylating agent badan, monitored by fluorescence	(99)	H76 in this ResA mutant mimicks H32 in DsbA. X-ray structure is PDB entry 3C73
B. subtilis ResA (-C₇₄-P₇₅-H₇₆-C₇₇-)	Cys74	6.3	pH dependence of reaction rate with alkylating agent badan, monitored by fluorescence	(99)	H76 in this ResA mutant mimicks H32 in DsbA. X-ray structure is PDB entry 3C73
B. subtilis ResA (-C₇₄-P₇₅-H₇₆-C₇₇-)	Cys77	5.7	pH dependence of reaction rate with alkylating agent badan, monitored by fluorescence	(99)	H76 in this ResA mutant mimicks H32 in DsbA. X-ray structure is PDB entry 3C73
DsbD wild type
E. coli DsbD C-terminal domain (-C₄₆₁-V₄₆₂-A₄₆₃-C₄₆₄-)	Cys461	10.5	NMR chemical shifts determined as a function of pH	(112)	pK _a for Cys461 in the isolated C-terminal domain of DsbD. Unusually high pK _a for an N-terminal cysteine in a -C-X-X-C- motif.
E. coli DsbD C-terminal domain (-C₄₆₁-V₄₆₂-A₄₆₃-C₄₆₄-)	Cys464	>12.2	NMR chemical shifts determined as a function of pH	(112)	pK _a for Cys461 in the isolated C-terminal domain of DsbD.
E. coli DsbD-gamma domain (-C₄₆₁-V₄₆₂-A₄₆₃-C₄₆₄-)	Cys461	9.3	pH dependence of reaction with iodoacetamide and UV absorption at 240 nm	(161)	Unusually high pK _a for an N-terminal cysteine in a -C-X-X-C- motif. X-ray structure is PDB entry 2FWF
DsbD mutants
E. coli DsbD C-terminal domain (-C₄₆₁-V₄₆₂-A₄₆₃-C₄₆₄-) E468Q	Cys461	9.9	NMR chemical shifts determined as a function of pH	(112)	To test the effect of E468 on the pK _a of Cys461
E. coli DsbD C-terminal domain (-C₄₆₁-V₄₆₂-A₄₆₃-C₄₆₄-) D455N	Cys461	9.3	NMR chemical shifts determined as a function of pH	(112)	To test the effect of D455 on the pK _a of Cys461
E. coli DsbD C-terminal domain (-C₄₆₁-V₄₆₂-A₄₆₃-C₄₆₄-) D455N/E468Q	Cys461	8.6	NMR chemical shifts determined as a function of pH	(112)	To test the combined effect of D455N and E468 on the pK _a of Cys461
DsbC
E. coli DsbC wild type (-C₉₈-G₉₉-Y₁₀₀-C₁₀₁-)	Cys98	4.1	pH dependence of UV absorption at 240 nm	(163)	This pK _a in wild-type dimeric DsbC was found to be very similar to its counterpart in a C-terminal fragment of DsbC (residues 66–216)
E. coli DsbC C-terminal fragment (-C₉₈-G₉₉-Y₁₀₀-C₁₀₁-)	Cys98	4.3	pH dependence of UV absorption at 240 nm	(163)	This C-terminal fragment of DsbC contained only residues 66–216 and was monomeric
E. coli DsbC (-C₉₈-G₉₉-Y₁₀₀-C₁₀₁-) Mutant T182V	Cys98	5.8	pH dependence of UV absorption at 240 nm	(139)	The mutation T182V is in the cis-Pro loop, in the vicinity of the active site
E. coli DsbC wild-type (-C₉₈-G₉₉-Y₁₀₀-C₁₀₁-)	Cys98	4.6	pH dependence of UV absorption at 240 nm	(139)	To compare to the pK _a in the T182V mutant obtained in the same study
DsbG
E. coli DsbG wild-type (-C₁₀₉-P₁₁₀-Y₁₁₁-C₁₁₂-)	Cys109	3.5	pH dependence of UV absorption at 240 nm	(139)	This study also determined the redox potential, but not the pK _a, for two mutants of DsbG
Tryparedoxin
Trypanosoma brucei Tryparedoxin (-C₄₀-P₄₁-P₄₂-C₄₃-)	Cys40	7.2	pH dependence of UV absorption at 240 nm	(138)	Wild-type tryparedoxin
T. brucei Tryparedoxin (-C₄₀-G₄₁-P₄₂-C₄₃-)	Cys40	7.2	pH dependence of UV absorption at 240 nm	(138)	Mutant tryparedoxin with the -C-X₁-X₂-C- motif of a typical Trx
T. brucei Tryparedoxin (-C₄₀-P₄₁-Y₄₂-C₄₃-)	Cys40	≤4	pH dependence of UV absorption at 240 nm	(138)	Mutant tryparedoxin with the -C-X₁-X₂-C- motif of typical Grx
Peptide model systems
Cysteine in random-coil peptides	Various sequences	8.48–8.90	UV absorption at 240 nm during pH titration	(89)	Provides reference values for the pK _a of cysteines in disordered (unfolded) peptides
Alanine pentapeptide	NA	8.55	Potentiometry	(169)	Study designed to provide reference, unperturbed (intrinsic), pK _a values for titratable groups in proteins
Cysteine in 16 model peptides	Various sequences	7.35–9.08	pH dependence of reaction rates with iodoacetamide or of UV absorption at 240nm	(17)	Carefully designed model systems, to model cysteines of BPTI or of members of the Trx superfamily
Cysteine at N-terminus of helical peptides	Various sequences	7.20–7.63	UV absorption at 240 nm during pH titration	(89)	Carefully designed model systems, testing the influence of peptide helicity and sequence on the pK _a at helical N-termini
CAAC at N-terminus of α-helical peptide	Ncap and N3Cys	6.74	Ellipticity monitored by circular dichroism at 222 nm	(73)	Only an apparent pK _a was determined, since the pK _as of the individual cysteines could not be separated.

Experimentally measured pK _a values reported in the literature; these values are listed here with the same precision as in the original publication.

For clarity, only references to experimental (noncomputational) work are cited here; see main text for references to theoretical/computational work.

DsbA, disulfide-binding protein A; Grx, glutaredoxin; NA, not applicable; ND, not determined; NMR, nuclear magnetic resonance; PDB, Protein Data Bank; PDI, protein disulfide isomerase; Trx, thioredoxin.

II. pK _a Determination Methods

The pK _a of a cysteine thiol group can be obtained from the equilibrium constant K _a for the deprotonation reaction: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}CysSH \rightleftarrows CysS^{ - } + H^{ + } \tag{{\rm Eq. \ 1}}\end{align*} \end{document}

in which [H⁺], [CysS⁻], and [CysSH] are the equilibrium concentrations given in mol/l.

The central equation for this equilibrium is the Henderson–Hasselbalch equation (Eq. 3), which is a convenient rearrangement of the K_a equilibrium equation: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} { \log } \ Ka &= { \log } \frac { [ H^ { + } ] [ CysS^ { - } ] } { [ CysSH ] } = { \log } [ H^ { + } ] + { \log } \frac { [ CysS^ { - } ] } { [ CysSH ] } \\ - { \log } [ H^ { + } ] &= - { \log } \ Ka + { \log } \frac { [ CysS^ { - } ] } { [ CysSH ] } \\ pH &= pKa + { \log } \frac { [ CysS^ { - } ] } { [ CysSH ] } \end{align*} \end{document} \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}10^ { ( pKa - pH ) } = \frac { [ CysSH ] } { [ CysS^ { - } ] } \tag{{\rm Eq. \ 3}}\end{align*} \end{document}

In this section, we briefly describe the experimental and calculation methods to obtain pK _a values. We focus on the current widely used techniques to obtain thiol pK _a values in proteins.

A. Experimental approaches

Traditional methods used to determine pK _a values of cysteines in proteins are based on UV absorption spectroscopy, rate constant determination, microcalorimetry, and in some cases, nuclear magnetic resonance (NMR) spectroscopy. Raman spectroscopy (101), quantitative mass spectrometry (122), and potentiometric titrations (169) have also been used.

A well-established spectroscopic method is based on the greater absorption of ultraviolet light by the thiolate versus the thiol at 240 nm (A _red) (Fig. 2). The difference in the ultraviolet extinction coefficient of the thiol and the thiolate anion is about 4000 M ⁻¹ cm⁻¹ at 240 nm (14, 131), which has been used to determine the pK _a values to the thiols of disulfide-binding protein A (DsbA) (121) and in several wild-type and active-site mutants of Trx (40, 142, 166) and related proteins (Table 1). The protein in a buffer solution with high pH is titrated with HCl, and A _red is followed at decreasing pH during the titration. However, several corrections on A _red are needed. For correction of the varying protein concentration due to the increased volume during to addition of HCl, the absorption at 240 nm needs to be corrected for absorptions at 280 nm, the wavelength at which one can spectroscopically determine the protein concentration (A_240red/A_280red ). Mainly, the side chains of Phe, Tyr, and Trp will also absorb at 240 nm, and their absorbance might change due to changes in the environment during the titration. This source of variability on (A_240red/A_280red ), which is thiolate/thiol independent, can be corrected by measuring the absorption of the same protein in which the thiol group is absent (A_240oxid/A_280oxid ). Therefore, the same protein in which the cysteines are alkylated with iodoacetamide (IAM) can be used, which will not contribute to the thiolate absorbance at 240 nm. Alternatively, an engineered variant in which the cysteine is mutated to a serine can be used. In case of a protein with a −C-X-X-C- active-site motif, the oxidized form with a disulfide between the two cysteines can be used. To make each experimentally obtained absorption ratio (A_240red/A_280red ) pH independent, one normalizes by dividing the (A_240red/A_280red ) value of the thiolate form by the absorption (A_240oxid/A_280oxid ), where no thiol or thiolate is present. The (A_240red/A_280red )/(A_240oxid/A_280oxid ) ratio is determined at varying pH by titration with HCl and plotted against pH (Fig. 2). The pK _a is obtained from a nonlinear least-square fitting with the rearranged Henderson–Hasselbalch equation: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}A_ { \exp } = A_ { SH } + \frac { ( A_ { s^ - } - A_ { SH } ) } { 1 + 10^ { ( pKa - pH ) } } \tag{{\rm Eq. \ 4}}\end{align*} \end{document}

FIG. 2.

Experimental p K _a titration curve for a cysteine thiol. The curve has been fitted to the rearranged Henderson–Hasselbalch equation (Eq. 4). Similar curves may be obtained for UV spectroscopy results, isothermal titration, nuclear magnetic resonance, and second-order rate constants.

in which A _exp is (A_240red/A_280red )/(A_240oxid/A_280oxid ) for the experimentally determined value; A_SH is the A ₂₄₀/A ₂₈₀ value for the protonated thiol form; and A_s- is the A ₂₄₀/A ₂₈₀ for the deprotonated thiolate form.

This rearranged equation has been deduced from the relation between the fraction A _exp and the absorption of the protonated and deprotonated form of the cysteine (Fig. 2), according to the following expression: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}&\qquad \quad A_ {\exp} = A_{s^ -} R + A_ {SH} (1-R) \\ &\qquad \qquad \quad R = \frac{A_ {\exp} - A_ {SH}}{A_ {s^ -} - A_ {SH}} \\ &A_ {\exp} - A_ { SH } = R (A_ {s^-} - A_ { SH } ) \tag{{\rm Eq. \ 5}}\end{align*} \end{document}

in which R is a fraction of the total absorption difference between totally deprotonated and protonated cysteine. By using equation (3), this fraction R can be written as: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}R &= \frac{[CysS^{ - }]}{[CysS^{-}] + [CysSH]} = \frac{1}{1 + [ CysSH ] / [ CysS^{ - }]} \\ &=\frac{1}{1 + 10^{(pKa - pH)}} \tag{{\rm Eq. \ 6}}\end{align*} \end{document}

which then finally results in the rearranged Henderson–Hasselbalch equation: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}A_ { \exp } = A_ { SH } + \frac { ( A_ { s^ - } - A_ { SH } ) } { 1 + 10^ { ( pKa - pH ) } } \ { \rm using \ ( Eq. \ 5 ) \ and \ ( Eq. \ 6 ) } \end{align*} \end{document}

In some cases, the protein structural environment around the cysteine or its local dynamics is affected by the pH, which in turn influences the cysteine absorbance. In these situations, it is difficult to evaluate the transition between protonated and deprotonated cysteine at 240 nm.

Alternatively, thiol pK _as in proteins and peptides can be determined based on the rates of alkylation as a function of pH (17, 83, 93, 106, 108), with IAM, with fluorescent IAM derivatives, with 5,5′-dithiobis(2-nitrobenzoate) (DTNB), with 2,2′-dipyridyl disulfide (2DPS), or with monobromobimane. The alkylation rates are evaluated over a certain pH range by evaluating the residual enzyme activity as a function of time. The pH-dependent degree of alkylation can also be used when there is no enzymatic assay to monitor direct changes in the reaction rate, typically when the cysteine of interest is not in an enzyme active site. Then, the amount of alkylated cysteine may still be determined as a function of time, for example, by chromatographic retention times or fluorescence of the alkylating agent linked to the cysteine. Nowadays, new superior fluorescent probes are being developed, which allows highly specific thiol labeling at low pH (127).

At each pH, the pseudo-first-order rate constants k _obs are determined from the slope of the plots of ln (E₀/E_t ) versus time, with E₀ the initial enzyme activity (or nonalkylated amount of protein) and E_t the enzyme activity (or amount of alkylated protein) at time t.

The second-order kinetic constant (k ₂) is then calculated by dividing k _obs by the concentration of alkylating agent at each pH, and then fitting to the following equations to determine the best-fit pK _app.

For a one-pK _a model: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}k_2 = k^ { \prime } \bigg( \frac { 1 } { 1 + 10^ { ( pK_ { app } - pH ) } } \bigg)\end{align*} \end{document}

or for a two-pK _a model (if another protein titratable side chain influences the reactivity of the nucleophilic cysteine): \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}&k_2 = k^{ \prime} \bigg( {1 \over 1 + 10^{ ( pK_{app} - pH ) }} \bigg) \bigg( {1 \over1 + 10^{ ( pK_{app2} - pH ) }} \bigg) \\ &\qquad \quad + k^{ \prime \prime} \bigg( {1 \over1 + 10^{ ( pK_{app2} - pH ) }} \bigg)\end{align*} \end{document}

with k′ and k′′, the two different second-order constants for the thiolate form.

This method has been applied for Trx (82), thiol/disulfide oxidoreductases (117), glyceraldehyde 3-phosphate dehydrogenase (108), yeast glutaredoxins (Grxs) (36), and peroxiredoxin 6 from Arenicola marina (106). For Mt_AhpE, an alkyl hydroperoxide reductase of Mycobacterium tuberculosis, the pK _a of oxidation and overoxidation (Fig. 1) of the peroxidatic thiol group were measured by determining the rate constants of peroxynitrite and H₂O₂-mediated oxidation and over oxidation using the difference in intrinsic fluorescence (70). For highly reactive cysteines, kinetic measurements need to be carried out in a stopped-flow apparatus. Overall, the chemical modification method has been very successful with enzymes of the Trx superfamily, since in these enzymes, the main cysteine of interest is catalytically active and solvent exposed. This contributes to explain the wealth of pK _a data on the Trx superfamily.

This method of chemical modification is however not trivial and certainly not applicable to all proteins, as it assumes that the cysteine side chain of interest is accessible for alkylation. Further, the pK _a value of the cysteine is rightly assumed to be the principal determinant of its alkylation rate. However, this is only the case in an idealized situation, because some cysteine side chains are partially or fully protected by the protein structure; their alkylation rates can also be affected by steric accessibility, conformational stability, and dynamics. Protected cysteines can only react with alkylating reagents when they are transiently exposed to bulk solvent by global or local unfolding events. Because protected cysteines primarily (or only) react in transient unfolded (open) states, they will exhibit seemingly unperturbed pK _a values (∼8.6). However, the actual pK _a value of the cysteine in its protected (folded) state could be very different.

Another method that should be applicable to low-solubility proteins is isothermal titration microcalorimetry (ITC) (165). ITC has been used to infer pK _a values of reactive residues of enzyme–substrate complexes by measuring the substrate-binding enthalpy ΔH_binding as a function of pH and buffer composition (162). Tajc and coworkers (165) used ITC to monitor directly the covalent reaction of IAM with a thiolate to produce a thioether, as a function of pH. The ITC instrument is set up in a single-injection mode, and sufficient IAM (up to 300 mM) is introduced to alkylate all the thiolate groups. The produced heat change upon injection \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document}$$\partial Q / \partial t$$ \end{document} is proportional to the reaction rate. The maximum absolute value of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document}$$\partial Q / \partial t$$ \end{document} is proportional to the initial rate, which according to the law of mass action, is proportional to the initial concentration of the thiolate. So, the maximum absolute value \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document}$$( \partial Q / \partial t )_{{\max,}{obs}}$$ \end{document} is determined at varying pHs. The pK _a can be calculated from the \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document}$$( \partial Q / \partial t ) _{ \max}$$ \end{document} -versus-pH plot using the Henderson–Hasselbalch equation: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}(\partial Q / \partial t )_{{\max,}{obs}} &= (\partial Q / \partial t)_{{\max,}{low}} \\ &\quad + \frac{(\partial Q / \partial t )_{{\max,}{high}} - (\partial Q / \partial t)_{{\max,}{low}}}{1 + 10^{(pK_a - pH)}} \end{align*} \end{document}

with \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document}$$( \partial Q / \partial t ) _{{ \max , }{low}}$$ \end{document} , the value at low pH (protonated cysteine), and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document}$$( \partial Q / \partial t ) _{{ \max , }{high}}$$ \end{document} at high pH (deprotonated cysteine) (Fig. 2).

NMR spectroscopy can also be used to determine the pK _a of a cysteine (38, 44, 77, 89, 112, 129). This method is expensive, requires a sophisticated instrumentation, and is labor intensive (resonance assignment). The presence of more than one cysteine in a protein does not limit in principle the determination of their pK _as by NMR; ideally, the pK _a values of all cysteine residues may be determined in the same NMR experiment. For the cysteine thiol ionization measured by 1D ¹H NMR spectroscopy, the cysteine C_α and C_β protons can be used as probes for the ionization state of the cysteine thiol group (89). A problem that might occur is that the resonances of the Cys C_α proton might not be followed over the entire pH range because of signal overlap or low intensity of the signal. Also, one potential complicating factor is that pH-dependent changes in a resonance chemical shift may occur due to pH-dependent changes in other surrounding residues. Because the chemical shift is particularly sensitive to the local environment, disentangling the effects of ionization at the monitored residue versus other changes in proximal residues can be nontrivial. With more expensive ¹³C-labeled material, the chemical shifts of the ¹³C_β resonance can also be followed as a function of the pH (38). The obtained chemical shifts can then be plotted against pH and fitted, again with the Henderson–Hasselbalch equation: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}\delta = \delta_{ SH } + { ( \delta_{s^ - } - \delta _{ SH } ) \over1 + 10^{ ( pK_a - pH ) }}\end{align*} \end{document}

with δ, the chemical shift of a resonance as function of pH, and δ_s- and δ_SH, the chemical shifts at high and low extremes of the pH, respectively. The pK _a values obtained by NMR seem to agree well with those obtained by other spectroscopic methods (38, 77, 89).

Since proteins tend to have a solubility minimum at their isoelectric point (pI), this might undermine the measurement of the pK _a during the pH titration. In addition, proteins may unfold when a titration changes a protonation state important for the stability of the folded protein. When proteins start to aggregate, unfold, or precipitate, the experimental measurements become difficult and unreliable. These limitations and other considerations have emphasized the growing interest in the development of computational methods to calculate the pK _as.

B. Computational methods

In proteins, there are many protonation sites, and in many cases, even several cysteine residues, complicating accurate experimental pK _a measurements and interpretation. Therefore, there is an increasing interest in supplementing, guiding, and interpreting experimental approaches with computational methods for pK _a rationalization/prediction (2, 98). Calculations can also help assign accurately measured pK _as to particular residues.

The basis for pK _a calculation techniques that can be used on large systems like proteins relies on estimates of energy terms, which influence the relative populations of the protonated and deprotonated states of the titratable groups. For the relevant chemical functionalities, those terms typically include the desolvation energy, the background interaction energy, and the site–site interaction energies. The desolvation energy is the energy change when transferring the titratable group from an aqueous solvent into the protein environment. The background interaction energy is the interaction energy between the titratable group and the protein, when all other titratable groups are considered as neutral entities. The main contribution to this term comes from hydrogen-bonding interactions between the titratable group and neutral polar groups. The site–site interaction energy describes the electrostatic interactions between pairs of charged titratable groups. In principle, one has to consider the interactions among all the charged groups simultaneously, a challenge when the titrations of several groups are coupled. These energy contributions can be addressed in various theoretical frameworks (2, 98), usually categorized as classical electrostatics with continuum solvation models, physics-based fully microscopic models, or more empirical approaches. All computational methods require a good-quality structural model of the protein, which determines for a large part the outcome of the calculations.

The Poisson-Boltzmann (PB) approach (4, 13, 29, 31, 35, 45, 68, 126, 161, 179, 188) is maybe the most representative and best-developed approach to address the pK _a shifts in proteins in the framework of classical electrostatics (68, 154). The calculation of pK _a shifts in proteins in the PB framework has been described in detail (13, 42). In summary, the pK _a in a protein environment is calculated as a shift relative to the reference intrinsic pK _a of the same residue free in aqueous solution. The reference pK _a value is known from measurements on model systems, typically peptides in solution (bottom of Table 1). For the cysteine thiol, a reference unperturbed pK _a of 8.3–8.6 is commonly accepted, with small variations in the corresponding measurements (17, 26, 39, 82, 89, 169). The range of cysteine pK _as measured in peptides is much narrower than in folded proteins (Table 1). Yet, some pK _a differences are observed in peptides, although the reasons for such spread are obscure, since there are no structures for these flexible peptides. One cannot say for sure if the differences between pK _a values of cysteines in peptides can be rationalized by hydrogen-bonding differences. The differences in measured pK _as also probably reflect different experimental protocols (169). Reference pK _a values obtained recently with pentapeptides with neutral (blocked) termini (169) are of special interest, since they minimize the influence of secondary structure and hydrogen bonding, which can occur in larger peptides, and suggest a reference pK _a value of 8.6 for cysteines. Note that the effect of the bulk aqueous solvent on the pK _a is encapsulated in the reference pK _a. Estimating the influence of the protein relative to such known reference pK _a is the approach currently adopted with all methods.

The thiol of cysteines is treated like any other titratable group. Therefore, the pK _a shift of a particular thiol is calculated by estimating the electrostatic interactions between the cysteine thiolate (and thiol) and the rest of the protein while taking into account the desolvation energies. In practical applications, neglecting electrostatic interactions between charged titratable groups which are far apart tends to be a valid working hypothesis (53). The desolvation corresponding to the transfer of a charge (e.g., a thiolate) from bulk water into the protein interior is calculated by representing the bulk solvent as a continuous high-dielectric-constant medium and the protein interior as a region of lower dielectric constant (13, 31, 55, 68). Such desolvation is largely electrostatic in nature. It destabilizes a charge, contributing to pK _a shifts, for instance, by destabilizing the charged form of a titratable group. Thus, one expects reactive cysteines to be mostly solvent accessible because of steric requirements allowing encounters with the substrate, but also because proximity to the water contributes to stabilize their thiolate. On the other hand, cysteine residues are very vulnerable to oxidation, which might explain the limited occurrence of nonactive cysteine residues on protein surfaces (109).

Representing the protein interior by a single dielectric constant can only be an approximate, and sometimes crude, treatment. For extended discussions on this subject, see the following references (4, 31, 45, 55, 154, 158, 159). With a system comprised of different dielectric regions (solvent and solute interior), the straightforward use of Coulomb's law in its well-known, simplest form is not applicable, and trying to treat the system in this framework leads to a mathematical description too complex to solve. Therefore, one uses the PB equation instead to obtain the electrostatic properties (54, 154), usually in its linearized form (68). The physical parameters that need to be supplied by the user include the dielectric constant of the protein interior and of the aqueous solvent, the temperature, partial charges, and radii for the protein atoms, and the ionic strength. The atomic radii are critical, since they underpin the boundary between the dielectric regions. The ionic strength represents the screening of electrostatic interactions by counter ions, but the calculated pK _a values tend not to be highly sensitive to this parameter. The general experience is that the calculated pK _a values tend to be much more sensitive to the value of the protein dielectric constant. Calculations with the PB method are more time consuming than empirical methods (e.g., PROPKA); however, they are quite tractable on commodity computers; a PB pK _a calculation on a medium-size protein takes less than a minute.

The best practices regarding PB-based pK _a calculations are still an evolving field (2). To gain productive insights from such calculations, one requires a good grasp of the underlying method and approximations. Accordingly, no standard generic safe protocol can be recommended. Yet, the physical principles (desolvation, Coulombic interactions, and hydrogen bonds) underlying the PB calculations are intuitively easy to understand, and the PB framework should make the notion of pK _a calculations accessible to the nonexpert, at least in terms of general ideas and as a basis for discussions. Of several software packages available, none has emerged as systematically more accurate or popular (2). Nevertheless, the software to perform PB pK _a calculations is freely downloadable; see Lee and Crippen (98) and Fitch and Garcia-Moreno (42).

As far as we know, no PB-based study has focused on the calculations of cysteine pK _as across proteins systematically. Instead, computational analyses of cysteine pK _as were reported in case studies of individual enzymes. The PB methodology was used to calculate the cysteine pK _a values of wild-type and mutant human Trx and Escherichia coli DsbA (49, 119). For the nucleophilic cysteine of wild-type Trx and DsbA, pK _a values of, respectively, 7.1 and 2.6 were calculated, in good agreement with the experimental values of 6.9 and 3.4. Also for the -X₁X₂- mutants in the -Cys₁-X₁-X₂-Cys₂- active-site motif of Trx and DsbA, the calculated pK _a values were close to their experimental counterpart (119). Only a slightly better agreement with the experimental pK _a values was obtained after conformational relaxation of the flexible ionizable groups. For E. coli Grxs 1 and 3, as well as pig Grx (and its mutants), the PB methodology calculated the relevant cysteine pK _a values in reasonable agreement with experiment (43, 44). In particular, the PB calculations account for the large downshift of the pK _a of Cys₁ in the -Cys₁-Pro-Tyr/Phe-Cys₂- motif of Grxs (pK _a≤5) and correctly assign a significantly higher pK _a to Cys₂. Crucially, these studies provided a detailed theoretical analysis of the factors that lower the pK _a of the catalytic cysteine. This analysis was strengthened by the formulation of true predictions that were subsequently tested, and experimentally supported (45). Small structural differences in the active site of enzymes of the Trx superfamily can largely rationalize the pK _a variations for the catalytic cysteine across this superfamily (Fig. 3). In addition, these local interactions were found to be sensitive to the protein dynamics. This was already apparent in a PB-based study of the pK _a of Cys32 in an NMR structure of E. coli Trx (35).

FIG. 3.

Thiolate stabilization with hydrogen bonds: a unifying theme across the Trx superfamily. Comparison of the hydrogen bond networks (green dotted lines) stabilizing the thiolate in reduced active sites of representative enzymes in the Trx superfamily (left: human Trx, Protein Data Bank [PDB] entry 1ERT; middle: Escherichia coli glutaredoxin [Grx] 3, PDB entry 1ILB; right: E. coli DsbA, PDB entry 1A2L). For each enzyme, only the -Cys₁-X₁-X₂-Cys₂- motif is shown, that is, -Cys₁-Gly-Pro-Cys₂- in Trx, -Cys₁-Pro-Tyr-Cys₂- in Grx, and -Cys₁-Pro-His-Cys₂- in DsbA. The sequence numbering of Cys₁ and Cys₂ is shown, with the thiol of Cys₂ donating a hydrogen bond to the nucleophilic thiolate of Cys₁. The number of hydrogen bonds donated to the thiolate depends on the -Cys₁-X₁-X₂-Cys₂- sequence. In particular, note that the proline in Trx removes a backbone N-H hydrogen bond donor. The more hydrogen bonds can stabilize the thiolate, the lower its experimentally measured pK _a, as illustrated by comparing Trx (pK _a ∼7.1, two hydrogen bonds), Grx (pK _a ∼4.0, three hydrogen bonds), and DsbA (pK _a ∼3.5, four hydrogen bonds). Modulation of the cysteine pK _a by local hydrogen bonds also explains why the pK _a of Cys₁ is lowered, but not that of Cys₂. (To see this illustration in color, the reader is referred to the web version of this article at www.liebertpub.com/ars.)

Importantly, the output of PB pK _a calculations provides a decomposition of the energetic components contributing to a pK _a shift, which allows for interpretation of the calculations. Such decomposition has provided much of the increasing evidence that the pK _as of many titratable groups are primarily influenced by short-range polar interactions such as hydrogen bonds (6, 44, 45, 49, 103, 134, 141, 143). It strongly supports the view that the pK _as are in general very sensitive to the details of the protein structure and dynamics (35, 43 –45, 134). The influence of local structural differences on the pK _a calculations was also apparent when comparing pK _as calculated with both X-ray and NMR structures of the same proteins (5, 86). Incidentally, there was no clear systematic improvement between calculated and measured pK _as depending on the technique (X-ray or NMR) used to build the structural model. In addition, the raw X-ray or NMR coordinates may have to be refined before pK _a calculations, to orient protein side-chain amide and imidazole groups, as well as conserved water molecules, according to the most likely hydrogen-bonding network (123, 126). Thus, a careful preparation of the protein structure is critical for pK _a calculations, regardless of the pK _a calculation method.

Preparing a relevant system for pK _a investigations might be even more sophisticated, if the pK _a depends on the binding between two or more molecular species. Thus, one may have to consider how the calculated pK _a might be influenced by the binding of cofactors (ions and organic ligands) or by the formation of a protein–protein complex. Since low cysteine pK _as are frequently involved in enzymes and their reactions with substrates, one can imagine situations where building a relevant enzyme–substrate complex may be a prerequisite to pertinent pK _a calculations. Some of these aspects have been illustrated in a study of a covalent complex between Trx and arsenate reductase (ArsC) (141). Another interesting system to study a cysteine pK _a influenced by complexation is DsbD. In the isolated C-terminal domain (gamma-domain), the pK _a of the reactive Cys461 is unusually high, with a pK _a of 9.3–10.5 (112, 161). Yet, there is indirect evidence that the pK _a of Cys461 is lowered upon complexation with the substrate N-terminal domain of DsbD (113). A model in which substrate binding enhances the reactivity of the active-site cysteines has also been proposed for ResA (28). It has been proposed that changes of the relevant pK _as during complexation allow controlled activation of reactive cysteines upon binding of cognate substrates, to restrict the reactivity toward those substrates (28, 113). Macromolecular crowding in the cell may also lead to nonspecific, but significant, interactions, which might alter protein structures, and therefore the pK _a of reactive amino acids. However, these nonspecific effects are expected to be even more challenging to characterize than those arising in the formation of specific complexes.

Empirical pK _a calculation methods use rules derived from experimental observations to predict pK _as and have the advantage to be very fast (seconds per protein conformation). They also lend themselves to updates of the underlying model and associated terms, driven to maximize pragmatically the agreement between experiment and calculations without having to abide by a constraining or interpretable theoretical framework. An overview of the available empirical pK _a prediction methods is given in reference (98). A currently popular empirical method is PROPKA (12, 30, 88, 103, 111, 130, 133, 134, 149, 160), which is available (http://propka.ki.ku.dk/). PROPKA considers an environmental pK _a perturbation ΔrM_c to the unperturbed, solution pK _a, of the titratable group.

The ΔrM_c term includes desolvation effects, hydrogen bonding, and charge–charge interactions via empirical relations. The first version, PROPKA1, was developed using a test set of 314 experimental pK _as, which contained only 12 cysteine pK _as. For these cysteine residues, the root mean square deviation between experimental and calculated pK _a values was 1.39. For oxidized (bonded) cysteine thiol groups, no pK _a calculation is performed (flagged by returning a value of 99.99 instead of an estimated pK _a value), with the exception of proteins from the Trx superfamily. Even when the cysteines of the conserved -Cys₁-X₁-X₂-Cys₂- motifs are in the oxidized form (i.e., when a disulfide bond is formed between Cys₁ and Cys₂), the pK _a values of the reduced cysteines are evaluated.

PROPKA has been updated twice. In PROPKA1, no pK _a shifts due to ligands, ions, and structural water molecules were considered. These effects were incorporated in PROPKA2 (12). In PROPKA3 (130), residues are no longer classified as either buried or surface residues, but an interpolation between these two extremes is used. This results in a burial ratio by which Coulomb interactions are no longer strictly either turned off (surface residues) or turned on (buried residues). A linear interpolation between the two extremes is made via a position-dependent weight function that depends on the number of heavy atoms within a sphere of 15 Å around the charge center.

A benchmark study (110) of PROPKA on cysteine residues for several Trx and ArsC proteins revealed a fair correlation (R²=0.74 with an average deviation from experimental value of 0.88 pK _a units) with experimental pK _as (Table 2). Before pK _a calculation, the raw X-ray structures were energy minimized with CHARMM. For comparison, Table 2 reports the PROPKA2 values for nonminimized structures, giving in some cases a notably degraded performance of PROPKA. This strengthens the notion that a careful preparation of the protein structure is critical for pK _a calculations. The performance of PROPKA3 with cysteines appears to be less accurate than the performance of PROPKA2 (Table 2). The advantage of PROPKA is its balance between speed and performance, and especially its Web-based ease of use. Therefore, PROPKA is a valuable tool to give initial insights in the protonation state of a cysteine. PROPKA illustrates how the development and successive adjustments of empirical models rely on augmented training sets of experimental data. Indeed, the continued experimental determination of pK _a values for cysteines is very valuable for an improved calibration of empirical pK _a calculators.

Table 2.

Comparison of Some Calculated and Experimentally Measured Cysteine pK _a s

Species	PDB	Cysteine residue surface (S)/buried (B)	PROPKA3.0	PROPKA2	PROPKA with CHARMM minimized structures	NPA-pK_a correlation (141)	Experimentally obtained pK_a
E. coli Trx1	1XOB (76)	Cys32 (S)	9.11	6.64	6.6^a	6.5	7.1 (38)
Staphylococcus aureus Trx1 (P31T C32S)	2O89 (142)	Cys29 (S)	7.61	3.86	4^a	6.5	6.4 (142)
Rhodobacter capsulatus Trx2	2PPT (191)	Cys73 (S)	8.30	5.84	5.7^a	4.8	5.2^b (40)
B. subtilis resA	1SU9 (28, 99)	Cys76 (B)	13.19	15.83	10^a	8.1	8.2 (28, 99)
S. aureus ArsC	1LJL (194)	Cys89 (S)	10.32	9.21	9.2^a	10.0	9.5^c (141)
S. aureus ArsC	1LJL (194)	Cys10 (B)	8.85	−0.41	6.8^a	6.9	6.8^d (141)
B. subtilis Trx	2GZY (104)	Cys29 (S)	8.37	5.91	5.7^a	5.5	ND
		Cys32 (S)	11.30	8.99	ND	8.2	ND
E. coli Grx3	1ILB	Cys11 (S)	7.59	4.29	ND	5.0	<5.5
		Cys14 (S)	10.92	7.71	ND	14.1	>10.5

See (110).

pK _a value obtained from E. coli Trx2.

pK _a value obtained from C15A/C10S/C82A Sa_ArsC.

pK _a value obtained from oxidized C15A Sa_ArsC.

ArsC, arsenate reductase; NPA, natural population analysis.

The observation that pK _as in proteins may be primarily influenced by short-range interactions (35, 43 –45, 49, 141) suggests that one may not need to include the full protein in the calculations. This has led to the development of a pK _a calculation protocol in which calculations of site–site electrostatic interaction energies were omitted for pairs of titratable groups beyond a distance cut-off value (125). The immediate environment of the titratable group is treated in detail, while the rest of the protein is described less accurately, which lowers the calculation time significantly without apparently degrading the quality of the calculated pK _a values. These findings suggest that protein active sites might be treated as independent units with respect to pK _a calculations. It opens the possibility to apply computationally demanding quantum mechanical methods to the calculation of pK _as in their protein microenvironment. A possibility is to use a model in which only the titratable group plus its directly interacting protein environment is represented. The rest of the protein is represented by a bulk dielectric constant or treated by molecular mechanics in a combined quantum mechanical (QM)–molecular mechanics approach. Alternatively, the bulk protein around the region of interest can be neglected altogether, without being included in the calculations; examples can be found in Li et al. (102), Zheng et al. (195), and Ullmann et al. (170). The combination of a model system for the protein site of interest and a surrounding dielectric constant was used by Roos et al. (141). The pK _a values were predicted via the linear relationship between the natural population analysis (NPA) charge calculated quantum mechanically on the sulfur atom of the deprotonated thiolate, and compared with the experimental pK _a values. The more negative the NPA charge on the sulfur atom, the higher the tendency to bind a proton and the more basic (higher pK _a) the thiol is. For Trx and ArsC systems, this linear relation works very well (Fig. 4 and Table 2) (141).

FIG. 4.

Correlation between experimental p K _a values of selected cysteine thiols and the calculated natural population analysis (NPA) charge. The selected cysteines were Cys89 Staphylococcus aureus arsenate reductase (ArsC) (141) (a), Cys76 Bacillus subtilis resA (28, 99) (b), Cys10 S. aureus ArsC (141) (c), Cys32 E. coli Trx1 (38) (d), Cys29 S. aureus P31T C32S Trx1 (142) (e), and Cys73 Rhodobacter capsulatus Trx2 (40) (f). Reprinted with permission of Roos et al. (141).

C. Future perspective for pKa calculations applied to cysteines

The continued development of computational methods has produced powerful tools to calculate pK _a values for each titratable group in a protein. When used wisely, such tools help to formulate specific working hypotheses about the pK _a values of interest and the factors influencing these pK _as. It provides a basis for a productive interplay with experimental approaches, including mutagenesis. However, theoretical pK _a prediction methods still suffer from a number of limitations (2, 30, 98). Large errors can be encountered for residues with unusual pK _a values (2, 31, 98, 126, 157), which tend to be of particular interest, because they are often involved in catalytic activities. Ionizable side chains partially or fully sequestered from bulk solvent by burial in the protein core are particularly prone to marked discrepancies between calculated and measured pK _as. In a recent blind calculation of 77 pK _as with undisclosed experimental values, by 12 groups using a range of methods (www.pkacoop.org), it was found that no method performed significantly better than the others. Each method had successful and unsuccessful predictions, indicating that each method suffers limitations (2). Hence, there is a clear need to improve the calculations methods, which could take several directions.

One avenue would be to develop more physically complete models with all atomic details at all stages of the calculations. For instance, one could aim to treat the aqueous solvent with discrete and dynamic water molecules rather than with a continuum dielectric. One can also envisage an increasing role for reliable quantum chemistry to quantify interactions. At the other extreme, one could may be improve empirical models by increasing the training sets on which they are based. There is also room to improve the current theoretical models, for instance, with respect to the protein structural relaxation and dynamics, or the underlying force fields. Polarizable force fields may improve the electrostatic energies when switching between neutral and charged state for titratable groups (23).

A common issue with pK _a calculations is the assumption that a protein structure is rigid and identical to a representative crystal structure. For example, the same structure is frequently used to evaluate the energetics of both the charged and neutral form of a titratable group, although one would expect the microenvironment of this group to relax according to its charge state. This suggests that different conformers and tautomers should be included in the pK _a calculations (51, 91, 192). Sampling of the hydrogen locations in response to protonation states already yields improvements (91). With the multiconfigurational continuum electrostatic method (MCCE) (51), the sampling was extended to the side-chain orientations. The MCCE approach gives pK _a values in better agreement with experiment than the single-configurational continuum electrostatic method. Another option is to precede the pK _a calculation with a substantially long molecular dynamics (MD) run (43 –45, 90, 161, 173). There is no recipe to determine the appropriate length of a simulation, but at least tens of nanoseconds (and ideally more) appear required, for example, to sample the conformations of long, flexible, charged side chains. In addition, MD simulations have proved of special interest to refine NMR structural models before the pK _a calculations (43). Using the MD snapshots as input, the pK _as can be calculated for regularly spaced snapshots, and then averaged (since the Boltzmann weighting of conformers is in principle implicitly contained in the simulated populations). This does yield not only the averaged calculated pK _a values but also the instantaneous pK _a fluctuations along the simulation. Examination of the instantaneous pK _a values and their variation can help interpret the structural factors, which affect the calculated pK _a, with full microscopic details. It may reveal that the averaged pK _a (equivalent to its measured counterpart) is a composite of distinct pK _a per structural microstates. Such microstates may be associated to different reactivities and may lead to mechanistic insights. When the MD runs are too short, it may produce a bias that can sometimes strongly influence the relative occurrence of the different populations and consequently the outcome of the pK _a calculations (90), for instance, when the pK _a of a semiburied residue depends on the motion of flexible charged side chains at the protein surface (43). Indeed, MD simulations do not always improve subsequent pK _a calculations (125, 185). For instance, MD simulations are still typically performed while keeping the initially assigned protonation states constant, in which case those states might be overstabilized in the MD conformational ensemble, compared to alternative protonation states. This is being addressed with the development of MD simulations at constant pH (85). Alternatively, one can perform simulations at various predefined relevant protonation states (128).

Another way to improve pK _a calculations is the collection of additional reliable experimental reference data. That may be particularly the case for cysteine residues for which there are still only limited experimentally measured pK _as, and not enough structural information for redox active cysteines at the various stages of their redox cycles. Crystallizing the reduced form of some enzymes has proved particularly problematic (72), as the nucleophilic cysteines have the tendency to oxidize to sulfenic, sulfinic, and sulfonic acids (144). This heterogeneous population of partially oxidized proteins may hamper crystallization (10). Therefore, additional good-quality experimental data on thiol pK _as in proteins combined with deeper insights in the factors controlling pK _a values (e.g., regarding protein dynamics) should pave the way toward more accurate pK _a predictors. Meeting these challenges will require genuine collaborative efforts. In this light, the pK _a cooperative was founded to improve pK _a calculations with proteins (www.pkacoop.org). To support this initiative, we deposited the present Table 1 on the Website of the cooperative. In the meantime, the state of the art of pK _a calculations is already mature enough to gain important insights, which can hardly be accessed otherwise. Although the data set of measured cysteine pK _as is relatively small, enough evidence has been accumulated to comment on the factors that modulate these pK _as.

III. Factors That Control the pK _a Values of Cysteine Thiols in Proteins

The propensity of a chemical functionality to ionize depends on its environment, in particular on factors that affect the stability of the charged form of the functionality. Therefore, the pK _a of a cysteine thiol can be strongly influenced by its microenvironment in a protein. Charge–charge interactions, hydrogen bonds, aqueous desolvation, and helix–dipole effects are generally invoked to rationalize perturbed pK _as of cysteine residues (62, 124). However, it has become increasingly clear that residues in the immediate environment of a titratable group play a prominent role (45, 153). In this section, we discuss the limited role of charged side chains and long-range electrostatic interactions on the cysteine pK _as in oxidoreductases and the direct link between thiol pK _a and hydrogen bonds. These ideas were initially inspired by studies of the catalytic cysteine in enzymes of the Trx superfamily (45). However, since these notions emerged from analyses rooted in physical chemistry, one expects that they can be generalized to a good extent (103, 134), as confirmed by inspection of a variety of systems (section III.D).

A. Limited role of charged side chains and long-range electrostatics

It has long been known that many enzymes have a thiolate cysteine in their active site with a pK _a clearly lower than the pK _a of free cysteine in an aqueous solution (Table 1). For example, proteins of the Trx superfamily have a recurrent -Cys₁-X₁-X₂-Cys₂- active-site motif within a conserved structural framework, where the pK _a of Cys₁ is dramatically lowered. Considerable efforts have been dedicated to characterize enzymes of this superfamily, which includes DsbA, Grx, and Trx, and many variants (Table 1). In these enzymes, the pK _a of Cys₁ covers a broad range of values, from ∼3.5 in DsbA to ∼7.1 in Trx, to >8.0 in atypical Trxs such as ResA (35, 38, 43, 45, 46, 49, 56, 61, 77, 82, 117, 129, 142, 179). Mutagenesis studies have shown that the pK _a of Cys₁ is strongly influenced by the X₁-X₂ residues (56, 117).

The discovery of the very low pK _as of Cys1 initially fuelled the speculation that this pK _a would probably be stabilized by positively charged side chains. This hypothesis has now been tested with a number of systems for which it has been mostly dispelled. However, it formed the basis for much mutagenesis work, for example, with pig Grx (190), E. coli Grx3 (45, 129), E. coli DsbA (56, 66, 117), and human Grx1 (75). Early work suggested that Arg26 in pig Grx may be the key for a direct stabilization of the thiolate of Cys22, but structural work has since shown that formation of a salt bridge between these two side chains is sterically precluded (44). In reduced E. coli Grx3, His15 was mutated to Val (129), assuming that His15 would be positively charged and thereby stabilizing the thiolate of Cys11. Yet, His15 was subsequently shown to be neutral at physiological pH and cannot stabilize the thiolate via long-range electrostatics (45). The other candidate basic side chain for stabilization of the thiolate in E. coli Grx3 is Lys8. In MD simulations of E. coli Grx3 (43, 45), the side chain of Lys8 was found to be very flexible with its amino group spending most of the time solvated by water, away from the thiolate of Cys11 (Fig. 5). In this situation, water screens the electrostatic interactions between the thiolate and the lysine side chain, and the long-range Coulomb interaction between these groups was found to be negligible for the thiolate stabilization (43, 45). The amino group of Lys8 comes occasionally into contact with the thiolate in the computer simulations, reflected in a transient further drop in the calculated pK _a of Cys11 of E. coli Grx3. However, the formation of this salt bridge was short-lived and predicted to contribute little to the overall stabilization of the thiolate (45), as subsequently confirmed experimentally (45). Similar arguments probably also explain why Lys19 has only a marginal influence on the pK _a of Cys22 in human Grx1 (75).

FIG. 5.

Importance of exploiting dynamical structural protein models for p K _a calculations. Conformational dynamics of cationic flexible side chains in the vicinity of the -Cys₁₁-Pro₁₂-Tyr₁₃-Cys₁₄- motif in E. coli Grx3 (A) and E. coli Grx1 (B), obtained during molecular dynamics (MD) simulations (43) in explicit solvent for 50 ns (Grx3) and 75 ns (Grx1). The catalytic nucleophilic cysteine Cys11 (in both Grx3 and Grx1) was treated as a thiolate with both proteins. (A) Conformational spread adopted by Lys8 relative to Cys11 during the simulation of Grx3. Lys8 only rarely comes close to Cys11 and rarely hydrogen bonds its thiolate (C). Instead, Lys8 spends most of the time pointing in solvent (for clarity, the surrounding water molecules are not shown), separated from Cys11. (B) Lys8 is replaced by Arg8 in Grx1. The guanidinium group of Arg8 spends more time hydrogen bonding the thiolate of Cys11 in Grx1 than Lys8 in Grx3. (C) Distance between the thiolate sulfur of Cys11 and (i) the Lys8 side chain amino nitrogen (blue, Grx3), and (ii) the Arg8 side chain nitrogen N^η2 (red, Grx1). Distances around 3Å correspond to hydrogen bonds between the thiolate and the cationic side chain. The pK _a of Cys11 depends on how frequently such hydrogen bonds are formed. Importantly, both Lys8 and Arg8 are very flexible, and it is therefore difficult to infer the structural and electrostatic roles of such side chains without the insights provided by simulations in explicit solvent. In turn, such computational models can inform experimental testing (155). (To see this illustration in color, the reader is referred to the web version of this article at www.liebertpub.com/ars.)

The extensive mutagenesis work with E. coli DsbA also supports the notion of a limited influence of the surrounding charged side chains on the pK _a of its catalytic Cys30. There are many mutants of residues X₁ and X₂ in the -Cys₃₀-X₁-X₂-Cys₃₃- active-site motif of DsbA (56, 66, 117), for which the pK _a of Cys30 has been measured (Table 1). In the majority of the DsbA mutants, the pK _a of Cys30 remains below 5.0 (Table 1 and (56, 117)), suggesting that the side chains of residues X₁ and X₂ are not the main factors decreasing the pK _a of Cys30. Indeed, including a basic arginine in mutant -C30-T31-R32-C33- gave a pK _a of 4.76 for Cys30 (56), higher than the pK _a of ∼3.5 for the wild-type sequence -C₃₀-P₃₁-H₃₂-C₃₃-. This supports the view that positioning basic side chains in the vicinity of a cysteine is not sufficient to lower its pK _a. It is consistent with the observation that the pK _a of Cys30 is virtually insensitive to the E37Q and E38Q mutations in the surrounding of the DsbA active site (66). In fact, elimination of all charged residues in the neighborhood of the active site of DsbA had a negligible impact on the pK _a of its nucleophilic cysteine (74). Instead, in DsbA as well as in other members of the Trx superfamily, there is evidence that interactions between the thiolate and the backbone of residues X₁ and X₂ are critical for modulation of the thiolate pK _a (45).

In general, computational and experimental results indicate that the surrounding charged side chains do not primarily control the thiolate pK _a in the Trx superfamily. An interesting possible exception, however, has been uncovered with Arg8 in reduced E. coli Grx1 (43). A comparative study of homologous E. coli Grx1 and Grx3 by MD and pK _a calculations convincingly suggested that Arg8 forms a salt bridge (including a hydrogen bond) with Cys11 relatively frequently in Grx1 (43), in contrast to Lys8 and Cys11 in Grx3 (Fig. 5). This could be explained in terms of subtle differences between the conformational dynamics of Arg8 versus Lys8. Therefore, it was proposed that Arg8 contributes to lower the pK _a of Cys11 in Grx1 (43), but not Lys8 in Grx3. These computational insights have recently received indirect independent experimental support (155). Thus, peripheral side chains can sometimes influence the stability of the catalytic thiolate, and such mechanism may play a role in other systems (83, 96, 100). The extracytoplasmic atypical Trx ResA provides another example, for which it has been reported that Glu80, located close to the active site, but not forming hydrogen bonds with the active thiolates, plays a role in controlling the acid–base properties of both active-site cysteines (100). Another interesting case is that of protein disulfide isomerase (PDI), where the pK _a values of cysteine residues play a crucial role during oxidative protein folding (83). In PDI, Arg120 was reported to lower the pK _a of the C-terminal cysteine from 9.2 to 8.6 (96). A movement of Arg120 in the active site was reported, but not described in details (83), so it is unclear if its effect on the C-terminal cysteine is mediated by a direct hydrogen bond or longer-range interactions. Overall, however, the role of peripheral side chains for thiolate stabilization in redox enzymes of the Trx superfamily appears limited.

If the charged side chains form hydrogen bonds with the thiolate, it stabilizes the thiolate. Apart from Grx1, this was also demonstrated in human peroxiredoxin, where a conserved arginine residue (Arg127) is hydrogen bonded to the redox-active cysteine Cys51 and strongly diminishes the proton affinity of the thiolate form of Cys51 (16). The proton affinity is related to the Cys51 pK _a. Another conserved arginine molecule (Arg150), which is also a part of the active site with Cys51, but is not hydrogen bonded to Cys51, has a much smaller influence on the Cys51 proton affinity. So, hydrogen bond contacts seem important to mediate the influence of charged side chains on cysteine pK _as.

Considering the enzymatic functional requirements, it is maybe not surprising that nature has not selected flexible charged side chains as the main mechanism for thiolate stabilization in the Trx superfamily and its solvent-exposed active site. First, peripheral side chains have to be long (Arg and Lys), and therefore flexible, to reach to the thiolate. Thus, a stable ionic contact with the thiolate would incur an entropic cost. Second, a side chain approaching the thiolate on its solvent-exposed side may occlude the active site and sterically prevent the approach between the thiolate and its substrate. Third, when charges are exposed to a high-dielectric medium such as water, the electrostatic interaction between charges is screened and strongly diminished. Therefore, water molecules in a solvent-exposed active site could easily disrupt the salt bridge. Fourth, the stabilization of the thiolate by charged side chains would expose the enzymatic activity to the influence of ionic strength. In contrast, this ionic strength effect is minor when the thiolate is stabilized by hydrogen bonds with neutral groups, for example, with backbone amides (44, 116). In addition, controlling a cysteine pK _a by local hydrogen bonds means that the peripheral ionized side chains can evolve independently of the maintenance of this pK _a (44). That leaves the peripheral side chains free to evolve under different selection pressures, guided may be instead, by substrate recognition. Also, hydrogen bonds allow a much more precise molecular control of the enzyme chemistry, for instance, by discriminating between the active-site N-terminal and C-terminal cysteines in the -C-X-X-C- active-site motif of enzymes (Fig. 3). Indeed, it is difficult to imagine how long-range electrostatic interactions with flexible peripheral charged side chains would discriminate between the active-site N-terminal and C-terminal cysteines, since these two cysteines are very close in space. Instead, very directional localized hydrogen bonds can dramatically decrease the pK _a of the N-terminal cysteine without affecting the neighboring C-terminal cysteine, as discussed in the next section.

B. The strong influence of direct hydrogen bonds on the pK_a of cysteines

Before discussing how hydrogen bonds perturb the pK _a of cysteine side chains, it is helpful to review the information pertaining to hydrogen bonds involving sulfur. Hydrogen bonds are energetically favorable interactions formed between a donor group (D−H) and an acceptor atom (A). D is an electronegative atom that polarizes the D^δ− −H^δ+ bond, resulting in a partial positive charge on the hydrogen atom. The hydrogen positive charge interacts with the (partial) negative charge of the acceptor atom, resulting in a favorable electrostatic interaction. Thus, hydrogen bonds are largely electrostatic interactions in nature, resulting in the well-known pattern: D^δ− −H^δ+–A^(δ)−.

Since sulfur is less electronegative than oxygen or nitrogen, the role of sulfur in hydrogen bonding has been a matter of debate. Compared to oxygen, sulfur has a lower electronegativity and a larger radius, which reduce the ability of sulfur to participate in hydrogen-bonding interactions, as a donor or as an acceptor. However, there is now evidence that the thiol group can act as a moderately strong hydrogen bond donor or acceptor, and the thiolate is a hydrogen bond acceptor (1, 37, 39, 45, 57, 84, 129, 137, 183, 196). Hydrogen bonds with sulfur are longer than those with nitrogen or oxygen because of the size of the sulfur atom and its more diffuse electron cloud (57, 196). A statistical analysis on more than 500 high-resolution protein crystal structures indicated a 5:1 donor:acceptor ratio for sulfur in protein thiol groups (196), suggesting that sulfur in (neutral) thiols has a greater propensity to donate a hydrogen bond than to accept one. Yet, some QM calculations have suggested that sulfur may, at least sometimes, be almost as strong a hydrogen-bond acceptor as oxygen (137, 182). In the reduced -Cys₁-X₁X₂-Cys₂- motif of the Trx superfamily, the cysteine thiols act as hydrogen bond donors and acceptors (39, 45, 129).

The perception of hydrogen bonds by sulfur is probably dominated by crystallographic observations of distances and angles between sulfur atoms and potential donor or acceptor groups (1, 37, 57, 84, 196). A crystallography-based view of the geometric parameters of various types of hydrogen bonds involving sulfur in proteins was recently presented (196). To consider hydrogen bonds that stabilize thiolate anions, we concentrate on the geometric parameters of sulfur as a hydrogen bond acceptor. In protein crystal structures of NH–S systems, the distance between the donor nitrogen and the acceptor sulfur ranged from 3.25 to 3.55 Å, with deviations from linearity of the NH–S angle up to 25° (1, 37). An analysis of 151 high-resolution protein X-ray structures identified a mean distance of 3.54 Å and a mean angle of 138° for hydrogen bonds with cysteine sulfur atoms as hydrogen bond acceptors (196). Table 3 summarizes the mean distances and angles for cysteine sulfur accepting hydrogen bonds from different conventional donor types (196). In principle, one should also consider nonconventional, weaker, C−H hydrogen bonds to the sulfur. Indeed, there is now ample evidence that the C−H bond can be polarized enough to form interactions with a marked hydrogen bond character (33, 59, 167, 174). In proteins, examples of C−H hydrogen bond donors include the backbone C_α−H group, and aromatic C−H vectors provided by aromatic side chains (18, 71, 168, 181). Although the C−H groups are weaker hydrogen bond donors than conventional donors, they could still contribute to tune the pK _a of some thiols. One such example has been proposed with pig Grx (Fig. 6), for which a detailed analysis (44) suggested that an aromatic C−H from a phenylalanine contributes to stabilize the thiolate. However, favorable interactions between sulfur acceptor and nonconventional C−H hydrogen bond donors have barely been explored. Thus, our analysis concentrates on conventional hydrogen bonds to the sulfur.

FIG. 6.

Nonconventional hydrogen bonds may contribute to stabilize a thiolate. The figure shows snapshots of reduced pig glutaredoxin (pGrx) taken every nanosecond from an MD simulation of 120 ns (44), overlaid on the -Cys₂₂-Pro₂₃-Phe₂₄-Cys₂₅- motif. Three hydrogen bonds between the thiolate of Cys22 and conventional hydrogen bond donors (backbone N−H groups of Phe24 and Cys25, and thiol of Cys25) are shown with green dotted lines. A nonconventional hydrogen bond between the thiolate and an aromatic C−H group of the phenyl of Phe24 is also suggested (magenta dotted line). The significance of this interaction was discussed in details (44), providing evidence that this interaction also contributes to stabilize the thiolate, in addition to conventional hydrogen bonds. Indeed, some C−H dipoles can be regarded as weak, but significant, hydrogen bond donors (33, 59, 167, 174). The C−H bonds of aromatic rings have marked dipoles, with an excess of positive charge on the proton (18, 71, 168, 181). In pGrx, one of the C−H dipoles of Phe24 was frequently positioned to point to the thiolate sulfur, with geometries consistent with some hydrogen-bonding character. The Phe24 side chain is at the protein surface, where it could adopt alternative conformations. However, the conformation with a stabilizing electrostatic contact between the phenyl group and the thiolate was favored. This is consistent with the conservation of an aromatic side chain at the X₂ position in the -Cys₁-X₁-X₂-Cys₂- motif of Grxs, where X₂ is either Tyr or Phe. An aromatic side chain is present at this position even in nonclassical Grx sequences, for example, -Cys₁₄-Val₁₅-Tyr₁₆-Cys₁₅- in phage T4 Grx or -Cys₃₀-Gly₃₁-Phe₃₂-Ser₃₃- in E. coli Grx4. (To see this illustration in color, the reader is referred to the web version of this article at www.liebertpub.com/ars.)

Table 3.

Geometric Characterization of Hydrogen Bonds Accepted by the Sulfur of Cysteines in Proteins with Different Donor Types

Donor type	Number of structures	d (S–H) Å	d (S–X) Å	θ (S–H–X)°
Backbone N	95	2.79 (0.23)^a	3.58 (0.19)	143.5 (24.6)
Amide N	12	2.87 (0.25)	3.61 (0.23)	136.8 (25.0)
Charged N	18	2.78 (0.33)	3.44 (0.23)	128.2 (22.2)
Aromatic N	8	3.05 (0.12)	3.50 (0.18)	109.4 (15.4)
Hydroxyl O	18	2.76 (0.35)	3.45 (0.24)	133.1 (27.8)
All	151	2.80 (0.26)	3.54 (0.21)	138.1 (25.6)

Geometric mean with standard deviation in brackets.

d (S–H): distance between the acceptor S and donor H atom. d (S–X): distance between the acceptor S and the donor X atom. θ (S–H–X): angle between sulfur, hydrogen, and donor.

Part of the data used in this table are reproduced from reference (196).

For redox cysteines, the geometric hydrogen-bonding parameters are likely to depend whether the accepting sulfur is neutral or in thiolate form. A thiolate acceptor may allow a broader range of angles for interaction with a hydrogen bond donor. In addition, one has to consider the situation where sulfur receives a hydrogen bond from another thiol, which lengthens the hydrogen bond distance. Thus, pragmatically, the following geometric criteria are frequently suitable to capture hydrogen bonds between sulfur and a donor atom: S–D distance <4 Å and S–H−D angle >90° (141, 196). In the following paragraphs, we discuss some examples that demonstrate the influence of hydrogen bond interactions on thiol pK _a in redox enzymes of the Trx superfamily.

For the conserved -Cys₁-X₁-X₂-Cys₂- motif of proteins in the Trx superfamily, observations link the pK _a of Cys₁ to the number of hydrogen bonds received by the acceptor sulfur of Cys₁. Structural analysis revealed that in reduced Trx, two hydrogen bonds are formed with Cys₁; three in reduced Grx; and four in reduced DsbA (45) (Fig. 5). This is consistent with the pK _a of 3.5 for Cys₁ in DsbA, of 4.0 to 5.0 in Grx, and ranging from 6.3 to 7.1 in Trx (38, 46, 77, 82). Note the importance of the hydrogen bonds with the backbone N–H groups, which may also rationalize the largest pK _a variations across the DsbA X₁-X₂ mutants. The predominant role of direct hydrogen bonding in stabilizing the thiolate in the Trx superfamily is consistent with the lack of effect of increased ionic strength (0.05–2 M) on the pK _a of Cys22 of human Grx1 (44, 116). Using PB-based calculations, the pK _a of Cys₁ in Grx was estimated with the hydrogen bonds to the thiolate formed or not, depending on the conformation adopted by the side chain of Cys₁ (45). Simply changing the rotamer of Cys₁ can switch the hydrogen bonds to the sulfur on or off, which showed that disrupting the hydrogen bonds clearly increases the calculated thiol pK _a. In general, the more hydrogen bonds to the sulfur that were present, the lower the pK _a of the thiol in PB calculations (35, 43, 45). The influence of hydrogen bonds on the pK _a of Cys₁ is also seen in calculations with PROPKA1 (103). PROPKA1 calculates a Cys₁ pK _a of 3.4, 4.4, and 5.5 in, respectively, DsbA (Protein Data Bank [PDB] entry 1DSB), human protein disulfide isomerase (hDPI, PDB entry 1MEK), and E. coli Grx 3 (PDB entry 3GRX). PROPKA1 attributes these low pK _as to hydrogen bond interactions between the thiolate form of Cys₁ and surrounding residues (103).

There is also evidence that controlling the thiol pK _as with hydrogen bonds plays a role during thiol–disulfide exchange reaction mechanisms, as suggested for Trx and Staphylococcus aureus pI258 arsenate reductase (Sa_ArsC) (Fig. 7). Sa_ArsC is one of the endogenous substrates of the powerful reductase Trx (8, 21, 115). Trx reduces oxidized ArsC via the nucleophilic attack of Cys29 of Trx (Cys₁ in Cys₁-X₁-X₂-Cys₂) on the ArsC disulfide (Cys82–Cys89), leading to the release of Cys82 and formation of the Trx-ArsC Cys29–Cys89 mixed disulfide (115). In a subsequent step, this mixed disulfide needs to be reduced to release reduced ArsC (141). In this process, Cys32 of Trx (Cys₂ in Cys₁-X₁-X₂-Cys₂) attacks Cys29 of Trx in the mixed disulfide, and Trx becomes oxidized (Cys29–Cys32 disulfide formed). Thus, the dissociation of the Trx-ArsC mixed disulfide proceeds via the nucleophilic attack of Cys32 of Trx on Cys29 of the Cys29^Trx–Cys89^ArsC disulfide. In isolated, reduced Trx, Cys32 has a high pK _a (pK _a>9), and is present in its thiol form. In the mixed disulfide, the resolving cysteine Cys32 needs to be activated to its nucleophilic thiolate form. A detailed study of the Trx-ArsC mixed disulfide complex (141) suggested that two hydrogen bonds between the Cys32 sulfur and backbone amides of Cys29 and Trp28 of Trx stabilize the thiolate form of Cys32. In the presence of these hydrogen bonds, the pK _a of Cys32 drops to ∼7.7 (141), which activates Cys32 for its nucleophilic attack on Cys29. Formation of these hydrogen bonds to Cys32 was uncovered by MD simulations after localized conformational rearrangements around Cys32 (141). This illustrates how hydrogen bonds control the reactivity of a thiol via small, but precise, structural rearrangements.

FIG. 7.

Reaction mechanism of disulfide reduction by Trx. A schematic representation of the reaction mechanism is shown on top of the structures of Trx at each step of the reaction. The reaction takes off with a nucleophilic attack of the N-terminal cysteine of the conserved -C-G-P-C- motif targeting the disulfide (1). The thiolate of the nucleophilic cysteine is stabilized by two hydrogen bonds with the −NH of the glycine and the −SH of the C-terminal cysteine [PDB code: 1TRV(136)] (A). As a result, an intermediate mixed disulfide complex is formed between Trx and the substrate protein, which in turn is reduced by a nucleophilic attack of the C-terminal cysteine of the -C-G-P-C- motif (2). The C-terminal cysteine is primed for nucleophilic attack in the Trx–protein mixed disulfide complex. Selected snapshots from an MD simulation of the B. subtilis Trx and ArsC complex show that the thiolate on the C-terminal cysteine is stabilized with two backbone amide hydrogen bonds, which lowers its pK _a to 7.4 (B) (141). Further, the N-terminal cysteine of Trx has been found to be more susceptible for the nucleophilic attack of the C-terminal cysteine and is also sterically closer to the C-terminal cysteine (141). One single catalytic reduction cycle stops with the release of a reduced substrate protein and oxidized Trx [PDB code: 1TRU (136)] (C). The figures were generated using MacPyMol (Delano Scientific LLC 2006). Reproduced with the permission of (24). (To see this illustration in color, the reader is referred to the web version of this article at www.liebertpub.com/ars.)

The above examples, and others (see section III.D), support the notion that hydrogen bonds may be the primary factors modulating thiol pK _as. It follows that a first indication of a decreased thiol pK _a could be gained from structural information, simply by counting the hydrogen bonds to sulfur atoms in the structural model. Since the influence of hydrogen bond interactions is at short range, they are very sensitive to the details of the protein structure and dynamics (43 –45). Consequently, the details of the structural model used for visual interpretation and pK _a calculations are important, and apparently, minor structural changes may strongly affect the calculated pK _a values of interest (3, 35, 43 –45, 192). Hence, a degree of manipulation of the raw X-ray coordinates may be required to orientate protein side-chain amide and imidazole groups, as well as conserved water molecules, according to the most likely hydrogen-bonding network (123, 126). Also, NMR structures present special challenges, since their details can be prone to uncertainties, which affect the outcome of pK _a calculations (35, 43, 134).

Apart from hydrogen bonds, another putative influence on the pK _a needs further consideration. In the Trx superfamily active sites, Cys₁ is located at the N-terminus of an α-helix. Helices have long been perceived to decrease the pK _a of residues at their N-terminus. For example, a pK _a decrease of the N-terminal cysteine of, respectively, 1.8 and 2.0 units was measured in rhodanese (151) and human Trx (46). The measured pK _a of a N-terminal aspartate in a helical dodecapeptide was suppressed by 0.6 units (81). Earlier quantum chemical studies on papain have shown that a helix near the active site facilitates the proton transfer from the N-terminal cysteine to the histidine residue of the catalytic dyad (171). The origin of this effect is explored in the following section.

C. Reinterpretation of the helical effect on the pK_as of cysteines

The decreased pK _as of residues at the N-terminus of helices have been attributed for a long time to a helix–macrodipole effect, which would originate from the vector sum of the microdipole moments of the individual peptide units, and would be oriented along the helix-axis (67, 135, 150, 156, 176, 180). However, recent PB calculations on several model helices stressed that the helix dipole depends on the geometry and on the solvent exposure of the helix termini (152). Therefore, the helix macrodipole is not a simple vector sum of the individual dipoles. In vacuum, the helix dipole increases with the helical length, but in transmembrane helices in which both helical termini are solvent exposed, the helical dipole was reported to decrease with the helical length (152). For solvent-exposed helices, the effective dipole is strongly dependent on the orientation of the helix relative to the aqueous medium (152). When aqueous solvent is present, the helix macrodipole is counteracted by the solvent reaction field, which drastically reduces the long-range effects of the helix macrodipole. So, in many situations, there is effectively no helix macrodipole at work. Thus, although the older helix macrodipole hypothesis has been widely influential (19), there is strong evidence that the origin of the so-called helical effect can be explained and reinterpreted without needing to invoke a helical macrodipole. These new insights came from several studies, both computational and experimental (6, 45, 49, 66, 89, 103, 143, 147).

Early new insights into the helical effect on pK _as came from computational studies on papain, for which more than half of the helical effect was attributed to hydrogen bonds with the backbone rather than to the macrodipole (147). This emphasized localized short-range interactions in addition to long-range electrostatic interactions. With sulfate-binding protein, Åqvist et al. (6) calculated the electrostatic contribution to the free energy of interaction between the helix and the substrate SO₄ ²⁻ bound at its N-terminus, and concluded that the charge-stabilizing effect of the α-helix can be best explained by short-range interactions with individual peptide bond N–H dipoles at the N-terminus of the helix. They found that the first two helical turns account for 95% of the overall helical effect. The minimal influence of the helix macrodipole was supported by investigating the effect of introducing a positive charge by mutation at the helical C-terminus. Such charge would have neutralized the dipole charge, but its introduction had a negligible effect according to the calculations (6).

It has also been possible to address the helical effect experimentally, although it is more difficult to control the presence and geometry of a helix experimentally. DsbA offered an opportunity to study experimentally the effect of the helix dipole on the pK _a of its catalytic Cys30 by manipulating a kink in the relevant helix (66). Mutants designed to alter the helical kink were expected to affect the overall helix dipole; however, only a minor effect on the pK _a of Cys30 at the N-terminus of this helix was observed.

Kortemme and Creighton explored experimentally and systematically the nature of the helical effect (89), by monitoring the pK _a of a cysteine at the N- or C-terminus of model α-helical peptides. They observed that a thiol pK _a at the N-terminus of a peptide with high helical content was decreased by up to 1.6 pK _a units (pK _a values from 7.20 to 7.63) relative to a normal thiol pK _a measured in an unfolded peptide. The interpretation was that a combination of electrostatic charge–helix dipole and hydrogen bonding interactions contribute to the pK _a-lowering effect (89). Yet, several observations were consistent with the particular importance of hydrogen bonds and local effects. Thus, variation of the (neutral) amino acids at the peptide N-terminus had an impact on the thiol pK _a at this N-terminus, pointing to the importance of local conformational effects and geometries, compatible with the pK _a being decreased by hydrogen bonds. The same study varied the ionic strength to distinguish between stabilization of the thiolate by hydrogen bonds or by a more-diffuse electrostatic interaction between the thiolate and a helix macrodipole. Hydrogen-bonding interactions should be much less sensitive to screening by salt than charge–helix dipole interactions. Indeed, the thiol pK _a was only weakly affected by changes in ionic strength, suggesting a dominant role for hydrogen bonding. Further evidence for a very weak interaction between the thiolate charge and a helix macrodipole came from pK _as measured for thiols at the C-terminus of peptides with high helical content (89). Such pK _a increased by only 0.2 pK _a units relative to a normal cysteine pK _a. The asymmetry of the magnitude of the pK _a shifts observed at the N- versus C-terminus is difficult to reconcile with a strong overall helix macrodipole. However, this asymmetry can be explained in terms of local hydrogen bond interactions, since the helical N-terminus, but not the C-terminus, presents amide N–H donor groups for interactions with a negative charge.

Recent work showed that the main influence on the pK _a of Cys₁ of the -Cys₁-X₁-X₂-Cys₂- motif located at the N-terminus of an α-helix in proteins of the Trx superfamily is from hydrogen bond interactions. In the PB study of E. coli Grx3 (45), the effect of the entire helix on Cys1 (Cys11) was investigated by turning off incrementally the charges on the peptide bond dipoles one after another, starting with the amide groups closest to the cysteine of interest (Fig. 8). This revealed that the first two helical amide groups decreased the Cys11 pK _a by ∼1.5 units each; the third and fourth amide groups decreased the pK _a by ∼0.5 and ∼0.2 units, respectively; with the sixth amide group, a decrease of only 0.09 units was found. So, the pK _a value of Cys₁ in the -Cys₁-X₁-X₂-Cys₂- motif could be predicted in satisfactory agreement with an experiment without invoking the helix–macrodipole effect (45). This result was confirmed during the development of PROPKA (103), when it was found that pK _a values can be reproduced based on the numbers of hydrogen bonds formed with the cysteine sulfur without intervention of a helix macrodipole (there is no such parameter in the PROPKA model).

FIG. 8.

Effect of the helix electrostatics on the p K _a of a cysteine thiol at the helix N-terminus. (A) Shows the reduced Cys11 (space-filling, sulfur in yellow) of E. coli Grx3, at the N-terminus of a helix comprising residues 12–24 (PDB entry 1ILB). For clarity, only the helix backbone and selected hydrogens are shown. The amide N−H groups are depicted in ball and stick. The two direct hydrogen bonds between the helix backbone amide N−H groups and the sulfur of Cys11 are shown as green dotted lines. (B) Shows how the pK _a of Cys11 is influenced by the electrostatics of the helix backbone, represented by the partial charges of its backbone amide groups. The pK _a was calculated with the Poisson-Boltzmann method in the context of the full protein, as described (45). This approach allows calculating the pK _a while the charges (dipoles) for specific amide group of the helix are removed (i.e., turned off) from the calculation. The plot in (B) shows how the pK _a of Cys11 increases by removing cumulatively the partial charges on the helix backbone amide groups, from the helix N-terminus to its C-terminus. The calculated pK _a with charges on all residues included (value 0 on the X-axis) of Cys11 was ∼5.2. By removing the charges on the first amide group at the helix, the N-terminus increased the pK _a to ∼6.7. This corresponds to removing a hydrogen bond between a backbone N−H group and the sulfur. Removing also the charges on the next backbone amide group removes the second hydrogen bond to the sulfur and increased the pK _a to ∼8.2. Importantly, removing the electrostatic influence of further backbone amide groups only has a limited impact on the pK _a. Thus, most of the pK _a downshift due to the helix can be attributed to its two N−H groups directly hydrogen bonding the sulfur. The backbone dipoles further away from the helix N-terminus have an increasingly minor contribution to the pK _a shift, arguing against a significant role for a helix macrodipole. (To see this illustration in color, the reader is referred to the web version of this article at www.liebertpub.com/ars.)

Further evidence came from a computational quantum mechanical study on the effect of the helix length on the pK _a of a cysteine thiol located at the N-terminus of model helices (143). Both 3₁₀- and α-helices of increasing lengths were tested, made of (neutral) alanine residues (143). The pK _as were calculated with the NPA method (see section II.B). An initial decrease of the cysteine pK _a with the first helical turns was found, but this effect weakened quickly after a few added residues. This pK _a decrease was accompanied by the increase of the backbone amide–SγCys hydrogen bond strength. The first Ala residue decreases the cysteine pK _a by 1.5 or 0.6 units in an α- or 3₁₀-helix, respectively. For the second residue, an extra decrease of, respectively, 0.2 or 0.1 pK _a units was found in α- or 3₁₀-helices. The third residue was responsible for an extra decrease of 0.1 units (Table 4). As such, the pK _a decrease diminished for every additional helical residue. Thus, once the hydrogen bonds to the sulfur are formed (typically associated with the first helical residues i.e., cysteine plus one alanine), the next residues strengthen these hydrogen bonds, but this has only a secondary effect on pK _a perturbation. These QM calculations (143) pointed out that the lowering of the pK _a of the N-terminal cysteine of an α- or 3₁₀-helix is largely due to the hydrogen bonds formed between the cysteine sulfur and the helical N-terminus.

Table 4.

Electrostatic Properties of Model 3₁₀-Helices and α-Helices, and Influence on the pK _a of Cysteines at the N-Terminus, Calculated Quantum Mechanically

	Dipole (D)	NPA Sγ (a.u.)	pK_a	Additional pK_a decrease per amino acid
3₁₀-helices
Cysteine	6.45	−0.824	8.3	—
S_3₁₀_2	14.33	−0.791	8.06	−0.64
S_3₁₀_3	14.38	−0.786	7.92	−0.14
S_3₁₀_4	17.97	−0.782	7.80	−0.12
α-helices
S_α_2	11.64	−0.763	7.22	−1.48
S_α_3	12.77	−0.758	7.04	−0.18
S_α_4	15.32	−0.753	6.90	−0.14
S_α_6	26.65	−0.754	6.93	+0.03

Helix–macrodipole, NPA charge and pK _a of N-terminal SγCys obtained in aqueous solution in 3₁₀-helices and in α-helices.

Table adapted from reference (143).

Overall, the above-mentioned studies consistently show that direct hydrogen bonds between the cysteine sulfur and the helical N-terminus are essentially sufficient to account for the thiol pK _a downshifts. To explain such pK _a shifts, the notion of helix macrodipole is not required. This further supports the idea developed in the previous section that hydrogen bonds to the cysteine sulfur are the main factors influencing the pK _a values of the corresponding thiol groups.

D. How general are the mechanisms modulating the pK_a of cysteines?

Apart from their function in the redox biochemistry, cysteines play important roles in the catalytic processes of a wide variety of enzymes (80, 109, 110) like proteases, transferases, kinases, phosphatases, and isomerases. Cysteine residues coordinate metallic redox centers as in iron–sulfur clusters. Coordination of metals by cysteines can also play structural roles, such as zinc coordination in Zn-finger domains (94). In addition, the structural organization and oxidative folding of proteins rely on disulfide bonds. One can surmise that the structural and physical principles modulating the cysteine thiol pK _a in redox enzymes, as discussed in the previous sections, are likely to be largely transferable to cysteines in other protein families.

For example, the reactive cysteine (Cys106) in human DJ-1 [a protein linked to Parkinson's disease and member of the class I glutamine amidotransferase-like superfamily (107)] has a decreased pK _a of 5.4. A marked contribution to this pK _a shift has been attributed to a hydrogen bond interaction with a conserved protonated glutamic acid Glu18, accounting for a pK _a decrease of 1.0 unit (184). On the other hand, the sulfate anion at 5.9 Å of the reactive cysteine only increases the thiol pK _a by 0.4 units (184). This further illustrates the limited role of long-range electrostatics and identifies hydrogen bonding as a key factor determining the pK _a of Cys106. Another example of hydrogen bonds determining a cysteine pK _a is found in human muscle creatine kinase. In this enzyme, the pK _a of cysteine 282 was measured to be 5.8 (177). QM calculations pointed out that the main determinants of this low pK _a are the hydrogen bonds between Cys282 and the −OH group of a serine side chain and a backbone amide (120, 177). Each hydrogen bond lowers the pK _a by, respectively, 0.8 and 1.5 units (120).

In human α-antitrypsin, the pK _a of the active Cys323 was measured to be 6.9 (58). QM studies indicated that a hydrogen bond with the amide group of a neighboring Leu residue decreases the pK _a of Cys323 by 1 (120). In ArsCs with a low-molecular-weight tyrosine phosphatase fold from S. aureus (145) and Corynebacterium glutamicum (175), a hydrogen bond network decreases the pK _a of the nucleophilic cysteine (Fig. 9). Another example is found in Tn501 mercuric ion reductase, the key enzyme involved in reducing Hg²⁺ to Hg⁰ in bacteria (11). Its N-terminal heavy-metal-associated domain contains two cysteines, Cys11 and Cys14, with decreased pK _as of 7.7 and 7.2, respectively (97). The authors invoke the position of both cysteines at the N-terminus of an α-helix as main factor lowering the pK _as. This helical effect is not further characterized, that is, the authors do not give evidence for a helical dipole effect nor mention hydrogen bonds. Based on section III.C, we propose that the so-called helical effect may be attributed to hydrogen bond interactions between Cys11 or Cys14 and N-terminal helical residues. This is supported by an examination of the structure with the PDB code 2KT2 (97). Based on this NMR structure, the following hydrogen bonds are found between the thiol side chain of Cys14 and backbone amides: Cys14Sγ–NHTyr10: 3.4 Å and Cys14Sγ–NHCys11: 3.7 Å, while no hydrogen bonds formed with Cys11 could be observed. This is consistent with the authors' statement that the lower pK _a found for Cys14 compared to Cys11 may be attributed to the more precise positioning of Cys14 in the first turn of the helix compared to Cys11 located in the more flexible loop preceding the helix (97). Further, the −OH group of Tyr62 accounts for an extra pK _a decrease of 0.8–1 pK _a units for both cysteines, while the electrostatic interaction with the positively charged His8 side chain influences the pK _a of both cysteines by only 0.2 to 0.4 pK _a units (97). Again, it suggests that the decreased pK _as of Cys11 and Cys14 can be attributed to hydrogen bonding rather than to electrostatic interactions between charged residues.

FIG. 9.

A hydrogen-bonding network in ArsCs with a low-molecular-weight protein tyrosine phosphatase (LMW PTPase) fold is lowering the p K _a of the nucleophilic cysteine. (A) View on the active-site P-loop of pI258 ArsC from S. aureus (PDB entry code 1LJL) (114). This potassium-binding site is an interesting feature observed in pI258 Sa_ArsC (95), as binding of K⁺ stabilizes the structure of Sa_ArsC and increases the specific activity with a factor of 5 (95, 140). Potassium is here a part of a hydrogen-bonding network (black) that decreases the pK _a of the nucleophilic cysteine thiol in pI258 Sa_ArsC (145). Other sulfur amide hydrogen bonds are in blue. (B) A view on the acitve-site P-loop of ArsC1′ from C. glutamicum (PDB entry code 3T38) (175). The hydrogen-bonding network from the lysine 144 (K144) via the asparagine 91 (N91) and serine 95 (S95) to the sulfur of C88 is indicated (black) next to the hydrogen bonds with the backbone amides (blue). The asparagine of the active-site P-loop is conserved in the β-conformation of the Ramachandran plot among ArsCs with a LMW-PTPase fold. In ArsC1′ and ArsC2 (not shown) from C. glutamicum, there is no potassium-binding site. Here, the charged N^ζH⁺ of a conserved lysine takes over the role of the potassium. The figure was generated using MacPyMol (Delano Scientific LLC 2006). (To see this illustration in color, the reader is referred to the web version of this article at www.liebertpub.com/ars.)

The lowering of the pK _a of the cysteine in glutathione (GSH) is seen in GSH S-transferases. GSH S-transferases are involved in cellular detoxification in a wide variety of organisms and catalyze the conjugation of GSH to electrophilic substrates by lowering the pK _a of the cysteine of GSH. In relation to this mechanism, the alpha, mu, pI, sigma, and theta GSH S-transferase classes are the best documented (7). GSH S-transferases are homodimers with a hydrophilic subunit interface and each polypeptide chain consists of two domains, an N-terminal domain with a Trx-fold and a C-terminal α-helical bundle. The N-terminal domain contains the active-site functional group, the hydroxyl group of a tyrosine or serine residue (79, 87, 105, 178), believed to activate the cysteine of GSH. The reactive species of GSH in the binary complexes is most probably the thiolate anion, which accepts a hydrogen bond from the seryl or tyrosyl hydroxyl group (E-OH–SG) and gathers additional stabilization from a positive charge of an arginine in the class α enzymes (Fig. 10). In some mu-class GSH S-transferases extra stabilization of the GSH thiol comes from a second sphere of electrostatic effects in which the π-electron cloud of the tyrosine is involved (PDB code 6GST) (187). Hydrogen bonding and other electrostatic effects lower the pK _a of the GSH thiol from ∼9 to ∼6 in the enzyme–GSH complex (20, 105), so that it is predominantly present as thiolate at physiological pH, and more nucleophilic.

FIG. 10.

Ribbon diagram of the three-dimensional structures of a glutathione (GSH) S-transferase from the alpha class in complex with GSH. [Figure made with MacPyMol (Delano Scientific LLC 2006) using the structural coordinates of PDB code 1F3A (60).] A view on the GSH-binding site is shown. GSH S-transferases are homodimers, each consisting of an N-terminal Trx-fold and a C-terminal helical bundle. The polypeptide chains are in red and gray. The thiol of GSH makes hydrogen bonds with the tyrosine (Y8) in strand β1 and the arginine (R14) located at the N-terminus of helix α1 in the position of the nucleophilic cysteine in Trxs. The hydrogen bonds stabilize the thiolate of GSH for nucleophilic attack and transfer of GSH. (To see this illustration in color, the reader is referred to the web version of this article at www.liebertpub.com/ars.)

It is not surprising that the factors modulating cysteine pK _a values also modulate pK _a values of other titratable groups in proteins (103). Local interactions were also proposed to be the main pK _a determinants for aspartate and glutamate residues in turkey ovomucoid third domain (OMTKY3) (102), since calculated and experimental pK _a values were in agreement when considering only interactions in the immediate vicinity (4–5 Å) of Asp or Glu. The developers of PROPKA generalized these conclusions regarding the pK _as of Asp and Glu residues by identifying hydrogen bond interactions as the main source for their pK _a perturbations (103). It is known that Asp and Glu residues at the N-terminal of an α-helix usually have lower pK _a values (133). As with cysteines at helical N-termini, this effect has long been attributed to the helical macrodipole (67, 176). However, recent analyses concluded that hydrogen bonds between helical backbone amides and Asp and Glu residues are the main contributors to their decreased pK _a values (133), instead of a helical macrodipole. These conclusions echo those obtained for cysteines (45, 143).

IV. Functional Properties Influenced by the Cysteine pK _as

The previous sections summarized techniques for determining the pK _as of cysteines and gaining insight into the factors that modulate those pK _as. This was largely illustrated based on the active-site cysteines of the redox proteins of the Trx superfamily. During a catalytic cycle, those cysteines undergo oxidation and reduction. The enzymatic activity of these proteins is determined by their reaction kinetics and redox potentials. In this section, we discuss how these properties are influenced by the relevant cysteine pK _as. This underlines the functional relevance of the pK _a of cysteines involved in enzymatic catalysis.

Lowered pK _a values of catalytic cysteines influence the reaction kinetics and thermodynamics, and strongly influence the catalytic efficiency of an enzyme. This is especially true for thiol–disulfide exchange reactions, which are characterized by a Brøndsted coefficient 0<β_nuc<1 for the nucleophilic cysteine (17, 78). The β-coefficients (β_nuc , β_lg , and β_c) determine the slope of the plot of the logarithm of the second-order rate constant versus the pK _a: [log(k _s−)=β_nuc×pK_a(nuc)+β_lg×pK_a(lg)+β_c×pK_a(c)+C, with C a constant applicable for a specific thiol–disulfide exchange reaction, and β_lg and β_c, the Brøndsted coefficients for respectively the leaving group thiol and the central thiol when a thiolate attacks an unsymmetrical disulfide] (Fig. 11). Brøndsted coefficients characterize the sensitivity of the reaction to the pK _a. The coefficient establishes the change in atomic charge as the reaction proceeds from the ground state to the transition state (Fig. 11). Complete proton transfer to the nucleophile gives a value for β_nuc of 1, and no transfer a value of 0. At those extreme values, changing the pK _a has no influence on the reaction rate. For nucleophilic thiols with a pK _a below the solution pH, an increased concentration of the thiolate is less significant than the decreased nucleophilicity resulting from electron withdrawal. The most significant effect in thiol–disulfide exchange reactions comes from the decrease of the pK _a of the leaving group. As the pK _a of the leaving thiolate is decreased by electron-withdrawing substituents or electrostatic effect, the rate constants for the reaction increases with a factor of 3.2 to 5 for each unit decrease in the pK _a of the leaving thiolate (β_lg=−0.5 to −0.7) (164). Decreasing the pK _a of the leaving thiolate from 8.5 to 4.5 should increase the rate for thiol–disulfide exchange by ∼100-fold to 630-fold (52). Also, the electron withdrawal on the central thiol (β_c) will accelerate the thiol–disulfide exchange with a factor of ∼2 for each unit decrease in the pK _a (β_c=−0.3) (164). All together, the decrease of the thiolate pK _a in proteins should be more important in the function of the thiolate as a leaving group than in the function of the thiolate as a nucleophile.

FIG. 11.

The decrease of the thiolate p K a in proteins should be more important in its function as leaving group than as nucleophile. (A) Thiol–disulfide exchange reaction showing the ground state and the transition state structures. The nucleophilic attack of a negatively charged thiolate on an unsymmetrical disulfide results in a transition state with the negative charge distributed on the nucleophilic thiolate (S_nuc), the leaving thiolate (S_lg), and the central sulfur (S_c). (B) The effect of the electron withdrawal on the rate constants for thiol–disulfide exchange at pH 7.0. The figure was reproduced from the paper of Hiram Gilbert (52) showing an educative view based on approximate rate constants of LMW thiols calculated using the Brønsted relationship of Szajewski and Whitesides (164). With a pKa of the nucleophile >7, lowering the pKa (increasing electron withdrawal) will result in an increased reactivity at pH 7 due to an increase of the thiolate concentration [S_nuc ⁻]. With a pKa of the nucleophile <7, lowering the pKa results in a decreased nucleophilicity and reaction rate due to electron withdrawal. These two opposing effects result in an optimum pKa around the solution pH. As the pKa of the leaving thiolate (S_lg ⁻) is decreased by electron withdrawal by electrostatic effects, the rate of the reaction increases linearly with decreasing pKa. Both effects are clearly visualized in the log(k_obs)-versus-pK _a plot for which the slopes indicate the magnitude of the Brønsted coefficient in the Brønsted equation: log(k _s−)=β_nuc×pK_a(nuc)+β_lg×pK_a(lg)+β_c×pK_a(c)+C.

Another example on how the pK _a determines reaction rates is the correlation between a cysteine pK _a and the rate constant of the H₂O₂-induced cysteine oxidation to sulfenic acid. Thiolates react more rapidly with H₂O₂ than thiols (183); here again, a low pK _a means a high reaction rate. Ferrer-Sueta et al. refined this overall view (41). Features that act to decrease the pK _a may also decrease the nucleophilic character of the thiol, and hence make it less reactive. The effect of lowering pK _a on rate enhancement will in general be most significant when pK _a values are close to solution pH. A small increase in the concentration of the thiolate resulting from a further reduction of pK _a will be undermined by the corresponding decreased nucleophilicity resulting from electron withdrawal (Fig. 11).

In the Trx superfamily, the pK _a of the nucleophilic cysteine is related not only to reaction rates but also to the disulfide reduction potential. When this pK _a decreases, the associated disulfide/thiol reduction potential increases, that is, the enzyme becomes a stronger oxidant. This was shown for Trx and DsbA active-site mutants for which a linear correlation between pK _a and reduction potential was found (69, 142). Empirical relations between the cysteine pK _a and the disulfide/thiol reduction potential have been proposed for Trx-type oxidoreductases (118). Here, we show a correlation for Trx-fold enzymes of E. coli (Fig. 12). The redox potential increases with ∼54 mV for each decreasing pK _a unit. The only exception is seen for the gamma-domain of DsbD, which has an unusual high pK _a of more than 9 [9.3 (161) and 10.5 (112)] and a redox potential of −241 mV (25). This makes the nucleophilic cysteine (Cys461) poorly reactive toward the disulfide of DsbDα and prevents its nonspecific oxidation (32). However, formation of the complex between DsbDγ (C-terminal domain) and DsbDα (N-terminal domain) seems to lower the pK _a of Cys461, allowing the transfer of electrons between these two DsbD domains (112, 113).

FIG. 12.

The redox potential of Trx-fold enzymes correlates with the p K a of the active-site cysteines. The redox potentials of Trx-fold oxidoreductases from E. coli were plotted versus their respective experimental determined pKa value. The redox potential increases with ∼54 mV for each decreasing pKa unit (r²=0.9). The pKa values are from Table 1. For the redox potentials of the different oxidoreductases from E. coli, we refer for DsbA (56), DsbC (193), DsbD (25), DsbG (15, 172), Trx2 (40), Trx (21, 92), and Grx1 and Grx3 (8) to the respective articles.

The importance of pK _as with respect to disease and health can be gathered from many examples. For example, Trx proteins, the function of which is largely depended on the pK _a of the cysteines in the -Cys₁-X-X-Cys₂- motif, are involved in the regulation of diverse proteins like peroxiredoxin, transcription factors, and ribonucleotide reductase. As such, Trxs are involved in maintaining the redox homeostasis of the cell (22, 24, 63, 78), antioxidant defense, apoptosis and DNA synthesis, and repair. All these factors are involved in many diseases like diabetes, cardiovascular diseases, cancer, Alzheimer's, and Parkinson's disease. For example, proteins involved in protein folding, like PDI, have a direct impact on heart and kidney failure (34). In addition, GSH S-transferases, which catalyze the conjugation of GSH to a broad range of xenobiotic substrates by lowering the pK _a of GSH, are well known to influence the metabolism of a number of drugs (65, 148). Thus, there is ample evidence that cysteine pK _as are critical to many biomolecular mechanisms essential for cellular functions, and are also involved in pharmacological processes. Therefore, understanding the molecular factors that control these pK _as will help future research to understand the molecular basis of some disease conditions.

V. Conclusions

Empirical and case-by-case determination of the pK _a values of catalytic cysteines remains very valuable, since it provides insights into the chemistry of individual enzymatic reactions. Also, determining additional reliable experimental pK _a data is very important to strengthen the foundations for the development of theoretical predictive methods. However, there are limits to what one can expect from time-consuming experimental measurements. It is probably unrealistic to hope to be able to measure all interesting pK _a values. Indeed, many relevant pK _a values will adjust transiently during molecular encounters and complex reaction mechanisms (141). In these situations, it is already clear that computational approaches will become the methods of choice. However, for those theoretical methods to be used routinely and confidently, their performances need to be further tested and improved. The results of accurate calculations will provide not only the pK _a values themselves but also precise insights into the factors modulating the pK _as and the associated functional properties. Such improvements will require method developments, which would immensely benefit from close collaborations between computational and experimental scientists.

Footnotes

Acknowledgments

JM is group leader Redox Biology at the Vlaams Instituut voor Biotecnologie (VIB), and thanks NF and GR for this fruitful collaboration. GR thanks the Fund for Scientific Research Flanders (FWO) for a postdoctoral fellowship. We would like to thank all the 10 reviewers for their valuable comments, which have enormously improved this review.

Abbreviations Used

References

Adman

, Watenpaugh

, Jensen

. NH–S hydrogen bonds in Peptococcus aerogenes ferredoxin, Clostridium pasteurianum rubredoxin, and Chromatium high potential iron protein. Proc Natl Acad Sci U S A, 72:4854–4858. 1975.

Alexov

, Mehler

, Baker

, A

, Huang

, Milletti

, Erik Nielsen

, Farrell

, Carstensen

, Olsson

, Shen

, Warwicker

, Williams

, Word

. Progress in the prediction of pK(a) values in proteins. Proteins, 79:3260–3275. 2011.

Alexov

, Gunner

. Incorporating protein conformational flexibility into the calculation of pH-dependent protein properties. Biophys J, 74:2075–2093. 1997.

Antosiewicz

, McCammon

, Gilson

. Prediction of pH-dependent properties of proteins. J Mol Biol, 238:415–436. 1994.

Antosiewicz

, McCammon

, Gilson

. The determinants of pKas in proteins. Biochemistry, 35:7819–7833. 1996.

Åqvist

, Luecke

, Quiocho

, Warshel

. Dipoles localized at helix termini of proteins stabilize charges. Proc Natl Acad Sci U S A, 88:2026. 1991.

Armstrong

. Structure, catalytic mechanism, and evolution of the glutathione transferases. Chem Res Toxicol, 10:2–18. 1997.

Aslund

, Berndt

, Holmgren

. Redox potentials of glutaredoxins and other thiol-disulfide oxidoreductases of the thioredoxin superfamily determined by direct protein-protein redox equilibria. J Biol Chem, 272:30780–30786. 1997.

Åslund

, Ehn

, Miranda-Vizuete

, Pueyo

, Holmgren

. Two additional glutaredoxins exist in Escherichia coli: glutaredoxin 3 is a hydrogen donor for ribonucleotide reductase in a thioredoxin/glutaredoxin 1 double mutant. Proc Natl Acad Sci U S A, 91:9813–9817. 1994.

10.

Bacik

, Hazes

. Crystal structures of a poxviral glutaredoxin in the oxidized and reduced states show redox-correlated structural changes. J Mol Biol, 365:1545–1558. 2007.

11.

Barkay

, Miller

, Summers

. Bacterial mercury resistance from atoms to ecosystems. FEMS Microbiol Rev, 27:355–384. 2003.

12.

Bas

, Rogers

, Jensen

. Very fast prediction and rationalization of pKa values for protein-ligand complexes. Proteins, 73:765–783. 2008.

13.

Bashford

, Karplus

. pK_a's of Ionizable groups in proteins: Atomic detail from a continuum electrostatic model. Biochemistry, 29:10219–10225. 1990.

14.

Benesch

, Benesch

. The acid strength of the -SH Group in cysteine and related compounds. J Am Chem Soc, 77:5877–5881. 1955.

15.

Bessette

, Cotto

, Gilbert

, Georgiou

. In vivo and in vitro function of the Escherichia coli periplasmic cysteine oxidoreductase DsbG. J Biol Chem, 274:7784–7792. 1999.

16.

Billiet

, Geerlings

, Messens

, Roos

. The thermodynamics of thiol sulfenylation. Free Radic Biol Med, 52:1473–1485. 2012.

17.

Bulaj

, Kortemme

, Goldenberg

. Ionization-reactivity relationships for cysteine thiols in polypeptides. Biochemistry, 37:8965–8972. 1998.

18.

Burley

, Petsko

. Aromatic-aromatic interaction: a mechanism of protein structure stabilization. Science, 229:23–28. 1985.

19.

Carvalho

, Fernandes

, Ramos

. Similarities and differences in the thioredoxin superfamily. Prog Biophys Mol Biol, 91:229–248. 2006.

20.

Chen

, Graminski

, Armstrong

. Dissection of the catalytic mechanism of isozyme 4–4 of glutathione S-transferase with alternative substrates. Biochemistry, 27:647–654. 1988.

21.

Cheng

, Arscott

, Ballou

, Williams

Jr.

The relationship of the redox potentials of thioredoxin and thioredoxin reductase from Drosophila melanogaster to the enzymatic mechanism: reduced thioredoxin is the reductant of glutathione in Drosophila. Biochemistry, 46:7875–7885. 2007.

22.

Cheng

, Zhang

, Ballou

, Williams

Jr.

Reactivity of thioredoxin as a protein thiol-disulfide oxidoreductase. Chem Rev, 111:5768–5783. 2011.

23.

Click

, Kaminski

. Reproducing basic pKa values for turkey ovomucoid third domain using a polarizable force field. J Phys Chem B, 113:7844–7850. 2009.

24.

Collet

, Messens

. Structure, function, and mechanism of thioredoxin proteins. Antioxid Redox Signal, 13:1205–1216. 2010.

25.

Collet

, Riemer

, Bader

, Bardwell

. Reconstitution of a disulfide isomerization system. J Biol Chem, 277:26886–26892. 2002.

26.

CRC. Handbook of Chemistry and Physics. Florida: CRC Press, 1994.

27.

Creighton

. Proteins: Structures and Molecular Properties. New York: W.H. Freeman, 1993.

28.

Crow

, Acheson

, Le Brun

, Oubrie

. Structural basis of Redox-coupled protein substrate selection by the cytochrome c biosynthesis protein ResA. J Biol Chem, 279:23654–23660. 2004.

29.

D'Ambrosio

, Pedone

, Langella

, De Simone

, Rossi

, Pedone

, Bartolucci

. A novel member of the protein disulfide oxidoreductase family from Aeropyrum pernix K1: structure, function and electrostatics. J Mol Biol, 362:743–752. 2006.

30.

Davies

, Toseland

, Moss

, Flower

. Benchmarking pK(a) prediction. BMC biochemistry, 7:18. 2006.

31.

Demchuk

, Wade

. Improving the continuum dielectric approach to calculating pK_as of ionizable groups in proteins. J Phys Chem, 100:17373–17387. 1996.

32.

Depuydt

, Messens

, Collet

. How proteins form disulfide bonds. Antioxid Redox Signal, 15:49–66. 2011.

33.

Desiraju

, Steiner

. IUCr Monographs on Crystallography, 9. Oxford: Oxford University Press/International Union of Crystallography, 1999.

34.

Dickhout

, Carlisle

, Austin

. Interrelationship between cardiac hypertrophy, heart failure, and chronic kidney disease: endoplasmic reticulum stress as a mediator of pathogenesis. Circ Res, 108:629–642. 2011.

35.

Dillet

, Dyson

, Bashford

. Calculations of electrostatic interactions and pKas in the active site of Escherichia coli thioredoxin. Biochemistry, 37:10298–10306. 1998.

36.

Discola

, de Oliveira

, Rosa Cussiol

, Monteiro

, Barcena

, Porras

, Padilla

, Guimaraes

, Netto

. Structural aspects of the distinct biochemical properties of glutaredoxin 1 and glutaredoxin 2 from Saccharomyces cerevisiae. J Mol Biol, 385:889–901. 2009.

37.

Donohue

. On N-H–S hydrogen bonds. J Mol Biol, 45:231–235. 1969.

38.

Dyson

, Jeng

, Tennant

, Slaby

, Lindell

, Cui

, Kuprin

, Holmgren

. Effects of buried charged groups on cysteine thiol ionization and reactivity in Escherichia coli thioredoxin: structural and functional characterization of mutants of Asp 26 and Lys 57. Biochemistry, 36:2622–2636. 1997.

39.

Dyson

, Tennant

, Holmgren

. Proton-transfer effects in the active-site region of Escherichia coli thioredoxin using two-dimensional 1H NMR. Biochemistry, 30:4262–4268. 1991.

40.

El Hajjaji

, Dumoulin

, Matagne

, Colau

, Roos

, Messens

, Collet

. The zinc center influences the redox and thermodynamic properties of Escherichia coli thioredoxin 2. J Mol Biol, 386:60–71. 2009.

41.

Ferrer-Sueta

, Manta

, Botti

, Radi

, Trujillo

, Denicola

. Factors affecting protein thiol reactivity and specificity in peroxide reduction. Chem Res Toxicol, 24:434–450. 2011.

42.

Fitch

, Garcia-Moreno

. Structure-based pKa calculations using continuum electrostatics methods. Curr Prot Bioinform8.11.18.11.222006.

43.

Foloppe

, Nilsson

. The glutaredoxin -C-P-Y-C- motif: influence of peripheral residues. Structure, 12:289–300. 2004.

44.

Foloppe

, Nilsson

. Stabilization of the catalytic thiolate in a mammalian glutaredoxin: structure, dynamics and electrostatics of reduced pig glutaredoxin and its mutants. J Mol Biol, 372:798–816. 2007.

45.

Foloppe

, Sagemark

, Nordstrand

, Berndt

, Nilsson

. Structure, dynamics and electrostatics of the active site of glutaredoxin 3 from Escherichia coli: Comparison with functionally related proteins. J Mol Biol, 310:449–470. 2001.

46.

Forman-Kay

, Clore

, Gronenborn

. Relationship between electrostatics and redox function in human thioredoxin: characterization of pH titration shifts using two-dimensional homo- and heteronuclear NMR. Biochemistry, 31:3442–3452. 1992.

47.

Gan

, Wells

. Identification and reactivity of the catalytic site of pig liver thioltransferase. J Biol Chem, 262:6704–6797. 1987.

48.

Gan

Z-R

, Sardana

, Jacobs

, Polokoff

. Yeast thioltransferase-the active site cysteines display differential reactivity. Arch Biochem Biophys, 282:110–115. 1990.

49.

Gane

, Freedman

, Warwicker

. A molecular model for the redox potential difference between thioredoxin and DsbA, based on electrostatics calculations. J Mol Biol, 249:376–387. 1995.

50.

This reference has been deleted.

51.

Georgescu

, Alexov

, Gunner

. Combining conformational flexibility and continuum electrostatics for calculating pK(a)s in proteins. Biophys J, 83:1731–1748. 2002.

52.

Gilbert

. Molecular and cellular aspects of thiol-disulfide exchange. Adv Enzymol Relat Areas Mol Biol, 63:69–172. 1990.

53.

Gilson

. Multiple-site titration and molecular modeling: two rapid methods for computing energies and forces for ionizable groups in proteins. Proteins, 15:266–282. 1993.

54.

Gilson

, Honig

. Calculation of the total electrostatic energy of a macromolecular system: solvation energies, binding energies, and conformational analysis. Proteins, 4:7–18. 1988.

55.

Gilson

, Honig

. The dielectric constant of a folded protein. Biopolymers, 25:2097–2119. 1986.

56.

Grauschopf

, Winther

, Korber

, Zander

, Dallinger

, Bardwell

. Why is DsbA such an oxidizing disulfide catalyst? Cell, 83:947–955. 1995.

57.

Gregoret

, Rader

, Fletterick

, Cohen

. Hydrogen bonds involving sulfur atoms in proteins. Proteins, 9:99–107. 1991.

58.

Griffiths

, King

, Cooney

. The reactivity and oxidation pathway of cysteine 232 in recombinant human alpha 1-antitrypsin. J Biol Chem, 277:25486–25492. 2002.

59.

, Kar

, Scheiner

. Fundamental Properties of the C-H···O Interaction: Is it a True Hydrogen Bond? J Am Chem Soc, 121:9411–9422. 1999.

60.

, Singh

, Ji

. Residue R216 and catalytic efficiency of a murine class alpha glutathione S-transferase toward benzo[a]pyrene 7(R),8(S)-diol 9(S), 10(R)-epoxide. Biochemistry, 39:12552–12557. 2000.

61.

Guddat

, Bardwell

, Glockshuber

, Huber-Wunderlich

, Zander

, Martin

. Structural analysis of three His32 mutants of DsbA: support for an electrostatic role of His32 in DsbA stability. Protein Sci, 6:1893–1900. 1997.

62.

Harris

, Turner

. Structural basis of perturbed pKa values of catalytic groups in enzyme active sites. IUBMB Life, 53:85–98. 2002.

63.

Hatahet

, Ruddock

. Protein disulfide isomerase: a critical evaluation of its function in disulfide bond formation. Antioxid Redox Signal, 11:2807–2850. 2009.

64.

Hawkins

, Freedman

. The reactivities and ionization properties of the active-site dithiol groups of mammalian protein disulphide-isomerase. Biochem J, 275,Pt 2:335–339. 1991.

65.

Hayes

, Flanagan

, Jowsey

. Glutathione transferases. Annu Rev Pharmacol Toxicol, 45:51–88. 2005.

66.

Hennecke

, Spleiss

, Glockshuber

. Influence of acidic residues and the kink in the active-site helix on the properties of the disulfide oxidoreductase DsbA. J Biol Chem, 272:189–195. 1997.

67.

Hol

, van Duijnen

, Berendsen

. The alpha-helix dipole and the properties of proteins. Nature, 273:443–446. 1978.

68.

Honig

, Nicholls

. Classical electrostatics in biology and chemistry. Science, 268:1144–1149. 1995.

69.

Huber-Wunderlich

, Glockshuber

. A single dipeptide sequence modulates the redox properties of a whole enzyme family. Fold Des, 3:161–171. 1998.

70.

Hugo

, Turell

, Manta

, Botti

, Monteiro

, Netto

, Alvarez

, Radi

, Trujillo

. Thiol and sulfenic acid oxidation of AhpE, the one-cysteine peroxiredoxin from Mycobacterium tuberculosis: kinetics, acidity constants, and conformational dynamics. Biochemistry, 48:9416–9426. 2009.

71.

Hunter

, Sanders

JKM

. The Nature of π−π Interactions. J Am Chem Soc, 112:5525–5534. 1990.

72.

Ingelman

, Nordlund

, Eklund

. The structure of a reduced mutant T4 glutaredoxin. FEBS Lett, 370:209–211. 1995.

73.

Iqbalsyah

, Moutevelis

, Warwicker

, Errington

, Doig

. The CXXC motif at the N terminus of an α-helical peptide. Protein Sci, 15:1945–1950. 2006.

74.

Jacobi

, Huber-Wunderlich

, Hennecke

, Glockshuber

. Elimination of all charged residues in the vicinity of the active-site helix of the disulfide oxidoreductase DsbA. Influence of electrostatic interactions on stability and redox properties. J Biol Chem, 272:21692–21699. 1997.

75.

Jao

, English Ospina

, Berdis

, Starke

, Post

, Mieyal

. Computational and mutational analysis of human glutaredoxin (thioltransferase): probing the molecular basis of the low pKa of cysteine 22 and its role in catalysis. Biochemistry, 45:4785–4796. 2006.

76.

Jeng

, Campbell

, Begley

, Holmgren

, Case

, Wright

, Dyson

. High-resolution solution structures of oxidized and reduced Escherichia coli thioredoxin. Structure, 2:853–868. 1994.

77.

Jeng

, Holmgren

, Dyson

. Proton sharing between cysteine thiols in Escherichia coli thioredoxin: implications for the mechanism of protein disulfide reduction. Biochemistry, 34:10101–10105. 1995.

78.

Jensen

, Hansen

, Winther

. Kinetic and thermodynamic aspects of cellular thiol-disulfide redox regulation. Antioxid Redox Signal, 11:1047–1058. 2009.

79.

, von Rosenvinge

, Johnson

, Tomarev

, Piatigorsky

, Armstrong

, Gilliland

. Three-dimensional structure, catalytic properties, and evolution of a sigma class glutathione transferase from squid, a progenitor of the lens S-crystallins of cephalopods. Biochemistry, 34:5317–5328. 1995.

80.

Jones

, Go

. Mapping the cysteine proteome: analysis of redox-sensing thiols. Curr Opin Chem Biol, 15:103–112. 2011.

81.

Joshi

, Meier

. The effect of a peptide helix macrodipole on the pKa of an Asp side chain carboxylate. J Am Chem Soc, 118:12038–12044. 1996.

82.

Kallis

, Holmgren

. Differential reactivity of the functional sulfhydryl groups of cysteine-32 and cysteine-35 present in the reduced form of thioredoxin from Escherichia coli. J Biol Chem, 255:10261–10265. 1980.

83.

Karala

, Lappi

, Ruddock

. Modulation of an active-site cysteine pKa allows PDI to act as a catalyst of both disulfide bond formation and isomerization. J Mol Biol, 396:883–892. 2010.

84.

Kerr

, Ashmore

, Koetzle

. A neutron diffraction study of L-cysteine. Acta Cryst, B31:2022–2026. 1975.

85.

Khandogin

, Brooks

3rd . Constant pH molecular dynamics with proton tautomerism. Biophys J, 89:141–157. 2005.

86.

Khare

, Alexander

, Antosiewicz

, Bryan

, Gilson

, Orban

. pKa measurements from nuclear magnetic resonance for the B1 and B2 immunoglobulin G-binding domains of protein G: comparison with calculated values for nuclear magnetic resonance and X-ray structures. Biochemistry, 36:3580–3589. 1997.

87.

Kolm

, Sroga

, Mannervik

. Participation of the phenolic hydroxyl group of Tyr-8 in the catalytic mechanism of human glutathione transferase P1-1. Biochem J, 285,Pt 2:537–540. 1992.

88.

Kongsted

, Ryde

, Wydra

, Jensen

. Prediction and rationalization of the pH dependence of the activity and stability of family 11 xylanases. Biochemistry, 46:13581–13592. 2007.

89.

Kortemme

, Creighton

. Ionisation of cysteine residues at the termini of model alpha-helical peptides. Relevance to unusual thiol pKa values in proteins of the thioredoxin family. J Mol Biol, 253:799–812. 1995.

90.

Koumanov

, Karshikoff

, Friis

, Borchert

. Conformational averaging in pK calculations: improvement and limitations in prediction of ionization properties of proteins. J Phys Chem B, 105:9339–9344. 2001.

91.

Koumanov

, Ruterjans

, Karshikoff

. Continuum electrostatic analysis of irregular ionization and proton allocation in proteins. Proteins, 46:85–96. 2002.

92.

Krause

, Holmgren

. Substitution of the conserved tryptophan 31 in Escherichia coli thioredoxin by site-directed mutagenesis and structure-function analysis. J Biol Chem, 266:4056–4066. 1991.

93.

Krautwurst

, Berti

, Encinas

, Frey

. Reaction of wild-type C365S, and C458S saccharomyces cerevisiae phosphoenolpyruvate carboxykinases with fluorescent iodoacetamide derivatives. Arch Biochem Biophys, 327:123–130. 1996.

94.

Kroncke

, Klotz

. Zinc fingers as biologic redox switches? Antioxid Redox Signal, 11:1015–1027. 2009.

95.

Lah

, Lah

, Zegers

, Wyns

, Messens

. Specific potassium binding stabilizes pI258 arsenate reductase from Staphylococcus aureus. J Biol Chem, 278:24673–24679. 2003.

96.

Lappi

, Lensink

, Alanen

, Salo

, Lobell

, Juffer

, Ruddock

. A conserved arginine plays a role in the catalytic cycle of the protein disulphide isomerases. J Mol Biol, 335:283–295. 2004.

97.

Ledwidge

, Hong

, Dotsch

, Miller

. NmerA of Tn501 mercuric ion reductase: structural modulation of the pKa values of the metal binding cysteine thiols. Biochemistry, 49:8988–8998. 2010.

98.

Lee

, Crippen

. Predicting pKa. J Chem Inf Model, 49:2013–2033. 2009.

99.

Lewin

, Crow

, Hodson

, Hederstedt

, Le Brun

. Effects of substitutions in the CXXC active-site motif of the extracytoplasmic thioredoxin ResA. Biochem J, 414:81–91. 2008.

100.

Lewin

, Crow

, Oubrie

, Le Brun

. Molecular basis for specificity of the extracytoplasmic thioredoxin ResA. J Biol Chem, 281:35467–35477. 2006.

101.

, Hanson

, Fuchs

, Woodward

, Thomas

. Determination of the pKa values of active-center cysteines, cysteines-32 and −35, in Escherichia coli thioredoxin by Raman spectroscopy. Biochemistry, 32:5800–5808. 1993.

102.

, Robertson

, Jensen

. The determinants of carboxyl pKa values in turkey ovomucoid third domain. Proteins, 55:689–704. 2004.

103.

, Robertson

, Jensen

. Very fast empirical prediction and rationalization of protein pK_a values. Proteins, 61:704–721. 2005.

104.

, Hu

, Zhang

, Xu

, Lescop

, Xia

, Jin

. Conformational fluctuations coupled to the thiol-disulfide transfer between thioredoxin and arsenate reductase in Bacillus subtilis. J Biol Chem, 282:11078–11083. 2007.

105.

Liu

, Zhang

, Ji

, Johnson

, Gilliland

, Armstrong

. Contribution of tyrosine 6 to the catalytic mechanism of isoenzyme 3–3 of glutathione S-transferase. J Biol Chem, 267:4296–4299. 1992.

106.

Loumaye

, Ferrer-Sueta

, Alvarez

, Rees

, Clippe

, Knoops

, Radi

, Trujillo

. Kinetic studies of peroxiredoxin 6 from Arenicola marina: rapid oxidation by hydrogen peroxide and peroxynitrite but lack of reduction by hydrogen sulfide. Arch Biochem Biophys, 514:1–7. 2011.

107.

Macedo

, Anar

, Bronner

, Cannella

, Squitieri

, Bonifati

, Hoogeveen

, Heutink

, Rizzu

. The DJ-1L166P mutant protein associated with early onset Parkinson's disease is unstable and forms higher-order protein complexes. Hum Mol Genet, 12:2807–2816. 2003.

108.

Marchal

, Branlant

. Evidence for the chemical activation of essential cys-302 upon cofactor binding to nonphosphorylating glyceraldehyde 3-phosphate dehydrogenase from Streptococcus mutans. Biochemistry, 38:12950–12958. 1999.

109.

Marino

, Gladyshev

. Cysteine function governs its conservation and degeneration and restricts its utilization on protein surfaces. J Mol Biol, 404:902–916. 2010.

110.

Marino

, Gladyshev

. Analysis and functional prediction of reactive cysteine residues. J Biol Chem, 287:4419–4425. 2012.

111.

Mason

, Jensen

. Protein-protein binding is often associated with changes in protonation state. Proteins, 71:81–91. 2008.

112.

Mavridou

, Stevens

, Ferguson

, Redfield

. Active-site properties of the oxidized and reduced C-terminal domain of DsbD obtained by NMR spectroscopy. J Mol Biol, 370:643–658. 2007.

113.

Mavridou

, Stevens

, Goddard

, Willis

, Ferguson

, Redfield

. Control of periplasmic interdomain thiol:disulfide exchange in the transmembrane oxidoreductase DsbD. J Biol Chem, 284:3219–3226. 2009.

114.

Messens

, Martins

, Van Belle

, Brosens

, Desmyter

, De Gieter

, Wieruszeski

, Willem

, Wyns

, Zegers

. All intermediates of the arsenate reductase mechanism, including an intramolecular dynamic disulfide cascade. Proc Natl Acad Sci U S A, 99:8506–8511. 2002.

115.

Messens

, Van Molle

, Vanhaesebrouck

, Limbourg

, Van Belle

, Wahni

, Martins

, Loris

, Wyns

. How thioredoxin can reduce a buried disulphide bond. J Mol Biol, 339:527–537. 2004.

116.

Mieyal

, Starke

, Gravina

, Hocevar

. Thioltransferase in human red blood cells: kinetics and equilibrium. Biochemistry, 30:8883–8891. 1991.

117.

Mossner

, Huber-Wunderlich

, Glockshuber

. Characterization of Escherichia coli thioredoxin variants mimicking the active-sites of other thiol/disulfide oxidoreductases. Protein Sci, 7:1233–1244. 1998.

118.

Mossner

, Iwai

, Glockshuber

. Influence of the pK(a) value of the buried, active-site cysteine on the redox properties of thioredoxin-like oxidoreductases. FEBS Lett, 477:21–26. 2000.

119.

Moutevelis

, Warwicker

. Prediction of pKa and redox properties in the thioredoxin superfamily. Protein Sci, 13:2744–2752. 2004.

120.

Naor

, Jensen

. Determinants of cysteine pKa values in creatine kinase and alpha1-antitrypsin. Proteins, 57:799–803. 2004.

121.

Nelson

, Creighton

. Reactivity and ionization of the active site cysteine residues of DsbA, a protein required for disulfide bond formation in vivo. Biochemistry, 33:5974–5983. 1994.

122.

Nelson

, Day

, Zeng

, King

, Poole

. Isotope-coded, iodoacetamide-based reagent to determine individual cysteine pK(a) values by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. Anal Biochem, 375:187–195. 2008.

123.

Nielsen

, Andersen

, Honig

, Hooft

, Klebe

, Vriend

, Wade

. Improving macromolecular electrostatics calculations. Protein Eng, 12:657–662. 1999.

124.

Nielsen

, Borchert

, Vriend

. The determinants of alpha-amylase pH-activity profiles. Protein Eng, 14:505–512. 2001.

125.

Nielsen

, McCammon

. Calculating pKa values in enzyme active sites. Protein Sci, 12:1894–1901. 2003.

126.

Nielsen

, Vriend

. Optimizing the hydrogen-bond network in Poisson-Boltzmann equation-based pKa calculations. Proteins, 43:403–412. 2001.

127.

Nielsen

, Jensen

, Hansen

, Gotfredsen

, Winther

. A fluorescent probe which allows highly specific thiol labeling at low pH. Anal Biochem, 421:115–120. 2012.

128.

Nilsson

, Karshikoff

. Multiple pH regime molecular dynamics simulation for pK calculations. PLoS One, 6:e20116. 2011.

129.

Nordstrand

, Åslund

, Meunier

, Holmgren

, Otting

, Berndt

. Direct NMR observation of the Cys-14 thiol proton of reduced Escherichia coli Glutaredoxin-3 supports the presence of an active site thiol-thiolate hydrogen bond. FEBS letters, 449:196–200. 1999.

130.

Olsson

HMM

, Søndergaard

, Rostkowski

, Jensen

. PROPKA3: Consistent treatment of internal and surface residues in empirical pKa predictions. J Chem Theory Comput, 7:525–537. 2011.

131.

Polgar

. Mercaptide-imidazolium ion-pair: the reactive nucleophile in papain catalysis. FEBS Lett, 47:15–18. 1974.

132.

Porat

, Lillig

, Johansson

, Fernandes

, Nilsson

, Holmgren

, Beckwith

. The reducing activity of glutaredoxin 3 toward cytoplasmic substrate proteins is restricted by methionine 43. Biochemistry, 46:3366–3377. 2007.

133.

Porter

, Hall

, Locke

, Jensen

, Molina

. Hydrogen bonding is the prime determinant of carboxyl pKa values at the N-termini of alpha-helices. Proteins, 63:621–635. 2006.

134.

Powers

, Jensen

. Chemically accurate protein structures: Validation of protein NMR structures by comparison of measured and predicted pK_a values. J Biomol NMR, 35:39–51. 2006.

135.

Presnell

, Cohen

. Topological distribution of four-alpha-helix bundles. Proc Natl Acad Sci U S A, 86:6592–6596. 1989.

136.

Qin

, Clore

, Gronenborn

. The high-resolution three-dimensional solution structures of the oxidized and reduced states of human thioredoxin. Structure, 2:503–522. 1994.

137.

Rablen

, Lockman

, Jorgensen

. Ab initio study of hydrogen-bonded complexes of small organic molecules with water. J Phys Chem A, 102:3782–3797. 1998.

138.

Reckenfelderbaumer

, Krauth-Siegel

. Catalytic properties, thiol pK value, and redox potential of Trypanosoma brucei tryparedoxin. J Biol Chem, 277:17548–17555. 2002.

139.

Ren

, Stephan

, Xu

, Zheng

, Tang

, Harrison

, Kurz

, Jarrott

, Shouldice

, Hiniker

, Martin

, Heras

, Bardwell

. Properties of the thioredoxin fold superfamily are modulated by a single amino acid residue. J Biol Chem, 284:10150–10159. 2009.

140.

Roos

, Buts

, Van Belle

, Brosens

, Geerlings

, Loris

, Wyns

, Messens

. Interplay between ion binding and catalysis in the thioredoxin-coupled arsenate reductase family. J Mol Biol, 360:826–838. 2006.

141.

Roos

, Foloppe

, Van Laer

, Wyns

, Nilsson

, Geerlings

, Messens

. How thioredoxin dissociates Its mixed disulfide. PloS Comp Biol, 5:e1000461. 2009.

142.

Roos

, Garcia-Pino

, Van Belle

, Brosens

, Wahni

, Vandenbussche

, Wyns

, Loris

, Messens

. The conserved active site proline determines the reducing power of Staphylococcus aureus thioredoxin. J Mol Biol, 368:800–811. 2007.

143.

Roos

, Loverix

, Geerlings

. Origin of the pK(a) perturbation of N-terminal cysteine in alpha- and 3(10)-helices: a computational DFT study. J Phys Chem B, 110:557–562. 2006.

144.

Roos

, Messens

. Protein sulfenic acid formation: from cellular damage to redox regulation. Free Radic Biol Med, 51:314–326. 2011.

145.

Roos

, Messens

, Loverix

, Wyns

, Geerlings

. A computational and conceptual DFT study on the Michaelis complex of pI258 arsenate reductase. Structural aspects and activation of the electrophile and nucleophile. J Phys Chem B, 108:17216–17225. 2004.

146.

Ruddock

, Hirst

, Freedman

. pH-dependence of the dithiol-oxidizing activity of DsbA (a periplasmic protein thiol:disulphide oxidoreductase) and protein disulphide-isomerase: studies with a novel simple peptide substrate. Biochem J, 315,Pt 3:1001–1005. 1996.

147.

Rullmann

, Bellido

, van Duijnen

. The active site of papain. All-atom study of interactions with protein matrix and solvent. J Mol Biol, 206:101–118. 1989.

148.

Rushmore

, Pickett

. Glutathione S-transferases, structure, regulation, and therapeutic implications. J Biol Chem, 268:11475–11478. 1993.

149.

Sanchez

, Riddle

, Woo

, Momand

. Prediction of reversibly oxidized protein cysteine thiols using protein structure properties. Protein Sci, 17:473–481. 2008.

150.

Sancho

, Serrano

, Fersht

. Histidine residues at the N- and C-termini of alpha-helices: perturbed pKas and protein stability. Biochemistry, 31:2253–2258. 1992.

151.

Schlesinger

, Westley

. An expanded mechanism for rhodanese catalysis. J Biol Chem, 249:780–788. 1974.

152.

Sengupta

, Behera

, Smith

, Ullmann

. The alpha helix dipole: screened out? Structure, 13:849–855. 2005.

153.

Sham

, Chu

, Warshel

. Consistent calculations of pKa's of ionizable residues in proteins: Semi-microscopic and microscopic approaches. J Phys Chem B, 101:4458–4472. 1997.

154.

Sharp

. Electrostatic interactions in macromolecules. Curr Opin Struct Biol, 4:234–239. 1994.

155.

Shekhter

, Metanis

, Dawson

, Keinan

. A residue outside the active site CXXC motif regulates the catalytic efficiency of Glutaredoxin 3. Mol BioSyst, 6:241–248. 2010.

156.

Sheridan

, Levy

, Salemme

. alpha-Helix dipole model and electrostatic stabilization of 4-alpha-helical proteins. Proc Natl Acad Sci U S A, 79:4545–4549. 1982.

157.

Simonson

, Carlsson

, Case

. Proton binding to proteins: pK(a) calculations with explicit and implicit solvent models. J Am Chem Soc, 126:4167–4180. 2004.

158.

Simonson

, Perahia

. Dielectric properties of proteins from simulations: tools and techniques. Comput Phys Commun, 91:291–303. 1995.

159.

Simonson

, Perahia

. Internal and interfacial dielectric properties of cytochrome c from molecular dynamics in aqueous solution. Proc Natl Sci U S A, 92:1082–1086. 1995.

160.

Søndergaard

, Olsson

HMM

, Rostkowski

, Jensen

. Improved treatment of ligands and coupling effects in empirical calculation and rationalization of pKa values. J Chem Theory Comput, 7:2284–2295. 2011.

161.

Stirnimann

, Rozhkova

, Grauschopf

, Bockmann

, Glockshuber

, Capitani

, Grutter

. High-resolution structures of Escherichia coli cDsbD in different redox states: a combined crystallographic, biochemical and computational study. J Mol Biol, 358:829–845. 2006.

162.

Subramanian

, Ross

. The enthalpy of protolysis of liver alcohol dehydrogenase upon binding nicotinamide adenine dinucleotide. J Biol Chem, 254:7826–7830. 1979.

163.

Sun

, Wang

. The N-terminal sequence (residues 1–65) is essential for dimerization, activities, and peptide binding of Escherichia coli DsbC. J Biol Chem, 275:22743–22749. 2000.

164.

Szajewski

, Whitesides

. Rate constants and equilibrium constants for thiol-disulfide interchange reactions involving oxidized glutathione. J Am Chem Soc, 102:2011–2025. 1980.

165.

Tajc

, Tolbert

, Basavappa

, Miller

. Direct determination of thiol pKa by isothermal titration microcalorimetry. J Am Chem Soc, 126:10508–10509. 2004.

166.

Takahashi

, Creighton

. On the reactivity and ionization of the active site cysteine residues of Escherichia coli thioredoxin. Biochemistry, 35:8342–8353. 1996.

167.

Taylor

, Kennard

. Crystallographic evidence for the existence of C-H…O, C-H…N, and C-H…Cl hydrogen bonds. J Am Chem Soc, 104:5063–5070. 1982.

168.

Thomas

, Smith

, Thomas

, Feldmann

. Electronic distributions within protein phenylalanine aromatic rings are reflected by the three-dimensional oxygen atom environments. Proc Natl Acad Sci U S A, 79:4843–4847. 1982.

169.

Thurlkill

, Grimsley

, Scholtz

, Pace

. pK values of the ionizable groups of proteins. Protein Sci, 15:1214–1218. 2006.

170.

Ullmann

, Noodleman

, Case

. Density functional calculation of p K(a) values and redox potentials in the bovine Rieske iron-sulfur protein. J Biol Inorg Chem, 7:632–639. 2002.

171.

van Duijnen

, Thole

, Hol

. On the role of the active site helix in papain, an ab initio molecular orbital study. Biophys Chem, 9:273–280. 1979.

172.

van Straaten

, Missiakas

, Raina

, Darby

. The functional properties of DsbG, a thiol-disulfide oxidoreductase from the periplasm of Escherichia coli. FEBS Lett, 428:255–258. 1998.

173.

van Vlijmen

, Schaefer

, Karplus

. Improving the accuracy of protein pKa calculations: conformational averaging versus the average structure. Proteins, 33:145–158. 1998.

174.

Vargas

, Garza

, Dixon

, Hay

. How strong is the Ca-H…O=C hydrogen bond? J Am Chem Soc, 122:4750–4755. 2000.

175.

Villadangos

, Van Belle

, Wahni

, Tamu Dufe

, Freitas

, Nur

, De Galan

, Gil

, Collet

, Mateos

, Messens

. Corynebacterium glutamicum survives arsenic stress with arsenate reductases coupled to two distinct redox mechanisms. Mol Microbiol, 82:998–1014. 2011.

176.

Wada

. The alpha-helix as an electric macro-dipole. Adv Biophys, 1–63. 1976.

177.

Wang

, McLeish

, Kneen

, Lee

, Kenyon

. An unusually low pK(a) for Cys282 in the active site of human muscle creatine kinase. Biochemistry, 40:11698–11705. 2001.

178.

Wang

, Newton

, Huskey

, McKeever

, Pickett

, Lu

. Site-directed mutagenesis of glutathione S-transferase YaYa. Important roles of tyrosine 9 and aspartic acid 101 in catalysis. J Biol Chem, 267:19866–19871. 1992.

179.

Warwicker

, Gane

. Calculation of Cys 30 ΔpK_a's and oxidising power for DsbA mutants. FEBS Lett, 385:105–108. 1996.

180.

Warwicker

, Watson

. Calculation of the electric potential in the active site cleft due to alpha-helix dipoles. J Mol Biol, 157:671–679. 1982.

181.

Waters

. Aromatic interactions in model systems. Curr Opin Chem Biol, 6:736–741. 2002.

182.

Wennmohs

, Staemmler

, Schindler

. Theoretical investigation of weak hydrogen bonds to sulfur. J Chem Phys, 119:3208–3218. 2003.

183.

Winterbourn

, Metodiewa

. Reactivity of biologically important thiol compounds with superoxide and hydrogen peroxide. Free Radic Biol Med, 27:322–328. 1999.

184.

Witt

, Lakshminarasimhan

, Remington

, Hasim

, Pozharski

, Wilson

. Cysteine pKa depression by a protonated glutamic acid in human DJ-1. Biochemistry, 47:7430–7440. 2008.

185.

Wlodek

, Antosiewicz

, McCammon

. Prediction of titration properties of structures of a protein derived from molecular dynamics trajectories. Protein Sci, 6:373–382. 1997.

186.

Woo

, Jeong

, Chang

, Park

, Yang

, Rhee

. Reduction of cysteine sulfinic acid by sulfiredoxin is specific to 2-cys peroxiredoxins. J Biol Chem, 280:3125–3128. 2005.

187.

Xiao

, Liu

, Ji

, Johnson

, Chen

, Parsons

, Stevens

, Gilliland

, Armstrong

. First-sphere and second-sphere electrostatic effects in the active site of a class mu gluthathione transferase. Biochemistry, 35:4753–4765. 1996.

188.

Yang

A-S

, Gunner

, Sampogna

, Sharp

, Honig

. On the Calculation of pK_as in Proteins. Proteins, 15:252–265. 1993.

189.

Yang

, Honig

. On the pH dependence of protein stability. J Mol Biol, 231:459–474. 1993.

190.

Yang

, Wells

. Identification and characterization of the functional amino acids at the active center of pig liver thioltransferase by site-directed mutagenesis. J Biol Chem, 266:12759–12765. 1991.

191.

, Cho

, Fuselier

, Li

, Beckwith

, Rapoport

. Crystal structure of an unusual thioredoxin protein with a zinc finger domain. J Biol Chem, 282:34945–34951. 2007.

192.

You

, Bashford

. Conformation and hydrogen ion titration of proteins: a continuum electrostatic model with conformational flexibility. Biophys J, 69:1721–1733. 1995.

193.

Zapun

, Missiakas

, Raina

, Creighton

. Structural and functional characterization of DsbC, a protein involved in disulfide bond formation in Escherichia coli. Biochemistry, 34:5075–5089. 1995.

194.

Zegers

, Martins

, Willem

, Wyns

, Messens

. Arsenate reductase from S. aureus plasmid pI258 is a phosphatase drafted for redox duty. Nat Struct Biol, 8:843–847. 2001.

195.

Zheng

, Zhan

C-G

, Ornstein

. Theoretical determination of two structural forms of the active site in cadmium-containing phosphotriesterases. J Phys Chem B, 106:717–722. 2002.

196.

Zhou

, Tian

, Lv

, Shang

. Geometric characteristics of hydrogen bonds involving sulfur atoms in proteins. Proteins, 76:151–163. 2009.

Understanding the p K a of Redox Cysteines: The Key Role of Hydrogen Bonding

Abstract

I. Introduction

II. pK a Determination Methods

A. Experimental approaches

B. Computational methods

C. Future perspective for pKa calculations applied to cysteines

III. Factors That Control the pK a Values of Cysteine Thiols in Proteins

A. Limited role of charged side chains and long-range electrostatics

B. The strong influence of direct hydrogen bonds on the pKa of cysteines

C. Reinterpretation of the helical effect on the pKas of cysteines

D. How general are the mechanisms modulating the pKa of cysteines?

IV. Functional Properties Influenced by the Cysteine pK as

V. Conclusions

Footnotes

Acknowledgments

Abbreviations Used

References

II. pK _a Determination Methods

III. Factors That Control the pK _a Values of Cysteine Thiols in Proteins

B. The strong influence of direct hydrogen bonds on the pK_a of cysteines

C. Reinterpretation of the helical effect on the pK_as of cysteines

D. How general are the mechanisms modulating the pK_a of cysteines?

IV. Functional Properties Influenced by the Cysteine pK _as