Abstract
Abstract
A probability-based quantification framework is presented for the calculation of relative peptide and protein abundance in label-free and label-dependent LC-MS proteomics data. The results are accompanied by credible intervals and regulation probabilities. The algorithm takes into account data uncertainties via Poisson statistics modified by a noise contribution that is determined automatically during an initial normalization stage. Protein quantification relies on assignments of component peptides to the acquired data. These assignments are generally of variable reliability and may not be present across all of the experiments comprising an analysis. It is also possible for a peptide to be identified to more than one protein in a given mixture. For these reasons the algorithm accepts a prior probability of peptide assignment for each intensity measurement. The model is constructed in such a way that outliers of any type can be automatically reweighted. Two discrete normalization methods can be employed. The first method is based on a user-defined subset of peptides, while the second method relies on the presence of a dominant background of endogenous peptides for which the concentration is assumed to be unaffected. Normalization is performed using the same computational and statistical procedures employed by the main quantification algorithm. The performance of the algorithm will be illustrated on example data sets, and its utility demonstrated for typical proteomics applications. The quantification algorithm supports relative protein quantification based on precursor and product ion intensities acquired by means of data-dependent methods, originating from all common isotopically-labeled approaches, as well as label-free ion intensity-based data-independent methods.
Introduction
As the complexity of datasets increases with the sophistication of the technology, it becomes correspondingly more important to adopt a coherent statistical methodology. To put it differently, the failings of other methods become more serious as technology advances. In fact, standard Bayesian probability calculus offers the only logically acceptable framework (Cox, 1946). Traditional orthodox statistical tests detract from coherent analysis, typically by producing different results when data constraints are applied in a different order (Jaynes, 2003). Such ad hoc testing procedures act to limit the quality and reliability of such inferences as are drawn.
Notwithstanding such criticisms, traditional statistical methods, such as the Student's t and ANOVA tests, are often used in conjunction with gel-based quantification techniques, and are considered to be established approaches (Gustafsson et al., 2004; Karp et al., 2007). More recently, q values have been introduced and used as an extension to false discovery rate calculations with their value assessed and used in quantitative 2D gel proteomics studies (Karp et al., 2007; Storey and Tibshirani, 2003). Multivariate statistical approaches are less frequently applied. The statistical methods mentioned above assume a normal distribution of the 2D gel data, which often requires normalization and/or transformation of the spot volumes (Gustafsson et al., 2004; Potra and Liu, 2006), and alignment of 2D gel images (Dowsey et al., 2008). Image alignment typically involves the identification of landmarks, warping of the images, and optionally creating a so-called composite master gel. This compensates for differences between gels caused by variations in migration, protein separation, stain artifacts, and stain saturation, which can otherwise complicate gel matching and quantification. More recently, a sample multiplexing technique has been developed for 2D gel analysis, which involves labeling of the sample prior to electrophoretic separation, to overcome limitations due to inter-gel variations (Unlü et al., 1997).
LC-MS-based proteomics quantification schemes also require normalization. However, in LC-MS experiments, a protein is typically represented by a greater number of features. Dependent on the labeling method of choice, a feature can be either a deconvoluted peptide precursor ion and its associated intensity, or product ions and their intensities. With inclusion of stable isotopes in the experimental design, normalization is typically conducted post-quantification. The complete data set is normally re-scaled by using either the mean or average value of all calculated ratio values (Li et al., 2007). Alternatively, summed intensities from all or selected components can be used. Components with ratios that lie outside a user-defined number of standard deviations from unity are typically considered to be significantly regulated (Li et al., 2003). Statistical outliers are removed with either Dixon's, Grubbs', or Rosner's tests (Aggarwal et al., 2005; Wong et al., 2008). With label-free LC-MS data, normalization can be conducted prior to the identification and quantification of the peptides and their originating parent proteins. The common nature of the LC-MS data, both within and across the complete data set, can be used to correct for systematic and/or sporadic changes (Kultima et al., 2009). This relies on the implicit assumption that changes in protein expression occur against a dominant background of proteins that do not change in their expression profile. Normalization can be conducted globally, where all LC-MS features are used simultaneously, or locally, where a subset of the features is used to calculate a normalization factor. Both approaches have been advocated and applied elsewhere (Callister et al., 2006; Tabata et al., 2007). Regardless of the type of normalization applied, time and m/z alignment (i.e., clustering of the detected features), has to be conducted first. A multitude of methods and algorithm types have been described, including correlation optimized warping (Prince and Marcotte, 2006), vectorized peaks (Hastings et al., 2002), (semi-) supervised alignment using non-linear regression methods (Fischer et al., 2006), hidden Markov models (Listgarten et al., 2007), statistical alignment (Wang et al., 2007), or more generic clustering methods (Lange et al., 2007; Mueller et al., 2007; Silva et al., 2005).
The above description gives an indication of the bewildering range of options available for analysis of protein expression data. Where a choice of tests is available, different practitioners may disagree about the selection and application for particular situations. For instance, missing data are sometimes dealt with by adding or imputing fictitious data (Little and Rubin, 2002), while the problem of uncertain assignment of observed features to predicted sequences may result in important data being discarded prematurely. This is an unsatisfactory situation, but it is by no means unique in data analysis (Dar et al., 1994). Fortunately, there exists a unique and consistent framework for reasoning with incomplete information that allows all sources of uncertainty to be combined using the standard rules of probability (Cox, 1946). In straightforward situations, the results often agree exactly with more traditional approaches, but awkward and universal features of large datasets, such as missing and outlying measurements, are accommodated automatically. In this article, we will illustrate that the presented framework can be applied to the quantitative analysis of proteomic data both at the protein and the peptide level. The robustness of the approach will be demonstrated by looking at protein quantification, including missing data, weak identifications, homologous peptides, and outlying measurements, with validated example data.
The transition from the qualitative characterization of complex proteomics samples to large-scale quantitative analysis by means of LC-MS has implications for the design of the overall experiment. In particular, the sample complexity and dynamic range, depth of the qualitative and quantitative proteome coverage desired, the required quantitative accuracy, and LC-MS instrumentation and acquisition settings, all impact the experimental strategy that is adopted. For example, the inherent dynamic range present in the sample compared with the dynamic range of the analytical approach chosen should determine whether sample fractionation at the protein or peptide level is required. The level of sample pre-fractionation performed will influence the selection of either a label-free or a label-dependent (stable isotope labeling) quantitative experiment (Patel et al., 2009; Wang et al., 2008a). The MS acquisition method settings are often not considered in quantitative experiments. However, the incorrect use of acquisition settings, such as the MS1 transmission window, precursor and product search tolerances, and instrument duty cycle settings, will all strongly impact the outcome of any type of proteomics experiment (Geromanos et al., 2009). Instrument characteristics such as ion saturation, either storage- (March and Todd, 2005; Williams and Cooks, 2005) or detection-related (Chemushevich et al., 2001), and how they can affect quantification are often not appreciated. The comprehensive statistical analysis of qualitative search results (Choi and Nesvizhskii, 2008; Fenyö and Beavis, 2003; Keller et al., 2002; Qian et al., 2005; Reiter et al., 2009), and that of quantitative LC-MS data (Listgarten et al., 2007; Smit et al., 2008), are attracting more attention with the realization that search algorithms were not designed with current sample complexity in mind. The interplay of these parameters and how they might be addressed are discussed within the context of the presented probabilistic quantification framework.
Materials and Methods
Sample preparation and tryptic digestion conditions
Cytosolic E. coli proteins differentially spiked with a standard protein mixture
An aliquot of 250 μL of 0.5% aqueous formic acid was added to 100 μg of a cytosolic E. coli tryptic digest standard. Tryptic digest stock solutions containing alcohol dehydrogenase, phosphorylase B, albumin, and enolase (Waters Corporation, Milford, MA), were prepared in 0.1% aqueous formic acid and diluted to a concentration of 25 fmol/μL for solution A, and 25, 12.5, 200, and 50 fmol/μL, respectively, for solution B. Equal volumes of the E. coli digest and the standard proteins mixture were combined to yield a sample concentration of 0.2 μg/μL of E. coli digest, and protein concentration ratios for protein mixture solution A versus protein mixture solution B of 1:1, 2:1, 1:8, and 1:2, respectively.
Yeast mitochondrial proteins
Yeast strain GT197 was grown at 30°C on glycerol medium prior to subsequent purification and fractionation of the mitochondria as previously described (Chacinska et al., 2000). Total protein fractions for Western blot analysis were obtained using a standard NaOH-TCA precipitation method (Nandakumar et al., 2003). The mitochondrial fractions were resolubilized in 50 mM ammonium bicarbonate/0.1% RapiGest solution (Waters Corporation), reduced in the presence of 100 mM dithiothreitol (DTT) (Sigma-Aldrich, St. Louis, MO) at 60°C for 30 min, and alkylated in the dark in the presence of 200 mM iodoacetamide (IAA) (Sigma-Aldrich) at room temperature for 30 min. Proteolytic digestion was initiated by adding sequence-grade modified trypsin (1:50 w/w) (Promega, Madison, WI), and incubated overnight at 37°C. Breakdown of acid-labile RapiGest was achieved in the presence of 4 μL of an aqueous 12 M HCl solution at 37°C for 15 min. The tryptic peptide solutions were centrifuged at 13,000 rpm for 10 min and the supernatant collected. Prior to LC-MS analyses, the tryptic peptide yeast solutions were diluted with aqueous 0.1% formic acid solution (Sigma-Aldrich) to provide adequate on-column loads, typically 0.3 μg. The LC-MS analyses were performed using 2 μL of the final protein digest mixtures.
Streptomyces coelicolor proteins
Streptomyces coelicolor M145 was grown in surface cultures and processed for protein extraction at eight different developmental time points as previously described (Manteca et al., 2006). Protein quantification was performed using a Bradford assay with a bovine serum albumin standard (Sigma-Aldrich). Proteins, 50 μg per lane, were separated by SDS-PAGE using precast PAGEr 4–20% Tris-Glycine gels (Lonza, Rockland, ME), and stained with Coomassie brilliant blue G-250 (Sigma-Aldrich). Each lane was divided into six strips and each gel piece was cut, washed, and shrunk with acetonitrile. Next, the cysteine residues were reduced with DTT and alkylated with IAA, swelled with a 10 ng/μL trypsin (Promega), 50 mM triethylammonium bicarbonate digestion buffer, and incubated overnight at 37°C. The supernatants were post-digestion recovered and peptide extraction from the gel fragments was performed with 25 μL of 5% formic acid for 30 min, after which an equal volume of pure acetonitrile was added and the samples were incubated for an additional 30 min at room temperature. Extracts obtained from each gel strip were pooled and vacuum-dried. The peptides were labeled with iTRAQ 8-plex reagent according to the manufacturer's instructions (Applied Biosystems, Foster City, CA). After labeling for 2 h at room temperature, all peptides from the gel pieces were combined (6 samples in total with 8 iTRAQ tags/developmental time points per sample). The concentration of the organic solvent was reduced using a vacuum concentrator. Peptide desalting (Gobom et al., 1999) was performed using GELoader micropipette tips (Eppendorf, Hamburg, Germany), prepared with C18 Empore extraction disks (3M, St. Paul, MN), and Poros R3 material (Applied Biosystems).
LC-MS configuration
Nanoscale LC separation of tryptic peptides was performed with a nanoACQUITY system (Waters Corporation), equipped with a Symmetry C18 5 μm, 5 mm×300-μm pre-column, and an Atlantis C18 3 μm, 15 cm×75-μm analytical reversed phase column (Waters Corporation). The experiments with the four-protein mixture added to the E. coli protein digest, the mitochondrial yeast proteins, were conducted in trapping configuration mode, and the Streptomyces coelicolor M145 sample was analyzed in direct on-column loading injection mode. Generic reversed-phase gradient conditions were applied to all samples and are summarized in Supplementary Table S1 (see online supplementary material at http://www.liebertonline.com). The four-protein mixture was added to the E. coli protein digest, and the mitochondrial yeast proteins and the yeast cytosolic protein fractions samples were all analyzed in triplicate (i.e., three technical replicates), unless stated otherwise.
The precursor ion masses and associated fragment ion spectra of the non-labeled tryptic peptides and the iTRAQ-labeled peptides were mass measured with a hybrid quadrupole orthogonal acceleration time-of-flight Q-Tof Premier or Synapt MS mass spectrometer (Waters Corporation, Manchester, U.K.), directly coupled to the chromatographic system. The time-of-flight analyzer of the mass spectrometer was externally calibrated, with the data post-acquisition lock mass corrected using the monoisotopic mass of the doubly charged precursor of [Glu1]-fibrinopeptide B. The mass spectrometer acquisition parameters are provided in Supplementary Table S1 (see online supplementary material at http://www.liebertonline.com).
Accurate mass-measured data for the protein standards added to the E. coli protein digest, the mitochondrial yeast protein sample, were collected in a data-independent mode of acquisition by alternating the energy applied to the collision cell between a low-energy and elevated-energy state, as described previously (Bateman et al., 2002; Geromanos et al., 2009). The alternate scanning method combines peptide MS and multiplexed, data-independent peptide fragmentation MS analysis, in a single LC-MS experiment for the quantitative and qualitative characterization of a peptide mixture. The iTRAQ fragmentation data were collected in a data-dependent mode of acquisition. The mass spectrometer acquisition parameters scanning method details of the various experiments are summarized in Supplementary Table S1 (see online supplementary material at http://www.liebertonline.com).
LC-MS and LC-MS/MS data analysis
All continuum LC-MS data were processed using ProteinLynx GlobalSERVER version 2.5.2 (Waters Corporation). The data-independent fragment ion spectra from the four-protein mixture differentially spiked into the E. coli protein sample, the mitochondrial yeast proteins, and the yeast cytosolic proteins samples, were database searched with ProteinLynx GlobalSERVER version 2.5.2 as well. Protein identifications were accepted with more than three fragment ions per peptide, seven fragment ions per protein, and more than two peptides per protein identified. The search criteria used for protein identification included automatic peptide and fragment ion tolerance settings (most commonly 10 and 25 ppm, respectively), 1 allowed missed cleavage, fixed carbamidomethyl-cysteine modification, and variable methionine oxidation. The four-protein mixture added to the E. coli sample was database queried against a Comprehensive Microbial Resource (http://cmr.jcvi.org) E. coli K12 database (1 Nov. 2007, 4403 entries) appended with the four spiked proteins and trypsin. The mitochondrial yeast data were searched against a Saccharomyces Genome Database (http://www.yeastgenome.org) S. cerevisiae database (16 Feb. 2009, 6717 entries). The fragment ion spectra of the iTRAQ-labeled peptides of Streptomyces coelicolor were database searched using MASCOT version 2.2 against an NCBI (http://www.ncbi.nlm.nih.gov) Streptomyces coelicolor taxonomy limited database (22 Jan. 2009, 8537 entries). Protein redundancy was cleared with dbtoolkit (Martens et al., 2005). The peptide mass and fragment ion search tolerances were 10 ppm and 0.025 Da, respectively. Trypsin cleavage was specified with a maximum of 2 missed cleavages. Methyl methane thiosulfonate-cysteine, iTRAQ on lysine, and peptide N-termini, were set as fixed modifications. Methionine oxidation was considered as a variable modification. In all instances, for both the data-dependent and the data-independent searches, a one-time randomized decoy version of the individual databases was appended to the original databases.
Probabilistic Peptide and Protein Quantification Model
The problem of measuring the variation in concentration of a protein or peptide across two or more experimental conditions is considered. Each condition is often represented by several replicate acquisitions. Furthermore, each acquisition is subject to systematic errors, such as injection volume errors, and non-systematic effects, such as counting statistics. Due to the complexity of the samples and low concentration of some components, the data may be incomplete, and it can include interferences, and thus the assignment of data to peptides or clusters can be uncertain.
A consistent way of expressing and combining all such sources of uncertainty is required. Acknowledgment that standard probability calculus affords the only consistent route is called the Bayesian standpoint. Formally, x represents the quantities that are to be determined, augmented by any other parameters required to model the experimental data, and D actual data. Bayes' theorem, which is simply the product law of probability, states
The likelihood function is the quantitative probability that a supposed x would produce the observed data, and it acts to modulate the prior possibilities into the posterior estimate. Meanwhile, the evidence is a number that offers quantitative assessment of whatever judgments and assignments led to the prior, should a user wish to compare alternative choices. The requirement of a prior, while logically necessary, puts the Bayesian standpoint in contrast to traditional approaches, which suppose that the issue can be evaded. For example, the Null Hypothesis Significance Test (NHST) attempts to use the likelihood alone to decide whether or not the data are consistent with some chosen null hypothesis x=x0, to be retained or not as a result of the test. However, first, the NHST test is incoherent. Which data are to be used in the rejection of x0? If only some, the analysis is inadequate because the other data are ignored. If all of a large dataset is used, there is very likely to be some minor unacknowledged and improperly modeled effect that damages the fit, leading to inappropriate rejection. Second, any continuous variable x almost certainly does differ from the hypothetical x0, which, as a single point of the continuum, has measure zero. In other words, it is known at the outset that
Peptide quantification
In peptide quantification it is assumed that model ion arrival rates are proportional to the peptide concentration in the injected sample. As an example, the situation in which data (ion counts) for a peptide are collected for two conditions with three replicate acquisitions for each condition is described. The assignments of the data to the peptide are initially not certain, so each has been assigned a reliability p (probability of being “good”) of 0.8. This assignment reflects the typical reliability of such information, and illustrates the sort of choice that often must be made when setting prior probabilities. There is no claim that such assignments are in any sense “true.” Rather, the choice is deemed reasonable, with substantially different settings being, in the authors' opinion, either implausibly strong or implausibly weak. The data are summarized in Table 1.
All of the data are assigned a reliability of 0.8.
In order to infer the peptide concentration ratio, a prior probability distribution Pr(y) for the peptide concentration y is specified. Here an exponential distribution is chosen
where Λ is a hyper-parameter that may be interpreted as the scale of the intensities that are measured. Also, the probabilities of the different configurations of data which may be relevant to the peptide have to be considered, which are calculated from the reliabilities. In a particular configuration, each datum is either good (ON) or bad (OFF). In this case, there are 24 distinct configurations, whose multiplicities sum to 26=64. The likelihood is
where S is the set of ON/OFF states in the current configuration, and
The joint probability of all data for a single configuration is
In this example, it is straightforward to marginalize over one of the y values to give a probability distribution for the ratio, yB/yA. This is plotted for the distinct configurations in Figure 1 with Λ equaling 113.3.

Probability versus log ratio for the 24 distinct configurations of switch ON/OFF states. Note that the scaling of the probability axes varies from plot to plot. The dominant configuration is 110–111 (log ratio ∼ log 2=0.693), shown in red. Its nearest subordinate configuration with a distinct maximum is 001-111 (log ratio ∼ log 4=1.386), shown in green.
The sum over all the configurations is shown in Figure 2. This is the total joint probability of the data and the log ratio.1 It should be noted that no attempt was made to reject outliers; unfavorable configurations are automatically down-weighted in terms of probability. The dominant configuration is 110-111 (i.e., sA3 is OFF, all other switches are ON), as shown by the peak centered at about log 2=0.693. The subsidiary peak at around log 4=1.386 is mainly due to the 001-111 configuration.

The multiplicity-weighted sum of the configurations shown in Figure 1. The contribution from the dominant 110-111 configuration is shown in red, and that from the 001-111 configuration is shown in green.
Protein quantification
The treatment of proteins is very similar to that described for peptides. The data collected for a protein will generally consist of an incomplete set of measurements of ion counts for several peptides across the available conditions and replicates. A peptide is expected to have similar measured ion counts across technical replicates within a condition. On the other hand, different peptides may be produced with different efficiencies under enzymatic digestion, and they will ionize differently depending on their amino acid composition. The observation of different ion counts for different peptides belonging to the same protein is therefore to be expected, even within a condition. A further complication arises when distinct proteins in a mixture share one or more peptide sequence.
To capture this behavior, the model of the experimental situation is extended to include an extra degree of freedom zj for each peptide j that captures its relative digestion and ionization characteristics. Ignoring any differences in digestion efficiency, the predicted ion count for peptide j in condition i becomes
where Hj is the set of proteins that produce peptide j, and yih is the concentration factor due to protein h in condition i. Rather than attempting to estimate the zj, an exponential prior with a mean of unity is simply assigned to each one, allowing the yih to set the scale of the data through their dependence on Λ. The joint probability of the data D and parameters y, z, S is therefore
Since we are not primarily interested in the zj or the Sijk, we are free to integrate and sum over these “nuisance parameters” to obtain the marginal distribution
where the sum runs over all possible ON/OFF configurations and the integrals over the positive z orthant. In principle, this distribution contains all the information that is required to extract marginal posterior distributions for arbitrary functions of the protein concentration values yip, and in particular, the logarithm of the ratios ymh/ynh, between conditions m and n. Given these posterior distributions it is straightforward to extract means, standard deviations, credible intervals, regulation probabilities, and any other desired statistic.
Although this completes the formal definition of the method, in practice the direct summation over the different configurations of switch states used in the peptide quantification example quickly becomes unwieldy as the number of data to be considered increases. Instead, a Monte Carlo method is used to approximate the integral and direct summation. The use of two such techniques has been investigated for the problem of protein quantification, namely Gibbs sampling (Mackay, 2003), and nested sampling (Skilling, 2006). The suggested implementation of nested sampling uses a Gaussian approximation to the Poisson model likelihood
While having negligible effect on the likelihood factors arising from ion counts other than the trivially small, which do not contribute to the inferences anyway, this approximation facilitates incorporation of non-Poisson uncertainties such as dead-time corrections.
Another practical issue arises due to small but noticeable systematic errors that occur during sample handling, enzymatic digestion of samples, and introduction of the samples into the LC-MS equipment. It is assumed that these effects result in ion counts that are systematically high or low in each replicate. It is beneficial to detect these variations and correct for them as part of the quantification process. This correction may use internal standards that are spiked into each sample at the same concentration or endogenous proteins whose concentrations are believed to be constant. Alternatively, normalization can be performed using the entire set of detected peptides as long as the biological perturbation being studied is not strong enough to affect more than a small fraction of the proteins identified. For similar reasons, random errors can be introduced, which result in variations in ion counts that exceed those expected from counting statistics alone. In order to incorporate these extra sources of uncertainty into the probabilistic analysis, a noise level is determined during the normalization process. This noise level is used to soften the observed data before quantification.
Normalization can be considered as another quantification problem in which each replicate is treated as a separate condition. When normalization is performed using the entire set of peptides, those belonging to proteins that do change will tend to be switched off. During the exploration, a relative strength is accumulated for the standards in each replicate. These strengths are used to correct the remaining data before the quantification step.
Results
Peptide and protein quantification
Three replicates of each of the differentially-spiked E. coli samples were acquired by means of data-independent alternate scanning. The raw data, in this instance stemming from a data-independent, label-free experiment, were processed and searched as described earlier. The database search was performed for each of the six individual experiments, which produces a list of protein identifications with associated peptides. Each peptide was accompanied by a reliability value. As an example, Table 2 shows the peptides identified from glycogen phosphorylase B in each experiment.
The counts reported have been normalized using ADH as an internal standard. Green shading indicates a peptide identification probability of 0.95 or above, yellow denotes 0.5–0.95, and red denotes a probability of less than 0.5 (not applicable in this instance). A blank space corresponds to a missing identification. The nominal phosphorylase B concentration ratio equals 2:1 for mixture 1 versus. mixture 2.
Quantification was performed twice, first using Gibbs sampling as implemented in the quantification software and described here, followed by nested sampling. The results for the digest standards are given in Table 3. No results are provided for alcohol dehydrogenase, as it was used as an internal standard. In the course of the Gibbs Monte Carlo exploration, the average settings of the switches sijk for each peptide in each replicate were also accumulated. These numbers are effectively updates of the reliabilities Pijk after all the data have been taken into account, and they give a measure of how strongly each peptide contributes to the reported protein ratio. All of the reported ratios in Table 3 are within a few percent of their nominal values (0.5, 2.0, and 8.0, respectively). Moreover, these values are included in the reported 95% credible intervals. The Gibbs algorithm and the nested sampling algorithm give very similar results. This result is encouraging, because it is expected that nested sampling will be more powerful under some circumstances, for example in the exploration of multi-modal distributions, or where the evidence might be used to calibrate the noise level (Skilling, 2006).
The nominal ratios are 0.5, 2.0, and 8.0. The credible interval applies to the log ratio.
The Gibbs sampling results for all proteins, including the digest standards, are summarized in Figure 3, which shows the reported ratios and credible intervals for all protein identification probabilities above 50%. The digest standards stand out clearly, and the E. coli proteins have ratios that are consistent with each other, but systematically only slightly higher than 1:1. This may be due to a small sample mixing error. Table 4 shows, as an example, the results for the three phosphorylase B peptides underlined in Table 2. The switch states illustrate how the algorithm deals with outlying and inconsistent measurements in real data. All technical replicates of both samples of the first peptide are consistent with the expected ratio. The second peptide is potentially problematic, in that two peptides of the second replicate are more consistent with a 1:1 ratio than the theoretical 2:1 ratio for phosphorylase B. The third peptide has conspicuous outliers in the third replicate of the first sample and the second replicate of the second sample.

Quantification results for all proteins showing the 95% credible intervals. The colors denote identification probabilities. Nearly all of the E. coli proteins lie just below the line corresponding to a 1:1 ratio, indicating a slight differential E. coli amount between samples.
Green shading indicates a probability of 0.95 or above, yellow denotes 0.5–0.95, and red denotes a probability of less than 0.5.
In order to test simultaneous quantification of proteins sharing peptides, a simulated dataset was created. The relationships between the proteins involved are shown in Figure 4. The three proteins A, B, and C, produce three common peptides (PEPTIDEA, PEPTIDEC, and PEPTIDED). Each pair of proteins shares another two peptides, and each protein also produces two unique peptides. The simulated data relates to a two-condition experiment having three replicates in each condition. Each peptide is assigned a relative ionization factor, and the “true” expression ratios for the proteins between the two conditions are A (1:1), B (1:2), and C (4:1). Six data points have been corrupted intentionally, and three data points have been discarded. The input data including pseudo-random Poisson noise are shown in Table 5, and the results of the quantification analysis of the homologous proteins are given in Table 6. The expected relative abundances were within the inferred 95% credible intervals, consistent with the correct operation of the quantification framework for homologous proteins.

Network diagram for the quantification of homologous proteins.
Peptides PEPTIDEA, PEPTIDEG, and PEPTIDEM are assigned a relative ionization value of 1, PEPTIDEC, PEPTIDEH, and PEPTIDEN a value of 2, PEPTIDED, PEPTIDEI, and PEPTIDEP a value of 3, PEPTIDEE, PEPTIDEK, and PEPTIDEQ a value of 4, and PEPTIDEF, PEPTIDEL, and PEPTIDER a value of 5.
The simulated relative protein abundance levels between conditions A and B for proteins A, B, and C are 1:1, 1:2, and 4:1, respectively.
All of the data are assigned a reliability of 0.9.
The nominal expected ratios are 1, 2, and 0.25, for proteins A, B, and C, respectively. The credible interval applies to the log ratio.
A difficulty arises when one or more proteins are identified using a subset of the peptides associated with another protein. For example protein A has associated peptides a, b, and c, protein B has associated peptides a, b, and c, and protein C is associated with peptides a and b. More generally, any unique peptides may be weak or unreliable. In this case, it is impossible to infer precise expression ratios for the proteins in question, but it would be possible to accumulate correlation statistics of the protein strengths during exploration. Proteins with strongly anti-correlated abundances could then be flagged for further investigation. For example, it might be appropriate to rerun quantification accumulating only ratios involving the sum of the strengths of the proteins in question.
Application examples
The first example involves the quantitative data-independent, label-free LC-MS analysis of the mitochondria of [PSI+] and [psi−] Saccharomyces cerevisiae strains (Chacinska et al., 2000). In this instance, no internal standard was added to the samples. As described above, for complex samples it is often possible to measure and correct for systematic errors taking into account slight differences in protein loading amounts without using an internal standard. The assumption is that changes in protein expression occur against a dominant background of proteins that are unaffected by the perturbation being studied. Supplementary Table S2 lists the normalization factors and associated uncertainties for each of the six experiments (see online supplementary material at http://www.liebertonline.com). The results suggest good technical replication, relatively small errors, and an estimated average ratio between the two conditions of interest of approximately 2.5. The latter is either indicative of different initial column loads, or as was the case in this example, significant variation in the studied sample types.
To illustrate the effect of auto-normalization, the monoisotopic doubly-charged precursor mass for tryptic fragment T11 identified for cytochrome c oxidase subunit 2 (Cox2) was extracted for one of the technical replicates of the [PSI+] and [psi−] samples. The ratio between the integrated areas of the chromatographic peaks shown in Supplementary Figure S1 is 0.159 (see online supplementary material at http://www.liebertonline.com). The quantification algorithm reported a ratio of 0.42 for Cox2, considering all technical replicates and identified peptides. Using the information provided in Supplementary Table S2, this reported normalized regulation factor can be converted to an un-normalized value of 0.161, which is in good agreement with the observed value in terms of raw peptide signal intensities (see online supplementary material at http://www.liebertonline.com).
The quantitative protein results were compared with the results obtained by Western blotting. In addition to Cox2, three other proteins were monitored, namely prohibitin-1 (PHB1), PROHIBITIN-2 (PHB2), and malate dehydrogenase (MDH1). A comparison of the results in terms of relative amounts is summarized in Supplementary Table S3 (see online supplementary material at http://www.liebertonline.com). Generally, the relative quantification results obtained by means of label-free LC-MS and Western blotting correlated well. Despite the modest observed relative expression of the Cox2, PHB1, and PHB2, the quantification algorithm gave consistent relative quantification values, supported by the reported upregulation probabilities. For Cox2 and PHB1, an upregulation probability of 0.00 was reported, and that of PHB2 was 0.02, illustrating in all instances a regulation probability greater than 95%. A more detailed description of the proteins identified and quantified in this study is presented elsewhere (Sikora et al., 2009), as well as the application of the quantification algorithm to other quantitative label-free nanoscale LC-MS applications (Chambery et al., 2009a, 2009b; Huang et al., 2007; Wang et al., 2008b; Xu et al., 2008).
The quantification framework described here is not restricted to quantitative label-free applications. Isotopically- and isobarically-labeled mass spectrometric data can also be analyzed under the assumptions that peptide or fragment intensities are provided for each peptide in each replicate. Examples of labeled techniques include iTRAQ, TMT, 18O, ICAT, and SILAC labeling (Gygi et al., 1999; Heller et al., 2003; Ong et al., 2002; Ross et al., 2004; Thompson et al., 2003). To illustrate the latter, iTRAQ relative quantification is demonstrated for the analysis of differences in the proteome of Streptomyces coelicolor during bacterial differentiation. Here, the quantification is based on product ion intensities in a multiplexed fashion. In other words, the samples are pooled and analyzed within a single experiment, whereas the previous label-free examples utilized precursor ion intensities from separate experiments. Figure 5 illustrates the MS/MS spectrum of one of the identified peptides, LLDEGQAGENVGLLLR, from an eight-plex experiment with samples obtained at different developmental stages. Shown inset is the product ion m/z region of interest that is utilized to estimate the ratio of the peptides. For this particular feature, the 113:114 peak-to-peak height ratio can easily be estimated and equals approximately 1.4.

Qualitative and quantitative identification of LLDEGQAGENVGLLLR by means of iTRAQ isotopic labeling. The inset shows the reporter ion region of interest for quantitative analysis.
In this example, the data are reasonably consistent given the observed reporter ion intensities across peptides, as can be expected for iTRAQ data. Considering the 11 non-redundant identified peptides (12 in total) to the protein of interest, elongation factor, the calculated 113:114 reporter ion ratio equals 1.31 (0.27±0.24 for the log ratio), with a probability of upregulation of 1.00. The reported uncertainty is relatively large due to interference (i.e., background and chemical noise), which demonstrates one of the challenges one faces when employing fragment ions in LC-MS/MS-based quantification schemes. Manual inspection of the reporter ion intensities of the non-redundant peptides, excluding statistical outliers, gave a 113:114 ratio of 1.30. This value is plausibly close to the ratio calculated by the algorithm, which addressed the outlier data automatically.
A methodological replicate was labeled with two iTRAQ tags and processed as an internal control as a further test of the quantification algorithm. As expected for a technical replicate, the inferred logarithm of the reporter ion ratios for the majority of the proteins were close to zero, as illustrated by the red dots in Supplementary Figure S2, indicating no variation (see online supplementary material at http://www.liebertonline.com). The 95% credible region excluded a log ratio of zero for less than 6% of quantified proteins. The median protein log ratio was −0.04±0.39 (95% interquartile range). Supplementary Figure S2 also demonstrates the increased quantification variability that might be expected for proteins identified using lower-scoring peptides, and/or a reduced number of peptides, that can be utilized for quantification. In the case of the analysis of the reporter ion ratios by the algorithm for the same proteins between samples of different developmental stages, the calculated variation is in general greater than for the technical replicates, as illustrated by the grey background dots in Supplementary Figure S2 (see online supplementary material at http://www.liebertonline.com). Moreover, the relative protein abundance values in the Streptomyces coelicolor developmental phases analyzed have clear biological meaning, which is presented elsewhere in more detail (Manteca et al., 2010).
Discussion
With the analyses of complex biological samples, a number of aspects have to be acknowledged prior to providing a quantitative result. One of the initial considerations deals with the identification of redundant, non-proteotypic peptides. From a qualitative perspective there is a positive side effect, since the precursor and product ion intensities are cumulative, and the probability of identifying the peptide increases. However, these intensities must be handled carefully in quantitative analyses. High-quality proteotypic identifications can assist in the accurate quantification of isoforms and homologues. A second consideration is the chimeric nature of the data generated from a complex biological matrix. The presence of multiple precursors within the collision cell during an MS/MS acquisition can have a negative effect on both the qualitative and quantitative aspects of the results (Hoopmann et al., 2007; Luethy et al., 2008). These problems are exacerbated in instances in which the data is acquired on an instrument with relatively low resolving power (Cox et al., 2008; Mann and Kelleher, 2008). Qualitatively, lower MS/MS resolution allows for productions of similar mass originating from different precursors to overlap, creating a false impression of sensitivity. Additionally, product ions of dissimilar mass, originating for example from overlapping isotopic distributions, will occupy adjacent mass bins, providing a false sense of selectivity. Quantitatively, lower resolving power results in a precursor ion intensity value that is the sum of all of the co-eluting chimeric species. In addition to adding error in label-free quantitative experiments, chimericy/co-fragmentation is also problematic in labeled applications such as iTRAQ (Ross et al., 2004), or TMT labeling (Thompson et al., 2003). Here the presence of multiple precursors within the gas cell results in summed reporter ion intensities, analogous to the issues regarding isoforms and homologues described above. These and other challenges faced with reporter ion intensity–based quantification methods are discussed by others in greater detail (Ow et al., 2009). With the described probabilistic model, and highly selective and accurate data, these effects can be both acknowledged and addressed.
Other issues that have to be considered in quantitative isotope-labeled data-dependent LC-MS/MS approaches are the previously mentioned inherent increase in chimericy, and the decrease in dynamic range, that are evident when multiple biological samples are analyzed together (Bantscheff et al., 2007). In this instance, increased chimericy stems from the increased mass spectral complexity that occurs when the detected labeled/non-labeled peptides (both precursor and fragment ions) co-elute. With increased multiplexing (i.e., an increased number of combined biological samples), chimericy increases and challenges qualitative identification, as well the ensuing quantification. This is especially severe with lower-intensity ions, close to the detection limit of the mass spectrometer. The combined analysis of biological samples also decreases the effective sample dynamic range that can be measured, since the amount of sample that can be loaded onto the chromatographic system is directly proportional to the amount of available stationary phase. With a smaller amount of protein digest loaded per condition, both the sequence coverage and the opportunity for identifying proteotypic, sequence-unique information is reduced. As a direct result, the number of quantifiable peptides falls, increasing uncertainty in the result (Cui et al., 2009; Van et al., 2008). Alternatively, consistency of peptide identifications across biological and technical replicates might be limited (Ting et al., 2009), despite the application of stringent search criteria.
Conclusions
The presented Bayesian framework for the quantification of label-free and label-dependent LC-MS data provides a robust approach for expressing relative protein and peptide amounts. Missing data and uncertainties in assignments and measurements are easily accommodated. Outliers of any type and homologous peptides can be dealt with automatically, making the quantification algorithm suitable for a wide range of quantitative LC-MS applications. With label-free applications, any number of sample groups and replicates can be compared. In the instance of label-dependent applications, the number of comparisons is only limited by the number of possible labels.
For a well-characterized four-protein mixture in the presence of a complex biological background, the quantitative label-free LC-MS results were found to be consistent with the nominal changes in concentration, both within 2% and the 95% credible interval. The Gibbs algorithm and the newer nested sampling algorithm gave very similar results. This result is encouraging, because it is expected that nested sampling will be more powerful in some circumstances, for example, in the exploration of multi-modal distributions. Quantification of three homologous proteins was tested using a simulated dataset which included intentionally corrupted intensities and missing data. In all cases, the inferred log ratios were consistent with the nominal values.
Further verification of the performance of the quantification algorithm was achieved through comparison with results obtained by independent techniques. In the case of a label-free LC-MS application example, the obtained quantification results were consistent with those obtained using a bioanalytical assay. For a labeled application, iTRAQ in this particular example, the performance of the algorithm was assessed through the analysis of technical replicates. For over 94% of proteins the inferred log ratio was consistent with zero.
Footnotes
Acknowledgments
We kindly acknowledge Iain Campuzano, Phillip Young, Thérèse McKenna, Joanne B. Connolly, Scott J. Geromanos, Michael J. Nold, Ignatius J. Kass, LeRoy B. Martin, Martha Stapels, Craig A. Dorschel, Guo-Zhong Li, Jeffrey C. Silva, Dan Golick, Marc V. Gorenstein, and Timothy Riley for their valuable contributions throughout the development of this work.
Author Disclosure Statement
The authors declare that no conflicting financial interests exist.
1
Natural logarithms are used throughout. Log ratios make the symmetry between up and down regulation clear since they change sign if the labels on the two samples are exchanged. In addition, probability distributions for log ratios are often more symmetric than for the direct ratio. All error bars for log ratios define a 95% credible interval.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
