Abstract
The PulseNet USA subtyping network recently established a standardized protocol for multiple-locus variable-number tandem repeat analysis (MLVA) to characterize Shiga toxin–producing Escherichia coli O157. To enable data comparisons from different laboratories in the same database, reproducibility and high quality of the data must be ensured. The aim of this study was to test the robustness and reproducibility of the proposed standardized protocol by subjecting it to a multilaboratory validation process and to address any discrepancies that may have arisen from the study. A set of 50 strains was tested in 10 PulseNet participating laboratories that used capillary electrophoresis instruments from two manufacturers. Six out of the 10 laboratories were able to generate correct MLVA types for 46 (92%) or more strains. The discrepancies in MLVA type assignment were caused mainly by difficulties in optimizing polymerase chain reactions that were attributed to technical inexperience of the staff and suboptimal quality of reagents and instrumentation. It was concluded that proper training of staff must be an integral part of technology transfer. The interlaboratory reproducibility of fragment sizing was excellent when the same capillary electrophoresis platform was used. However, sizing discrepancies of up to six base pairs for the same fragment were detected between the two platforms. These discrepancies were attributed to different dye and polymer chemistries employed by the manufacturers. A novel software script was developed to assign alleles based on two platform-specific (Beckman Coulter CEQ™8000 and Applied Biosystems Genetic Analyzer 3130xl) look-up tables containing fragment size ranges for all alleles. The new allele assignment method was validated at the PulseNet central laboratory using a diverse set of 502 Shiga toxin–producing Escherichia coli O157 isolates. The validation confirmed that the script reliably assigned the same allele for the same fragment regardless of the platform used to size the fragment.
Introduction
MLVA-related research on foodborne organisms has mainly focused on Shiga toxin–producing Escherichia coli O157 (STEC O157) (Keys et al., 2005; Lindstedt et al., 2007), Salmonella enterica serotypes Typhimurium (Lindstedt et al., 2003) and Enteritidis (Boxrud et al., 2007; Beranek et al., 2009), and Listeria monocytogenes (Murphy et al., 2007; Sperry et al., 2008). PulseNet USA, the national molecular subtyping network for foodborne disease surveillance, recently established a standardized MLVA protocol for STEC O157 (Hyytia-Trees et al., 2006). This protocol was intended to be used as a complimentary technique to pulsed-field gel electrophoresis (PFGE) which the PulseNet system uses as the “gold standard” method for subtyping foodborne bacterial pathogens (Gerner-Smidt et al., 2006). This MLVA assay shows great promise as a molecular epidemiologic tool particularly when used to further discriminate PFGE clusters of isolates with commonly seen restriction patterns (Hyytia-Trees et al., 2006). PulseNet USA was originally built on the concept of decentralized laboratory testing and centralized data management (Swaminathan et al., 2001) to generate and share subtyping results in more timely manner. Thus far, PulseNet USA has used MLVA in centralized manner, that is, isolates requiring MLVA testing have been analyzed at the PulseNet central laboratory at the Centers for Disease Control and Prevention (CDC, Atlanta, GA). The current format often results in delays in the availability of MLVA results by 1 to 2 weeks. Therefore, implementing MLVA at the local public health laboratories has become a high priority for PulseNet USA. Implementation of MLVA protocols into a system like PulseNet, where data will be generated locally by different laboratories and compared nationally in a reference database, demands that such data be reproducible and of the highest quality possible.
Multilaboratory external validations have been part of PulseNet USA's protocol development strategy since the network was established in 1996 (Swaminathan et al., 2001). The external validation phase is designed to further test the robustness of the PulseNet protocol in laboratories outside the agency that developed it. The idea is to include laboratories with different levels of expertise and resources, and to assess how their preferences, in terms of some of the reagents and equipment, may influence the performance of the protocol being tested.
The aim of this study was to test the robustness and reproducibility of the proposed standardized PulseNet MLVA protocol for STEC O157 by subjecting it to a multilaboratory external validation process and to address any discrepancies that may have arisen from the study. As a follow-up, over 500 STEC O157 strains with a wide range of alleles were tested to fully assess the allelic diversity in each VNTR locus.
Materials and Methods
Multilaboratory validation study
The PulseNet standardized MLVA protocol for STEC O157 was validated in 10 PulseNet participating laboratories. PulseNet USA steering committee selected the participating laboratories based on their research proposals and their overall track record in the PulseNet system. Five laboratories used the Beckman Coulter (Fullerton, CA) CEQ™8000 capillary electrophoresis platform and the other five the Applied Biosystems (Foster City, CA) Genetic Analyzer 3100-Avant™ (2) or 3130xl (3) platforms (Table 1). Only 1 (Laboratory A) of the 10 laboratories that volunteered for the study had considerable experience with capillary electrophoresis and MLVA. All 10 laboratories analyzed the same blinded sample set of 50 STEC O157 strains provided by CDC as live cultures following the standard operating procedure (SOP) for each platform. The validation panel included strains that cover a wide spectrum of fragment sizes for each VNTR. Four strains were included in duplicate to test the intralaboratory reproducibility of the method. Six test strains, including one duplicate pair, had one or more null loci. STEC O157 strain EDL933 (ATCC 43895) was included in the panel as a reference control strain from which the fragments generated for each VNTR locus were predicted based on the published sequence. The VNTR raw data generated by the laboratories were sent to CDC as peak files (comma delimited or text delimited) and were evaluated by the CDC staff for its quality. Accuracy of sizing for each strain was compared to sizing results produced at CDC and reproducibility of sizing was determined by comparing results between laboratories.
Number of isolates with a correct MLVA type or allele type out of the 50 strains tested. Alleles were assigned in the Bionumerics software using the look-up table method for the CEQ 8000 and the Genetic Analyzer 3130xl data, and manually for the 3100-Avant data.
MLVA, multiple-locus variable-number tandem repeat analysis.
Standardized MLVA protocols for STEC O157
The MLVA protocol for the Beckman Coulter CEQ 8000 platform was slightly modified from that published earlier by Hyytia-Trees et al. (2006). The original protocol was a refined version of the protocol published by Keys et al. (2005), and hence employed the same locus nomenclature. The most significant modification was the exclusion of one VNTR locus (O157-10) because it showed too much variability during outbreaks generating data that confounded epidemiological investigations. The eight VNTR loci included in the final protocol were amplified in two multiplex PCRs. Reaction 1 contained primers for loci O157-3, O157-34, O157-9, and O157-25 at final concentrations of 0.6, 0.18, 0.13, and 0.05 μM, respectively. Reaction 2 contained primers for loci O157-17, O157-19, O157-36, and O157-37 at final concentrations of 0.1, 0.02, 0.012, and 0.03 μM, respectively. The PCR primer concentrations recommended in the standard protocol served as a starting point but in most cases each laboratory had to reoptimize the primer concentrations to achieve optimal amplification of all targets. Optimization included increasing primer concentrations for targets that did not amplify well (fluorescence below 5000 units) and decreasing primer concentrations for targets that generated too intense amplicons (fluorescence above 80,000 units). No other parameters specified in the SOP should have been changed. The WellRed dyes D3 and D4 (Beckman Coulter) were replaced with more inexpensive substitutes Quas705 and Quas670, respectively (Biosearch Technologies, Novato, CA). The WellRed D2 dye was used as the label for the forward primers of loci O157-3 and O157-17. All primers were synthesized by the CDC Biotechnology Core Facility (Atlanta, GA). The DNA template preparation, PCR amplification, fragment analysis setup, and data analysis in the BioNumerics software (Applied Maths, Kortrijk, Belgium) were performed as described by Hyytia-Trees et al. (2006).
The MLVA protocol for the Applied Biosystems Genetic Analyzer 3100-Avant and 3130xl platforms was developed by modifying the CEQ 8000–specific MLVA protocol described above as follows: (1) the fluorescent dyes Quas670 and Quas705 and WellRed D2 in the forward primers were substituted with FAM, HEX, and CalRed590 (Biosearch Technologies); (2) the following primer concentrations were used as a starting point in Reaction 1: O157-3, 0.5 μM; O157-34, 0.08 μM; O157-9, 0.06 μM; O157-25, 0.05 μM; and in Reaction 2: O157-17, 0.15 μM; O157-19, 0.015 μM; O157-36, 0.012 μM; O157-37, 0.015 μM; (3) after PCR amplification, the PCR products were diluted 1:20 with distilled water, and a 1 μL aliquot of the dilution was mixed with 8 μL of Hi-Di Formamide (Applied Biosystems) and 1 μL of GeneFlo 625 ROX-labeled DNA size standard (Chimerx, Madison, WI); (4) the PCR products were sized in the Genetic Analyzer 3130xl using the default running conditions for 50 cm capillary array and POP7 polymer; laboratories using the Genetic Analyzer 3100-Avant and POP6 polymer had to increase the default running time of 4000 to 6500 s to be able to record all size standard peaks. As with the laboratories using the Beckman Coulter CEQ 8000 platform, the Applied Biosystems instrument users were allowed to optimize the primer concentrations in the PCR amplifications (target peak heights 1000–6000 fluorescence units).
Development and validation of a strategy to normalize fragment size data in BioNumerics
A total of 502 diverse STEC O157 isolates from the CDC strain collection were tested using both the Beckman Coulter CEQ 8000 and the Applied Biosystems Genetic Analyzer 3130xl at CDC. Fragment size ranges for each allele in both platforms were manually recorded in two fragment size range tables. The allele (copy number) assignment in the CEQ 8000–specific table (Table 2) was based on the approach that was described by Hyytia-Trees et al. (2006), that is, the alleles were assigned in the BioNumerics software by using a mathematical algorithm that deducted the offset in base pairs (flanking sequences outside the VNTR region) from the observed fragment size and divided the remaining fragment size by the repeat unit size. In the Genetic Analyzer 3130xl–specific table (Table 3) adjustments were manually made, so that the same allele was always assigned to the same fragment regardless of the platform used to size it. A novel BioNumerics script was developed by Applied Maths that referred to the platform-specific fragment size range tables (look-up tables, i.e., Tables 2 and 3) for allele assignment instead of using the mathematical algorithm described above. The script and the tables were integrated into the BioNumerics software by saving them in specific subfolders in the database file. The new script and the tables were validated by reanalyzing the data from the 502 isolates used to construct the tables in the same database using the look-up table approach. The datasets for the 50 isolates generated by the eight laboratories that possessed either the CEQ 8000 or the Genetic Analyzer 3130xl were also reanalyzed using the look-up table approach.
VNTR, variable-number tandem repeat.
Results
Interlaboratory reproducibility of generating correct MLVA types
Six out of the 10 laboratories were able to generate correct MLVA types for 46 (92%) or more strains out of the 50 included in the study (Table 1). Only two laboratories (A and H) generated correct MLVA types for all 50 isolates. The main reason for incorrect MLVA types was the failure to be able to amplify all eight VNTRs in the two multiplex PCRs due to poorly optimized primer concentrations. This was particularly notable with laboratories E and I. Some of the laboratories that performed well otherwise (B, D, J) had trouble with the locus O157-36 because it frequently contains null alleles. A total of 5 out of the 50 isolates had a null allele in this locus. False-positive results were associated with problems, such as too high primer concentrations that resulted in either high background (peaks with fluorescence level below 1000 units) or fluorescence overflow between channels (a too strong signal at one channel spills over as an artifact to the adjacent channel) causing nonspecific peaks with fragment sizes that could be correct for locus O157-36. On the other hand, all 10 laboratories correctly identified null alleles (one strain each) for loci O157-3 and O157-37. Less frequent reasons for discrepancies in MLVA type assignment included incorrect fragment sizing due to the DNA size standard not running properly because of suboptimal quality of the capillary array or reagents, and an apparent sample transposition. Six laboratories (A, B, D, F, H, and J) were able to generate the same MLVA pattern for the four duplicate isolate pairs. Again, locus O157-36 caused most difficulties for laboratories in terms of reproducibility, since one of the duplicate isolate pairs had a null allele in this locus. Three laboratories (E, G and I) were not able to reproducibly identify the null allele in this locus.
Interlaboratory reproducibility of fragment sizing
The interlaboratory reproducibility of the fragment sizing for the same strain was within 1.5 base pairs when the exact same platform was used with the exception of the Applied Biosystems Genetic Analyzer 3100-Avant. Size ranges of 1.5–3.0 base pairs were detected for the large fragments of locus O157-9 with the 3100-Avant (Table 4). Even more significant sizing discrepancies were detected between the Beckman Coulter CEQ 8000 and the Applied Biosystems Genetic Analyzer 3100-Avant/3130xl (Tables 4 and 5). There were also sizing discrepancies in all three platforms between the observed fragment sizes and the fragment sizes that were predicted from the genome sequence of the reference strain EDL933 (Table 4). The sizing differences were entirely locus dependent, that is, sizing was not consistently off a certain number of base pairs in all eight loci. In two loci (O157-34 and O157-19) sizing agreed well between the platforms. In some loci (O157-3, O157-25, O157-17) sizing varied as much as three to six base pairs which is a discrepancy large enough to result in a difference in the allele type if the allele type is assigned using the mathematical algorithm designed to deduct the offset region from each amplicon (Table 6). In the loci O157-9 and O157-36 the sizing discrepancies were not linear but were decreasing or increasing with the increasing fragment size (Table 5). The locus-dependent sizing discrepancies could not be explained by an association with any particular fluorescent dye, fragment size, repeat unit size, or the sequence GC content. Even though the sizing agreed better between the Genetic Analyzer versions 3100-Avant and 3130xl, there were notable discrepancies in three out of the eight loci (O157-9, O157-17 and O157-37).
Amplicon size based on the EDL933 genome.
Size range observed in six laboratories (study participants and CDC).
Size range observed in four laboratories (study participants and CDC).
Size range observed in two laboratories (study participants).
Fragment size ranges for each locus given for the smallest, the medium-sized, and the largest allele included in the multilaboratory study sample set.
Fragment size ranges in six laboratories (CDC and study participants).
Fragment size ranges in four laboratories (CDC and study participants).
Fragment size ranges in two laboratories.
Comparison between the mathematical algorithm and the look-up table allele assignment methods.
VNTR locus nomenclature as originally described by Keys et al. (2005).
Validation of the look-up tables
Fragment size data from the CEQ 8000 and the Genetic Analyzer 3130xl for 502 diverse STEC O157 isolates were analyzed in the same BioNumerics database. Allele assignment based on the platform-specific look-up tables was in complete agreement between the two platforms for all isolates. Table 6 displays an example for one isolate on how the sizing discrepancies resulted in a difference in the allele assignment when the mathematical algorithm was used and how reanalyzing the data using the look-up table approach corrected the allele discrepancy. The discrepancies in the allele assignment between the laboratories that participated in the external validation (Table 1) were mainly caused by false-positive or false-negative PCRs, as discussed earlier, not by platform-dependent sizing discrepancies.
Discussion
The most commonly seen problems encountered during external validations of PulseNet protocols, regardless of the method being validated, usually stem from differences in reagents and equipments used by the participating laboratories (Kam et al., 2008). Laboratory-specific conditions, such as the type and calibration status of the thermocycler, the type of Taq polymerase used, and the overall familiarity of the staff in working with small pipetting volumes, can drastically affect the amplification efficiency. Therefore, each laboratory in this study had to reoptimize the primer concentrations so that all targets would amplify with approximately the same efficiency. The reoptimization process was especially challenging for many laboratories, since most of them had no significant prior experience in fragment analysis. As a consequence, poorly optimized PCRs and nonauthorized deviations (use of a different type of Taq polymerase) from the SOP were the main reasons for a failure to produce high-quality data or correct MLVA types. If anything, this study demonstrated clearly that staff training is an essential part of technology transfer. It is crucial that the technical staff understand which parameters can be modified in the process of optimization, and how changing different parameters will affect the outcome of the assay. For example, the use of Taq polymerase with a different level of stringency than what was specified in the SOP (Platinum Taq; Invitrogen, Carlsbad, CA) can either cause nonspecific peaks and/or incorrect sizing (lower stringency) or a failure of the target to amplify (higher stringency). As a consequence of this validation study, some adjustments to the SOP were made. For example, some primer working concentrations were made more dilute to enable larger pipetting volumes for easier pipetting. Additionally, the parameters that are flexible are now explicitly stated. Hawes et al. (2006) reported a multilaboratory study assessing the ability of DNA sequencing core facilities to successfully sequence a set of well-defined templates containing difficult repeats. They, too, found that laboratory-specific variables such as instrument robustness, reagent quality, and technical experience affected the results even when a standardized protocol was used.
In addition to the issues in optimizing the PCRs, the comparability of fragment sizing between laboratories using different capillary electrophoresis platforms turned out to be a major problem. In the recent multilaboratory study that evaluated the reproducibility of a microsatellite-based typing assay for Aspergillus fumigatus, de Valk et al. (2009) also reported significant sizing discrepancies between different platforms. The Beckman Coulter and the Applied Biosystems capillary electrophoresis platforms differ in the dye chemistries, DNA size standards, and the types of polymers used. The Applied Biosystems Genetic Analyzer 3130xl and 3100-Avant employ the same dye chemistry and size standard but different polymers (POP7 vs. POP6). The different dye chemistries and DNA size standards probably accounted for some of the sizing discrepancies seen in this study since different fluorescent labels have differing dye motilities due to different structures (Tu et al., 1998). The size standards also had a differing number of fragments in them although the size range covered was similar. However, a more important source for the discrepancies most likely was the different polymers used by these three platforms. Capillary electrophoresis systems employ interpolation algorithms that can convert mobility data from an unknown peak into size information, based on the size versus mobility data from the standard set (Rosenblum et al., 1997). The electrophoretic mobility of DNA is sequence dependent; therefore, DNA fragments of the same length can have different mobility, and hence can vary in the calculated fragment size. The different polymers are likely to have different compositions in terms of types and concentrations of denaturating agents (the exact composition of each polymer is proprietary information). Consequently, each system will have slightly different denaturating conditions that result in various degrees of secondary and tertiary structure and anomalous migration of DNA (Rosenblum et al., 1997). As a result, in most loci the observed fragment size did not agree with the size predicted from the sequence (Table 4) and fragment sizes from different platforms did not agree with each other. To draw an analogy to more commonly used PFGE technology, it is known that the type of agarose used to generate PFGE patterns will affect run time and band resolution, hence resulting in differences in banding patterns between laboratories using different agaroses (PulseNet USA, unpublished data).
Since the sizing discrepancies between the Beckman Coulter and Applied Biosystems platforms were significant in many loci, a strategy had to be developed to allow direct comparison of data in the same database. In their study, de Valk et al. (2009) proposed accomplishing this by using allelic ladders that would contain reference peaks for each allele. From PulseNet's viewpoint such an approach would be impractical, since constructing an allelic ladder would require considerable effort and the ladder would also have to be made available for all laboratories participating in the network. Larsson et al. (2009) recently suggested that fragment sizing data for each platform should be normalized to be comparable to the fragment size predicted from the actual sequence. They proposed achieving this by sending out a reference set of isolates that covers the most common alleles. Since all alleles for each locus were not covered by the reference set even though the authors sequenced at least one representative of each allele, an assumption was made that a mathematical algorithm can be developed to normalize the data for all alleles. The theoretical benefit of this approach is that VNTR data could be compared even when the same primer sequences are not used for the same locus, since multiple competing partially overlapping protocols are being used around the world (Keys et al., 2005; Hyytia-Trees et al., 2006; Noller et al., 2006; Lindstedt et al., 2007). However, the disadvantage is that because the sizing discrepancies between different platforms and the actual sequence are not always linear, developing an algorithm that accurately accounts for nonlinear discrepancies would be extremely challenging. Additionally, sequencing just one representative of each allele may not give adequate information on the actual sequence length, since partial repeats are prevalent in some loci. Therefore, PulseNet USA's strategy was to normalize size data from one platform so that it was comparable with the other platform, and not to give any consideration to normalizing data from both platforms so that it would be comparable with the actual sequence data. Since PulseNet USA has for the last 4 years collected data using the Beckman Coulter CEQ 8000 platform, it was natural to use these data as the basis of the national database and normalize the Genetic Analyzer 3130xl data to be comparable with the CEQ 8000 data. Because the older version (3100-Avant) of the Genetic Analyzer is being phased out by Applied Biosystems, PulseNet USA decided not to support that platform in the network, particularly since the sizing did not completely agree between the new and old versions, and since the reproducibility of sizing large fragments seemed to be compromised with the 3100-Avant. The platform-specific look-up tables (Tables 2 and 3) that were constructed to automate the analysis of MLVA data in the BioNumerics software are flexible in that they can be easily adjusted when new alleles are discovered. In practice, these tables will only be able to be modified by PulseNet USA's database managers, and only after the identity of the new allele has been confirmed by PulseNet USA's central laboratory at CDC. In future, when platforms are upgraded or replaced, new tables will most likely have to be constructed to maintain backward comparability of the data.
Conclusions
The results from the multilaboratory validation study of the PulseNet STEC O157 MLVA protocol reported here highlighted the typical challenges that must be overcome by laboratory networks that collect and analyze data generated in different laboratories. The challenges with PCR-based protocols are magnified due to the availability of multiple different platforms (thermocyclers, capillary electrophoresis equipment) and reagents of various price ranges that do not always produce data of comparable quality. The fact that many of the laboratories that participated in this study performed well, even when experience in fragment analysis technology was lacking, suggests that the challenges can be overcome by strict adherence to the SOPs, providing training and establishing a comprehensive quality assurance/quality control program. One of the major challenges that PulseNet USA had to overcome was to provide public health laboratories in the United States with the ability to directly compare and analyze MLVA data obtained with the different capillary electrophoresis platforms described herein. This was achieved by developing platform-specific look-up tables that can be used in the BioNumerics software to automatically assign allele types based on the fragment size data.
Footnotes
Acknowledgments
The authors wish to thank the following PulseNet USA participating laboratories: Arkansas Department of Health Public Health Laboratories, Colorado Department of Public Health and Environment, Florida Department of Health, Hawaii Department of Health, Michigan Department of Community Health, New Jersey State Department of Health, New Mexico Department of Health, Orange County Public Health Laboratory, Commonwealth of Virginia Division of Consolidated Laboratories; and the following PulseNet International partners: National Microbiology Laboratory, Canada, and National Institute of Infectious Diseases, Japan.
Disclosure Statement
No competing financial interests exist.
