Abstract
There is little published data on the performance of biological indicator tests and immunoassays that could be used by first responders to determine if a suspicious powder contains a potential biothreat agent. We evaluated a range of biological indicator tests, including 3 protein tests, 2 ATP tests, 1 DNA test, and 1 FTIR spectroscopy instrument for their ability to screen suspicious powders for Bacillus anthracis (B. anthracis) spores and ricin. We also evaluated 12 immunoassays (mostly lateral flow immunoassays) for their ability to screen for B. anthracis and ricin. We used a cost-effective, statistically based test plan that allows instruments to be evaluated at performance levels ranging from 0.85 to 0.95 lower confidence bound of the probability of detection at confidence levels of 80% to 95%. We also assessed interference with 22 common suspicious powders encountered in the field. The detection reproducibility for the biological indicators was evaluated at 108 B. anthracis spores and 62.5 μg ricin, and the immunoassay detection reproducibility was evaluated at 107 spores/mL (B. anthracis) and 0.1 μg/mL (ricin). Seven out of 12 immunoassays met our most stringent criteria for B. anthracis detection, while 9 out of 12 met our most stringent test criteria for ricin detection. Most of the immunoassays also detected ricin in 3 different crude castor seed preparations. Our testing results varied across products and sample preparations, indicating the importance of reviewing performance data for specific instruments and sample types of interest for the application in order to make informed decisions regarding the selection of biodetection equipment for field use.
The authors evaluated a range of biological indicator tests, including 3 protein tests, 2 ATP tests, 1 DNA test, and 1 FTIR spectroscopy instrument for their ability to screen suspicious powders for Bacillus anthracis spores and ricin.
R
Our test plan was guided by published standard method performance requirements from AOAC, which require detection with an estimated 5% lower confidence limit on the probability of detection of 0.95 or higher.3-7 Our plan was also guided by an ASTM standard specification for handheld point chemical vapor detectors, which requires that detectors achieve a probability of detection of at least 0.85% as specified by an 80% lower confidence bound. 8
Our plan enables an efficient and cost-effective approach to establishing performance levels of various biodetection products. Rather than simply measuring performance in a pass/fail fashion, we characterized the performance of the assays within a performance range between the ASTM published standard (0.85 probability of detection [POD]/80% confidence level [CL]) and the AOAC standards (0.95 POD/95% CL). While our statistical approach differed slightly from the AOAC and ASTM standards, it was comparable in rigor. Our criteria employ a probability of detection as demonstrated by a lower confidence bound of 0.95 (minimum 0.85) of a one-sided 95% (minimum 80%) confidence interval. We fixed the maximum number of test sample replicates at 79. If an assay failed 10 times, performance of that product fell below the lowest acceptable performance criteria (0.85 POD/80% CL) and the testing was halted. The range of lower confidence bound and confidence level values can be found in Table 1. Further details on our statistical approach can be found in a companion article in this journal issue. 9
Products were tested to the range of LCB and CL shown in the table below. The highest criterion was 0.95 LCB/95% CL, while the minimum criterion was 0.85 LCB/80% CL.
Total number of tests = 79.
Our inclusivity testing included a single biosafety level 2 (BSL-2) strain of B. anthracis, the Ames35 strain that lacks the pXO2 virulence plasmid. Because of the specific application we addressed (field screening of suspicious visible powders by first responders to support tactical decision making and public/responder safety actions), near-neighbor exclusivity organisms were not deemed to be of value to test because the most common material encountered that could generate false-positive results are suspicious powders and not near-neighbor organisms. We also did not conduct any exclusivity testing with ricin for the same reasons. We did conduct testing using a limited numbers of replicates (6) with 3 different preparations of crude castor bean mash to simulate the types of real-world samples that might be encountered by first responders. Prior to B. anthracis inclusivity testing, we evaluated the impact of strain type and growth media used to generate B. anthracis test material and found significant impacts on immunoassay sensitivity for these 2 variables (data not shown). This article summarizes our evaluation of 28 different general biological indicator and immunoassay products for the detection of B. anthracis and ricin.
Materials and Methods
Bacillus anthracis
Four B. anthracis strains were obtained from BEI Resources (Manassas, VA): Ames35 (NR-10355), Sterne 34F2 (NR-1400), Weybridge (NR-10350), and UM23 (NR-10351). Each strain was grown in identical conditions, as previously described. 10 Briefly, overnight cultures were grown at 37°C in tryptic soy broth, then diluted 1:100 in 1.6% nutrient broth with CCY salts (a commonly used mixture of salts that are added to bacterial growth media) and grown at the appropriate temperature with shaking at 200 rpm. B. anthracis spores were harvested after 72 hours by centrifugation for 10 min at 10,000 × g at 4°C. Spore pellets were resuspended in sterile water and stored at 4°C for 7 days to enhance vegetative cell lysis. Spores were then washed 3 times in sterile water prior to use. Spore purity was evaluated using phase contrast microscopy, and all preparations were >95% phase bright, indicating a pure preparation. Four types of growth media were tested to assess the impact of media on limit of detection: (1) Nutrient Broth (NB; Becton Dickinson, Franklin Lakes, NJ) with CCY salts (Sigma Aldrich, St. Louis, MO), (2) New Sporulation Medium (NSM; Sigma Aldrich, St. Louis, MO), (3) Blood Agar (Becton Dickinson, Franklin Lakes, NJ), and (4) Brain Heart Infusion (BHI) agar (Teknova Inc, Hollister, CA). Initial testing indicated the Ames35 strain was the most consistently detected at lower concentrations (see below). Therefore, this strain was chosen for all subsequent B. anthracis testing and was stored at 4°C as a stock suspension of 108 spores/mL.
On the day of testing, the stock was vortex mixed for 2 minutes immediately prior to making dilutions. Dilutions were made using manufacturer-supplied buffer or ultrapure water if not supplied. Serially diluted samples were vortex mixed in between every dilution to maintain spore suspension. B. anthracis test sample concentrations were verified each day of testing by serially diluting an aliquot of the standard test solutions before and after testing to ∼103 spores/mL. The aliquot was cultured on agar plates in triplicate, and the resulting colonies were enumerated the next day after overnight growth at 37°C. Limit of detection for the biological indicator tests was determined using the following amounts of B. anthracis spores per test: 108, 107, 106, 105, and 104. These amounts of spores were obtained by using a 50-μL test volume and B. anthracis concentrations ranging from 2 × 109 spores/mL to 2 × 105 spores/mL.
Ricin
Pure ricin toxin was obtained from Vector Laboratories (Burlingame, CA) as a 5-mg/mL stock solution and stored at 4°C. On the day of testing, dilutions were made from the stock using manufacturer-supplied buffer or ultrapure water if buffer was not indicated, and stored on ice prior to and during testing.
In addition to purified ricin, 3 crude ricin preparations were made at PNNL by (1) crushing castor seeds to produce a mash, (2) acetone extraction of oil from mash, and (3) acetone precipitation from seed mash. 11 Following are brief summaries of each of the ricin preparation methods.
• Crushed seed mash: Soak seeds in NaOH, rinse in cold water, then air dry and peel the seed hulls. Mash the peeled seeds with a mortar and pestle.
• Acetone extraction: Prepare mash as above, then cover with acetone in a flask, and stir for 72 hours. Filter through a double layer of filter paper, dry, and scrape the dried material into a vial.
• Acetone precipitation: Prepare 5 grams of castor seeds as for the crushed seed mash, but add 10 mL phosphate-buffered saline when grinding with a pestle. Centrifuge the resulting slurry at 10,000 × g for 20 min at 4°C, which results in 3 layers. Recover the middle (aqueous) layer. Add acetone to approximately 6 times the aqueous volume and freeze overnight. Centrifuge the next day at 10,000 × g for 20 min, decant the acetone, and air dry the precipitate.
Crude ricin preparations were stored at −80°C. On the day of testing, dilutions were made using manufacturer-supplied buffer or ultrapure water. To ensure stability, all samples were stored on ice prior to and during testing.
Suspicious Powders
Twenty-two powders were selected that are often encountered by first responders during suspected bioterror events and used to assess the assays (Table 2). Many of these powders are listed in the Environmental Factors Panel in AOAC standards.4,7 The powders were purchased from Amazon, Target, and a local garden store. They were categorized into 4 groups based on composition: organic biological, organic protein containing, organic no protein, and inorganic (Table 2). Powders were tested at a concentration of 1 mg/mL for biological indicator tests. The PROFILE® 1 ATP and immunoassays were tested at 0.1 mg/mL.
Commonly encountered suspicious powders analyzed using biological indicator tests and immunoassays
General Biological Indicator Tests
Brief descriptions of the 7 biological indicator tests that we evaluated as part of this study are given below. We evaluated 3 protein test kits: (1) BioCheck™ Powder Screening (20/20 Gene Systems™, Rockville MD), (2) INDIPRO strips (Macherey-Nagel, Bethlehem, PA), and (3) TASKit Bioscreener™ (Field Forensics, St. Petersburg, FL). We evaluated 2 ATP tests: (1) Clean-Trace™ Surface ATP (3M™, St. Paul, MN) and (2) PROFILE® 1 ATP (New Horizons Diagnostics, Baltimore, MD). We evaluated 1 DNA test, the Prime Alert® system (GenPrime™, Spokane, WA). We also evaluated 1 FTIR system, the HazMatID™ 360 FTIR system (Smiths Detection, Edgewood, MD).
Immunoassay Tests
We tested both single-plex assays (1 target biothreat is detected per sample) and multiplex assays (2 or more biothreat agents are detected simultaneously per sample). Immunoassays require a liquid sample, so B. anthracis and powders were solubilized or suspended prior to testing. Some manufacturers offered optical readers, and, when available, we also used the optical readers (in addition to manually reading the assay results); we report the readout option used in the results tables. The immunoassays tested along with the volume of liquid applied are listed below:
• AdVnt (Phoenix, AZ) ○ BADD™: single-plex assays for B. anthracis and ricin (200 μL/test) ○ Pro Strips™: 5-plex assay includes ricin and B. anthracis (600 μL/test) • Alexeter (Wheeling, IN) ○ BioDetect™: single-plex assays for B. anthracis and ricin + optional reader (200 μL/test) ○ RAID™ 5: 5-plex assay includes ricin and B. anthracis (400 μL/test) ○ RAID™ 8: 8-plex assay includes ricin and B. anthracis (800 μL/test) • ANP Technologies® (Newark, DE) ○ NIDS® 3-Plex 3: 3-plex assay includes B. anthracis + optional reader (120 μL/test) ○ NIDS® 4-Plex 5: 4-plex assay includes ricin + optional reader (120 μL/test) • Tetracore® (Rockville, MD) ○ BioThreat Alert®: single-plex assays for B. anthracis and ricin + optional reader (150 μL/test) • Pathsensors (Baltimore, MD) ○ CANARY® Zephyr: single-plex assay for B. anthracis + required measurement platform; no manual read option (200 μL/test) • Environics (Tolland, CT) ○ ENVI: B. anthracis and ricin single-plex assays + optional reader (120 μL/test) • BBI Detection (Madison, WI) ○ IMASS™: 8-plex assay includes ricin and B. anthracis (2.5 mL/test) • Response Biomedical (Vancouver, BC, Canada) ○ RAMP®: single-plex assays for B. anthracis and ricin + required reader (no manual read option) (70 μL/test) • New Horizons Diagnostics (Baltimore, MD) ○ SMART™-II: single-plex assays for B. anthracis and ricin (100 μL/test) • GenPrime™ (Spokane, WA) ○ Toxin Screen: 3-plex assay includes ricin (250 μL/test)
All of the above technologies are based on visible dyes using a lateral flow assay format, except the CANARY® Zephyr and the RAMP®. The CANARY uses an engineered B-cell format and requires a suitcase-sized platform, which contains a laptop, small centrifuge, and luminescence optical reader. The RAMP® uses fluorescence detection, rather than visible dyes, so a fluorescence optical reader is required.
Results and Discussion
General Biological Indicator Tests
General biological tests detect a broad range of biological and organic materials, but do not confirm the presence of a specific biothreat agent. In general, biological indicator tests have low specificity (ie, potential for a false-positive result), and many have low sensitivity (ie, potential for a false-negative result). Therefore, they are best suited to be used as a screening tool when sufficient material is present and in conjunction with more specific tests. Indicator tests are often designed to measure a solid sample directly, though most can also measure a liquid sample.
Protein tests are based on color-changing indicator dyes and are read by eye. These tests detect any type of protein or amino acid, which are present in all cells (eg, anthrax, human cells, and nonpathogenic bacteria) and also are present in some common consumer products (eg, coffee creamer and powdered infant formula). Some kits include separate tests for pH or starch to further help identify the sample type.
Adenosine triphosphate (ATP) tests can determine if any type of cellular material is present and alive, since ATP is a cellular metabolite present in all living cells. The amount of ATP in a sample is typically proportional to the number of living cells. Because ricin is a protein, it is not detected by this method unless residual live cells from the castor plant (Ricinus communis) associated with ricin are present. These tests require an optical reader.
Deoxyribonucleic acid (DNA) tests detect any type of DNA (eg, human, plant, or animal) and some types of RNA using an optical reader that measures the fluorescence of a dye that binds only to nucleic acids.
Fourier transform infrared (FTIR) spectroscopy is primarily used to rapidly identify the chemical composition of an unknown sample. Proteins (contained in most biological materials) give a unique FTIR spectrum that should be detected in samples if they have a protein content of at least 10%. Identification of a sample is based on comparison of the sample's composite spectrum to a library of known individual component spectra.
Immunoassays
Immunoassays use antibodies, which are proteins designed to bind to a specific agent such as anthrax or ricin. Most immunoassays use a lateral flow assay format similar to a home pregnancy test. A lateral flow assay includes an assay strip containing all the assay components encased in a plastic cartridge. The cartridge has a sample window where the sample is applied to the assay strip and a results window where the results are read manually (with your eye) or using an electronic optical reader. Depending on the specificity of the immunoassay, false-positives can occur if closely related agents are present (like nonpathogenic relatives of B. anthracis).
Limit of Detection Estimation
The limit of detection can be determined in various ways and has a wide range of formal definitions;12,13 here we simply are using the term to indicate the lowest concentration that consistently produces a positive result (in 3 out of 3 replicates). We performed testing to estimate the limit of detection for general biological indicators and immunoassays with B. anthracis and ricin. This testing was not intended to precisely determine the limit of detection in a statistically rigorous fashion, but rather to provide an estimate to guide selection of a concentration level for inclusivity testing (which should be done at a concentration above the limit of detection). Five concentrations over 5 orders of magnitude were tested in triplicate. If 3 of 3 results were positive at the lowest concentration, successively lower (10-fold) concentrations were tested in triplicate. Because of the 10-fold difference in test sample concentrations, it is likely that the sensitivity of the assay actually lies between the lowest concentration that consistently generated positive results and the next lowest concentration tested.
Impact of Spore Strain Type and Growth Conditions on Immunoassay Limit of Detection
Because initial limit of detection results were several orders of magnitude higher for multiple products than manufacturer claims, we examined several variables that may have affected our results. These variables included strain type and media conditions.
B. anthracis Strain Testing
Four B. anthracis strains were obtained from BEI Resources: Ames35 (NR-10355), Sterne 34F2 (NR-1400), Weybridge (NR-10350), and UM23 (NR-10351). Each strain was grown in identical conditions (in New Sporulation Medium), washed 5 times, and stored wet. We consulted ASTM E2800-11 for guidance on B. anthracis spore characterization and storage. 14 One immunoassay product manufacturer communicated that their testing indicated that the limit of detection was affected by dry (higher LOD) versus wet B. anthracis storage, and the time after initial suspension of B. anthracis (ie, longer wet storage times) also resulted in higher limit of detection (data not shown); therefore, it is possible that storage condition and storage time will affect the estimated limit of detection.
Testing was conducted with 3 of the BEI strains (UM23 did not sporulate and was not used). Three different manufacturer immunoassays were used that had been compared in a previously published study. 15 Although this prior study used only 2 to 6 replicates for each test concentration and did not identify the B. anthracis growth medium or whether spore washing was performed, it did provide some independent comparative test data for the immunoassay spore limit of detection. Our limit of detection testing was done in triplicate starting at 107 spores/mL. If 107 spores/mL was negative, only then was 108 spores/mL tested. Successively lower concentrations (10-fold dilutions) were then tested until negative results were obtained on all 3 replicates. The diluted spore samples were also plated and cultured in triplicate to verify the concentration of spores that were actually applied to the test strips. Phase contrast microscopy was performed on the spore preparations to verify purity and that minimal spore clumping was occurring. A summary of the lowest concentration detected (at least 1 of 3 replicates) for the BADD™, Smart™-II, and BioThreat Alert® immunoassays using the 3 B. anthracis strains tested is given in Table 3. Based on the results (lowest LOD for the greatest number of products) and the ease of obtaining the strain, B. anthracis Ames35 was chosen for all subsequent B. anthracis testing. It should be noted that all of the B. anthracis strains we tested are BSL-2 strains and are not as virulent as BSL-3 strains because they are lacking 1 or both of the virulence plasmids. While it is unlikely a BSL-2 strain would be used in a terror attack, working with BSL-2 strains is far less costly, and these strains are typically used by other researchers when testing immunoassay products. As our initial data show, the sensitivity of the products may be different depending on the B. anthracis strain.
LOD estimates for immunoassay B. anthracis spore detection using 3 different strains of B. anthracis spores, all grown in New Sporulation Medium, using 3 different immunoassays. Ames35 was chosen for all subsequent B. anthracis LOD and inclusivity testing.
Inconsistent results are not uncommon at concentrations near the LOD.
B. anthracis Growth Media Testing
As another quality check and to assess the influence of growth media, we tested the impact of different growth media used to generate the B. anthracis test samples on the immunoassay limits of detection for B. anthracis Ames35. Three different immunoassay products (the same products that were used to test the difference in LOD for B. anthracis strains) were used in triplicate for B. anthracis. 15 Test sample concentrations ranged from 105 to 109 spores/mL.
While there was significant dependence of B. anthracis limit of detection on the type of growth media (up to 3 orders of magnitude), the results did not show that one growth medium resulted in lower limits of detection for multiple products. The results summarized below are for instances where 3 of 3 replicates were positive.
• BADD™ detected 107 spores/mL in NSM, and 108 spores/mL in BHI, blood agar, and NB with CCY salts;
• Smart™-II detected 106 spores/mL in NB with CCY salts, 108 spores/mL in blood agar, and was unable to detect 108 spores/mL in BHI or NSM; and
• BioThreat Alert® detected 108 spores/mL in blood agar and NB with CCY salts, and was unable to detect 108 spores/mL in BHI or NSM.
We ultimately chose to use NB with CCY salts based on the good results across platforms with this medium, as well as good results using NB with CCY salts reported in the literature. 10
General Biological Indicator Tests Limit of Detection Estimation
A small, concentrated liquid sample (50 μL) was used for B. anthracis, pure ricin toxin, and powder testing rather than a solid sample to enable the application of a precise amount of test material. Therefore, the amount applied to the test is listed in spores or μg, not a concentration. A 2 × 109 spores/mL stock suspension was used to prepare serially diluted B. anthracis standards. For pure ricin, the stock 5 mg/mL solution was used as received from the manufacturer.
Bacillus anthracis
B. anthracis was tested using the following amounts per test: 108, 107, 106, 105, and 104. These amounts of spores were obtained by using a 50-μL test volume and B. anthracis concentrations ranging from 2 × 109 spores/mL to 2 × 105 spores/mL. The PROFILE® 1 ATP test, which was designed for screening suspicious powders and includes an incubation step to germinate spores and increase their number, was the most sensitive of the biological indicator tests, with an estimated limit of detection of 105 spores, although this test required multiple manual manipulation steps, making it more cumbersome and time-consuming to perform than the other biological indicator or immunoassay tests. The BioCheck™ and INDIPRO protein tests could both detect 108 spores, which was the inclusivity test concentration for biological indicator tests. In addition to the commercially available BioCheck™ test kit, a prototype variant of the BioCheck™ test kit that included a bead-beating step to disrupt the spores was determined to have an estimated limit of detection of 107 spores. The TASKit™ BioScreener™ protein test was able to detect 109 spores. The Clean-Trace™ ATP test, which was not designed for this application but rather for verifying the cleanliness of food contact surfaces without any incubation step, was unable to detect 109 spores. The Prime Alert® DNA test also could not detect 109 spores. Our compiled estimated limit of detection test data for B. anthracis is shown in Table 4, along with manufacturer limit of detection claims (where available) and limit of detection data from a peer-reviewed literature publication. 16 Poore et al found similar limits of detection as we did for the biological indicator tests that were assessed and also showed that increased spore washing can raise the limit of detection for protein and DNA tests by more than an order of magnitude. 16
LOD estimates for B. anthracis spore and ricin detection using general biological indicator tests
A kit that included a bead-beating step to disrupt spores was also tested and achieved an estimated LOD of 107 spores (not currently commercially available).
100 μg is ∼108 spores.
NP = Not performed.
N/A = Not applicable.
Ricin
Pure ricin toxin was tested at the following concentrations: 250 μg, 62.5 μg, 15.6 μg, 3.9 μg, and 0.98 μg. Only the protein tests were evaluated with pure ricin toxin, since no DNA or ATP is expected to be present in pure ricin. The BioCheck™, INDIPRO, and TASKit™ BioScreener™ protein tests all had an estimated limit of detection of 62.5 μg of ricin toxin, which was the inclusivity test concentration that we used for biological indicator tests. The ricin limit of detection test data are also shown in Table 4.
Immunoassay LOD Estimation
The test sample volumes applied to each product varied from 70 μL/test to 2.5 mL/test, depending on manufacturer instructions. (Volumes are listed in the Materials and Methods section.) A B. anthracis stock concentration of 108 spores/mL was used to prepare 10-fold serially diluted standards. For pure ricin, the stock 5 mg/mL solution was used as received from the manufacturer. Some immunoassays had optional optical readers, in which case both visual (manually read by eye) and optical reader results were reported. The RAMP® and CANARY® Zephyr both required optical readers and could not be read manually. Based on our studies showing up to 3 orders of magnitude impact to limit of detection depending on the type of B. anthracis growth medium and strain, it is not surprising that limit of detection values reported by manufacturers and in the literature vary widely. However, our limit of detection results are in much closer agreement with published literature values compared to manufacturer-reported specifications. Manufacturer-reported limit of detection values were typically 2 to 3 orders of magnitude lower than we found for B. anthracis and up to 2 orders of magnitude lower for ricin. The compiled test data, manufacturer limit of detection claims, and other published data are shown in Tables 5A and 5B.
LOD estimates for B. anthracis spore and ricin detection using immunoassays
2/3 positive at 107 spores/mL with reader.
2/3 positive at 0.1 μg/mL; 1/3 positive at 0.4 μg/mL (inconsistent results are not uncommon at concentrations near the LOD).
Value shown in table has been modified from published value (# spores) to reflect actual concentration tested.
N/A = not applicable.
NR = not reported by manufacturer.
R = reader; M = manual.
LOD estimates for B. anthracis spore and ricin detection using immunoassays
While LOD was estimated to be 106 spores/mL during growth media testing, during B. anthracis strain testing and later actual LOD testing the LOD was found to be 107 spores/mL.
2/3 positive at 0.4 μg/mL.
Value shown in table has been modified from published value (# spores) to reflect actual concentration tested.
N/A = not applicable.
NR = not reported by manufacturer.
R = reader; M = manual.
Bacillus anthracis
Immunoassay test sample concentrations ranged from 108 to 104 spores/mL. If an assay result was still positive at 104 spores/mL, successively lower concentrations were tested in triplicate (ie, 103, 102 spores/mL). If an assay failed to give 3 positive results at 108 spores, then the limit of detection was reported as >108 spores. The CANARY® Zephyr achieved an estimated limit of detection of 103 spores/mL, which was 4 orders of magnitude lower than any other immunoassay tested. Of the other 11 immunoassay products, 6 had limit of detection estimates of
Ricin
Each of the immunoassays was also tested with pure ricin toxin at concentrations ranging from 1,600 to 6.25 ng/mL using 4-fold serial dilutions. If a test result was still positive at 6.25 ng/mL, successively lower concentrations were tested in triplicate. The CANARY® Zephyr achieved the lowest limit of detection (3 ng/mL). Six products had estimated limits of detection of 25 ng/mL, while 4 of the 13 ricin immunoassays had estimated limits of detection >0.1 μg/mL, which was the inclusivity test concentration for immunoassays.
Inclusivity Testing
Immunoassays and biological indicators were tested with B. anthracis, pure ricin toxin, and common powders using the PNNL-developed test plan to determine if a given assay met the lower confidence bound of the probability of detection of 0.95 at the test concentration with 95% confidence, or fell within the range of performance of 0.95 LCB/95% CL to 0.85 LCB/80% CL. To achieve this, 47 samples had to be tested with no failures, or up to 79 samples were tested if between 1 and 9 failures occurred during the course of 79 tests. If there were 10 failures during any point in the testing, testing was halted for a given product. Because we tested 22 different suspicious powders, we chose to test each powder in triplicate (66 samples) at a minimum. If there were any failures (up to 10 failures allowed before testing was stopped), 79 samples were tested. In addition, we conducted limited replicate testing of 3 different crude ricin preparation test samples, which allowed demonstration of a 0.85 LCB/80% CL for that product if no failures resulted.
The exception to this testing plan was the general biological indicator testing of powders. We tested each of the 22 powders in triplicate with each of the indicator tests for a total of 66 tests per product to characterize the biological indicator test performance (ie, to understand which common powders generated positive vs negative results), rather than evaluating the products using a more statistically rigorous test plan that specifies confidence bounds and levels. As an example, because some of these products test for protein, and some of the powders (eg, milk powder) contain protein, a “positive” result is expected. However, although the kits are performing as intended, the outcome in those cases is essentially a false alarm for first responders.
For inclusivity testing, we chose to use test concentrations that were appropriate for visible powder screening and for the 2 technology “families” (general biological indicator tests and immunoassays). We also considered the sensitivity of the technology based on results of the limit of detection estimate study, manufacturer claims, and published literature. We chose inclusivity test concentrations of 107 spores/mL for B. anthracis and 100 ng/mL for pure ricin toxin for immunoassays. We arrived at these values by considering the following information and possible scenario:
• 1-10 mg is the typical amount of residual powder remaining for field screening assays after the bulk sample is collected for Federal Bureau of Investigation (FBI) or Laboratory Response Network (LRN) use per ASTM E2458-10 (data not shown).
22
• 109 pure spores weigh approximately 1 mg.
23
• Assume that a benign suspicious powder is “spiked” to contain only 1% by weight of a biothreat agent (∼10 μg of biothreat agent in 1 mg of powder). • Assume that only 1 mg of powder (containing 1% biothreat agent) is collected, suspended in 1 mL, and used for field screening. • The resulting sample would have a concentration of 107 spores/mL.
In the case of ricin we make similar assumptions, except a crude ricin mash is spiked into a benign suspicious powder, which would equate to ∼100 ng of active ricin in 1 mg of powder. Crude ricin preparations contain low percentage amounts of active ricin, and the ricin content can vary based on the castor cultivar and preparation method. As a general guide, crushed seed mash contains approximately 1% active ricin, acetone extraction contains approximately 2% active ricin, and acetone precipitation contains approximately 4% active ricin. As a result, crude ricin preparations were tested using much greater amounts than pure ricin. Ten mg/mL crude ricin was used for biological indicator tests, while 0.1 mg/mL was used for immunoassays. Pure ricin was used to estimate the limit of detection and for the inclusivity studies. The scoping studies using crude ricin preparation included a limited number of replicates of each preparation, to obtain general information about the ability of the platforms and tests to detect of the types of ricin samples that could be encountered in the field.
For biological indicators, we tested quantities of material (rather than concentrations) that were determined based on the results of the limit of detection estimate studies. These levels (108 spores [∼100 μg] or 62.5 μg pure ricin toxin) are still reasonable amounts of material to be expected for a visible suspicious powder.
Inclusivity Testing: General Biological Indicator Tests
No spore testing or pure ricin testing was performed with the HazMatID™ 360 FTIR. The 2 ATP tests and the DNA test were not tested with pure ricin because these components (ATP and DNA) are not expected to be present in pure ricin. The FTIR system was tested only with dry suspicious powders and crude ricin preparations, and ∼1 mg of material was used, which was just enough to cover the sampling optics. Summaries of the general biological indicator inclusivity data are shown in Tables 6 and 7.
Inclusivity results for general biological indictor B. anthracis spore and ricin detection. Unshaded cells met the 0.95 LCB/95% CL criteria. Shaded cells did not meet 0.85 LCB/80% CL minimum performance criteria. Data are reported as number of positive results/total tests performed.
N/A = not applicable.
Limited screening of 3 crude ricin preparations using biological indicator tests. Unshaded cells met the minimum 0.85 LCB/80% CL performance criteria. Shaded cells did not meet this criterion. Data are reported as number of positive results/total tests performed.
Approximately 1 mg of material was used for FTIR system testing, which was just enough material to cover the sampling optics.
Bacillus anthracis
A quantity of 108 spores (50 μL of 2 × 109 spores/mL) was used. The BioCheck™ and INDIPRO strip protein tests and the Profile® 1 ATP test achieved a 0.95 LCB/95% CL with no failures for 47 replicates of the B. anthracis test samples. The TASKit™ BioScreener™ protein test, Clean-Trace™ ATP test, and Prime Alert® DNA test all failed to meet the minimum criteria of 0.85 LCB/80% CL for B. anthracis test samples.
Ricin
The pure ricin toxin inclusivity test amount was 62.5 μg. Both the BioCheck™ and the INDIPRO strip protein tests achieved a 0.95 LCB/95% CL with no failures in 47 replicates, while the TASKit™ BioScreener™ protein test failed to meet the minimum criteria of 0.85 LCB/80% CL.
A limited number of 3 different crude ricin preparations were analyzed using 50-μL aliquots of 10 mg/mL suspension. All 3 of the protein tests and the FTIR resulted in positive detections for 6 of 6 samples in each of the 3 crude ricin preparations and met the 0.85 LCB/80% CL criteria. The Clean-Trace™ ATP test produced positive results for 6 of 6 replicates of acetone precipitation and crushed seed mash ricin preparations (achieving the 0.85 LCB/80% CL criteria) but did not meet these criteria for the acetone extraction preparations (only 5 of 6 replicates positive). The Profile® 1 ATP test and the Prime Alert® DNA test did not have any positive detections for any replicates of the 3 different ricin preparations. The results of the ricin testing for the general biological indicators are summarized in Table 7.
Inclusivity Testing: Immunoassays
Bacillus anthracis
Inclusivity testing was performed using a test concentration of 107 spores/mL. Seven of 12 products met the 0.95 LCB/95% CL criteria, with only 1 product (Smart™-II) requiring 79 tests due to 1 assay failure. The other 5 products failed the first 10 replicate assays and, therefore, did not meet the minimum criteria (0.85 LCB/80% CL). Data are shown in Table 8A.
Inclusivity results for B. anthracis spore (A) and ricin (B) detection using immunoassays. Unshaded cells met the 0.95 LCB/95% CL performance criteria. Shaded cells did not meet 0.85 LCB/80% CL minimum performance criteria. Data are reported as number of positive results/total tests performed.
R = reader; M = manual.
Ricin
Inclusivity testing was done using a pure ricin concentration of 0.1 μg/mL. Nine of 12 products met the 0.95 LCB/95% CL criteria and had no failures in the first 47 assays. The other 3 products failed to meet the minimum criteria of 0.85 LCB/80% CL. BADD™ and Toxin Screen failed the first 10 replicate assays, while Smart™-II failed 10 of 35 assays. Data are shown in Table 8B.
We also tested the immunoassays using a limited number of the 3 different crude ricin preparations at a concentration of 0.1 mg/mL. As shown in Table 9, all but 2 of the 12 products tested produced positive results for all 6 replicates of each of the 3 crude ricin preparations. The RAID™ 8 and the Toxin Screen both produced 6 of 6 positive results for the acetone precipitation and acetone extraction samples, but for the crushed seed mash sample, the RAID™ 8 generated 5 of 6 positive results and the Toxin Screen generated 4 of 6 positive results.
Limited screening of 3 preparations of crude ricin using immunoassays. Unshaded cells met the minimum 0.85 LCB/80% CL performance criteria. Shaded cells did not meet this criterion. Data are reported as number of positive results/total tests performed.
R = reader; M = manual
Suspicious Powder Testing
While the assumption of equivalency for powders does not at first consideration seem to be valid, for relatively specific assays like immunoassays, suspicious powders can be considered a class of material that should not generate any false-positives (ie, a type of exclusivity panel). Assuming equivalency in a statistical sense allows the powder samples to be considered as a single group of test samples. Twenty-two powders were initially tested in triplicate to yield a total of 66 tests per immunoassay. If a false-positive result occurred in any of the initial 66 tests, then additional tests were conducted, up to a total of 79 tests per immunoassay, and the additional tests beyond 66 were selected at random. If more than 10 failures (false-positives) occurred, testing of the powders was halted.
Because general biological indicator tests have poor specificity, the powders cannot be grouped together for statistical considerations when testing these products. Nevertheless, triplicate tests were conducted for each of the 22 powders to provide general information about indicator test performance and limitations with these sample types.
Suspicious Powder Testing: General Biological Indicator Tests
The BioCheck™ and TASKit™ protein tests also include another test for pH and starch, respectively, with the goal of providing additional information to guide interpretation of the protein test result. While suspensions of biothreat agents are often a neutral pH, an acidic or basic result does not necessarily exclude the presence of a biothreat. And while a positive starch result is indicative of certain foodstuffs, it does not necessarily mean there is no biothreat agent present.
Most of the indicator tests generated positive results with powders, many of which are to be expected (eg, protein tests generating positive results for milk or flour products, ATP or DNA tests generating positive results for brewer's yeast). However, some powders with no known protein content and inorganic powders also generated positive results. While the number of “false” positive results ranged from 3 of 66 (Prime Alert® DNA test) to 32 of 66 (INDIPRO Protein test), these results cannot be technically considered false-positives from an analytical perspective because many of them are detecting the component they are designed to detect. However, if a product generates a positive result for a suspicious powder it can be considered a false-positive from an operational perspective. The data are summarized in Table 10.
Biological indicator testing of commonly encountered suspicious powders. It is not appropriate to apply performance criteria to indicator powder tests due to their inherently nonspecific nature, but the results are informative to understand limitations when screening commonly encountered suspicious powders. Data are reported as number of positive results/total tests performed.
A = acid; B = base; N = neutral; I = indeterminate.
Suspicious Powder Testing: Immunoassays
No false-positive results were generated for any of the 8 single-plex B. anthracis immunoassay products, which all achieved 0.95 LCB/95% CL (Table 11A). Of the 8 single-plex ricin immunoassay products that were tested, 6 had no false-positives and achieved 0.95 LCB/95% CL. The ENVI ricin immunoassay generated 1 false-positive for instant pectin (only for the optical reader, not for manually reading the same assay), so 13 additional powder samples were tested at random. Because there was only 1 false-positive in 79 samples, the ENVI met the 0.95 LCB/95% CL. The BioDetect™ immunoassay generated false-positive results for 3 of 3 kaolin test sample replicates with both the optical reader and manually reading the result, and additional samples were analyzed for a total of 79. With 3 false-positives out of 79 test samples, the BioDetect™ achieved 0.91 LCB/95% CL per the values listed in Table 1. The single-plex ricin immunoassay data are shown in Table 11B. Of the 4 multiplex immunoassay products, only 1 generated any false-positive results (Table 11C). The IMASS™ produced a false-positive result for 3 of 3 kaolin test sample replicates. With additional sample testing (79 total), the IMASS achieved 0.91 LCB/95% CL.
(A-C). Immunoassay testing of commonly encountered suspicious powders using B. anthracis immunoassays (A), ricin immunoassays (B), and multiplex assays (C). Unshaded cells met the 0.95 LCB/95% CL criteria. Shaded cells achieved a 0.91 LCB/95% CL. Data are reported as number of positive results/total tests performed.
a3/3 kaolin test sample replicates resulted in false-positive results for ricin.
Summary
Our estimated limits of detection for spores and pure ricin toxin for biological indicator products were higher than manufacturer claims. None of the immunoassay product's claimed limits of detection were achieved with our spore testing, although our ricin immunoassay results were comparable to other published studies.19,21 As demonstrated by our preliminary studies on B. anthracis growth media and strain type, the differences between PNNL B. anthracis limit of detection estimates, manufacturer claims, and published data could be due to variations in sample preparation or spore strain type. For products that had optional optical readers, we found that the use of an optical reader does not always improve sensitivity.
B. anthracis spores and ricin were detected with a 0.95 LCB/95% CL by several products, including general biological indicators. Two of the protein tests (BioCheck® and INDIPRO) met these test criteria for B. anthracis and ricin. The PROFILE® 1 ATP test met these criteria for B. anthracis. All other biological indicator test kits fell below 0.85 LCB/80% CL for both B. anthracis and pure ricin. Initial screening of 3 crude ricin preparations showed promise for protein tests and 1 of the ATP tests, but not the DNA test. Additional testing of these sample types is needed.
In general, the immunoassays performed slightly better for ricin detection than for B. anthracis detection. Seven out of 12 immunoassays met our most stringent criteria for B. anthracis detection, while 9 of 12 met our most stringent test criteria for ricin detection. Five out of 12 B. anthracis immunoassays and 3 of 12 ricin immunoassays failed to meet the minimum criteria of 0.85 LCB/80% CL. All of the immunoassays detected all forms of crude ricin in 6 of 6 replicates except the RAID™ 8 (5 of 6) and Toxin Screen (4 of 6). Additional testing would be needed to determine if any of these technologies can achieve 0.95 LCB/95% CL for detection of ricin in crude castor seed preparations.
It is important to note that as more equipment and assays become commercially available, it will be necessary to quickly and cost-effectively compare their performance using an objective testing framework that measures performance relative to a standard. Likewise, as manufacturers continue to modify their existing instruments and assays, these new products will need to be reevaluated to determine if they still meet acceptable performance criteria. Our confidence interval–based testing approach enables efficient instrument performance testing to defined confidence levels, and our testing results allow users to make better informed decisions regarding the purchase and use of biodetection equipment.
Footnotes
Acknowledgments
The Department of Homeland Security, Science and Technology Directorate, provided funding for this research through contracts HSHQDC-08-X00843/6 and HSHQPM-12-X-00169/6 to Pacific Northwest National Laboratory (PNNL). PNNL is operated by Battelle Memorial Institute for the US Department of Energy under contract DE-AC06-76RLO. This evaluation does not indicate an endorsement or adoption of any of the technologies by PNNL or the US Department of Homeland Security. The authors declare no conflicts of interest related to this work.
