Abstract
Hazard analysis critical control point (HACCP) programs have been endorsed and implemented globally to enhance food safety. Our objective was to identify, assess, and summarize or synthesize the published research investigating the effect of HACCP programs on microbial prevalence and concentration on food animal carcasses in abattoirs through primary processing. The results of microbial testing pre- and post-HACCP implementation were reported in only 19 studies, mostly investigating beef (n=13 studies) and pork (n=8 studies) carcasses. In 12 of 13 studies measuring aerobic bacterial counts, reductions were reported on beef (7/8 studies), pork (3/3), poultry (1/1), and sheep (1/1). Significant (p<0.05) reductions in prevalence of Salmonella spp. were reported in studies on pork (2/3 studies) and poultry carcasses (3/3); no significant reductions were reported on beef carcasses (0/8 studies). These trends were confirmed through meta-analysis of these data; however, powerful meta-analysis was precluded because of an overall scarcity of individual studies and significant heterogeneity across studies. Australia reported extensive national data spanning the period from 4 years prior to HACCP implementation to 4 years post-HACCP, indicating reduction in microbial prevalence and concentration on beef carcasses in abattoirs slaughtering beef for export; however, the effect of abattoir changes initiated independent of HACCP could not be excluded. More primary research and access to relevant proprietary data are needed to properly evaluate HACCP program effectiveness using modeling techniques capable of differentiating the effects of HACCP from other concurrent factors.
Introduction
Challenges to implementing HACCP programs in abattoirs include cost, manpower, and training requirements (Fearne et al., 2004; Jevsnik et al., 2006; Lupin et al., 2010). The choice of critical control points (CCPs) is crucial to the effectiveness of the HACCP program, and baseline microbial sampling of carcasses at multiple points in the slaughter process is required to identify optimal CCPs (Lammerding and Todd, 2006). A comparison of national HACCP programs' effectiveness is difficult, given the range of CCPs selected in different settings and the lack of specified microbial performance standards or baseline industry studies conducted prior to HACCP adoption and implementation in many jurisdictions (Fearne et al., 2004).
The widespread implementation of HACCP programs over the past decade has sparked numerous expert and stakeholder debates on the benefits of HACCP in terms of reducing microbial contamination at the processing level. Systematic review methodology follows a transparent, structured, replicable approach to review research on intervention effectiveness and is well suited for providing informed, evidence-based inputs to policy makers (Waddell et al., 2009). If sufficient data are reported in the existing literature, meta-analysis (MA) may be applied to data from multiple similar studies, resulting in more precise quantitative overall estimates of effect. Our objective, therefore, was to identify, evaluate, and summarize any published primary research investigating the effect of HACCP or similar structured programs (please refer to definitions below) on microbial contamination, including but not restricted to bacteria and viruses, on carcasses of food animals, fish, shellfish, or molluscs in abattoirs or processing plants through primary processing, using systematic review methodology and, where appropriate, MA.
Materials and Methods
Definitions and inclusion criteria
A list of definitions used in this review is included in Table 1.
CCP, critical control point; HACCP, hazard analysis critical control point.
Inclusion criteria for this review
• Population: Carcasses of food animals, fish, shellfish, or molluscs in abattoirs or processing plants of any size or location through primary processing.
• Intervention: HACCP or similar structured programs.
• Comparison: Same or similar population unexposed to HACCP.
• Outcomes: Microbial contamination, including but not restricted to bacteria and viruses, as measured by prevalence or concentration on carcasses.
• Study design: Any published primary research using a design employing the use of a comparison group was deemed relevant.
Studies that reported one of either the “before” or “after” measurements and compared with measurements reported elsewhere were also included.
MA inclusion criteria
Datasets consisting of two or more independent studies investigating the effect of HACCP implementation were stratified by microbial outcome and carcass species. Where more than one study sampled similar populations, in similar or overlapping time periods, both studies were included in data extraction; however, only the study reporting the most complete set of data was utilized for MA. For example, four studies reported potentially relevant data before and after HACCP implementation in Australia and all are included in this review. However, for each microbe/carcass species combination, only the study reporting the most complete data required (e.g., number of animals sampled, bacterial counts, and standard deviation) were included in MA.
Review team, search strategy, and screening for relevance
The research team included four epidemiologists and a research librarian, with extensive experience in systematic review methodology and/or food safety. A review protocol was drafted and is presented as Supplementary Material (Supplementary Material is available online at
Reference lists of each relevant primary research article were checked to identify additional potentially relevant work. The five most-published topic experts whose studies were captured by the search were contacted to request any work close to being submitted or in press, and nonresponders were contacted once more. An internet search was also conducted using the Google and Yahoo search engines, employing the same search terms used in the electronic databases, in September 2010. When several national baseline surveys were identified by this search, given the scarcity of relevant studies, the decision was made post hoc to include these paired baseline studies (i.e., pairs of studies available on the internet, with one investigating microbial contamination on carcasses in abattoirs “pre-HACCP” and a complementary study sampling “post-HACCP”) as a source of evidence in addition to the peer-reviewed studies captured.
All levels of screening, from first-level relevance screening to data extraction, were conducted independently by two epidemiologists using tools designed a priori and pretested, with reviewing at each level being initiated only when between-reviewer agreement yielded a kappa statistic of 0.8 or greater. First-level relevance screening (RS1), wherein each abstract was appraised using the RS1 screening tool, served to exclude irrelevant citations. Citations not excluded at RS1 advanced to RS2, where appraisal of the full article using the RS2 tool categorized studies by population sampled, bacteria measured, and type of program implemented as well as confirming relevance.
Methodological assessment and data extraction
All relevant studies were assessed for methodological soundness using the following criteria: whether the sampling scheme was random or systematic (yes/no); whether the sample size was justified (yes/no); whether steps were taken to ensure pre- and post-HACCP sample populations were exchangeable, meaning the exposure status of the two groups could be switched without impacting the study results (Dohoo et al., 2009) (yes/no); and whether indirectness, or lack of comparability, existed between sample and target population, or outcome measure employed versus outcome measure of interest (e.g., sampling calves when the population of interest is finished steers; or measuring meat juice ELISA to demonstrate Salmonella exposure, when the objective is to measure carcass Salmonella contamination), given the review objective and inclusion criteria. These criteria were captured for discussion purposes. However, no studies were excluded from the review based on methodological assessment criteria. Data extracted from all studies advancing from level 2 screening to methodological assessment and data extraction included parameters such as descriptors of the population sampled (e.g., species and age of carcass animals sampled, country in which investigation was conducted), methods of sampling (e.g., excision vs. swab), microbial outcomes measured (e.g., aerobic plate counts; prevalence or concentration of Campylobacter spp., coliforms, Escherichia coli, Enterobacteriaceae, Salmonella spp., Staphylococcus aureus), and estimates of effect (e.g., mean logarithmic differences in counts; odds ratios of prevalence post- and pre-HACCP). The tool used for data extraction, with a complete list of variables extracted, is included in the review protocol as Supplementary Material.
Review management and data analysis
All electronic citations were downloaded and deduplicated in Procite 5.0 (Thomson ResearchSoft, Philadelphia, PA) and uploaded into SRS 4.0 electronic systematic review management system (Systematic Reviews SRS 4.0; Trialstat Corporation, Ottawa, Canada).
Data were entered and cleaned in Microsoft Office Excel 2003 (Microsoft Corporation, Redmond, WA) and descriptively summarized. Random effects MA was performed based on the a priori assumption that heterogeneity existed across studies. Odds ratios of prevalence or logarithmic mean differences in bacterial counts between exposed (post-HACCP) and unexposed (pre-HACCP) groups and their respective 95% confidence intervals were calculated from reported raw data. A pooled estimate of the odds ratio or log mean difference in bacterial counts was calculated using the method of DerSimonian and Laird (1986). Cochran's Q statistic and I 2 (the percentage of total variation among studies due to heterogeneity) were used to evaluate heterogeneity (Higgins et al., 2003). Evidence for the presence of publication bias as indicated by evidence of small study bias in each MA dataset was appraised using visual examination of the funnel plot (a plot of standard error by logarithmic odds ratio, or logarithmic mean difference in bacterial counts, for each study) for asymmetry. Where suspected, its extent was estimated using the trim-and-fill method of Duval and Tweedie (2000). Post hoc, the decision was made that asymmetry tests for publication bias could only be appropriately applied to MA datasets meeting the following criteria: more than 10 studies were included in MA; significant (p<0.1) heterogeneity was not demonstrated across studies and at least one study in the dataset reported a significant result (Ionnidis and Trikalinos, 2007). Descriptive statistical analyses were performed in Stata Intercooled 11 (Stata Corporation, College Station, TX); meta-analyses were performed in Comprehensive Meta-analysis 2 (Biostat, Inc., Englewoood, NJ).
Results
From 752 reviewed abstracts, 53 were identified as potentially relevant at RS1, and 19 articles were confirmed relevant at RS2. Of 19 relevant studies, 15 and 4 studies were obtained from four bibliographic databases and through simple internet searches, respectively.
Four studies reported relevant data collected after HACCP implementation, which were then compared with data obtained pre-HACCP that had been reported elsewhere (Phillips et al., 2001, 2006; Sumner et al., 2003, 2004). No relevant articles published in languages other than English were identified, precluding the need for translation.
The main characteristics of 19 studies included in this review and published from 1996 to 2009 are shown in Tables 2 and 3. These reported investigations were conducted in Canada (n=1), Australia (4), Belgium (1), Greece (1), Ireland (2), Serbia (1), South Africa (1), and the United States (8). Species sampled included beef (13), swine (8), poultry (3), sheep (2), and crawfish (1). Microbes studied included indicator organisms suggestive of general process hygiene such as total aerobic counts (13) and coliform bacteria (2), which are indicators of fecal contamination, as well as potential pathogens: Campylobacter spp. (1), E. coli (13), Listeria spp. (1), Salmonella spp. (14), and coagulase-positive Staphylococcus aureus (3). Thirteen of 19 studies investigated multiple outcomes, and 5 of 19 investigated multiple carcass species.
Calculated post hoc.
Studies accessed similar datasets and are therefore not independent.
Not reported.
Levels of microbial contamination were higher post-HACCP.
Studies accessed similar datasets and are therefore not independent.
Calculated post hoc.
Levels of microbial contamination were higher post-HACCP.
Not reported.
Our search captured investigations of two types of HACCP programs in abattoirs, applying HACCP principles to either regulatory health inspections in the abattoir or slaughter/primary processing itself. The former type of HACCP program was investigated in two studies of HACCP-based health inspection programs for identification of defects such as bruising or inflammatory lesions (Cates et al., 2001; Tamplin et al., 2001). Both reported no significant difference (p>0.05) in prevalence of Salmonella spp. on pork and poultry, respectively, using HACCP-based or non–HACCP-based health inspection systems. The balance of studies captured (17) examined the effects of HACCP (n=15 studies) or similar structured control programs (n=2) on microbial carcass contamination from slaughter through primary processing.
The results of individual studies indicated that HACCP or similar structured control programs were more consistently effective in reducing indicator organisms, for example, aerobic bacterial counts, compared with pathogens such as Salmonella spp. on carcasses (Tables 2 and 3).
MA of three studies investigating aerobic bacterial counts on beef carcasses before and after HACCP implementation (Dormedy, 1999; Phillips et al., 2001; Nastasijevic et al., 2009) yielded a significant (p<0.001) logarithmic mean difference of −0.75 cfu/cm2 (−log 0.94 cfu/cm2, −log 0.55 cfu/cm2) with insignificant heterogeneity (Q=2.3, p=0.31, I 2=14) across the studies (Fig. 1). This dataset, as with all of the MA datasets captured, did not meet our criteria for application of asymmetry tests to investigate the presence of publication bias (Ionnidis and Trikalinos, 2007).

Reduction in logarithmic mean aerobic bacterial counts on beef carcasses before and after hazard analysis critical control point implementation.
MA of three studies investigating E. coli prevalence on beef carcasses before and after HACCP implementation (USDA-FSIS 1998a; Wagude, 1999; Phillips et al., 2001) yielded an insignificant (p=0.13) odds ratio of 0.59 (0.30, 1.2) with significant heterogeneity (Q=10, p=0.006, I 2=80) across the studies (Forest plot not illustrated for brevity).
MA of five studies investigating Salmonella spp. prevalence on beef carcasses before and after HACCP initiation (USDA-FSIS, 1998a; Wagude, 1999; Phillips et al., 2001; Rose et al., 2002; Ghafir et al., 2005) yielded an insignificant (p=0.65) odds ratio of 0.89 (0.53, 1.48), with insignificant heterogeneity (Q=3.4, p=0.50, I 2=0) across studies (Fig. 2).

Odds of Salmonella spp. contamination on beef carcasses before and after hazard analysis critical control point implementation.
MA of three studies investigating Salmonella spp. prevalence on pork carcasses before and after HACCP initiation (USDA-FSIS, 1998b; Rose et al., 2002; Ghafir et al., 2005) yielded an insignificant (p=0.17) odds ratio of 0.78 (0.54, 1.1) and significant heterogeneity (Q=10, p=0.007, I 2=80) across studies (Forest plot not illustrated for brevity).
MA of three studies investigating Salmonella spp. on poultry (USDA-FSIS, 2000; Rose et al., 2002; Ghafir et al., 2005) yielded a significant (p=0.02) odds ratio of 0.39 (0.17, 0.87), with significant heterogeneity (Q=76, p<0.001, I 2=97) across studies (Forest plot not illustrated for brevity).
Problems with study design were frequently noted in the literature captured and are presented online as Supplementary Material. Ten of 19 studies reported use of convenience sampling schemes, and only 5 justified sample size. However, all investigators utilized directness of comparisons as defined by the Cochrane Collaboration (2010), that is, sampling a population comparable to the target population, using outcome measures of direct interest as opposed to a proxy measure. Also, each investigation made some effort to ensure that pre- and post-HACCP sample populations were exchangeable (e.g., sampling in multiple seasons to adjust for seasonal shedding).
Discussion
Overall, a small body of publicly available data on the effect of HACCP on microbial contamination of food animal carcasses were identified; more consistent reductions were observed for indicator bacteria associated with process hygiene (e.g., aerobic bacteria and coliforms) than for pathogens (e.g., Salmonella spp.). The relatively small volume of published literature on this topic is striking, considering the widespread endorsement and initiation of HACCP programs in food animal abattoirs over the past decade. This may be due to the proprietary nature of the data collected by abattoirs in countries with specified pathogen performance standards, such as the United States, or the fact that microbial monitoring is conducted to verify HACCP, not as part of the HACCP program itself, as well as the lack of baseline microbial studies conducted prior to HACCP initiation in many jurisdictions (Phillips et al., 2001; Fearne et al., 2004). Three of the largest studies captured were obtained not from peer-reviewed articles but from the internet search, highlighting the importance of this source of publicly available data.
MA can be a powerful tool to estimate the effect of an intervention (e.g., HACCP implementation). However, robust MA was not possible in this review, given the overall small body of data, as well as the wide range of populations and outcomes studied, and heterogeneity in diagnostic and sampling methods. Using a random effects model, power for MA depends on the total number of subjects and also on the number of studies included; if study numbers are small and variance across studies is substantial, MA power may be low (Borenstein et al., 2009). Although MA for some carcass species–bacterial species combinations (E. coli on beef; Salmonella spp. on beef and pork) yielded insignificant pooled effect estimates, nevertheless, interpretation of their respective wide confidence intervals suggests that, in some settings, HACCP implementation had a protective effect on carcass contamination. Therefore, given the small number of studies available for MA in these datasets, it is likely more accurate to conclude that, given the number of studies captured and their respective sample sizes, there is currently insufficient evidence to judge whether HACCP has a significant effect (Valentine et al., 2010). Similarly, publication bias was difficult to determine in each of the datasets undergoing MA, given the small number of studies captured and the heterogeneity detected across some datasets.
Possible sources of heterogeneity across studies include variable intensity of changes in personnel hygiene and training, improvements in cleaning and disinfection through the use of zoning or designation of high/low traffic areas, air filtration, improved process control, incorporation of antimicrobial interventions, and increased microbial process control monitoring; similarly, HACCP components such as variable choice of CCPs and differences in HAACP prerequisites could also be sources of heterogeneity (I. Jensen and R. Usborne, pers. comm.) (Rose et al., 2002; Delazari et al., 2006).
Additionally, the relationship of each of these factors with HACCP implementation may vary with jurisdiction. For example, Canada's Food Safety Enhancement Program (FSEP) includes seven prerequisite programs covering areas such as personnel hygiene and ventilation, as well as HACCP itself, and also HACCP validation and maintenance, and each part of FSEP has contributed to enhanced food safety (Canadian Food Inspection Agency, 2010) (R. Usborne, pers. comm.).
All studies included in the review were before/after studies in which the outcome of interest (i.e., microbial contamination on a given carcass species) was measured pre- and postimplementation of the intervention (HACCP or similar structured programs). Appropriate analysis and interpretation of such data must take into account measurable factors other than HACCP that might affect the outcome during the same time interval, such as those listed earlier. For example, we performed an informal analysis of Australian data on E. coli prevalence on beef carcasses before the time of national HACCP implementation and for the subsequent 4 years (Sumner et al., 2004). We divided the overall time period into two blocks, pre-HACCP (1994–1998) and post-HACCP (1998–2002), and performed linear regression on the annual prevalence for both time periods. The annual incremental reduction in E. coli prevalence did not change significantly across blocks of time (i.e., pre- vs. post-HACCP). This supports the hypothesis that other factors were also reducing the level of microbial contamination of carcasses and could have been exerting an effect around the time of HACCP implementation (I. Jenson, pers. comm.).
Another factor that makes interpretation of before–after trials difficult is the phenomenon of regression to the mean: the measure of interest may have been unusually elevated immediately before the intervention and randomly returned to a more typical level after the intervention, creating the appearance of a treatment effect, because of chance. Without analysis to identify this phenomenon if present and adjust for its effect, it is possible that some of the change in microbial contamination reported post-HACCP may in fact be due to this phenomenon. Modeling techniques such as Empirical Bayes have been used in other disciplines to interpret before–after trial data, allowing both the separation of effects from concurrent confounders and regression-to-the-mean phenomena from the underlying treatment effect (Casella, 1986; Persaud and Lyon, 2007). One reason modeling has not been more extensively used in interpreting HACCP before–after trial data may be due to a lack of large datasets that can be required for this type of analysis (Persaud and Lyon, 2007). In the absence of modeling, interpreting raw mean differences in microbial prevalence or concentration as resulting solely from the effect of HACCP implementation will inevitably yield an inaccurate estimate of treatment effect. Therefore, the extent to which reported reductions in microbial prevalence or concentration on carcasses can be attributed to HACCP implementation is currently unknown.
Currently, limited data are available to identify the effectiveness of HACCP implementation in reducing the rate of attributable foodborne illness. Notifiable disease registries in both the United States and Australia similarly report a decline in the human incidence of Campylobacter cases, with little substantive change in the incidence of reported cases of Salmonella, since the time of implementation of HACCP in each country (CDC-DHHS, 2010; GWADH, 2010). Canada's 2006 report of national laboratory surveillance data for enteric pathogens reports a decline in the reported cases of Campylobacter and also in Salmonella, but an increase in reported cases of pathogenic E. coli (Public Health Agency of Canada, 2009). These before–after data contain the same challenges in interpretation as outlined earlier for before–after trials and, therefore, should be interpreted with caution. Data such as these, obtained through passive surveillance, also reflect a geographically variable degree of underreporting, which could bias comparisons between time periods of locations (Wheeler et al., 1999; Voetsch et al., 2004). Comparing the degree of concurrence between the commonly identified animal serovars of pathogens and the commonly reported human serovars of the same pathogens, since HACCP implementation, is another method of investigating the effect of HACCP on public health outcomes. Two U.S. studies investigating Salmonella spp. prevalence prior to, and during, the time of HACCP implementation, respectively, in humans and on meat both reported that the most common serotypes on tested meat products were not the most common causes of human illness (Schlosser et al., 2000; Sarwari et al., 2001). Similarly, an Australian study reported little concurrence between prevalence of Salmonella serovars derived from red meat and human cases, either before or after HACCP, although some concurrence existed between human serovars and those on poultry (Sumner et al., 2004). Comparing concurrence of serovars, at best, may identify an association, but not the direction of movement of microbes (i.e., animal-to-man, vs. man-to-animal, or mutual exposure to a third, independent, source of the pathogen). Molecular epidemiology, particularly DNA sequence analysis and phylogenetic reconstruction, offers the potential to trace the spread of microbes from food animal carcasses to human populations and thus shed more light on the effectiveness of HACCP in reducing human illness attributable to food animal consumption, as recent research has demonstrated for both Campylobacter spp. (Mullner et al., 2009, 2010; Scottish Food Standards Agency, 2009) and Salmonella spp. (Patel, 2007) infections. As more data become available over time, it might be possible to evaluate temporal trends both in bacterial contamination on carcasses in abattoirs and their potential correlation to attributable human cases of foodborne illness.
Several knowledge gaps were identified by our review. Some combinations of bacterial species and food animal species were studied more frequently than others. In general, more large studies measuring pathogenic bacteria such as Campylobacter spp., Salmonella spp., and shiga-like E. coli are needed, given the importance of these pathogens in foodborne disease. None of the studies captured measured nonbacterial outcomes (e.g., viruses) and studies including nonbacterial outcomes would be useful in informing agrifood public health policy. Estimation of the effects of other interventions occurring in abattoirs, such as staff training, on microbial contamination on carcasses would be a useful prerequisite to modeling the effects of HACCP initiation. Overall, given the documented financial burden that HACCP implementation imposes, and the trend to implement HACCP even in small abattoirs over time, there is a need to verify its effectiveness in reducing microbial contamination on food animal carcasses. Future research accessing both proprietary and government surveillance data, employing analysis such as Empirical Bayes to adjust for potential confounders inherent in before/after trials, is needed to more accurately estimate the microbiological effectiveness of HACCP. This would be in keeping with HACCP's history of private–public sector collaboration.
Conclusion
The existing publicly available data on the effectiveness of HACCP on microbial contamination of carcasses are limited. Consistent trends in reduction of bacterial contamination were observed only for indicator organisms (e.g., aerobic bacteria). Research accessing both proprietary and government surveillance data is needed to more accurately estimate the microbiological effectiveness of HACCP, and large-scale prevalence studies measuring pathogenic bacteria and nonbacterial outcomes would be useful.
Footnotes
Acknowledgments
The authors gratefully acknowledge the complementary reviewing efforts of Oliver Bucher, Ashley Farrar, and Dr. Sarah Parker as well as the financial support of the University of Saskatchewan.
Disclosure Statement
No competing financial interests exist.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
