Abstract
Expert elicitation is a useful tool to explore sources of uncertainty and to answer questions where data are expensive or difficult to collect. It has been used across a variety of disciplines and represents an important method for estimating source attribution for enteric illness. A systematic review was undertaken to explore published expert elicitation studies, identify key considerations, and to make recommendations for designing an expert elicitation in the context of enteric illness source attribution. Fifty-nine studies were reviewed. Five key themes were identified: the expert panel including composition and recruitment; the pre-elicitation material, which clarifies the research question and provides training in uncertainty and probability; the choice of elicitation tool and method (e.g., questionnaires, surveys, and interviews); research design; and analysis of elicited data. Careful consideration of these themes is critical in designing and implementing an expert elicitation in order to reduce bias and produce the best possible results. While there are various epidemiological and microbiological methods available to explore source attribution of enteric illness, expert elicitation provides an opportunity to identify gaps in our understanding and where such studies are not feasible or available, represents the only possible method for synthesizing knowledge about transmission.
Introduction
E
Since its inception (Helmer-Hirschberg, 1966), expert elicitation methodology has evolved into a range of methods that rely on behavioral and mathematical techniques to estimate unknown quantities, to characterize risk pathways, and to quantify uncertainty. Expert elicitation can be used qualitatively to rank pathways or build models (Tan et al., 2010; de Jong et al., 2012) or to produce quantitative estimates, such as proportions (Hoffmann et al., 2006), percentages (Cressey and Lake, 2005; Speirs-Bridge et al., 2010), probabilities (Cooke et al., 2007; Montangero and Belevi, 2007), and natural frequencies (Vally et al., 2014). Expert elicitation provides an opportunity to recruit experts across a range of disciplines to estimate measures of interest and is used to answer questions that are difficult to answer via other methods.
Producing source attribution estimates can be difficult and resource intensive. For many enteric pathogens, there is insufficient research or surveillance-based data available to perform source attribution using methods other than expert elicitation (Pires, 2013). Expert elicitation can be an excellent mechanism for gathering information and supplementing estimates obtained through other attribution methods. Such information can be used to guide the development of more effective food and water safety policies that target interventions at the transmission routes and reservoirs that cause the greatest burden in our communities. Expert elicitation has been used in New Zealand (Cressey and Lake, 2005), the United States (Hoffmann et al., 2006), the Netherlands (Havelaar et al., 2008), Canada (Ravel et al., 2010; Davidson et al., 2011), and Australia (Vally et al., 2014) to inform source attribution efforts toward informing enteric illness prevention strategies and burden-of-illness estimates (Thomas et al., 2013).
The purpose of this systematic review was to identify published expert elicitation studies from a range of disciplines, to explore the state of knowledge and practice in expert elicitation, and to identify key considerations that can be applied in the context of enteric illness source attribution.
Materials and Methods
This review is reported based on the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) protocol (Moher et al., 2009). Three research questions were developed to guide this systematic review.
Research questions
1. What methods have been used in previous expert elicitations?
2. What are the strengths and weaknesses of these methods?
3. What are best practices for expert elicitation of enteric illness transmission?
Review protocol
Six databases were searched using keywords to identify expert elicitation studies across any discipline (Table 1). Searches were performed January 7 to February 4, 2013. Search limits included publication dates between 1980 and 2012.
Study selection and data collection
Initial selection of articles was based on broad relevance screening of articles pertaining to expert elicitation on any subject. Round 1 article selection criteria were based on title and abstract (Table 2) and publication in English in 1990 or later. In round 2, articles were assigned to two categories. Category 1 articles indicate the use of formal expert elicitation methods as the primary study method, indicated by the use of phrases “formal,” “Delphi,” or “structured” in describing elicitation or through reference to specific methods (e.g., Classical Model or Delphi Method). Category 2 articles included review and discussion articles relevant to expert elicitation methods—studies where expert elicitation is a secondary component of the analysis and informal expert knowledge-gathering exercises. Only category 1 articles are reviewed here (Fig. 1).

Flowchart of publication selection, adapted from PRISMA Protocol (Moher et al., 2009) for systematic review of expert elicitation studies. 1. Articles identified via branching and hand-selection. 2. Articles excluded based on duplicate publications for the same study (n=5), not meeting screening criteria (n=3) or inability to access full-text based on institutional subscription access (n=2).
A single reviewer completed initial searches and round 1 screening (AB). Round 2 screening was performed by two reviewers (AB and KT or KP). Conflicts between reviewers were resolved via discussion. References and abstracts were stored in a RefWorks database (ProQuest LLC., RefWorks 2013;
N/A, not applicable.
Synthesis and analysis
Descriptive analysis was performed on the final selected studies; frequencies and proportions for categories were reported.
Results
Study selection
A total of 13,438 articles were identified from the database searches. From prescreening, 6299 articles were selected based on meeting the relevance criteria (Table 2); after deduplication, 4130 remained to be evaluated. Thirty additional references were added via hand searches of reference lists and through branching by examining the reference lists of relevant reports, primary and review articles, and Google searches for the full text of some publications for sources that matched the original inclusion/exclusion study criteria. The search was further restricted to those that contained the phrase “elicitation” in the title or abstract. Thus, 246 articles were screened in round 1, of which 156 articles were selected for round 2 screening. Ninety articles were excluded due to inclusion and exclusion criteria (including two previously overlooked duplicates). During round 2 screening, 69 articles were assessed as category 1 (formal expert elicitation as primary study outcome), 84 were assessed as category 2 (review or discussion articles, and expert elicitation as secondary study outcome), and 3 were excluded. An additional 10 articles were excluded during critical appraisal: duplications of studies reviewed elsewhere in the critical appraisal process (n=5); the full text did not describe the methods of an expert elicitation (n=3) and the full text was not accessible under institutional subscriptions (n=2). A total of 59 articles, covering a range of subjects (e.g., nuclear safety, health and environmental risks of climate change, health economics, etc.), were included for synthesis, 5 of which explicitly explored source attribution for enteric illness (Fig. 1; Supplementary Appendix A; Supplementary Data are available online at
Themes of expert elicitation methods
Key themes identified in the review included the expert panel, the background material supplied, the elicitation model, analysis methods, and research design.
Expert panel: Recruitment, panel size, definition of expert, assessment of expertise
An expert panel is defined by the method of recruitment, the number of experts, and the definition of an expert. Of the studies reviewed, 22/59 (37%) used relevance screening to recruit the expert panel, where experts were chosen based on a priori judgment of the participants' expertise, by the researchers (Table 4). Snowball recruitment was used in 20/59 (34%) of the studies, where a seed group of experts are asked to nominate their peers, who in turn are asked to nominate their peers to build a panel of experts. Experts were also recruited based on convenience (e.g., membership in a panel or attendance at a relevant workshop [n=3]). The method of recruitment was not reported in 22% (13/59) of studies. The majority (14/22; 64%) of studies that used relevance selection used external assessment methods (e.g., publication record, membership in professional group) to evaluate expertise (Table 4). For snowball recruitment, expertise was self-assessed (10/20; 50%), externally assessed (8/20; 40%), or assessed using seed questions (2/20; 10%) (Table 4). The number of experts recruited varied from 2 (Norrington et al., 2008) to 244 (Donlan et al., 2010), with an average of 26 experts (excluding a single study that involved more than 1 expert panel [Goossens and Harper, 1998]) (Fig. 2). Few studies (4/59, 7%) failed to report on expert panel size. There were no explicit inclusion or exclusion criteria for expert selection provided in 12 (20%) studies. In the remaining 47 (80%) studies, inclusion criteria included publication record; professional affiliation (including membership in relevant research groups); technical experience in relevant topics; location or regional expertise; profession; and employer. Exclusion criteria included geographic location or regional expertise and recruitment was restricted to local or regional experts due to resources or subject matter. Seed questions were used to assess and/or calibrate expertise in 4/59 (7%) studies. Other methods for defining expertise were based on external assessment by researchers or their peers (23/59; 39%), or self-assessment (15/59; 25%).

Expert panel size as reported in 57 reviewed expert elicitation studies (as frequency), published 1990–2012.
Pre-elicitation material
Background information was provided to experts in 40/59 (68%) studies, with a briefing book most often used (Table 5). Many (23/59; 39%) studies reported training experts in probability and uncertainty (e.g., through providing worked examples); 18/23 (78%) studies provided additional background information (Table 5). Three elicitation tools were employed in the majority of studies reviewed: questionnaires (41/59; 69%), workshops (25/59; 42%), and interviews (11/59; 19%) (Table 6). Workshops were used as either the sole elicitation method (8/25; 32%), or in combination with other elicitation tools (17/25; 68%). Of those studies using workshops, 6/25 (24%) reported using a modified Delphi approach, 3/25 (12%) reported using the HENVINET elicitation framework (Bartonova, 2012), and 2/25 (8%) reported using the Cooke method (Cooke, 1991). Other methods included the SHeffield ELicitation Framework (SHELF) (O'Hagan, 2012), the ELI elicitation technique (van Lenthe, 1993), informal group processes, and probability trees. Questionnaires were administered using various methods (Table 7). One-on-one interviews included structured, questionnaire-based, and narrative approaches. Pilot testing or validation of the survey materials prior to administration was reported in 17/59 (29%) studies (Table 5). Pilot testing was conducted by co-workers or subject matter experts not part of the expert panel.
Elicitation tool: Format, number of rounds
In 33/59 studies (56%), more than 1 round of elicitation was undertaken (Fig. 3). In nearly half of these studies (n=16), different methods were used between rounds (e.g., both questionnaire and workshop) and an explicit consensus process (e.g., sharing group results from previous rounds and producing consensus statements from workshop discussions) was employed in 18/33 (55%) studies.

The number of elicitation rounds undertaken in 59 reviewed expert elicitation studies (as frequency), published 1990–2012.
Question framing
Roughly half (31/59; 53%) of the studies reviewed used expert elicitation to produce quantitative estimates, including measures of central tendency, percentiles, and probabilities across distributions (Table 8). A combination of both quantitative and qualitative measures were collected in 23/59 (39%) studies to elicit ranking and scoring, to generate model parameters, and to characterize risk factors (Table 8). Few (3/59; 5%) studies collected information using more than 1 method. Nearly half (27/59; 46%) of the studies collected information about the limits of the estimated quantities, as 5th and 95th percentiles or confidence intervals (n=13) and maximum and minimum values (n=12), and 3/59 (5%) studies collected information across a wider range, including the 5th, 25th, 50th, 75th, and 95th percentiles to produce more detailed probability distribution functions. Visualization (e.g., using dynamic graphs to represent probability distributions or drawing model tree structures) was incorporated into the elicitation process in 23/59 (39%) studies, including 8/59 (14%) that specifically used visual estimation tools to collect probabilities (n=4), percentiles (n=2), ranking (n=1), and mean values (n=1) (Table 9).
Analysis
In combining expert opinions, 18/59 (31%) studies reviewed used weighting based on seed or calibration questions (n=5), expertise (n=5) exclusion filters (n=3) or other methods (n=5); in 17/59 (29%) studies linear opinion pooling or simple averaging were used (Table 10); and in 7/59 (12%) studies, Bayesian networks were built using the elicited judgments. The types of distributions explicitly described in the reviewed studies for characterizing elicited quantities and probabilities varied, and included Bayesian modeling (n=6), β distributions (n=4), and normal and log-normal distributions (n=6). Cumulative distributions were not addressed in 23 studies, which included both qualitative and quantitative studies.
Discussion
This review explored 59 expert elicitation studies over a broad range of topics to develop a comprehensive summary of key themes and considerations for performing an expert elicitation in the context of enteric illness attribution. Expert elicitation can be performed with a behavioral or mathematical elicitation approach, to obtain an explicit consensus estimate or to explore the uncertainty in expert knowledge on a topic. The methods used in performing expert elicitations across the disciplines can include both formal and informal processes, which influence the final output. Identified sources of bias include the expert panel employed; the type of preparatory material provided to participants; the choice of elicitation model; the method of analysis; and the specific research design (Cooke, 1991; Walker et al., 2001). In designing an expert elicitation, researchers need to carefully consider and report the sources of bias (Cooke, 1991).
There are a number of biases common across expert elicitation studies. Anchoring derives from a failure to adjust viewpoints based on new information and can lead to fixation of estimates about initial values despite introduction of data to the contrary. Availability bias relates to the tendency to base estimates on information that is easily recalled. Base rate bias arises from failure to consider underlying population rates. Overconfidence relates to the tendency to produce estimates with too great a level of certainty or confidence bounds that are too narrow (Tversky and Kahneman, 1974; Walker et al., 2001; Kynn, 2008).
Recruitment methods can lead to bias through sampling from specific subsets of expert groups in recruiting expert panels, thus limiting the breadth of available knowledge and expertise (Tversky and Kahneman, 1974; Cooke, 1991). Relevance-based recruitment by experts can help to ensure that relevant stakeholders are recruited, but could potentially lead to overrepresentation of experts with whom researchers are familiar and who may hold similar opinions or viewpoints of the research questions (Evans et al., 1994). Snowball recruitment helps to ameliorate the influence of researcher bias in recruitment (recruiting who they know), but depending on the initial snowball sample being used, bias can still occur (e.g., do the experts represent relevant stakeholder groups who are qualified to address the research question?).
Some literature suggests that a panel size larger than seven experts does not improve the strength of the study (Ashton, 1986; Cooke, 1991; Winkler and Clemen, 2004). Others disagree, suggesting that larger panels increase the statistical power of the analysis and allow for better characterization of uncertainty (Clemen and Winkler, 1999; Moon and Kang, 1999; Hoffmann et al., 2007). Given the variety of subjects included in this review, a comparison of expert definitions used in the reviewed studies is not practical, but common concepts are presented herein. Transparent methodology, involving explicit inclusion and exclusion criteria, is critical (Ayyub, 2001). Without inclusion or exclusion criteria, it can be difficult for researchers to ensure that the recruited experts' knowledge is adequate. Expert selection from across disciplines (e.g., academia, industry, and public service) and broad organizational representation can help produce richer and more balanced elicitations (Ayyub, 2001).
Expertise can be quantitatively assessed using seed questions (van der Fels-Klerx et al., 2002; Cooke et al., 2007), though this is not always feasible, or through self-assessment (Hoffmann et al., 2006; Donlan et al., 2010). Seed questions are a tool for addressing overconfidence and providing an appropriate calibration for responses (Morgan and Henrion, 1992; Walker et al., 2003). Asking experts to estimate quantities that are known, or that will become known during the project timeframe, can measure an expert's ability to accurately produce estimates on the topic of interest (Cooke and Goossens, 2008). A problem with this approach lies in the ability to find questions for which answers are known or that can become known that are directly relevant. Self-assessed expertise and background information from the expert panel can be used to explore estimate weighting or use of threshold models to calibrate the estimates and reduce overconfidence bias (Ayyub, 2001; Ravel et al., 2010). Self-assessed expertise is considered more useful in characterizing experts' uncertainty with their own estimates (Hoffmann et al., 2007) than as a method for defining expertise.
Briefing books can present summaries or lists of relevant studies, outline the elicitation process, and frame the research questions. They can also provide guidance to experts on uncertainty and probability (e.g., through a worked example) (Kynn, 2008), which can reduce bias from misinterpretation of elicitation methods (Tversky and Kahneman, 1974; Cooke, 1991). These tools are similar to a pretest phase of a survey, which ensures that instructions are clear, questions are easily interpreted, and bias resulting from word choices is minimized (Knol et al., 2010). Providing experts with summaries or access to relevant current literature as part of a briefing book can help to address availability bias and base-rate bias (Cooke, 1991; Walker et al., 2001); however, it is necessary to ensure that these studies are representative of current knowledge on the topic and are not introducing a new layer of bias (Kahneman and Tversky, 1973). Base rate and representativeness bias can be avoided by providing experts information on the incidence or underlying frequency of exposures (e.g., surveillance data or toxin concentrations in the environment) (Koehler, 1996).
A variety of elicitation methods are available including workshops, questionnaires, and interviews. A key difference in these methods lies in the type of interaction between experts and facilitators, and the level of anonymity provided to participants (Brito et al., 2012). Anonymity of participants allows experts to provide estimates or judgments that may not be congruent with corporate or departmental views (Hetes et al., 2011).
Workshops and face-to-face discussion can be costly in both time and money and be difficult for experts to attend; however, they provide opportunity for experts to discuss sources of disagreement (Clemen and Winkler, 1999). A trained facilitator is recommended in workshops to reduce the potential for bias from group dynamics (Tan et al., 2010, Tyshenko et al., 2010, 2011). Nominal group technique is the most commonly used tool for structured information collection from workshops, involving the collection of input from all members, and discussion and ranking of suggested outcomes (Delbecq and van de Ven, 1971). Other forms of group decision making for workshops include the analytic hierarchy process, a hierarchical model for prioritizing outcomes or risk factors (Forman and Gass, 2001), and the RAND/UCLA Appropriateness criteria, created for supplementing scientific evidence in designing clinical process in medical care (Fitch et al., 2001). The choice of workshop method is dependent on research priorities and outcomes of interest.
More than one round of elicitation was used in over half the studies, which can provide the opportunity to resolve conflict and encourage consensus. The potential benefits of multiple rounds should be weighed against potential cost in time and money. Increased rounds of elicitation may also lead to loss of participants (dropouts) (Doria et al., 2009). The Cooke (Classical) model recommends using one round of elicitation (Cooke, 1991), reducing the risk of results becoming skewed in the direction of the “loudest” voices (Aspinall, 2010). Providing summary statistics between rounds can lead to an adjustment toward a central value (Aspinall, 2010). Including a discussion or workshop following quantitative (questionnaire) estimation can reduce the risk of anchoring bias by encouraging experts to discuss the reasoning behind their estimates (Goossens and Harper, 1998; Coppersmith et al., 2009).
Question framing depends on the outcome of interest and dictates which type of measures are collected (e.g., probability, mean, frequency, weighting), whether participants require an understanding of probability theory (e.g., probability distributions) and whether uncertainty will be explicitly quantified (e.g., quantitative measure, probability distribution) (Kuhnert et al., 2010). In eliciting quantitative measures, researchers should consider the potential benefits of using methods with easily interpreted language such as natural frequencies or means (Kuhnert et al., 2010).
Several methods have been used to aggregate elicited quantities. Linear opinion pools or simple averaging are commonly used as a straightforward method of combining expert opinions and are recommended by Cooke's (Classical) method (Cooke, 1991). Weighting of expert opinions in aggregate functions (e.g., through validation or seed questions) is recommended (Cooke, 1991). Alternatively, estimates can be aggregated on the basis of self-assessed expertise; however, this introduces the potential for overconfidence bias (Cooke, 1991).
Analytical tools chosen by researchers are highly dependent on research questions, the elicitation framework, and the elicited values. Presentation of summary statistics can help to characterize the data being collected and identify any trends or anomalies, such as a bimodal distribution (Ravel et al., 2010).
The distribution used to characterize summarized probabilities is also important. A normal distribution can often fit elicited data; however, as variation increases, other distributions can also be considered including the β distribution (Ravel et al., 2010) and the γ distribution (Cagno et al., 2000). Fuzzy distributions, Bayesian methods, and probabilistic inversion can also be used in producing probability distribution functions (Clemen and Winkler, 1999; Moon and Kang, 1999; Havelaar et al., 2008, Kuhnert et al., 2010).
Uncertainty in elicited data is a recurring theme in the literature. Researchers need to consider several sources of uncertainty in their estimates, as stated by Hoffman et al. (2007): (1) variability in expert judgment; (2) the level of agreement between experts' assessments and prior estimates based on primary data; (3) individual experts' uncertainty about their own assessments; and (4) variability in individual experts' uncertainty about their own best estimates.
Application of Expert Elicitation Methods: Enteric Illness Source Attribution
Of the 59 studies reviewed, 5 (8%) used expert elicitation to explore source attribution in (foodborne) enteric illness in Canada (Ravel et al., 2010), the United States (Hoffmann et al., 2007), The Netherlands (Havelaar et al., 2008), New Zealand (Cressey and Lake, 2005), and Australia (Vally et al., 2014) (Table 11).
Multisectoral expert panels were recruited in the Canadian (Ravel et al., 2010) and U.S. (Hoffmann et al., 2007) enteric illness studies, while the Australian (Vally et al., 2014) study used a narrower definition of experts (epidemiologists and public health professionals). Recruiting across a range of disciplines, the Canadian (Ravel et al., 2010) and the U.S. (Hoffmann et al., 2007) studies used larger expert panel sizes (54 and 42, respectively) than the Australian (Vally et al., 2014) (n=12), New Zealand (Cressy and Lake, 2005) (n=14), and The Netherlands (Havelaar et al., 2008) (n=16) studies. Panels were composed of experts from academia, public service, and industry, allowing the panels to draw from a breadth of experience (Hoffmann et al., 2006; Havelaar et al., 2008; Ravel et al., 2010; Vally et al., 2014), except in the New Zealand study, where panel composition was not described (Cressey and Lake, 2005).
Self-assessment of expertise in reference to the subject material as a whole or in the context of specific questions was employed in three of the five enteric illness elicitations (Hoffmann et al., 2006; Havelaar et al., 2008; Ravel et al., 2010; Davidson et al., 2011), externally assessed in one of the elicitations (Vally et al., 2014), or not described (Cressy and Lake, 2005) (Table 11). Generating appropriate seed questions is difficult in the context of enteric illness research and has not been reported. Other health-related expert elicitations have used surveillance data as a basis for seed questions (e.g., air pollution and mortality) (Cooke et al., 2007).
A thorough review of the current state of knowledge of the transmission of the illnesses being studied is an appropriate first step. This information helps frame the expert elicitation and provides a source of background information for the experts. The Australian elicitation provided a detailed literature review to the expert panel (Vally et al., 2014). Current illness surveillance data were provided to experts in two of the studies (Havelaar et al., 2008; Vally et al., 2014), and experts were encouraged to consider the potential sources of bias. While a useful tool for framing the research question and providing experts with the best available information from which to draw their estimates, briefing books also increase the likelihood of availability bias, by potentially providing only part of the information on a topic (Walker et al., 2001). The Canadian, U.S., and Australian elicitations provided a working example of the questionnaire as part of a briefing book (Hoffmann et al., 2006; Ravel et al., 2010; Davidson et al., 2011; Vally et al., 2014). Training experts to use the elicitation tool and in uncertainty can help address several sources of bias (e.g., anchoring, overconfidence, representativeness, and the base rate fallacy) (Cooke, 1991). Elicitation materials were only reported as being pretested in the U.S. study (Hoffmann et al., 2006). The Canadian and Australian studies (Ravel et al., 2010; Vally et al., 2014) built upon the U.S. survey tool.
Both workshops and questionnaires were employed in two of the studies (Cressey and Lake, 2005; Vally et al., 2014), and in the remaining three (Hoffmann et al., 2006; Havelaar et al., 2008; Ravel et al., 2010), questionnaires were mailed (n=2) or e-mailed (n=1) to experts. The Australian expert elicitation is the only enteric illness source attribution project that used multiple elicitation rounds to achieve consensus, and incorporated a discussion between rounds of survey administration (Vally et al., 2014).
There are many methods for measures and evaluating expertise. Researchers should consider whether the ability to predict surveillance data or other information would serve as a proxy for expertise in enteric illness transmission. In three of the studies, experts were asked to self-assess their expertise. In the Canadian elicitation study, a threshold model was created using self-assessed expertise ≥3 out of 5 as a cut-off for inclusion of estimates in the linear opinion pool. In the United States, experts were asked to produce estimates only for pathogens for which they felt their expertise was sufficient (Hoffmann et al., 2006). Linear opinion pools were used in the New Zealand, Australian, Canadian, and U.S. expert elicitations of enteric illness to combine expert opinion (Cressey and Lake, 2005; Hoffmann et al., 2006; Ravel et al., 2010; Vally et al., 2014). Cluster analysis was used to explore unexpected bimodal distributions in the Canadian elicitation data (Ravel et al., 2010). Triangular probability (n=3), normal (n=1), and PERT (n=1) distributions were produced from the individual estimates, which were combined using simple averaging (n=2), probabilistic inversion (n=1), or Monte Carlo simulation (n=2).
To elicit information about the proportion of cases of human illness transmitted by specific routes, questionnaires are beneficial for estimating natural frequencies or percentages and characterizing the surrounding uncertainty, using measures such as 90% certainty intervals. The incorporation of workshops allows experts to review the collective estimates and to discuss variation or divergence in estimated opinions. This can highlight sources of disagreement between experts and reduce the risk of unexplainable clustering such as observed in the 2009 Canadian elicitation. Readministration of a survey following discussion allows for revisions, reducing the potential bias of anchoring (Knol et al., 2009). Collecting information about self-assessed expertise can calibrate responses, to reduce the influence of overconfidence, especially when seed questions are not used.
On the basis of the reviewed literature, in designing a future expert elicitation of enteric illness source attribution, the strategies outlined in Table 12 should be used to strengthen future approaches.
Study Limitations
Sources of bias in this review included publication bias, the use of specialized terminology, and restriction of the search to health- and biological science–oriented indexing databases. Articles using the phrase “expert elicitation,” one of the primary search strings, are likely to emulate protocols outlined in relatively few seminal papers (e.g., the Cooke method) (Cooke, 1991), especially within disciplines. This can increase comparability across studies, but introduces the potential for bias toward specific practices that may not represent all established best practices or the underlying diversity of methods employed.
Due to the high volume of articles identified using the initial search string (4130), round 1 review of literature was restricted to 246 articles containing the word “elicitation” in the title or abstract. This may have eliminated several studies using alternative vocabulary such as “Delphi method,” “Analytic Hierarchy Process,” or “Nominal Group Technique.” Restriction to articles available in English may also have eliminated key studies. It is not possible to quantify the impact of this; however, the included literature comes from a variety of disciplines using a range of expert elicitation methods.
Conclusions
Expert elicitation has been used in a range of disciplines to answer questions that are difficult or expensive to answer in other ways. This review has highlighted critical aspects and considerations related to the expert panel, the pre-elicitation material, the elicitation tool or method, question framing and analysis, and how different types of bias may occur and be prevented. This review further indicates a departure from a rigid protocol and toward a flexible elicitation tool that can be adapted to fit individual research needs. Thorough consideration of these themes and following the guiding principles for conducting expert elicitation has informed the development of a framework for future expert elicitations of enteric illness source attribution.
Footnotes
Acknowledgments
Funding was provided by the Public Health Agency of Canada. Several experts in the design and analysis of expert elicitations, specifically in the context of enteric illness transmission, were consulted and their input has helped to guide the content of this review.
Disclosure Statement
No competing financial interests exist.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
