Abstract
Objective
To review common qualitative and quantitative methods of measuring shared mental models appropriate for use in the healthcare setting.
Background
Shared mental models are the overlap of individuals’ set of knowledge and/or assumptions that act as the basis for understanding and decision making between individuals. Within healthcare, shared mental models facilitate effective teamwork and theorized to influence clinical decision making and performance. With the current rapid growth and expansion of healthcare teams, it is critical that we understand and correctly use shared mental model measurement methods assess optimal team performance. Unfortunately, agreement on the proper measurement of shared mental models within healthcare remains diffuse.
Method
This paper presents methods appropriate to measure shared mental models within healthcare.
Results
Multiple shared mental model measurement methods are discussed with regard to their utility within this setting, ease of use, and difficulties in deploying within the healthcare operational environment. For rigorous analysis of shared mental models, it is recommended that a combination of qualitative and quantitative analyses be employed.
Conclusion
There are multitude of shared mental model measurement methods that can be used in the healthcare domain; although there is no perfect solution for every situation. Researchers can utilize this article to determine the best approach for their needs.
Introduction
Defining shared mental models
Mental models are an organized set of knowledge and/or assumptions about reality that give individuals a basis for decision making and understanding. 1 Day et al. 2 demonstrated that when individuals’ mental models were more similar to experts’ mental models, it was predictive of skill performance. Shared mental models (SMMs) are the overlap or co-understanding between two or more individuals. 3 SMMs describe the ability for all members of a team to be thinking in a similar way and performing in accordance with each other. More specifically, team members can have SMMs surrounding tasks, technology, team interactions, and team members. 4 See Table 1 for examples and definitions of different types of SMMs.
Organization of shared mental models within the context of healthcare teams (adapted from Mathieu et al. 3 ).
Measuring SMMs
The dearth of research on SMMs in healthcare could be due to the fact that they are difficult to assess. Effective measurement of SMMs has proven challenging even in experimental environments, without much consensus among experts.5,6 For instance, Burtscher and Manser 5 found that in a review of 33 published studies on SMMs, seven different methods were used for assessment and ten methods were used for analysis. Further, they concluded that “more than a decade after a seminal review by Mohammed et al. 6 was released, there is still no consensus among the scientific community regarding measurement” (p. 1351). Despite discrepancies in the appropriate form of measurement, research is consistent with regard to the importance of SMMs in excellent teamwork stating that well-established SMMs improves a team’s ability to communicate and coordinate.2–6
Because appropriate assessment is crucial for better understanding SMMs and their relationship with teamwork and patient safety, this paper seeks to provide a ground level view of how SMMs in healthcare teams can be studied and evaluated. To delineate the most appropriate SMM assessment methods within healthcare, we synthesized meta-analytic evidence 7 and a seminal systematic review. 8 The two articles are a review of team SMMs 6 and cognitive interview techniques. 10 DeChurch’s article focuses on measurement of team SMMs with a particular focus on the various ways the literature has utilized SMM measurement in teams and how, regardless of measurement technique, SMMs are positively related to team performance. This indicates that SMMs play an important role in teamwork across the extant literature even though there is a lot of variance in the way they are measured when looking across studies.
Kohnken et al.’s work instead focuses on one way to assess SMMs, particularly through use of cognitive interviewing techniques compared to standard interviews. This meta-analytic evidence found that cognitive interviews had a strong effect for increased recall of details, although they also led to an increase in errors with regard to recall. Furthermore, effect sizes were larger when participants were involved in staged events compared to video films. This indicates that the cognitive interviewing technique is efficacious for elucidating the way people think, but that positive outcomes are subject to changes as a factor of the method used to interview.
Together these articles demonstrate that SMMs are measurable, but that there is little agreement on how to measure when looking at the current literature base. Further, cognitive interviewing has some tradeoffs, although it is likely a better technique for garnering correct details, it also brings with it more error compared to standard techniques.
In synthesizing this information, we aimed to both foster an understanding regarding the most appropriate methods for assessing SMMs within the healthcare setting, and to aid healthcare researchers in selecting assessment techniques, which empirically demonstrate theorized relationships within the healthcare context. With these aims in mind, we first define mental models as well as SMMs. Next, we describe the methods that can be used to measure team SMMs in the healthcare setting. Finally, we conclude with recommendations for selection of SMM assessment methods based upon the various constraints present in the healthcare setting.
Mental model assessment techniques
Few empirical studies have focused on the concept of SMMs in healthcare. One potential reason is that recommendations for measurement of SMMs do not exist in the healthcare setting. Leveraging work outside the healthcare domain, DeChurch and Mesmer-Magnus 8 presented three important types of SMM measurement methods. The three techniques can be arranged as follows: (1) elicitation methods, (2) structure representations, and (3) representation of emergence. 9 Elicitation methods identify the concepts and content within the SMM. Structure representations unite these concepts by organizing them based upon their relationships. Finally, representation of emergence allows multiple SMM “maps” to be compared and contrasted. As an example, the elicitation method would demonstrate one’s family by including content like “mother”, “father”, “brother”, and “sister”, The structure of representation would be demonstrated by a family tree. Finally, the representation of emergence would be determined by the overlap between your family tree and a cousin’s family tree. Refer to Table 2 for a summary and description of these techniques.
Categories of shared mental model elicitation techniques.
The remainder of this paper will review common assessment methods that could be utilized in healthcare research to garner insights regarding SMMs. The methods have been arranged according to type of data analysis: qualitative and quantitative. Qualitative methods of data collection and analysis commonly focus on the use of words, gathered through storytelling, theory development, and case study. Conversely, quantitative methods focus on objective measurements commonly by conducting polls, surveys, questionnaires, or direct measurement.
Cognitive interviewing
Cognitive interviewing is a systematic approach to elicit knowledge with the aim of increasing relevant and accurate information. 10 Interviews are transcribed and analyzed using propositional or discourse analysis. 7 Typically, this involves aggregating interviewer notes and respondent answers, establishing common themes, and determining key findings. 11 Meta-analytic evidence suggests this method is particularly beneficial for improving correct detail recall. 12 The healthcare industry has leveraged cognitive interviewing for survey development,13–16 root-cause analyses, 17 and design guidelines. 18
Cognitive interviewing includes two dominant approaches: think-aloud interviewing and verbal probing.19,20 Think-aloud interviewing entails the interviewer reading a question, with minimal interjection, and the respondent “thinking aloud” as (s)he answers each question. For example, an interviewer might ask the participant to describe a process as if they were doing it, essentially having the participant “think aloud” all of the steps in first person. Because interviewers solicit information and ask only clarifying questions beyond the scripted interview, minimal training is required, and interviewer bias is limited. 21 Additionally, the open-ended format of the think-aloud interviewing is conducive for respondents who have a wealth of knowledge. 22 Conversely, respondents may have difficulty articulating their processes eloquently; therefore, some respondents may require training on how to effectively “think-aloud”. 19
Verbal probing consists of an interviewer following-up broader questions with more specific questions (i.e. probes) to gain additional insights on a particular topic. Probes can either be asked concurrently or retrospectively. Concurrent probing involves a question–answer exchange between the interviewer and respondent, while retrospective probing involves probes asked after the entire interview has been administered. Regardless of the probing technique, there are two types of probing questions: scripted and spontaneous. Scripted probes are developed a priori, and spontaneous probes are created impromptu. Overall, the advantage of using verbal probing (as opposed to think-aloud interviewing) is that the interviewer can tailor the interview, thus resulting in a more relevant, productive, and consistent interviews. 19 Unfortunately, the active role of the interviewer potentially invokes bias and leading respondents. 19
Within healthcare, these elicitation methods may illustrate differences in procedural expectations between disciplines, professions, or expertise. For example, an expert nurse and novice surgical resident may have different understandings of pertinent information to include in a handoff. The differences and similarities between these two conceptualizations may be used to adjust protocol, create useful cognitive aids, and ensure that all stakeholders are receiving the information necessary to complete their tasks safely and efficiently. However, cognitive interviewing is limited to information that a respondent can verbalize. Additionally, since interviews are typically conducted after performance, bias arises as respondents may distort information or memories may have decayed. 7 These errors in recall or verbalization may result in crucial minor details being overlooked and thus left out of any future interventions.
Although cognitive interviewing is traditionally conceptualized as one-on-one interviews to establish individual mental models, they can also be conducted in groups to conceptualize team mental models, adding a level of flexibility not available to all SMM techniques. Group discussions can generate important linkages; however, the views of the most extroverted and opinionated may dominate the discussion and misrepresent the team’s mental model. Despite this potential limitation, group interviewing may encourage teams to engage in self-correction, strengthening the quality of the SMM, not only allowing for researchers to measure the SMM but practitioners to enhance it.
Card sorting
Card sorting refers to a diverse group of techniques that involves the naming, arrangement, and ordering of concepts in accordance to a set of criteria. Card sorting has been used to study cognitive function 23 and has provided insight to participants’ mental organization of information.
Card sorting can be an effective method in eliciting important concepts of a mental model as well as provide insight into how concepts are cognitively associated. 8 In its most basic form, the card sort involves concepts written on physical cards where participants arrange and order the cards according to their own understanding and perception of the topic. 24 In general, there are two approaches labeling the groupings. 25 In open card sorting, the labels are generated by the participants as they group the concepts. In closed card sorting, the labels are provided by the investigators and are typically generated from prior research. For example, in an open card sort, participants might organize a standard deck of playing cards by color, face, number, or any other chosen manner. In a closed card sort, participants would be asked to sort the cards strictly by suit.
After the groupings are generated, they can be compared across individuals to derive general principles. However, more modern forms of card sorting employ virtual cards on computers and utilize cluster analysis to construct tree diagrams (i.e. dendrograms). These provide a visual depiction of how concepts and terms are grouped by an individual or by a set of individuals and allows for comparisons and calculation of similarities between different models. Further, online card sorting software permits card sorting be accomplished remotely with little input from the investigator, making it flexible with regard to where it can be implemented.
In healthcare, there has been a growing interest in measuring SMMs through methods like card sorting to understand their impact on areas such as the integration of healthcare organizations, 26 collaboration of clinical teams, 27 and implementation of clinical guidelines. 28 Card sorting has been employed to show significant differences between mental models among healthcare team members,29,30 as well as demonstrated how disparate mental models impact computerized decision aids design, 31 clinical team performance, 32 and the development of clinical handoff tools. 33 This particular method of elicitation may be helpful within the context of healthcare for understanding differences in role expectations or policy perceptions between disciplines, professions, or levels of tenure.
Card sorting can be a simple, time-efficient technique that allows for a large number of participants. However, the insight into the structure of concepts is relatively crude and only provides information on what ideas are related, not the relative strength of the relationships. Therefore, card sorting is effective at eliciting the content of mental models with minimal insight into the structure of the information.
Finally, card sorting specifically focuses on individual mental models, which allows the comparison of each team member’s mental model with other members but its main limitation is that it does not evaluate how those mental models interact within the team. 8 Nevertheless, it is an important tool to have in one’s arsenal, as the team SMM is directly influenced by how individual mental models overlap. Future applications may examine using the card sorting task at the group level as a method of training and solidifying team conceptualization and strengthening SMMs.
Concept mapping
Concept maps are visual hierarchical representations that examine the organization and structure of knowledge.34,35 Originally developed to elicit the knowledge and understanding of basic scientific principles in children, concept mapping has evolved into both qualitative and complex quantitative modeling techniques. By understanding an individual’s organization of knowledge, gaps of understanding and mismatches between team members’ knowledge representations can be identified and addressed.
In terms of elicitation, every map is guided by its most general concept, typically the item which you are trying to elicit the information about (i.e. an event, object, or process). 35 In concept mapping, each concept is connected by branches or lines, which describe the relationship between them. Each layer beneath this concept branches out to include other sub-concepts within this category. For example, consider the concept of teamwork. See Figure 1 for an example of concept mapping. Concept mapping is useful not only for knowledge elicitation but also as a comparison method between experts and novices, individuals within the same team, and those completing the same or similar tasks.35–38

Example of concept map of “teamwork”.
There are several quantitative concept mapping methodologies that have been identified for measuring SMMs including pairwise ratings, distance ratio formulas, multidimensional scaling (MDS), and Pathfinder.7,8 Each of these will be discussed in detail below.
Pairwise ratings
Pairwise ratings are used to identify if two paired concepts are similar, dissimilar, or identical. Participants are asked to rate pairs of concepts based upon how similar/related they are on a scale that commonly ranges from “not at all related” to “highly related”. These data are then transferred into a matrix for analysis (see Figure 2.). 39 For SMMs, the pairwise rating method is based on relationships and organization of words held in the memory, 39 connected through a hierarchical and stereotypical relationship. 39

Example of a rater response matrix for the concept map of “teamwork”.
Pairwise comparison has been used to measure patients, healthcare teams, healthcare education/training, and medical sciences and pharmaceutical research.40–43 Pairwise ratings have four advantages: time efficiency, ease of use, the ability to provide a more robust data set, and the ability to create a statistically rendered mental model. Given these advantages, pairwise ratings can be particularly useful in the context of healthcare where patients are incorporated as a valued member of the team, or at the very least, recognized as the key informant that may prevent errors. 44
Because pairs can be read to the participant and the participant can respond verbally, pairwise ratings offer a convenient method of eliciting patient mental models regardless of the education/literacy levels. Last, rather than the participant expressing or expounding upon the SMM, statistical analysis creates the data-based models. 7 The disadvantage of collecting data in this manner is that as the list of pairs is increased, the risk of the task becoming arduous to the participant increases. 7
Distance ratio formula
The distance ratio formula is an algorithm employed when using causal mapping measures (visualizing the influence or causality between variables). It can be used to study complex networks, even when variables interact with themselves directly (i.e. self-loop).45,46
The distance ratio formula builds upon the pairwise rating by requiring participants to comment on the relationship of the concept pairs as having a positive or negative interaction and the strength of that interaction. The raters’ matrices are then analyzed using simple calculations to produce a convergence ranging between 0 and 1, 0 being most similar and 1 being least similar. 47
This technique allows for the comparison of complex networks between individuals or within a group.39,45 The data set that is rendered using this technique is robust and rich, providing not just the relationship between concepts but the direction of that relationship. A broad sample is required to achieve a rich data, which becomes increasingly difficult to analyze as the matrices expand.45,47 Additionally, if a link is absent between two concepts, the formula treats this absence as if the concept itself were missing. 39
Distance ratio formula has been applied across multiple fields of study including social relations, power grids, genes, and text analysis.48–52 Specific to healthcare, this method can be applied to measure providers’ (individual or team) SMMs throughout a variety of tasks, both qualitatively and quantitatively. 7 One particularly useful domain of application for this method of elicitation is the potential causes of sentinel events. Using this method with interprofessional healthcare teams not only aids in understanding what potential causes lead to errors or near-misses, but also allows researchers and practitioners to identify differing conceptualizations of risky behaviors between professions and disciplines.
Multidimensional scaling
MDS refers to a method of creating a visual representation of similarity between individual cases by using proximity data. The main difference with MDS and distance mapping is that it requires data-driven, a priori categories, compared to post hoc categories. This indicates that a theoretical basis must be established ahead of measurement to ensure that the method is being used appropriately. In this manner, this method can be used as an alternative to factor analysis, allowing the researcher to “discover how and why variables are related”. 53 MDS is flexible in that it can be used to analyze multiple types of information, including interval, ratio, and ordinal level metrics. MDS has been applied to a multitude of differing fields including anthropology, economics, organizational behavior, political science, psychology, and sociology. 53
Data are mapped upon a two-dimensional space with each concept being represented by a marker, usually a dot. Similar concepts have smaller spaces between them and dissimilar concepts have larger spaces between them. The more information you use to create an MDS, the more reliable the model becomes as dots cluster together into related themes. However, higher densities of markers create complexity and can be difficult to interpret.
The advantage of MDS is that one can manipulate the data set so that the convergence becomes refined, expressing more detail regardless of the scale applied. Further, this method provides the ability to create a pictorial representation by clustering more similarly related concepts. MDS also allows the researcher to identify the dimensions used by the respondent to judge similarity and dominance. MDS can also compensate for missing data within a dataset. The only requirement is that the data entries must be relatively large compared to the number of relationships between them. 53
MDS does have some methodological issues, including the fact that once the data are refined to a small section to show intricacy, it becomes easy to lose the large-scale view of the data. For instance, when a researcher is scaling the data, the best technique to use is not always easily identifiable. 7 Another disadvantage of MDS is the requirement of a large number of respondents to ensure there are more data points than relationships between those points. 54 This can lead to an extraordinarily large number of ratings for even small set of concepts, leading to a time-consuming data collection process.
Within healthcare, MDS has been used to extend patient-centered marketing research by addressing patient satisfaction, patient loyalty, and hospital image with each measure being a coalescence of other factors like physical environment quality, interaction quality, outcome quality, and effect on image. 55 MDS has also been used for complex concepts such as psychiatric disorders, quality of life and quality of care, end of life care, health planning and evaluation, and development of guidelines for public health management of lower prevalence chronic conditions such as epilepsy, communication patterns, and doubt/uncertainty in clinical settings.56,57
Potential applications of measuring SMMs in healthcare can range from understanding what medicines are confused with one another, to creating associations between similar patient cases and disease states, procedure processes and anticipated complications, patient handoffs, and more. MDS allows for a very large system, like healthcare, to be analyzed in small pieces without losing site of the “big picture.”
Pathfinder
Pathfinder is a structural assessment technique that represents each concept within a map as a node, with the distance between nodes acting as a representation of the strength of the relationship between those two concepts. The more similar a concept, the more closer together the nodes. This technique visually generates each person’s mental model by comparing judgments given between raters. 58 Using Pathfinder, similarities can be represented in a network structure, where these similarities are shown as nodes. Pathfinder applies a numerical weight based upon the closeness and the similarity between concepts. Because each person has a different perception of how concepts may relate to one another, Pathfinder must compare the models created by all individuals. This is accomplished by assessing “closeness”. “Closeness” refers to whether a node is present in structures for two individuals. 59 Closeness is computed for each node, and the results are averaged, which then indicates the overall similarity of the mental models. The range of similarity is from 0 to 1, 0 being unrelated and 1 indicating identicalness. As an example, Pathfinder can be implemented when analyzing SMMs with regard to handoff protocols. Items that are passed during the handoff can be grouped together according to relatedness. Terms like “patient identity” and “name” would be placed close together by participants and most likely assigned a “closeness” rating of 1, because the terms are synonymous. However, terms such as “blood pressure” and “temperature” would be placed near each other because they are commonly understood to be vital signs, but would not have a high closeness rating because they are independent concepts.
Although Pathfinder represents the similarity of perceived relatedness, it does not depict causality of the perceived content. 60 Much like other measurement tools of mental models, there are prescribed concepts that are to be compared between individuals; therefore, Pathfinder may not be applicable to all scenarios. 60 Pathfinder differs from other assessment techniques because it represents better comparisons of judgments in each structure. 61
Advantages of using Pathfinder include the ability to compare the similarity of concepts of different networks through an algorithm, reducing bias present when scored by people. It organizes the concepts through networks or trees, making it easier to use and understand. 7 It also provides a more predictive advantage over other measurement tools because it exemplifies the comprehensive relationships of concepts rather than the direct relationships. 62 Pathfinder’s global approach, which analyzes similarity between networks, has been recognized as more reliable and valid over MDS or simple representation models.62–64
Pathfinder’s simplicity, conceptual approach, and overall comprehensive technique has been applied in a broad range of industries, including: aviation, human–computer interaction, healthcare, and education.5,7,65–67 There are also disadvantages of the Pathfinder technique in measurement of SMMs. Pathfinder does not adequately provide semantic information behind the nodes or concepts, rather it just provides the general ideas. 7 Therefore, the causality is not known between the nodes but just understood that there is similarity. 60 The Pathfinder technique is also very repetitive in nature for the rating of concepts; therefore, there is a possibility of the responses to lack continuous reliability or validity. 7
Overall, there is a lack of recent published research or work for Pathfinder and mental model research. Despite this lack of published work, the recommended use of Pathfinder is for individuals working interdependently under time pressure in a dynamic setting where performance influences serve as an input and output, which lends itself well to healthcare.7,68
Conclusion
This paper reviews multiple techniques, each with their own advantages and disadvantages that could be applied to better understanding of SMMs in the healthcare context. Although it is well established that SMMs are important for excellent teamwork, little research in healthcare effectively assesses SMMs and their relationship to relevant healthcare outcomes such as patient safety. Therefore, we aimed to provide a basis for future research in this area through summarizing the current practices used to assess SMMs and the various ways they could each be utilized in the healthcare setting. A summary of the techniques along with their advantages and disadvantages is presented in Table 3.
Summary of SMM measurement techniques.
Those wishing to conduct SMM research within the healthcare industry must take into consideration the variety of factors affecting the appropriate selection of method(s) used. Time pressures, availability of resources and technology, and access to participants can severely affect the ability to implement some of these assessment methods. Some of these methods depend on the use of software, requiring financial and budget approval/support. For others, it may be difficult to obtain time with participants to complete a cognitive interview or a lengthy pairwise rating session. In this instance, a card sort may be the easiest solution. While it is the recommendation of the authors to pair qualitative and quantitative methods of analysis in order to provide a robust and rigorous analysis of conceptual relationships, the previously mentioned factors and others can prevent this possibility of using multiple measures. Researchers must select the most appropriate measurement technique for each unique healthcare setting and study goal. A summary of assessment techniques in relation to possible setting constraints is presented in Table 4.
Preferred measurement techniques based on provider circumstance.
Easy – Can perform this technique with minimal training.
Medium – Simple mathematical understanding of the concept or software needed for analysis.
Hard – Complex mathematics required or use of software required.
Even though SMMs are believed to be an important theoretical aspect of clinical decision-making and performance, 69 there is insufficient empirical assessment of SMMs in healthcare. This paper provides a comprehensive list of assessment techniques, strengths and weakness of each technique, and a foundation for clinicians and researchers alike to utilize the correct assessment techniques for their specific healthcare setting. Future work will need to address techniques that provide empirical support for the relationship between SMMs and the complex outcomes associated with healthcare work.
Footnotes
Application
Although mental models are often mentioned as an important component of teamwork in medicine, they are measured infrequently in the healthcare environment. This article provides guidance for those interested in measuring shared mental models in healthcare.
Precis
This article reviews multiple methods for measuring shared mental models and applies them to the context of healthcare teamwork.
Authors’ contribution
Logan Gisick: Drafting of manuscript and critical revision; Kristen Webster: Conception and design, drafting of manuscript, critical revision; Joseph Keebler: Conception and design, drafting of manuscript, critical revision, guarantor; Elizabeth Lazzara: Conception and design, drafting of manuscript, critical revision; Sarah Fouquet: Drafting of manuscript and critical revision; Keaton Fletcher: Drafting of manuscript and critical revision; Agnes Fagerlund: Drafting of manuscript; Victoria Lew: Drafting of manuscript; Raymond Chan: Drafting of manuscript and critical revision.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
