Abstract
To what extent do people achieve accuracy in judging others’ situations? Based on interpersonal perception models, we propose that ex situ raters may attain accuracy by judging the psychological characteristics of a situation that in situ raters have experienced according to a normative and distinctive characteristics profile. Biesanz’ social accuracy model (SAM) provides a flexible crossed-effects random coefficient modeling framework that can be applied to situation perception data. By targeting characteristics profiles with the analytical unit of the ex situ rater-situation dyad, the extent of and variation in normative and distinctive accuracy of ex situ raters can be estimated and explained by personality correlates to quantify “the good ex situ rater.” We demonstrate an SAM approach to situational accuracy with real in situ and ex situ data (402 ex situ raters judged 10 situations on 8 characteristics) and sketch future research.
Keywords
People form impressions of situations as if they were real, coherent entities (Cantor, Mischel, & Schwartz, 1982; Edwards & Templeton, 2005; Forgas, 1976; Magnusson, 1981; Rauthmann, 2012; Rauthmann, Sherman, & Funder, 2015a, 2015b; Serfass & Sherman, 2013; Sherman, Nave, & Funder, 2012, 2013). Thus, situation perception may follow principles similar to person perception (Nystedt, 1972a, 1972b, 1981; Rauthmann, 2012; Rauthmann et al., 2014). Attending to situation perception is important because knowing others’ situations can help understand and predict their behavior better. Thus, a crucial question is how good people actually are at deciphering others’ situations. Further, most research interested in the situation as the unit of analysis or as a moderating variable will need explicit measurements of situations (Rauthmann et al., 2015a), which will most often be ratings of the psychological characteristics of target situations from different sources (e.g., in situ raters: people in and affected by the situation; ex situ raters: neither present nor affected) and thus essentially tap perceptions. This opens up the door to investigate the agreement between different kinds of situation raters (Rauthmann & Sherman, 2017).
This work serves to deepen our understanding of situational accuracy as the extent to which ex situ raters can accurately judge the characteristics of situations from in situ raters with only minimal information available. Specifically, we aim to address three questions within a flexible multilevel modeling (MLM) approach: How accurately can people generally judge others’ situations? How strong is normative and distinctive situational accuracy? Which broad personality traits are associated with individual differences in being normatively and distinctively accurate?
Situation Perception
Person and Situation Perception
Concepts and methods of interpersonal perception and person(ality) judgment literature can also be applied to situation perception (Nystedt, 1981; Rauthmann, 2012; Rauthmann & Sherman, 2017; Rauthmann et al., 2015a). Nonetheless, situation perception is in key respects different from person perception. First, perceptions are only unidirectional (i.e., no reciprocity as in interpersonal perceptions; Kenny, 1994). Second, situation perceptions could fluctuate more as situations are ever-changing, dynamic, and fleeting (in contrast to persons as lasting, physical beings). Third, situations cannot rate themselves, so raters are required to assess their psychological characteristics. This latter point invites the question of reality: To what extent is a situation a “real thing?” To cope with this question, Rauthmann, Sherman, and Funder (2015a) proposed three principles of psychological situation research: A psychologically relevant situation (1) only “exists” if at least one person processes it, (2) is grounded in three types of reality (physical, social, and personal), and (3) should be measured from different perspectives (i.e., in situ and ex situ raters). These principles require a better understanding of how in situ and ex situ raters agree in their perceptions of the same situation.
Situation Characteristics
People pervasively process environmental cues and form psychological situation representations imbued with meaning and interpretations (Argyle, Furnham, & Graham, 1981; Block & Block, 1981; Magnusson, 1981; Rauthmann et al., 2015a, 2015b). Psychological situations can be described with situation characteristics similarly to how persons can be described with traits (de Raad, 2004; Edwards & Templeton, 2005; Rauthmann et al., 2014). Rauthmann et al. (2015a, 2015b) argued that situation research should proceed in a variable-oriented way by using continuous dimensions of characteristics. For example, situations are assessed by asking participants to rate the extent that a certain characteristic applied (e.g., work has to be done) rather than just list cues (e.g., books lying on the desk) or classify the situation (e.g., work situation). Thus, we obtain data on what situations mean to people. This work concerns to what extent people agree in their assessments of situations’ characteristics.
Recently, Rauthmann et al. (2014) proposed to taxonomize situation characteristics into eight major domains, the Situational Eight DIAMONDS (Duty: Does work need to be done? Intellect: Is deep thinking required? Adversity: Is someone threatened? Mating: Is the situation sexually/romantically charged? pOsitivity: Is the situation enjoyable? Negativity: Could negative feelings ensue? Deception: Is mistrust an issue? Sociality: Can meaningful social interaction and relationships develop?). These eight domains, integrating most previously identified characteristic dimensions into a common framework and language, have been shown to be useful in understanding how people’s everyday situations look like (Brown & Rauthmann, 2016; Serfass & Sherman, 2015) and how personality, situations, and behavior work together (Rauthmann, 2016; Rauthmann, Sherman, Nave, & Funder, 2015c; Rauthmann et al., 2016; Rauthmann & Sherman, 2016a; Sherman, Rauthmann, Brown, Serfass, & Jones, 2015). Thus, we deem the Situational Eight a good starting point to examine situational accuracy.
Accuracy in Judging Situation Characteristics
When judging others’ situations, there can be two accuracy criteria: capturing what the typical situation is like (= normative profile) and what makes the judged situation unique (= distinctive profile as the deviation from the norm). People may be accurate with regard to none, both, or only one of these. This means we should distinguish between normative and distinctive accuracy (Biesanz, 2010; Furr, 2008), although achieving any form of situational accuracy is important in daily life. To understand others, predict their behavior, and coordinate own behavior with others, it is paramount to accurately judge others’ situations—both in terms of what situations are generally like (normative accuracy) and what makes a specific situation stand out from the average situation (distinctive accuracy). Being normatively accurate may aid navigating typical or recurrent social situations, while being distinctively accurate could aid understanding specific situations. Having distinctive situation knowledge (Sherman et al., 2012; Serfass & Sherman, 2013) may, in turn, translate into better perspective-taking skills and more empathy or at least underlie them. Thus, understanding situational accuracy is an important endeavor.
To date, however, there is only little direct research on situational accuracy. First, Rauthmann et al. (2014) had ex situ raters read brief vignettes describing the situations of in situ raters (e.g., “Going shopping with my boyfriend”), while both in situ and ex situ raters rated the situations on the DIAMONDS. The average agreement hovered around r = .50, which is quite sizable given that ex situ raters had only limited written information on in situ raters’ situations available. However, this study cannot tell us whether ex situ raters were normatively and/or distinctively accurate.
Second, Rauthmann and Sherman (2017) outlined how situational accuracy could be studied using different variance decompositions (Biesanz, 2010; Cronbach, 1955; Jussim, 2005; Kenny, 1994; Kenny, West, Malloy, & Albright, 2006). Despite their appeal, the showcased decomposition techniques cannot clearly distinguish between normative versus distinctive accuracy (Biesanz, 2010) and extract individual differences therein. For example, most techniques require piecemeal procedures to examine personality correlates of situational accuracy (i.e., effect scores need to be extracted first, which are then correlated with personality variables). This work employs an MLM approach by Biesanz (2010) to disentangle normative from distinctive situational accuracy and flexibly incorporate personality moderators.
A Social Accuracy Model (SAM) Approach to Situation Perceptions
Common analysis of variance-inspired decomposition techniques usually require multistep approaches (e.g., extracting effects, correlating them, etc.). As Biesanz (2010, p. 858) noted, such a “two-stage modeling approach is inelegant, inefficient, requires a complete and balanced design, and is restrictive in the questions that it allows to be asked,” while “through the use of [MLM], the entire analysis can be placed within a single model that will allow a richer set of substantively important questions to be addressed.” MLM thus offers several advantages. First, it can be used to analyze the entire data with robust estimations, handling missing values effectively. Second, ex situ raters’ personality traits may be introduced simultaneously into one model in higher order levels. Third, common decomposition techniques can be readily translated into MLM (e.g., Gelman, 2006; Hoffman & Rovine, 2007; Kenny et al., 2006). Lastly, variations in stimuli—here situations—can also be explicitly modeled by treating situations as random factors (Judd, Westfall, & Kenny, 2012).
To date, the most sophisticated MLM approach to address accuracy questions is Biesanz’ (2010) SAM. SAM is particularly elegant because it disentangles normative and distinctive accuracy in a straightforward manner. It embraces and reconciles the Cronbachian tradition, concerned with distinguishing normative from distinctive accuracy, and Kenny’s social relations model tradition, concerned with relations between perceivers and targets. Applying SAM to situation perception data, let the index p represents the pth perceiver, s the sth situation, and c the cth characteristic:
The variables as well as fixed and random effects in this crossed-random effects MLM are summarized in Table 1 (see also Biesanz, 2010, pp. 866–868). The basic rationale is that for each rater-situation dyad across all characteristics, ex situ ratings are predicted from the in situ criterion scores as well as the norm values of all characteristics (= typical/average situation: how situations are generally rated on all characteristics sampled). Thus, profile relationships across characteristics are focused on, and for these there are two kinds of accuracy (Equation 1). The parameter β1ps provides an estimate of distinctive accuracy: How ex situ ratings uniquely capture in situ ratings, controlling for the normative profile. The parameter β2ps provides an estimate of normative accuracy: How ex situ ratings capture the normative profile (controlled for in situ ratings).
Overview of Multilevel Variables and Parameters for Situation Perception Data.
Note. Perceiver = ex situ rater. See more information in Biesanz (2010).
The random effects of normative and distinctive accuracy for ex situ raters and situations, respectively, point toward differences in “the good ex situ rater” and “the good situation” (for personality judgment analogs, see Funder, 1995, 1999): Ex situ raters may attain more or less normative or distinctive accuracy, respectively (i.e., perceptive accuracy; Biesanz, 2010), and situations may be judged with more or less normative or distinctive accuracy, respectively (i.e., expressive accuracy; Biesanz, 2010). These relations are summarized in Table 2.
The “Good Ex Situ Rater” and the “Good Situation”.
Note. Adapted from Biesanz (2010, Figure 4, p. 864).
Going even further (Equation 1.1), we can attempt to explain the individual differences in perceptive accuracy. 1 Specifically, we can model to what extent personality traits of ex situ raters moderate distinctive (β11) and normative perceptive accuracy (β21). Such analyses address the traits of who is a “good” (= more accurate) or “bad” (= less accurate) ex situ rater.
Current Work
Aims
This work serves three main aims in demonstrating how an MLM implementation can be fruitfully applied to situation perception data. In doing so, we focus on Biesanz’ SAM approach because it readily grants examining the magnitude of normative and distinctive situational accuracy. First, we aim to uncover how well ex situ raters can generally judge in situ raters’ situations (impressionistic accuracy). Second, we aim to disentangle normative from distinctive situational accuracy. Third, we seek to identify trait correlates in being normatively and distinctively accurate as, within SAM, personality moderators can be added to explain individual differences in perceptive accuracy.
Hypotheses
First, we expected both normative and distinctive accuracy to be sizable, although the former should be higher than the latter (e.g., Biesanz, 2010). Second, we expected sizable individual differences in perceptive accuracy: Ex situ raters should vary in their accuracies. Substantial (and meaningful) variance in perceptive accuracy is a prerequisite to introducing personality traits as predictors (or cross-level moderators, see Note 1). Third, we expected that broad personality traits, such as the Big Five, would explain perceiver slope variances. However, we did not form any clear a priori hypotheses on which Big Five traits would be positively or negatively associated with normative and distinctive accuracy, respectively. The lone exception was Neuroticism, a trait associated with higher vigilance toward potentially negative (or ambiguous) stimuli and negative biases (e.g., Hirsh & Inzlicht, 2008; Robinson, Ode, Moeller, & Goetz, 2007) so that situations may be processed more negatively than they are which may entail less accuracy in judging situations.
Method
Participants
The data used here were detailed in Rauthmann and Sherman (2016b, 2016c). Further, Rauthmann and Sherman (2017) performed various variance decomposition techniques on the data to tease apart different forms of accuracy. However, the current SAM analyses are novel and provide unique insights into situational accuracy and possible personality correlates. The Online Supplemental Materials contain all data and R codes to reproduce findings (see also osf.io/qgu6h).
In situ raters
In situ criterion data were gathered online from a German sample of N = 547 participants (407 women, 140 men; age: M = 28.01, SD = 10.47, range: 15–77 years). Participants were first asked to think about the situation they were in 24 hours earlier and then answer five questions in an open-text field: What was happening? Who was with you? What were you doing? Where were you? What time was it (approximately)? Next, participants rated their situation on the S8-I (as well as some other measures not of relevance here). From the total pool of 547 situations, we selected 10 target situations (based on the criterion to have different DIAMONDS profiles and reflect situations people could commonly experience) to be presented to a second sample, from which we obtained ex situ ratings. Table 3 shows the 10 criterion situations, along with their raw in situ ratings on the S8-I items.
Criterion Data: Target-situations and Their In Situ Ratings.
Note. D = Duty: work has to be done; I = Intellect: deep thinking is required; A = Adversity: somebody is being threatened, accused, or criticized; M = Mating: potential romantic partners are present; O = pOsitivity: the situation is pleasant; N = Negativity: the situation contains negative feelings (e.g., stress, anxiety, guilt, etc.); D = Deception: somebody is being deceived; S = Sociality: social interactions are possible or required. Situations 1–8 were selected specifically because of their high ratings on one Situational Eight DIAMONDS dimension (see gray-shaded cells). Situations 9 and 10 were not selected on any specific basis except for being mundane, everyday situations. English translations of the German vignettes are given. The Situational Eight DIAMONDS dimensions were measured with the S8-I (Rauthmann & Sherman, 2015b).
Ex situ raters
Ex situ ratings came from N = 404 participants (300 women, 104 men; age: M = 25.2, SD = 5.51, range: 17–51 years) who rated each of the 10 situations in Table 3 on the same S8-I items as in the in situ data. Additionally, these participants completed a Big Five questionnaire.
Measures
Situation characteristics
Situations were rated in situ and ex situ on the S8-I with 1 item per DIAMONDS characteristic (Rauthmann & Sherman, 2016b), using a 7-point Likert-type scale (1 = not at all, 7 = totally). The items can be found in Table 3. The S8-I has favorable psychometric properties (Rauthmann & Sherman, 2016b) and is useful for answering substantive research questions (Sherman et al., 2015).
Personality traits
Ex situ raters’ Big Five traits were assessed with the 16-item Big Five Inventory (BFI-S16; Gerlitz & Schupp, 2005; Lang, 2005), using a 7-point Likert-type scale (1 = does not apply at all to me, 7 = applies totally to me). We obtained trait scores for openness (M = 5.50, SD = 0.95, α = .62), conscientiousness (M = 4.96, SD = 1.12, α = .68), extraversion (M = 4.65, SD = 1.32, α = .81), agreeableness (M = 5.21, SD = 1.01, α = .46), and neuroticism (M = 4.31, SD = 1.31, α = .74).
Results
SAM analyses were conducted on 32,320 observations consisting of 404 perceivers’ ex situ ratings of 10 situations on 8 characteristics. The ex situ ratings were made on a 7-point Likert-type scale from 1 to 7, with M = 3.58 (SD = 2.38). We within-characteristic centered the in situ criterion ratings on the eight characteristics by subtracting out the characteristics norm ratings on those characteristics (J. Biesanz, personal communication, March 18, 2014). These normative ratings stem from the total in situ sample (Rauthmann & Sherman, 2016b, 2016c) with the following means (also on a 7-point Likert-type scale): Duty = 3.60, Intellect = 3.48, Adversity = 1.58, Mating = 2.56, pOsitivity = 4.91, Negativity = 2.71, Deception = 1.47, and Sociality = 4.85. The norm values and personality scores were grand mean centered. 2
In total, we estimated three models: Model 1 predicted ex situ ratings from in situ ratings (Aim 1), Model 2 predicted ex situ ratings from in situ ratings and normative values (Aim 2), and Model 3 added perceivers’ personality traits to Model 2 as moderators (Aim 3). Findings from Model 3 are summarized in Table 4 and random effects of Models 1–3 in Table 5. 3
Multilevel Regression Estimates for Model 3.
Note. N = 402, N
obs = 32,160. Estimate = effect sizes of slopes; SE = standard error. 95% Confidence intervals (CIs) are based on 500 bootstrapped simulations. d = standardized effect size estimates, computed as
aThese estimates correspond to main effects. Because the intercept is the person’s average rating across all situations and all characteristics, personality predictors of individual differences in scale usage are captured. More open and neurotic ex situ raters had higher than average ratings and those more conscientious and agreeable had lower than average ratings. bThese estimates correspond to cross-level interaction effects.
*p < .05, **p < .01, ***p < .001, using estimate/SE as t-statistic on 384 degrees of freedom.
Summary of Random Effects.
Note. N = 402, N obs = 32,160. SD = standard deviation. See text for details on Models 1–3. Perceivers = ex situ raters.
Model 1: Impressionistic Accuracy
Model 1 examined the overall effect of accuracy (i.e., impressionistic accuracy) by predicting ex situ ratings only from the in situ data. Intercepts and slopes were allowed to vary across perceivers and across situations (i.e., different perceivers and different situations could vary in their accuracy levels). There was a statistically significant fixed effect (b = 0.52, 95% confidence interval [CI] = [0.23, 0.80], t = 3.40) indicating that, on average, ex situ raters accurately judged the pattern of characteristics of the situations.
The SDs of perceiver effects were 0.30 [0.27, 0.33] for intercepts and 0.05 [0.04, 0.07] for slopes. The SDs of situation effects were .72 [0.42, 1.06] for intercepts and 0.48 [0.27, 0.74] for slopes. The residual SD was 1.91. Overall, this indicates that there was sizable variation in accuracies between situations, while the variation in perceiver accuracies was not as large (see Figure 1). This indicates that there may be more differences in being a “good situation” (i.e., situations that are generally judged with more accuracy) than being a “good ex situ rater” (i.e., ex situ raters that generally judge situations with more accuracy).

Slope density distributions for Model 1.
Model 2: Normative and Distinctive Accuracy
Model 2 added the normative characteristics profile as both a fixed- and random-effects predictor of ex situ ratings. Including the normative profile allows simultaneously estimating normative and distinctive accuracy (Biesanz, 2010; Cronbach, 1955; Furr, 2009). By estimating them as random effects, we allowed both normative and distinctive accuracy slopes to vary across ex situ raters and across situations. The results showed statistically significant effects of both normative (b = 0.88 [0.61, 1.15], t = 6.56) and distinctive (b = 0.49 [0.31, 0.68], t = 5.82) accuracy. Thus, on average, perceivers were both normatively and distinctively accurate in judging the characteristics profiles of situations.
The SDs of the perceiver effects were 0.33 [0.30, 0.36] for intercepts, 0.19 [0.17, 0.21] for normative accuracy slopes, and .08 [0.07, 0.09] for distinctive accuracy slopes. The SDs of situation effects were 0.51 [0.29, 0.75] for intercepts, 0.42 [0.24, 0.57] for normative accuracy slopes, and 0.27 [0.15, 0.41] for distinctive accuracy slopes. Thus, there was substantial variation in both normative and distinctive accuracy between situations, with smaller but nontrivial variation in normative and distinctive accuracy between ex situ raters (see Figure 2).

Slope density distributions for Model 2.
Model 3: Personality Correlates
Model 3 examined Big Five traits as moderators of normative and distinctive accuracy. Centered scores of the personality variables and their interaction terms were added to Model 2 as fixed effects. The results from this analysis are summarized in Table 4 (with random effects in Table 5). As can be seen, the effects of normative accuracy were indeed associated with ex situ raters’ personality: Those higher on conscientiousness, extraversion, and agreeableness and lower on neuroticism were more normatively accurate, and those higher on neuroticism more distinctively accurate.
Discussion
This work used SAM analyses to disentangle normative from distinctive situational accuracy and additionally examine personality moderators. First, in line with our expectations, we found both substantial normative and distinctive accuracy, and the former was stronger in all three estimated models. Second, the variances in normative and distinctive perceptive accuracy were not as sizable as those of normative and distinctive expressive accuracy. Thus, on average, there were only small differences between ex situ raters in their accuracy levels (Figures 1 and 2). Because of the relatively large sample size of ex situ raters, we could nonetheless associate those small differences with the Big Five. Conscientiousness, extraversion, agreeableness, and neuroticism emerged as significant predictors, but not openness. Additionally, these traits predicted mostly only normative perceptive accuracy.
Magnitude of Accuracy
How strong was accuracy? This question can be answered by comparing the unstandardized effect estimates found here to those from the person perception literature. First, the level of normative accuracy (0.88) was similar to those found in person perception literature, while the level of distinctive accuracy (0.49) was higher (i.e., often around 0.10–0.30; e.g., Biesanz & Human, 2010; Biesanz et al., 2011; Chan, Rogers, Parisotto, & Biesanz, 2011; Human & Biesanz, 2011, 2012; Lorenzo, Biesanz, & Human, 2010). Second, because standardized effect sizes are somewhat problematic for Level 1 effects in MLM, we only report unstandardized regression coefficients for effects at this level (as is common in SAM analyses). The following is intended to aid in interpretation of these effects. The intercept is the average rating made by all raters across all situations and across all characteristics. The distinctive coefficient means that a 1-point increase in the in situ ratings yielded a 0.49 increase in the ex situ ratings (across all characteristics), controlling for all other fixed effects in the model (which includes the normative profile). In other words, the 0.49 value is an index of the average ex situ rater’s sensitivity to changes in the criterion (in situ), where 1.00 would be perfect correspondence. Again, this effect is across all situations and all characteristics. The normative coefficient means that a 1-point increase in the average situation (in situ ratings) yielded a 0.88 increase in ex situ ratings (again, across all situations and characteristics). This is close to perfect correspondence. 4 Taken together, the accuracy levels found are actually quite high, especially considering that ex situ raters had only limited information available (Table 3).
Interindividual and Intersituational Differences
Differences between persons
We did not find strong individual differences in perceptive accuracy despite a relatively large sample that should have created sufficient variation. This may be the case because people are, on average, accurate in judging others’ situations—and there is little (but still nonnegligible) room for variation. After all, it is adaptive to perceive situations as most other people do because this enables joint communication and coordination in a socially shared reality (Miller, 2007; Rauthmann et al., 2015a). Indeed, person(ality) perception literature also finds small variation in perceptive accuracy (Biesanz, 2010; Kenny, 1994), explaining why it has been so difficult to identify “the good judge.” Small perceptive accuracy differences may thus generalize across judging persons and situations.
Nonetheless, some differences between persons still emerged. For example, interindividual differences in normative (but not distinctive) perceptive accuracy could be explained by the Big Five (except for openness), with small- to medium-effect sizes (see d in Table 4). On the other hand, only (high) neuroticism emerged as a predictor of distinctive perceptive accuracy. One interpretation of this pattern of findings could be that those with a more normative personality profile (i.e., high conscientiousness, extraversion, and agreeableness, with low neuroticism) tend to use situational normativeness to achieve accuracy. This is similar to findings in the person(ality) perception literature indicating that well-adjusted individuals tend to be accurate perceivers of what others are generally like (Human & Biesanz, 2011). In contrast, those with more nonnormative personalities (i.e., high neuroticism) may tend to be more distinctively accurate, probably because they deviate more from the normative profile. Interestingly, this improvement in distinctive accuracy for those high on neuroticism is more than negated by the losses in normative accuracy (i.e., 0.01 vs. −0.02). However, individuals high in neuroticism may also be better in distinctive cue detection due to their hypersensitivity and vigilance (Allen & Badcock, 2003). Guillaume et al. (2016) found that situations around the world were typically social and mildly pleasant. Thus, as negative situations seem less normative, distinctive accuracy could be achieved by a stronger focus on negative aspects.
Together, our data suggest that personality differences may matter most when using normative situation knowledge. This is intriguing because traditionally unique and distinctive patterns of perception have been associated with personality (Sherman et al., 2012). However, such research only concerned how people (uniquely) perceive situations and not whether those perceptions align with any criterion variables. Thus, our findings provide novel insights into situational accuracy.
Differences between situations
An interesting finding is the relatively large variance in situations’ expressive accuracy: There were wide intersituational differences in being judged normatively and distinctively accurately. This finding, again, corresponds to person(ality) perception literature which also find levels of expressive accuracy larger than perceptive accuracy (Human & Biesanz, 2013). We insert the caveat, however, that we only had a limited set of situations that were not selected to be homogeneous because we wanted meaningful variance. Nonetheless, it is striking how much more situation variance there was than rater variance. If we had sampled more situations and had other information available (e.g., physicobiological cues of situations; “style” characteristics such as base rate or situation strength; situation class membership), we could also attempt to explain intersituational differences in accuracy slopes the same way as interindividual accuracy differences were explained here by traits. Examining explanatory variables of “the good (or bad) situation” as moderators of expressive accuracy may be a fruitful direction for future research.
A Note on Statistical Modeling
We have demonstrated how SAM can be applied to situation perception data. It may not have escaped the notice of those initiated in MLM that other data-analytical choices could have been made. The accuracies uncovered reside at the ex situ rater and situation level. However, one could also quantify accuracies at other levels. First, running a model where we estimate random intercepts and slopes for characteristics and for situations (i.e., not for raters) yields accuracies at the level of situations and characteristics. In other words, for every situation separately the ex situ profile is predicted from the in situ profile. So for each situation, this regression would be based on 404 raters × 8 characteristics = 3,232 pairs of scores (across all raters and characteristics). Additionally, because we have a cross-classified MLM, for each characteristic we get a regression based on 404 raters × 10 situations = 4,040 pairs of scores across all situations and perceivers.
Second, as another alternative, we could estimate random intercepts and slopes for perceivers and characteristics, yielding accuracies at the level of raters and characteristics. Thus, for each rater, we get a regression based on 10 situations × 8 characteristics = 80 pairs of scores, and for each characteristic a regression based on 404 raters × 10 situations = 4,400 pairs of scores. Findings of these models are compiled in the Online Supplemental Materials (together with R code) for interested readers.
Although these models can make interesting sense of the data (e.g., looking at normative and distinctive characteristic accuracy), they are conceptually and statistically different from SAM (J. Biesanz, personal communication). In SAM, characteristics are fixed, while in these other analyses they are random. These different sets of analyses create the problem that we cannot include moderators of normative accuracies, though distinctive accuracies are not affected. Additionally, cross-random effects need to be orthogonal, so we are not able to model the effect of characteristics changing by situation and/or rater. Hence, SAM represents a more flexible modeling approach. For the sake of completeness and to stimulate future research on MLM implementations in situational accuracy studies, we wanted to alert to different but nonequivalent modeling procedures here.
Limitations and Future Directions
The limitations of this work point toward future research that is needed to overcome them. First, we have employed a relatively limited set of target situations and characteristics which may suggest lack of stimulus sampling (Judd et al., 2012; Wells & Windschitl, 1999; cf. Biesanz, 2010). This was mainly the case to reduce participant burden. However, given that ex situ raters’ variance in accuracy estimates was not too high, future research could use fewer ex situ raters, but have them rate more situations on more characteristics (e.g., Situational Eight on 32 items: Rauthmann et al., 2014; on 24 items: Rauthmann & Sherman, 2016c; on 89 items from the Riverside Situational Q-sort: Sherman et al., 2010; Wagerman & Funder, 2009). If more situations are available, moderators of expressive situational accuracy can be examined.
Second, we used only one normative profile for the “average situation.” However, the average DIAMONDS profile may be different for different classes of situations (e.g., van Heck’s, 1984, 1989 10 types of situations). It did not make sense to distinguish different classes within our sampled situations, but future research may seek to sample different situations within different classes (and possibly derive normative profiles for each class). It will be interesting to examine accuracy slopes for different classes and whether some people show high perceptive accuracy across all or only within specific classes.
Third, future research could sample several situations per in situ rater (we used just one per person), preferably a representative set of situations in daily life. This allows examining idiosyncrasies in in situ ratings and to what extent ex situ raters might pick up on those.
Lastly, future research could use traits other than the Big Five, such as empathy, perspective-taking ability, and socioemotional competencies to explain interindividual accuracy differences. These may be more closely tied to accurately judging distinctive situation profiles and thus increase the chance of predicting distinctive perceptive accuracy.
Conclusion
We demonstrated that person perception models, such as SAM, may be fruitfully applied to situation perception data, making it possible to examine normative and distinctive accuracies concerning ex situ raters (perceptive accuracy) and situations (expressive accuracy)— a distinction that has so far not been made in situations literature. This work thus provides a first window into how strong normative and distinctive situational accuracy are and which person variables may explain them. These findings may be important for research seeking to understand the basis of good perspective-taking skills.
Footnotes
Acknowledgments
We thank Jeremy Biesanz for valuable advice on data-analytical issues concerning SAM.
Author Note
All findings and R codes as well as additional analyses can be found openly at osf.io/qgu6h as well as in the Online Supplemental Materials.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Supplemental Material
The supplemental material is available in the online version of the article.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
