Event-Based Conformity Versus Regression to the Mean: A Comment on Kim and Hommel (2015)

Abstract

Kim and Hommel (2015) provided an intriguing alternative explanation for conformity effects. Building on the theory of event coding (TEC; Hommel, 2009; Hommel, Müsseler, Aschersleben, & Prinz, 2001), they assumed that one’s own and others’ actions are represented in comparable ways, so that people may fail to distinguish between those two action categories. As a consequence, “people’s actions that have no social meaning should induce conformity effects” (p. 484).

Building on a paradigm previously used to investigate social conformity (Klucharev, Hytönen, Rijpkema, Smidts, & Fernández, 2009; Shestakova et al., 2013), Kim and Hommel tested their hypothesis in three experiments involving facial-beauty ratings. In a first block, participants rated the beauty of 220 faces on a scale from 1 to 8 using a computer keyboard. After each rating, participants were presented with an intervening event. Participants saw either a static slide showing a number between 1 and 8 or a short movie in which another person pressed the respective number key (1–8) on a computer keyboard. These numbers were equal to, 2 to 3 points lower than, or 2 to 3 points higher than the rating given by the participant (equal, lower, and higher conditions, respectively). In a second block, all faces were again rated on the same scale from 1 to 8. The hypothesis was that numbers lower than the initial rating should lead to a negativity shift, whereas numbers higher than the initial rating should lead to a positivity shift. The authors present support for this across all three experiments.

Critically, the paradigm confounded extremity of initial ratings and assignment of experimental condition (equal vs. lower vs. higher). Since the rating scale was restricted from 1 to 8, experimental condition could not be assigned independently of initial ratings. For example, it was not possible to assign an initial rating of 7 to the higher condition, but only to the equal or lower condition. Similarly, low ratings could not be assigned to the lower condition, but only to the equal or higher condition. As a result, the higher condition occurred for low (and moderate) but not for high values, and the lower condition occurred for high (and moderate) but not for low values. Figure 1 shows the distribution of experimental conditions across initial ratings and experiments in Kim and Hommel’s study.

Fig. 1.

Distribution of experimental conditions (trial type) across initial ratings in the three experiments reported in Kim and Hommel (2015). Note that while trials for initial ratings of 3 and 6 look as though experimental conditions are relatively equally distributed, they are still problematic because only a difference of 2 (not 3) toward the nearest scale end point can be shown for these trials.

This confound is problematic because regression to the mean (Campbell & Kenny, 1999; Galton, 1886) dictates that extreme values on a variable X (e.g., first measurement) will be closer to the mean on any variable Y (e.g., second measurement) when X and Y are less than perfectly correlated. While error is on average randomly distributed across all measurements of X, high values of X will on average have a more positive error, whereas low values of X will have a more negative error. In other words, the measured values of X correlate with the measurement error of X. Because the measurement error of Y is independent of the measurement error of X, any extreme value in X will be less extreme in Y. See Shanks (2017) for an extensive discussion on this topic.

Applied to Kim and Hommel’s data, this means that we would expect initially high values to be lower in the second measurement and initially low values to be higher in the second measurement. Regression to the mean combined with the confound described above can create the original pattern of results without assuming any psychological processes.

Reanalysis of the Original Data

Kim and Hommel provided us with the original data files, allowing us to analyze the extent to which there were effects beyond regression to the mean. We present two types of analyses, for all of which change from the initial rating to the final rating was the criterion. These are multilevel analyses using random intercepts for participants and stimuli (Judd, Westfall, & Kenny, 2012). Note that these analyses are on a trial level.

First, we ran analyses similar to the original (confounded) analyses on each data set, using trial type (lower vs. equal vs. higher, coded as −1, 0, and 1, respectively)¹ and, if appropriate, type of feedback (coded as −1 vs. 1) as predictors (Analysis 1). Second, we reran these analyses, this time using only the initially neutral trials (initial rating of 4 or 5). For these trials, there is no confound, and effects found here cannot be attributed to regression to the mean (Analysis 2).

Results

Results of the analyses are presented in Table 1. Note that for the sake of brevity, we do not present main effects of feedback type. Full results and additional detailed analyses can be found in the Supplemental Material available online. As expected, when not controlling for the confound (Analysis 1), we found reliable effects of trial type (lower vs. equal vs. higher) across all experiments and conditions. Hence, differences in the following analyses are not attributable to the multilevel approach. Because of the interaction of trial type and feedback type in Experiment 2, we present separate analyses for both feedback-type conditions.

Table 1.

Reanalysis of the Original Data From Kim and Hommel’s (2015) Study

Experiment and intervening event	Analysis 1: confounded;change ~ TT (× FT); full data sets	Analysis 2: corrected; change ~ TT (× FT); only initially neutral trials
Experiment 1	TT: b = 0.212, SE = 0.025, t(2891.60) = 8.393, p < .001	TT: b = 0.089, SE = 0.032, t(1333.50) = 2.786, p = .005 (smaller effect)
Experiment 2
Overall	TT: b = 0.187, SE = 0.020, t(5870.00) = 8.992, p < .001;TT × FT: b = −0.103, SE = 0.020, t(5859.00) = −4.959, p < .001	TT: b = −0.030, SE = 0.027, t(2156.40) = −1.104, p = .269 (no effect);TT × FT: b = −0.085, SE = 0.027, t(2166.20) = −3.123, p = .002
Movie	TT: b = 0.289, SE = 0.032, t(2923.30) = 8.997, p < .001	TT: b = 0.055, SE = 0.042, t(990.00) = 1.285, p = .199 (no effect)
Number	TT: b = 0.090, SE = 0.026, t(2944.40) = 3.395, p < .001	TT: b = −0.118, SE = 0.035, t(1196.20) = −3.373, p < .001 (reversed effect)
Experiment 3	TT: b = 0.254, SE = 0.030, t(2932.70) = 8.274, p < .001	TT: b = −0.009, SE = 0.044, t(1020.80) = −0.203, p = .839 (no effect)

Note: TT = main effect of trial type (lower vs. equal vs. higher), TT × FT = interaction of trial type and feedback type (movie vs. number).

Critically, the majority of corrected analyses did not show the original effect. Out of the five analyses we ran, three showed no significant effect of trial type, and one even showed a reversed effect (see b weights in Table 1). While there was still a small effect in Experiment 1, it is clear that the confound of initial rating and trial type led to a strong overestimation of the reliability and strength of the effects in the original analyses.² When confounded trials were removed, most of the effects vanished. Researchers interested in related phenomena should therefore be very careful when using a similar paradigm.

Footnotes

Action Editor

D. Stephen Lindsay served as action editor for this article.

Author Contributions

The idea for this Commentary was originally developed by M. Ihmels. F. Ache helped throughout the entire process of further developing the idea and shaping the manuscript. The data provided by were jointly analyzed by both authors.

Declaration of Conflicting Interests

The author(s) declared that there were no conflicts of interest with respect to the authorship or the publication of this article.

Funding

The research reported in this article was supported by an Emmy Noether grant from the Deutsche Forschungsgemeinschaft (HU 1978/4-1) awarded to Mandy Hütter.

Supplemental Material

Additional supporting information can be found at

Notes

References

Campbell

D. T.

Kenny

D. A.

(1999). A primer on regression artifacts. New York, NY: Guilford Press.

Galton

(1886). Regression towards mediocrity in hereditary stature. The Journal of the Anthropological Institute of Great Britain and Ireland, 15, 246–263.

Hommel

(2009). Action control according to TEC (theory of event coding). Psychological Research, 73, 512–526.

Hommel

Müsseler

Aschersleben

Prinz

(2001). The theory of event coding (TEC): A framework for perception and action planning. Behavioral & Brain Sciences, 24, 849–878.

Judd

C. M.

Westfall

Kenny

D. A.

(2012). Treating stimuli as a random factor in social psychology: A new and comprehensive solution to a pervasive but largely ignored problem. Journal of Personality and Social Psychology, 103, 54–69.

Kim

Hommel

(2015). An event-based account of conformity. Psychological Science, 26, 484–489.

Klucharev

Hytönen

Rijpkema

Smidts

Fernández

(2009). Reinforcement learning signal predicts social conformity. Neuron, 61, 140–151.

Shanks

D. R.

(2017). Regressive research: The pitfalls of post hoc data selection in the study of unconscious mental processes. Psychonomic Bulletin & Review, 24, 752–775.

Shestakova

Rieskamp

Tugin

Ossadtchi

Krutitskaya

Klucharev

(2013). Electrophysiological precursors of social conformity. Social Cognitive and Affective Neuroscience, 8, 756–763.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.51 MB