Abstract
When representing high-level stimuli, such as faces and animals, we tend to emphasize salient features—such as a face’s prominent cheekbones or a bird’s pointed beak. Such mental caricaturing leaves traces in memory, which exaggerates these distinctive qualities. How broadly does this phenomenon extend? Here, in six experiments (N = 700 adults), we explored how memory automatically caricatures basic units of visual processing—simple geometric shapes—even without task-related demands to do so. Participants saw a novel shape and then immediately adjusted a copy of that shape to match what they had seen. Surprisingly, participants reconstructed shapes in exaggerated form, amplifying curvature, enlarging salient parts, and so on. Follow-up experiments generalized this bias to new parameters, ruled out strategic responding, and amplified the effects in serial transmission. Thus, even the most basic stimuli we encounter are remembered as caricatures of themselves.
Caricature is the practice of exaggerating distinctive features when representing some stimulus or concept (Fig. 1a). Though often used intentionally for political or comedic effect (Perkins, 1975), human cognition sometimes engages in a caricaturing process of its own, encoding and even misremembering stimuli in exaggerated form. For example, caricatures are often judged as the best likeness of familiar faces (more so than the actual faces themselves; Rhodes et al., 1987), and participants are slower to differentiate a face from its caricature when first seeing the face and then the caricature than vice versa (suggesting that the first stimulus was encoded in exaggerated form; Mauro & Kubovy, 1992). Such exaggeration is thought to aid related processes, such as recognition and categorization (Benson & Perrett, 1991; Chang et al., 2002; Mauro & Kubovy, 1992; O’Toole et al., 1997; Rhodes et al., 1987), and may be related to neural pattern separation that protects mnemonic representations from interference by keeping them distinct (Yassa & Stark, 2011; for exaggeration effects explained by different neural mechanisms, see Zhao et al., 2021, as well as Chanales et al., 2021, and Drascher & Kuhl, 2022).

(a) Illustration of facial caricature, adapted from Rhodes et al. (1987). (b) The present work explores caricatures for basic visual shapes by manipulating their features to normalize or exaggerate them. The normalization process decreases the magnitude of the turning angles along a shape’s contour; in contrast, exaggeration increases these magnitudes. The radiating bars indicate the surprisal values of points along the shape’s contour, which increase as the shape becomes caricatured.
Mental Caricatures: From Faces to Shapes
How general is the phenomenon of mental caricature? Does it apply to any stimulus, regardless of its category, class, or context, and regardless of the demands of a given task? Indeed, it has been speculated that caricatured representation may be appropriate not just for faces, but also for other visual stimuli such as animals and objects. For example, Rhodes et al. (1987) suggest that upon encountering a new species of bird, we might encode the bird with its distinctive features exaggerated, so as to better distinguish it from other birds. Early studies provided similar evidence for simple visual patterns; for example, when a shape has a gap differentiating it from other shapes, the gap is misremembered as wider (Garner, 1962; see also Gibson, 1929). But unlike caricatured encoding in face memory, this result was more localized rather than holistic, and it was likely induced by explicit task demands (which involved differentiating a target shape from distractors). More generally, whereas caricature-like processing can be adaptive when one is explicitly required to distinguish multiple items held in memory at one time (see, e.g., Chunharas et al., 2022, on repulsion effects), it is unknown whether memories of simple, single visual forms show caricature biases in the absence of considerable external pressure to distinguish them from one another (Donderi, 2006).
Here, we explore caricatured representation for what are among the simplest visual stimuli we encounter: basic geometric shapes (Fig. 1b). Shape stimuli have a long history in research on visual perception and memory (Attneave & Arnoult, 1956), and they are interesting here too for several reasons. First, compared to faces, familiar objects, or even novel categories (e.g., Davis & Love, 2010; Gauthier & Tarr, 1997), random geometric shapes have little social significance, few preexisting associations, and few salient characteristics by which to organize them, making them well-suited to avoid contamination by high-level goals, knowledge, and other such factors. Second, despite their simplicity, geometric shapes can vary along several well-defined parameters, affording control over their features in ways that are not always possible with more naturalistic stimuli like faces and animals. Third and finally, advances in computational geometry allow for succinct and standardized measures of shapes’ information density (Feldman & Singh, 2005; Siddiqi et al., 1999), making it possible to quantify memory distortions in precise and illuminating ways.
Statement of Relevance
We often portray images in caricatured form, exaggerating their distinctive qualities for political or comedic effect. Do our memories also engage in a caricaturelike process when encoding and remembering what we see? The present work explored a caricature bias for the most basic visual stimuli we encounter: simple geometric shapes. When observers saw a shape and had to reproduce it from memory, they tended to create shape caricatures that exaggerated each shape’s qualities, even though the task itself gave them no obvious reason to do so. This work suggests that our minds encode the world around us in ways that emphasize what is distinctive, even when the relevant stimuli have no particular significance and even when there are no task demands that require it.
At the same time, exploring caricatures for such stimuli invites a further question: What does it mean to caricature a shape? How does one exaggerate a meaningless, contextless, blob? For face caricatures, it is relatively clear which features to emphasize and how to do so. For example, a caricature of Albert Einstein might make his hair frizzier, his cheeks rounder, and so on. More formally, work in computer graphics has automated caricature generation by computing the distance between a given face and a norm comprising many faces, and then adjusting the face’s features away from the norm (Brennan, 1982; see also Lee et al., 2000; Rhodes et al., 1987). But it is not obvious that such an approach even applies to geometric shapes, if only because there is no straightforward candidate for a “normal” or “average” shape (or, perhaps, there are many such candidates, and it would be unclear which to choose; see, e.g., Amalric et al., 2017).
Here, we approach shape caricaturing from the perspective of information theory. It has long been proposed that the information content of a shape is naturally characterized in terms of changes to its bounding contour, such as vertices, protuberances, curves, and other deviations from smoothness (Attneave & Arnoult, 1956; Norman et al., 2001). Along these lines, we conceive shape caricaturing as a process that increases these changes and further exaggerates this contour information so that the turns of a shape’s visible boundary appear even more salient and distinctive (Fig. 1b). With this approach in hand (see the Method section for more detail), we ask whether the mind engages in a caricaturing process even for stimuli with no obvious norm and even when there is no task-related pressure to do so (such as long retention periods, active interference from other stimuli, and so on).
The Present Experiments: Reproduce the Shape
Do people spontaneously misremember even basic visual stimuli in exaggerated form? To test this question, we asked whether participants who must reproduce a recently seen geometric shape tend to create a caricatured version of that shape. We generated a library of novel shapes whose contour information could be manipulated; then we showed these shapes to participants, one at a time, and simply asked them to reproduce what they had just seen a moment earlier. We then analyzed memory distortions in terms of changes in contour information. To foreshadow the key results, shapes were consistently remembered as exaggerated versions of themselves.
Open Practices Statement
Demos of these experiments can be viewed at https://perceptionresearch.org/caricature/. Data, experiment code, stimuli, and other relevant materials for all studies are available at https://osf.io/7grk8/. These studies were not preregistered.
Experiments 1 and 2: Shape Caricatures in Visual Memory
As an initial test of shape caricatures in visual working memory, Experiments 1 and 2 asked participants to briefly view a shape and then adjust a copy of that shape until it looked like the one they had just viewed.
Method
Participants
We recruited 50 participants for each experiment from the online platform Prolific (https://www.prolific.co/). A power analysis based on a pilot suggested that this sample would have power above 99% to reveal effects of the sort explored here. This experiment and all others reported here were approved by the Johns Hopkins University Institutional Review Board.
Stimuli
Our task requires that a shape be adjustable, such that its features can change in a continuous fashion while still preserving, in some meaningful way, a sense of its being the same object. To achieve this, we first created 30 “parent” shapes, which were highly irregular polygons that served as the maximally exaggerated shape within its shape family (i.e., the shape with the greatest amount of information along its contour). The shape-generation process started with selecting a set of randomly located points to be vertices of the shape’s edges. We then connected these points using the method of Delaunay triangulation. Facets along the boundary of this triangle mesh were removed until the resulting polygon had a predetermined number of sides. Finally, the boundary of the polygon was resampled to 1,000 sequential points and smoothed to appear natural or organic. Additional constraints included a minimum angle of at least 10° and a maximum angle of 170°, so that the turns on the contours of shapes would be discernible.
For each of these shapes, we gradually smoothed its contours to create a spectrum of similar shapes (using ShapeToolBox 1.0; Feldman & Singh, 2006). A box mask was applied to an increasing number of consecutive points on the contour of the parent shape so that the curve consisting of the points in the mask was flattened to the averaged value along each axis. We started by setting up the mask size as 4 points and then smoothed the shape with that mask twice to create the next shape; then we increased the size of the mask by one unit to create the next 25 shapes, eventually creating a series of 51 structurally similar but gradually smoothed shapes (one parent shape and its 50 children). All the images were shown at an approximate size of
We computed the contour surprisal (i.e., shape information) for all shapes (including parent shapes and derivative shapes)—a measure based on the magnitude of point-to-point turns along the contour of a shape (Fig. 1b). An intuitive way to capture this measure might be to imagine a person walking along the contour of a shape; the more often and dramatically this person changes direction (so that their next step was not easily predictable from their previous step), the less predictable and the higher the surprisal of that step. The cumulative surprisal is the sum of the surprisal of the set of points along a shape’s contour. 1 This quantification derives from information-theoretic approaches to visual perception (Attneave, 1954; Feldman & Singh, 2005), which are complemented by experimental evidence that the visual system uses such contour information to guide object recognition, feature detection, and other processes (e.g., Baker et al., 2021; Barenholtz et al., 2003; Norman et al., 2001). This information-theoretic measure is especially appropriate for our purposes, because the exaggeration of features in caricaturing a shape would amplify the overall information of the shape’s contour. As contour information increases, the shapes appear to have exaggerated features, such as amplified curvature, enlarged salient parts, and so on.
As can be seen in Figure 2b, the shapes yielded by this procedure are relatively complex, with the center of the slider representing a fairly angular or pointed shape (rather than an overly smooth one). Thus, Experiment 2 used the very same design as Experiment 1 but with twice the range of contour information by including more rounded and blunt simple shapes (so that the center of the range corresponded to a much simpler and normalized shape). To achieve this, we used the same procedure as before to further normalize the shapes used in Experiment 1, and we derived another 50 shapes continuing into the simple end of the spectrum. The new spectrum that resulted included 101 shapes from a given family—the 51 shapes used in Experiment 1 and the 50 newly derived shapes. (Note that there were still 51 steps on the responding slider, each step corresponding to one of every pair of shapes in a sequence of 101 shapes, resulting in greater change between each step of the slider).

Illustration (a) of our shape-reproduction task, seen in Experiment 1. Each trial begins with a random-looking shape appearing for 1.2 s. After a 2-s delay, participants see a copy of that shape (appearing at a different contour-information level) and are asked to reproduce the shape they just saw by moving a slider to exaggerate or normalize the test shape. The shapes in Experiment 1 are drawn from a range of entropy values within a single shape family (b); there are 30 shape families in all (only one of which is shown here). Experiment 2 doubled the range by including even more simple shapes; thus, the target shapes were sampled from a wider degree of exaggeration, and moving the slider caused more dramatic changes of shapes’ appearance. In the actual experiments, the sign of the slider was randomly assigned between participants, and the starting position of the slider was randomized on each trial. (Four additional experiments tested other ranges of information density. See more details in endnote 2.) 2
Procedure
In the task, participants briefly saw a single shape, which then disappeared; a copy of it then returned, and participants had to adjust the copy to match what they had seen.
On each trial, a shape from one of the 30 families appeared on the screen at a random level of information density between 20% and 80% of the full range. The shape remained on the display for 1.2 s. Next it disappeared, and was replaced by a blank display that lasted for 2 s. Finally, the shape reappeared in the same location with a slider located below it, this time at a different random level of information (sampled from the full range). Participants were instructed to move the slider to adjust the second shape until they thought it was identical to the shape that appeared at first (which was indeed an option available to them), with no time pressure to respond (Fig. 2a).
By moving the slider from one end to the other, participants were iterating the sequence of shapes derived from the parent shape. Though there were 51 discrete steps on the slider corresponding to 51 different shapes, these shapes varied smoothly enough that the adjustment process felt much like a continuous animation of a shape evolving back and forth until a satisfactory frame was found.
Overall, the experiment consisted of 34 trials, including one practice trial at the beginning, 30 test trials corresponding to the 30 unique shape families, and three catch trials appearing in the early, middle, and late stages of the session. (In catch trials, certain locations along the slider completely transformed the shape into a member of a different family that did not appear elsewhere in the experiment; any participant who ever answered in this region of slider space was determined not to have been paying attention.) The sign of the slider was randomly assigned across participants so that moving the slider rightward either exaggerated or normalized a shape.
Readers can experience this task for themselves at https://perceptionresearch.org/caricature/.
Results
In Experiment 1, 7 participants were excluded for failing to submit a complete data set, leaving 43 participants with analyzable data. For ease of presentation, we normalized the range of contour information from 0 to 1 (0 represents the maximally normalized shape and 1 the maximally exaggerated shape).
As predicted by an encoding-by-caricature approach, participants tended to overestimate the information density of the remembered shapes, increasing their contour information by an average of 13.76% relative to the actually presented shapes’ contour information (Fig. 3). This pattern can be seen (and quantified) in several ways. One straightforward way is to consider the average slider position of recalled shapes: Out of 51 total steps, where the average shape was presented at step 26, the bias corresponded to approximately 4 steps. Another way is to consider the average normalized information density of the remembered shapes (0.59) compared to the average normalized information density of the presented shapes (0.52). A paired t test confirmed that the information density of reproduced shapes was significantly greater than the true average information density of the target shapes, t(42) = 6.25, p = 1.72 × 10−7, d = 0.95, SE = 0.011, CIbias = 0.072 [0.049,0.095] (Fig. 3b). This pattern of results was also fairly consistent across participants, with 81% of participants trending in the expected direction (Fig. 3c). In other words, participants adjusted the test shapes to be exaggerated versions of the original shapes they had just seen seconds earlier. Figure 3a shows these results for each of the information-density levels used as targets in the experiment; most of the recalled values are greater than the true values (see Experiment 3 for evidence that even these very complex shapes are actually misremembered in exaggerated form as well).

Results of Experiment 1. In (a) is illustrated the recall bias at different levels of complexity, with presented-shape complexity on the x-axis and recalled shape complexity on the y-axis. The majority of points fall into the purple space, meaning that the information density of recalled shapes was higher than the information density of the presented shapes. In (b) we illustrate the recall bias collapsed across all complexity levels, showing that recalled shapes were significantly caricatured by 7.19% on the entire scale. Error bars depict +1 SEM of the difference between the mean of recalled values and presented values. In (c) we show how a strong majority of participants reproduced exaggerated versions of target shapes.
In Experiment 2, 4 participants were excluded for failing to provide a complete data set, and 6 participants failed to pass at least one catch trial, leaving 40 participants with analyzable data. Again, participants tended to reproduce caricatured versions of the original shapes. The averaged contour information of recalled shapes was 0.56, significantly higher than the true average information of 0.52, t(39) = 4.60, p = 4.36 × 10−5, d = 0.73, SE = 0.0088, CIbias = 0.041 [0.024,0.058]. Seventy-eight percent of participants trended in this direction (i.e., recalling shapes as having exaggerated contours). Thus, the memory distortion observed in Experiment 1 generalizes to a wider range of shapes that vary more considerably in information density, with participants again caricaturing shapes in memory.
These results thus provide initial evidence for the hypothesis that the mind encodes and stores even novel contextless shapes as more informationally dense than they really are, caricaturing even some of the most basic stimuli we encounter.
Experiment 3: Forced Choice
Experiments 1 and 2 revealed a bias wherein novel shapes were remembered in exaggerated form. However, these results could be specific to certain methodological choices, especially the response method of continuously adjusting a shape using a slider. For example, because the slider cycles through the full range of available shapes, participants were able to see many candidate answers in ways that may have biased their final choice. This response modality also likely introduced a kind of regression effect to the center of the responding slider: Unless the target shape corresponds to the very middle of the slider, the uncaricatured options and caricatured options are unbalanced in terms of available responses. For example, recalling a shape that appeared at the 80th percentile of contour information gives participants only 20% of the space to caricature it but 80% to uncaricature it, which may bias responses in that direction (and vice versa for simpler shapes).
To address this, Experiment 3 used a forced-choice paradigm, giving only two options for all target shapes across different levels of complexity. This experiment thus asks whether the caricature effect goes beyond any one response modality and also whether it is more uniform over different complexity ranges than Experiments 1 and 2 may have seemed to suggest (in particular, whether it also arises for shapes on the complex end of the spectrum).
Method
One hundred participants were recruited for this experiment from Prolific. This sample size was double that of Experiments 1 and 2 because the data collected by the two-alternative forced-choice (2AFC) paradigm are sparser than the data collected by the previous response modality.
The critical difference between this experiment and the previous two experiments is that two candidate shapes (instead of an adjustable slider) served as response options during the recall phase, and participants simply selected one of them. Both candidate shapes deviated from the true target by two steps in opposite directions on the shape spectrum used in Experiment 2. In other words, one option was a caricatured version of the target shape, and the other was an uncaricatured version of the same shape (see Fig. 4a). This paradigm precluded participants from viewing the entire shape spectrum during the recall phase and also eliminated or attenuated any independent bias toward the center of the responding range. If memories of a shape are genuinely biased toward its caricature, then participants should tend to choose the caricatured version over the uncaricatured version.

Results of Experiment 3. Participants were asked to view a shape and then select from two options (one more caricatured and one more uncaricatured) within the same shape family as the target shape (a). Participants were significantly more likely to choose the caricatured shape (b), extending our findings to another response method. Moreover, this recall bias (y-axis) emerged across the full range of complexity levels (x-axis); all points (averaged responses) fell into the “caricature” space (purple), indicating that shapes at each complexity level were more likely to be recalled as their caricatured versions. As shown in (c), a strong majority of participants more frequently chose exaggerated versions of the target shapes.
Results
Six participants were excluded for failing at least one catch trial, leaving 94 participants for analyses.
Once again, the results supported a caricature bias in visual memory. When recalling the target shape from two alternatives (a caricatured shape and an uncaricatured shape), participants were significantly more likely to select the caricatured option (62% for the caricatured option; t(93) = 8.35, p = 6.26 × 10−13, d = 0.86, SE = 0.015, CIdifference = 0.12[0.092,0.15]). Seventy-three out of 94 participants trended in this direction.
Importantly, without a slider to draw responses toward its center, it became especially clear that the caricature bias occurred across the whole range of shape complexities. Figure 4b shows results for each of the information-density levels used as targets in the experiment: Even the most information-dense shapes tended to be caricatured in recall (with no tendency for this effect to drop off with complexity), suggesting that the caricature effect does not depend on how complex the target is and indeed persists even for very complex shapes. This result thus (a) demonstrates the consistency of the caricature effect across further variations in experimental design; (b) reveals that the response modality (a bounded slider) may have masked the effect’s consistency in previous experiments; and (c) helps to rule out the possibility that regression to the center of the slider (and the range of sampled shapes) was responsible for the effects observed earlier. 3
Together with results from Experiments 1 and 2, this 2AFC task provides converging evidence for the caricature bias in visual memory.
Experiments 4 and 5: Ruling out Response Bias
We have described the present results as a bias to remember shapes in exaggerated form. However, there may be alternative explanations. For example, perhaps participants simply enjoyed looking at more complex images, and so adjusted their responses in the direction of exaggeration to match this preference. Or perhaps participants responded strategically: If they expected their memories to lose detail over time, for example, they might compensate for such expected losses by intentionally choosing a more informationally dense shape. Experiments 4 and 5 ruled out these alternatives.
Method
As in Experiment 3, 100 participants were recruited from Prolific for Experiments 4 and 5.
Experiment 4 proceeded in the same way as Experiment 2 except that the 30 shapes were divided into two testing blocks: a memory block and a perception block. In the memory block, participants were asked to reproduce shapes exactly as in Experiment 2, by adjusting a shape after the target shape had disappeared. In the perception block, the target shape remained on the screen the entire time, and participants simply adjusted another shape to match the target shape, which was still visible right in front of them (see Fig. 5a). The 30 shapes from Experiment 2 were divided into two groups of 15 shapes (one group for each block), randomly chosen anew for each participant. Block order was counterbalanced across participants. Each block started with a practice trial and included two catch trials. This design aimed to tease apart memory distortions and response biases favoring exaggerated shapes. If the caricature bias found in previous experiments merely arose from response biases, then the bias should also apply to the online viewing cases (as in studies of the “El Greco fallacy”; Firestone, 2013; Firestone & Scholl, 2014; Valenti & Firestone, 2019). However, if the perception block fails to produce a caricature bias similar to the bias found in the memory block, then the bias we observed earlier is unlikely to be explained by general response biases of this sort.

Results of Experiments 4 and 5. Experiment 4 (a) replicated our memory effect, but also included a perception condition in which the adjustable shape and the target shape remained on the screen at the same time. Under these conditions, there was only a very minimal caricature bias (0.7%), and it was many times smaller than the memory bias (3.6%), consistent with a genuine memory distortion in our previous studies. Experiment 5 asked participants to predict the results of our prior experiments. Participants were shown a toy version of the task (b) and were asked what kind of bias they thought would occur. They were almost perfectly split in their predictions, suggesting no strong or pervasive predicted bias toward either exaggerations or simplification. Error bars represent 1 SEM of the difference between conditions.
Experiment 5 aimed to rule out strategic responding as an explanation of our findings. In this experiment, we surveyed participants for their intuition about the expected results of our memory task. The procedure began with a brief introduction to the background and the research question from the previous studies, including a short video demonstrating the experimental trials in Experiments 1 and 2. After participants indicated that they understood the experiment, they were presented with two predictions about the outcome of the experiment, with example shapes depicting the possible results; they were then asked to select which prediction they thought was right (Fig. 5b). Two options were given to participants: more simple (shown by example shapes being normalized) or more complex (shown by example shapes being exaggerated); participants indicated their prediction by clicking on the corresponding button. If participants have a shared assumption about the biases that might arise in recalling these shapes, then such beliefs could interact with the effect found in previous experiments. However, if they do not, then such beliefs are perhaps unlikely to explain our results.
Results and discussion
Both experiments supported a genuine memory distortion, rather than other forms of bias.
In Experiment 4, 5 participants were excluded for not passing at least one catch trial, leaving 95 participants for analysis. As in earlier experiments, participants in the memory condition reproduced shapes in caricatured form, t(94) = 5.57, p = 2.39 × 10−7, d = 0.62, SE = 0, CIbias = 0.036 [0.024,0.048]. There was also a very small bias in the caricatured direction for perception trials, where the target shape remained on the screen,
In Experiment 5, we found that there were no consensus assumptions about the expected results of this kind of task, in ways that are encouraging for our memory interpretation. The two options (more simple and more complex) were chosen at almost identical rates: 51 participants chose “more complex,” and 49 chose “more simple” (binomial probability test,
Experiment 6: Accumulating Biases in Serial Reproduction
Although the effects in our earlier experiments were reliable and robust, their magnitude was often subtle, resulting in a caricature bias of 4% to 6% when considered in terms of the full range of information densities on the scale. One consequence of these effect sizes is that any one instance of this bias is likely to be small in practice. However, in the real world, we often recall the same objects and events multiple times, and even relay them to others who in turn recall what we have told them. Might iterated recall of this sort amplify the biases observed here?
Experiment 6 explored this question using serial reproduction, in which one participant’s responses serve as the stimuli for another participant (Bartlett, 1932). Analogous to the game of “telephone,” a message or stimulus is shown to an initial observer, who then transmits it to another observer—but not before modifying the stimulus (often unintentionally) according to their own priors or biases. By repeating this procedure several times, the transmission chain accumulates the collective bias of the group and converges on a well or attractor that serves as a kind of equilibrium point.
This method has been used to study a variety of biases (e.g., Kalish et al., 2007; Langlois et al., 2021; Uddenberg & Scholl, 2018), though often for more complex tasks or with high-level stimuli. Here, we applied this method to our very simple shape-memory task. This allowed us to explore (a) how such distortions accumulate over time and (b) whether there exists a boundary for caricature distortion over which the shapes may not be continuously exaggerated.
Method
Participants
We aimed to have 30 transmission chains for each of 30 shapes, with each chain comprising 10 steps (i.e., nine observer-to-observer transmissions). We also allowed each participant to make 30 judgments (one for each of the 30 shapes). In total, this led to a target sample of 300 participants with admissible data, which required 316 participants total (see below for more information). All participants were recruited from Prolific.
Stimuli and procedure
From the participant’s point of view, the experiment was quite similar to Experiments 1 and 2: A shape was presented on a given trial, and the participants reconstructed it after a short delay. However, in almost all cases, the stimuli they were viewing had come from a previous participant who had completed the same procedure; an exception occurred when a participant was at the beginning of a transmission chain.
From the experimenter’s point of view, there were 10 rounds of shape judgments for each transmission chain, 30 transmission chains per shape, and 30 shapes total (i.e., 900 chains of 10 steps in all). The 30 shapes to be viewed in the first round initially appeared at one of three levels of contour information: 10 shapes at a relatively low level of contour information (approximately one quarter of the way up the scale), 10 at a moderate level (middle point of the scale), and the other 10 at a relatively high level (roughly three quarters of the way up the scale). Every participant occupied the same position in every chain they participated in: For example, if a participant was in the fourth position of a given chain, he or she was in the fourth position of each of the 30 shape chains that were part of the session.
Results
The results again revealed a caricature bias, but this time with a much greater final magnitude (Fig. 6a). A sample chain from this experiment appears in Figure 6b; the shape starts off at a fairly low level of complexity, and by the end it has grown several new jagged appendages. This pattern pervaded the stimulus set, resulting in a subjectively appreciable degree of caricaturing. Compared to the original shapes, the shapes reproduced in the 10th round had much greater contour information (0.72 vs. 0.52), t(29) = 14.97, p = 3.54 × 10−15, d = 2.73, SE = 0.013, CIbias = 0.19 [1.64,0.21], so that the shapes were eventually biased approximately 20% away from their true averaged values in terms of the entire scale and nearly 40% away when considered in terms of their initial values. Average contour information increased in each round without tending to pause or reverse.

Performance on the serial reproduction task of Experiment 6, in which the recalled shapes of one participant were used as stimuli for the next participant. The three colored lines in (a) represent the average complexity values of the first step in the transmission chains, from highly normalized (red), medially normalized (blue), and highly exaggerated (green). The three lines gradually converge as the transmission chains proceed, ending with a bias toward the information-dense side of the scale. The collective error is about 20% of the entire range of information densities used in the experiment. The shaded band around each line indicates a 95% confidence interval. In (b) we show the evolution of one of the shapes in the experiment as it passed from participant to participant.
Figure 6a shows the temporal evolution of the shapes, where each line represents the average value across all reproduction chains at the three given starting levels of shape information. As shown, these chains gradually converge into the exaggeration region of space as the experiment advances. Indeed, even for chains starting at a high level of contour information (indicated by the green line), the effect of regression to the center of the slider did not drive the reproduction chain back to a more moderate level. This pattern of convergence thus demonstrated a robust bias to remember shapes as being increasingly information dense or more exaggerated than they really were, and in ways that can be easily visualized and appreciated.
Even though the caricature effects observed here may cause only a subtle distortion for any one shape on any one trial (which may be hard to recognize at the level of each caricaturing step), the present results suggest that the accumulation of such biases over time can be significant indeed. Considering that daily life often involves repeatedly remembering and recalling various objects, the biases we observed here may well accumulate in real-world contexts as well.
General Discussion
Memory rarely replicates what we see; it reconstructs past experiences with biases and distortions. What kinds of biases arise for the most basic stimuli we encounter? Here, we explored how memory engages in a process of caricature, even for simple geometric shapes, and even without explicit demand to remember or distinguish multiple objects at the same time, preexisting schematic associations or knowledge, or other contextual factors that might tend to encourage such biases.
Adding Information to Memories
It was not a foregone conclusion that the present experiments would turn out this way (i.e., with a bias toward exaggeration, or indeed with any directional bias at all). If anything, memories tend to lose detail over time—and so a natural prediction might have been that shapes like those used here would be remembered as simpler or less detailed versions of themselves (Cooper et al., 2019). At the same time, memories often add information that wasn’t present in the encoded stimulus or event, and so in certain situations they can end up being richer or more detailed than what was actually shown to participants. For example, memory may generate events that never occurred (Kominsky et al., 2021; Loftus & Palmer, 1974), add objects that were not originally in visual scenes (Bainbridge & Baker, 2020; Intraub & Richardson, 1989), complete ambiguous visual patterns (Bartlett, 1932; Brewer & Treyens, 1981), run forward events seen previously (Freyd & Finke, 1984; Hafri et al., 2022), and even enhance the quality of images (Rivera-Aparicio et al., 2021).
However, unlike memory distortions that recruit high-level knowledge or schemas (Bae et al., 2015; Hemmer & Steyvers, 2009), the caricature bias we explore here adds information to single, novel objects that have never been seen before, making it less likely that this bias relies on long-term memory or preexisting associations.
Caricatures and Memory Repulsion
Why, then, does this happen? One reason this bias may occur is to enhance distinctiveness and aid later recognition, as in studies of face caricatures (e.g., Lee et al., 2000). Recent work at the intersection of machine learning and visual perception has also shown that a kind of caricatured encoding of videos can improve human performance in detecting visual misinformation (Fosco et al., 2022).
These results may thus be related to the hypothesis that remembered objects are often repulsed from one another, so that similar items (such as colored circles) are encoded in ways that amplify the differences between them (e.g., Bae & Luck, 2017; Chunharas et al., 2022; Scotti et al., 2021). The current work could be seen as either (a) extending this phenomenon even further, by showing that even memorizing a single item can engage this process of repulsion from its possible “peers” (even when those other objects need not be remembered or recalled on a given trial), or (b) revealing a more general mechanism whereby salient object information is automatically remembered in exaggerated form. Though the present work does not fully unravel the relationship between caricature biases and repulsion effects, future work may investigate whether caricaturing makes shapes more distinctive than their counterparts and whether the present caricature bias assists later processes, such as recognition and categorization.
Caricatures Beyond Faces
Although most of the scientific (and popular) attention paid to caricatures has focused on faces and bodies, caricaturing may also arise when visually representing information in different domains. For example, adults and children who are given the task of conveying concepts through drawings tend to exaggerate those aspects of the object that best distinguish it from neighboring concepts, and they may distort the same object in different ways depending on the level of abstraction being considered (Fan et al., 2020). The present work adds to this literature as well. It may not be so surprising for the mind to amplify the distance between a new object and a norm composed of similar objects; indeed, the primary approach used in early studies of caricatures relies on norms of this sort, such as a face norm averaged from many faces, or a prototype of a species of animal (e.g., Corneille et al., 2004). What is distinctive about the present results, however, is the absence of an obvious norm or prototype for novel shapes, at least of the sort studied here (though see Sablé-Meyer et al., 2022, for a discussion of the notion of geometric primitives). Thus, the effects reported here may reflect a more general process: Even when there is no explicit pressure to do so, visual memory emphasizes and exaggerates what is salient about the world around us.
Footnotes
Transparency
Action Editor: Rachael Jack
Editor: Patricia J. Bauer
Author Contributions
