Abstract
Understanding that individuals can be mistaken, or hold false beliefs, about the world is an important human ability that plays a vital role in social interactions. When and how does this ability develop? Traditional investigations using elicited-response tasks suggested that false-belief understanding did not emerge until at least age 4. However, more recent studies have shown that children demonstrate false-belief understanding much earlier when tested via other means. In the present article, I summarize recent evidence that a robust, flexible understanding of false belief emerges in infancy and discuss why older children fail elicited-response tasks despite their ability to represent beliefs.
Much of our everyday lives involves predicting, interpreting, and responding to the behavior of agents, entities that can perceive their environment and control their actions. Adults do this by considering agents’ mental states, including their goals, knowledge, and beliefs. Developmental psychologists have long been interested in how children develop the ability to attribute mental states to agents. In particular, considerable research has focused on when children understand that agents can be mistaken or hold false beliefs about the world. False-belief understanding requires the ability to recognize that mental states are internal representations rather than direct reflections of reality and thus can be inaccurate. This ability may be uniquely human (e.g., Kaminski, Call, & Tomasello, 2008; but see Krupenye, Kano, Hirata, Call, & Tomasello, 2016) and has been argued to play an important role in many human social behaviors (e.g., Herrmann, Call, Hernández-Lloreda, Hare, & Tomasello, 2007). For instance, recognizing that others’ representations of the world may differ from our own helps us communicate more effectively and facilitates cooperation (e.g., Baillargeon et al., 2013).
The development of false-belief understanding was traditionally investigated using elicited-response false-belief tasks, in which children were asked direct questions that required them to predict or explain the behavior of an agent who held a false belief (for a review, see Wellman, Cross, & Watson, 2001). In one such task (Baron-Cohen, Leslie, & Frith, 1985), children hear a story enacted with props: Sally puts a toy in a container and then leaves (Fig. 1). In her absence, Anne moves the toy to another container. Children are then asked where Sally will look for her toy when she returns. Beginning around age 4, children correctly answer that Sally will look in the original container, where she falsely believes the toy to be. In contrast, younger children incorrectly answer that Sally will look in the toy’s current location, suggesting an inability to understand Sally’s false belief. This pattern of findings was widely replicated using several different elicited-response tasks, leading many to conclude that false-belief understanding did not emerge until at least age 4 (Wellman et al., 2001).

The Sally/Anne false-belief task. Children are introduced to Sally, who places a toy into one of two containers (i.e., the bowl). In her absence, Anne moves the toy to the other container (i.e., the box). Sally then returns, and children are asked where Sally will look for her toy.
This conclusion was challenged by the discovery that much younger children appear to demonstrate false-belief understanding when tested with tasks that do not involve direct questions about a mistaken agent’s behavior (non-elicited-response tasks). In a seminal study, Onishi and Baillargeon (2005) tested 15-month-old infants in a violation-of-expectation task, which relies on infants’ tendency to look longer at events that are inconsistent, as opposed to consistent, with their expectations. The infants saw an agent play with a toy watermelon and hide it in a green box. In the agent’s absence, the watermelon moved from the green box to a nearby yellow box. In the test trial, the agent returned and reached into either the green box (green-box event) or the yellow box (yellow-box event) and paused. The infants who saw the yellow-box event looked reliably longer than those who saw the green-box event, suggesting that they attributed to the agent a false belief that the toy was in the green box, expected her to reach into the green box in order to obtain the toy, and hence looked longer when she violated this expectation by reaching into the yellow box. Together with the results of several additional conditions, this finding suggested that by 15 months, infants realize that agents can hold and act on false beliefs.
This study launched a new wave of research into early false-belief understanding, as many researchers adopted novel approaches to studying this ability in infants and toddlers. Since 2005, over two dozen published studies have reported evidence of false-belief understanding in children between 6 months and 3 years of age (for a review, see Baillargeon et al., 2015). These subsequent studies have addressed two important, interrelated questions regarding the origins and development of false-belief understanding.
What Is the Nature of Children’s Early Competence?
There has been considerable debate regarding the nature of the competence measured in non-elicited-response tasks. Advocates of mentalistic accounts argue that like elicited-response tasks, non-elicited-response tasks assess children’s ability to attribute false beliefs to agents (Buttelmann, Carpenter, & Tomasello, 2009; Carruthers, 2013; Kovács, Téglás, & Endress, 2010; Scott, Baillargeon, Song, & Leslie, 2010; Southgate, Senju, & Csibra, 2007). On this view, positive results in non-elicited-response tasks indicate that the capacity to attribute false beliefs to agents emerges in infancy. However, some researchers have questioned this view, arguing that responses in non-elicited-response tasks are driven by limited, rudimentary abilities, such as a tendency to look longer at perceptually novel configurations of “colors, shapes, and movements” (Heyes, 2014, p. 648), learned behavioral rules for how agents typically behave in particular situations (e.g., Ruffman, 2014), or an early-developing system for tracking the information registered by an agent (e.g., Apperly & Butterfill, 2009). According to these late-emergence accounts, the capacity to represent false beliefs does not emerge until age 4, as indicated by success in elicited-response tasks.
Are infants and toddlers capable of representing beliefs, or does their success in non-elicited-response tasks instead reflect more primitive capacities? One approach to addressing this question is to examine the range of non-elicited-response tasks in which young children succeed. School-aged children demonstrate false-belief understanding in a variety of elicited-response tasks involving different types of false beliefs, and they can use an agent’s false belief to interpret and predict a range of belief-based responses produced by that agent (Wellman et al., 2001). This demonstrates that older children possess a robust understanding of belief that they can flexibly use in a variety of situations. If infants and toddlers are capable of attributing false beliefs to agents, then they should also demonstrate the “flexible use of belief understanding” that is seen in older children (Perner & Ruffman, 2005, p. 216). If they were instead limited to reasoning about an arbitrary subset of belief-inducing situations and belief-based responses, then this might suggest that their performance was driven by mechanisms other than an understanding of belief.
Accumulating evidence suggests that early false-belief understanding is indeed flexible and robust. Infants and toddlers demonstrate false-belief understanding in numerous different non-elicited-response paradigms that measure a range of responses, including children’s looking behavior, helping responses, emotional expressions, and neurological activity (Buttelmann et al., 2009; Kovács et al., 2010; Moll, Kane, & McGowan, 2016; Southgate & Vernetti, 2014). They can use an agent’s false belief to interpret that agent’s actions (e.g., Luo, 2011), predict the agent’s future behavior (e.g., Southgate et al., 2007; Surian & Geraci, 2012), and guide their own interactions with the agent (e.g., Buttelmann et al., 2009). For instance, in one experiment, 17-month-olds watched an agent hide two novel objects in two boxes and then leave; in her absence, an experimenter switched the objects’ locations (Southgate, Chevallier, & Csibra, 2010). The agent returned, pointed to one of the boxes, and said it contained a “sefo.” She then asked the infants, “Can you get the sefo for me?” Most infants approached the box that the agent had not pointed to, suggesting that they understood that she held a false belief about the boxes’ contents and thus intended to label the other object as the sefo.
Like older children, infants and toddlers can represent many types of false beliefs, including false beliefs about the contents, identity, location, presence, and properties of an object (e.g., He, Bolz, & Baillargeon, 2011; Kampis, Parise, Csibra, & Kovács, 2015; Scott & Baillargeon, 2009; Scott et al., 2010). They can reason about a variety of physical actions performed by a mistaken agent, such as where the agent will search or which object the agent will select (e.g., Scott et al., 2010; Song & Baillargeon, 2008; Surian, Caldi, & Sperber, 2007). They can use an agent’s false belief to interpret what that agent says (Southgate et al., 2010) and to predict and interpret the agent’s emotional reactions (e.g., Moll et al., 2016; Scott, 2017).
Moreover, young children can reason about complex causal interactions between false beliefs and other mental states (Scott & Baillargeon, 2009; Scott, Richman, & Baillargeon, 2015). For instance, my colleagues and I (Scott et al., 2015) examined 17-month-olds’ ability to interpret the actions of a deceptive agent who sought to lure another agent into holding a false belief about the identity of an object. In three experiments, a thief attempted to steal a desirable rattling toy without its owner’s knowledge by replacing it with a less desirable silent toy in the owner’s absence. Results suggested that the infants realized that the thief could get away with her theft only if (a) the silent object was visually identical to the rattling object that the thief stole (otherwise, the owner would detect the substitution when she saw the silent toy) and (b) the owner did not routinely shake her toy when she returned (otherwise, she would detect the substitution when she shook the toy). When these conditions were met, the infants expected the owner to be deceived by the thief’s actions and to falsely believe that the silent toy was the rattling toy. Thus, the infants understood the interaction between the thief’s goals and the owner’s false belief about the identity of the silent toy.
Finally, positive results have been obtained using non-elicited-response tasks with infants and toddlers in many Western countries, as well as in three traditional, non-Western communities: a Salar community in western China, a Shuar/Colono community in Ecuador, and a Yasawan community in Fiji (Barrett et al., 2013).
The findings reviewed here cast doubt on the notion that young children’s performance in non-elicited-response tasks reflects limited, rudimentary abilities (for critiques of late-emergence accounts, see Carruthers, 2013, 2016; Scott, 2014; Scott & Baillargeon, 2014). Instead, these findings suggest that across cultures, the capacity to attribute false beliefs to agents emerges in the first year of life. This early understanding is robust, flexible, and sophisticated, allowing young children to reason about a broad range of belief-based responses and belief-inducing situations, including complex interactions between beliefs and other mental states. Together with the large body of evidence that infants can reason about other mental states in the first year of life (e.g., Baillargeon et al., 2015), these findings support mentalistic accounts of early psychological reasoning, which propose that infants possess a psychological-reasoning system that provides a skeletal causal framework for interpreting and predicting the actions of agents in terms of their mental states, including false beliefs (e.g., Luo & Baillargeon, 2010).
Why Do Children Fail Elicited-Response False-Belief Tasks?
If the ability to attribute false beliefs to agents emerges in infancy, as the evidence reviewed in the previous section suggests, then why do children fail elicited-response false-belief tasks until age 4? Some researchers have suggested that young children fail elicited-response tasks because of the heavy processing demands that these tasks impose (e.g., Baillargeon, Scott, & He, 2010; Bloom & German, 2000; Roth & Leslie, 1998). In particular, I, Baillargeon, and our colleagues have put forward the processing-demands account, which argues that elicited-response tasks involve at least three executive function demands (Baillargeon et al., 2010; Scott & Roby, 2015). To illustrate, consider the Sally/Anne task described earlier. When children are asked the test question, “Where will Sally look for her marble?”, this initiates a response-selection process: Children must interpret the question, choose to answer, and select an appropriate response. Interpreting this question often triggers an inappropriate, prepotent tendency to answer based on the marble’s actual location, and children must inhibit this response in order to answer based on Sally’s false belief (response-inhibition process). Simultaneously holding in mind the agent’s false belief and executing the response-selection and response-inhibition processes imposes substantial demands on children’s working memory. These combined response-selection, response-inhibition, and working memory demands overwhelm young children, causing them to fail elicited-response tasks.
The processing-demands account predicts that if processing demands are sufficiently reduced, children should succeed in elicited-response tasks at younger ages. Several recent findings support this prediction (e.g., Rubio-Fernández & Geurts, 2013; Setoh, Scott, & Baillargeon, 2016). For instance, 2.5-year-olds succeed in an elicited-response task in which (a) the location of the marble is unknown, making it easier for the children to inhibit reality-based responses, and (b) the children receive practice trials designed to reduce the demands associated with the response-selection process (e.g., Setoh et al., 2016). Note that if young children failed elicited-response tasks because they could not represent false beliefs (as argued by late-emergence accounts), then reducing processing demands would have little effect on their performance: They would continue to fail such tasks. The fact that reducing processing demands improves young children’s performance thus demonstrates that they can represent false beliefs, but they are easily overwhelmed by the joint demands imposed by simultaneously doing so and answering a question about that belief.
According to the processing-demands account, non-elicited-response tasks impose fewer demands on children than do elicited-response tasks, allowing children to express their false-belief understanding at younger ages. For instance, I and my colleagues (Scott et al., 2012) tested 2.5-year-olds in a violation-of-expectation task in which they watched an adult subject participate in a classic Sally/Anne task (Fig. 2). When the adult subject was asked where Sally would think her toy was, children expected the adult to answer correctly by pointing to the container that Sally falsely believed held the toy, and they looked reliably longer if the adult instead pointed to the toy’s current location. Because the test question was addressed to the adult subject rather than the children, no response-selection or response-inhibition processes were activated, allowing children to successfully demonstrate their understanding of Sally’s false belief.

The version of the Sally/Anne task used by Scott, He, Baillargeon, and Cummins (2012). Sally (the woman on the left) places her toy into the green box and then leaves. In her absence, the experimenter (the woman on the right) moves the toy from the green box to the blue box. The experimenter then asks the adult subject (the woman in the middle) the standard Sally/Anne test question of where Sally will look for her toy when she returns. Two-and-a-half-year-olds expected the adult subject to answer correctly that Sally’s false belief would cause her to look in the green box, and they looked reliably longer if the adult instead pointed to the toy’s current location.
The processing-demands account does not claim that non-elicited-response tasks do not involve any demands or that children should always pass such tasks. Rather, this account maintains that children’s ability to demonstrate false-belief understanding in any situation should depend on their ability to cope with the demands imposed by that situation. This account thus predicts that just as decreasing processing demands improves young children’s performance in elicited-response tasks, increasing processing demands should impede young children’s performance in non-elicited-response tasks. Recent work by Roby and I (Scott & Roby, 2015) supports this prediction: Three-year-olds were tested in a non-elicited-response task involving a verbal false-belief story, and the linguistic ambiguity of the story (i.e., whether story lines were open to multiple interpretations) varied across children. When the story was unambiguous, children successfully demonstrated false-belief understanding. When the story was ambiguous, however, children’s performance depended on their language abilities, and only those children with the highest verbal abilities succeeded. These findings support the processing-demands account by demonstrating that regardless of whether a task involves an elicited response, children’s ability to succeed depends on the processing demands of the task and their processing skills.
Footnotes
Acknowledgements
The author thanks John Bunce, Erin Roby, Megan Smith, and three anonymous reviewers for helpful feedback on the manuscript.
Declaration of Conflicting Interests
The author declared no conflicts of interest with respect to the authorship or the publication of this article.
