Abstract
This research examined the influence of contrasting instructional approaches on children’s decision-making competence. A total of 764 fifth graders, mostly African Americans and Hispanic Americans, from 36 classrooms in eight public schools serving children from low-income families completed a six-week unit on wolf management, using either direct instruction or collaborative groups, or were waited-listed controls. Analysis of children’s essays on a topic unrelated to wolves revealed that students who participated in collaborative groups but not students who received direct instruction acquired decision-making strategies and transferred them to the novel task. Students in collaborative group work classrooms wrote essays that were significantly better than essays of students from direct instruction classrooms in each of the three aspects of decision making that were evaluated—considering more than one side of a dilemma, comprehensiveness of reasons, and weighing the importance of reasons. In contrast, direct instruction students performed no better than uninstructed control students.
In training a child to activity of thought, above all things we must beware of what I will call “inert ideas”—that is to say, ideas that are merely received into the mind without being utilized, or tested, or thrown into fresh combinations. Alfred North Whitehead (1929/1957, p. 193)
Decision making has been studied in a variety of disciplines, including economics and psychology (for a review, see Weber & Johnson, 2009), politics and public policy (e.g., Dye, 1992; Howlett, 2009), and health and medicine (e.g., Chapman & Sonnenberg, 2003; Hibbard, Slovic, & Jewett, 1997). Because of its universal importance in human affairs, as early as the 1960s, Engle (1960) declared that decision making should be the heart of social studies instruction for children and adolescents. She explained that social studies should go beyond simple facts to inform students of the reasoning required to reach decisions on public and private matters. Newman and Oliver (1970) maintained that when teaching social studies, teachers should foster students’ capacity for justifying or challenging decisions. However, Jonassen (2012) recently concluded that intervention programs for improving children’s decision-making competence are far from satisfactory.
The current study represents a renewed effort to improve children’s decision making. We evaluated a six-week intervention called the Wolf Reintroduction and Management Unit, during which students were responsible for making a decision about whether a community should be allowed to kill a pack of wolves that alarmed many of its citizens. The unit integrates science, social studies, and the English language arts, incorporating instruction in the ecosystem, economy, and public policy. The unit was taught using one of two instructional approaches. One approach was collaborative group work. This approach was compared to teacher-led whole-class direct instruction and regular instruction in control classrooms. The key goal of the current research was to determine whether children could transfer decision-making skills acquired from the wolf unit to a novel task far removed from issues of wolf reintroduction and management. The overarching hypothesis was that collaborative group work would lead to greater transfer than direct instruction. As we will detail later, the basis for this hypothesis is that collaborative group work positions students as active decision makers, whereas direct instruction places them in a passive role following the reasoning of their teacher.
Research on Decision Making
Decision making has been defined as the process of making choices among competing courses of action (Beyth-Marom, Fischhoff, Quadrel, & Furby, 1991; Raiffa, 1968; von Winterfeldt & Edwards, 1986). Decision making is higher-order thinking in that it “challenges the student to interpret, analyze, or manipulate information, because a question to be answered or a problem to be solved cannot be resolved through the routine application of previously learned knowledge” (Newman, 1990, p. 44).
Models of decision making generally are of two types, normative and descriptive (for a review, see Regenwetter et al., 2009). Normative models deal with logically consistent decision procedures. The purpose is to specify how a rational, informed person should identify the optimal option via an appropriate procedure, like probability and utility analysis. For example, if a choice has to be made between: (a) a sure gain of $240 or (b) a 25% chance to gain $1,000 and 75% chance to gain nothing (Tversky & Kahneman, 1981), assuming people are strictly rational, they would select the option that maximizes the expected value or utility. Utility is determined by a simple formula: the probability of each option times the value of each option. Here the utility of (b) is 250 (25% × 1,000 + 75% × 0 = 250), which is higher than that of (a); therefore, a rational person would choose option (b) under normative models of decision making.
The goal of descriptive models is to represent the actual process of how individuals make decisions. Descriptive models start with the assumption that individuals have biases that affect decision making, that they often employ rules of thumb, and that they are influenced by their emotions and personal experience (for a review, see Weber & Johnson, 2009). In fact, a substantial body of empirical research has documented that people’s behavior often deviates from the performance considered normative in reasoning and decision-making tasks (for a review, see Stanovich, West, & Toplak, 2012). For instance, individuals may assess probabilities incorrectly, violate the axioms of utility theory, and ignore alternative arguments when evaluating a decision. In the previous example, Tversky and Kahneman (1981) found that when considering options for gain, most people were risk averse and opted for the sure thing over the chance for getting a larger gain. In contrast, when similar choices were framed in a negative way—a sure small loss versus the chance of a larger loss—people became risk takers; they preferred the choice that might entail a larger loss over a smaller but sure loss. Some descriptive models acknowledge a rational component in individuals’ decision making (for a review, see Gigerenzer & Gaissmaier, 2011). These models posit, for example, that people use fast-and-frugal heuristics, which may bring positive outcomes if used in an adaptive manner.
Decision making often has been assumed to be a competence like language that develops naturally through maturation and socialization (Beyth-Marom et al., 1991). Some studies suggest that children and adolescents are less skilled than adults in several aspects of decision making, including advice seeking, weighing pros and cons of options, and adaptive goal setting (e.g., Byrnes, 2002). Halpern-Felsher and Cauffman (2001) found that adults outperformed children in making decisions about several types of dilemmas. Adults were more likely to spontaneously consider the risks, benefits, and long-term consequences associated with different decisions. Klaczynski (2001) compared the decision making of early and middle adolescents. Middle adolescents performed closer to the normative ideal than early adolescents, which suggests that the use of analytic processes generally increases with age. Findings of studies comparing people of different ages are not completely consistent, however. In some studies, adults were shown to be equally biased or even more biased than children and adolescents (e.g., Klaczynski, Gordon, & Fauth, 1997). Though there seems not to be a final conclusion yet, existing evidence suggests that, as compared to adults, children and adolescents lack skills and dispositions required for thoughtful decision-making.
Instructional Interventions to Improve Decision-Making Skills
Instructional interventions have been created with the goal of improving children and adolescents’ decision making, either through direct instruction or collaborative discussion. The interventions have been designed to teach a process of good decision making through three major approaches: (a) a pure decision-making course, (b) an issue-based socio-science curriculum with explicit decision-making procedures outlined, and (c) an issue-based socio-science curriculum with embedded decision-making elements.
Pure decision-making courses impart knowledge about probability and basic principles in decision making through direct teaching. An example is the GOFER course (Mann, Harmoni, Power, Beswick, & Ormond, 1988), which presents a five-step decision-making process: (a) goals clarification, (b) options generation, (c) fact finding, (d) consideration of effects, and (e) review and implementation. For socio-science curricula in which improved decision making is a goal, similar decision-making procedures have been taught. For example, Ratcliffe (1997) expanded the five-step process into six steps: (a) identifying possible options, (b) identifying suitable criteria for comparing different options, (c) clarifying information known about possible options, (d) evaluating the advantages and disadvantages of each alternative against the identified criteria, (e) choosing an option based on the analysis undertaken, and (f) evaluating the decision-making process undertaken. She incorporated the six-step decision-making procedure into an energy conservation unit. A similar six-step scheme was used in the Decision Skills Curriculum (Spitzhoff, Ramirez, & Wills, 1982) with the purpose of increasing students’ resistance to internal and external pressures to engage in destructive behavior.
Whether students could recognize or repeat the normative procedures that have been taught has been the major criterion for evaluating the effectiveness of these decision-making courses. For example, the Decision Knowledge Questionnaire used to evaluate the GOFER course contained multiple choice and open-ended questions like: What makes someone a really good decision maker? What is the difference between a simple decision and a thinking decision? In asking students to repeat abstract decision-making principles, the researchers made the dubious assumption that heightened awareness of normative decision-making procedures was equivalent to a thoughtful decision-making process. Moreover, although students who received training had better memory for decision-making principles in the short term (Mann et al., 1988), the positive effects soon faded away, as Ratcliffe (1997) reported that two months after the intervention, none of the participants recollected the detail of the decision-making principles. In addition to verbalizing rules, students’ self-evaluations have been used to show benefits from courses (e.g., The decisions I make turn out well. When faced with a decision, I go along with what others suggest.). It is encouraging to see improved self-efficacy, but the measures fall short of providing evidence that students can actually engage in thoughtful decision making.
An embedded issue-based socio-scientific decision-making curriculum does not teach decision-making principles in a didactic manner; instead, it utilizes an integrated, issue-based approach to guide students toward informed decision making, with the aim of promoting their conceptual understanding as well as their ability to engage in rational argumentation. For example, Lee (2007) used the Hong Kong government’s ban on smoking as the topic to engage students in argumentation and decision making. Teachers attempted to develop students’ basic conceptual understanding about smoking in multiple ways, such as introducing information about the structure and function of the human respiratory system, doing experiments, and providing evidence showing how smoking affects health. Students were encouraged to explore both sides and evaluate their views in a rational evidence-based manner. Besides smoking, other socio-scientific topics, such as animal transgenesis (Simonneaux, 2001) and bat conservation (Lee & Grace, 2010), have been employed in programs to improve decision making. Evaluations of these programs have typically had a pre- and posttest design where students were either interviewed or asked to write about their decisions; however, the topic was the same all the way through. In other words, there was not a novel transfer task, which makes it impossible to tell whether the participants had acquired generalizable decision-making skills that they are able to apply to new issues.
Collaborative Interaction and the Development of Higher-Order Thinking
Argumentative thinking inevitably accompanies deliberative decision making, and collaborative discussion has proved to be a successful method for promoting argumentative thinking (Kuhn, 1992; Reznitskaya, Anderson, & Kuo, 2007). Thus, we are among those who see collaborative interaction as a promising approach to promote decision-making skills (Braund, Lubeen, Scholtz, Sadeck, & Hodges, 2007; Lee, 2007; Simon & Maloney, 2007).
The expectation that collaborative interaction will foster children’s decision-making skills is based on the premise that it intertwines social and cognitive processes that promote higher-order thinking. According to Piaget’s (1932) socio-cognitive theory, cognitive development is initiated by disequilibrium between a child’s current understanding and unsettling new information. During collaborative interaction, a child’s existing understanding is often challenged by others, which creates a state of disequilibrium. In order to restore the equilibrium, the child needs to develop a new and integrated perspective to reconcile the conflict. In other words, cognitive conflict arising from social interaction brings children’s thinking to a higher level of sophistication, thus facilitating cognitive growth.
Vygotskian socio-culture theory emphasizes the role of dialogical experience in intellectual growth. According to Vygotsky (1962), children’s intellectual development from the actual level to the potential level relies largely on dialogical interaction with more expert partners. An expert partner can either be a knowledgeable adult or a competent peer. From the Vygotskian perspective, active exchange of ideas through verbal communication is critical in child’s cognitive growth because language serves vital functions as a cognitive tool for the child to process information, as a social tool for sharing knowledge among children, and as a pedagogic tool to provide intellectual guidance (Mercer, 1999).
Over the past few decades, an accumulating body of research has provided empirical support for the benefits of collaborative interaction in promoting children’s higher-order thinking in a variety of domains involving different age groups, including concept learning (e.g., Asterhan & Schwarz, 2007, 2009), problem solving (e.g., Fawcett & Garton, 2005), skills of argument (e.g., Dong, Anderson, Kim, & Li, 2008; Dong, Anderson, Lin, & Wu, 2009; Kim, Anderson, Miller, Jeong, & Swim, 2011; Kuhn & Udell, 2003; Reznitskaya et al., 2001), and nonverbal reasoning (e.g., Mercer, 1996; Mercer, Dawes, Wegerif, & Sams, 2004; Wegerif, Mercer, & Dawes, 1999). A number of scholars have maintained that active exchange of ideas with others is the key mechanism in children’s development of high-order thinking and reasoning (e.g., Rogoff, 1990; Teasley, 1995; Webb & Favier, 1999).
One approach to collaborative interaction is collaborative reasoning (CR), a peer-led free-flowing discussion forum intended to encourage authentic argumentation (Anderson, Chinn, Waggoner, & Nguyen, 1998). Students in heterogeneous small groups balancing school academic achievement, talkativeness, gender, and ethnicity are expected to find a resolution of an issue posed by a story that they have read (Anderson et al., 1998). They are supposed to manage their own discussions, and teachers intervene only when necessary. A series of studies have found that CR discussions foster children’s acquisition of argument skills (e.g., Anderson et al., 1998, 2001; Lin et al., 2015). Students who have participated in as few as four or five CR discussions are able to compose essays about a topic they have not discussed with a more salient argument-counterargument-rebuttal structure than comparable control students (e.g., Dong et al., 2008, 2009; Kim et al., 2011; Reznitskaya et al., 2001, 2007; for a review, see Reznitskaya, 2009).
Rationale for the Current Research
Students spent six weeks studying the Wolf Reintroduction and Management Unit. The wolf unit can be classified as an embedded issue-based socio-scientific decision-making curriculum. The key message of the curriculum was to help young students become aware of how to make a reasonable and responsible decision. Through participating in the wolf unit, it was expected that students’ decision-making skills would be fostered in at least three aspects: (a) disposition to consider more than one side of issues, (b) ability to generate different types of reasons on each side of issues, and (c) ability to weigh reasons on both sides. Children’s decision-making skills were examined in a novel transfer task that had no surface features in common with wolf management, which overcomes a serious drawback of previous research on programs to teach decision making.
The wolf unit was taught in two formats: collaborative group work (CG) and direct instruction (DI). CG was a combination of collaborative reasoning discussions and other group activities. Students were broken into groups to discuss the “big question”— whether the community should be permitted to hire professional hunters to the kill wolves—and to discuss more specific questions related to the big question. DI entailed teacher-guided whole-class activities, whole-class question-and-answer sessions, and individual seatwork. Wait-listed control classrooms continued regular instruction and received the wolf unit in the semester after the study.
Our expectation was that the collaborative groups would prove superior to direct instruction, as well as conventional instruction in control classrooms, on all three aspects of decision making. This expectation can be explained in terms of several differences in students’ perspective and behavior during collaborative group work and direct instruction. A key difference is that, as realized in the Wolf Reintroduction and Management Unit, collaborative group work positioned the student as a decision maker personally responsible for reaching the best overall decision about the wolves. In contrast, in the direct instruction classroom, the student shared the decision-making responsibility with the teacher, and it was perhaps easy for the student to slip into the role of going along with what the teacher seemed to think was best. Because they individually and collectively shouldered the responsibility for decision making, students in collaborative groups may be considered to be active rather than passive learners. With a full sense of decision-making responsibility, we suppose, came the impetus to thoroughly explore all sides of issues.
In collaborative groups, each child is a decision maker invited to argue for his or her standpoint, justify, negotiate, and try to convince others (Dillenbourg, 1999), which provides a counterweight to myside bias (Wolfe & Britt, 2008) since participants are confronted with viewpoints different from their own. Possible benefits and harms of a decision, beyond the scope of any one individual’s knowledge or imagination, may be brought to light by other participants. Hence, CG provides a context in which recurrent patterns of decision-making elements—considering the other side of the issue, appealing to different reasons supporting each side, and weighing reasons on each side—are likely to show up, which may bring children’s understanding of decision making to a higher level of sophistication.
Although direct instruction has certain benefits (Kirschner, Sweller, & Clarke, 2006; Stein, Carnine, & Dixon, 1998), it may not foster students’ higher-order thinking to the same extent as collaborative interaction. Teachers dominate talk in the conventional direct instruction classroom, and episodes of genuine dialogue between the teacher and students or among students are vanishingly small (Nystrand, Wu, Gamoran, Zeiser, & Long, 2003). Direct instruction is monological and hierarchical, limiting opportunities for students to engage in extended dialogue (Wells & Arauz, 2006), which according to Bakhtin (1981) is essential for the development of reasoning given its dialogical nature. During direct instruction, the teacher controls the topic, determines which students will have turns for speaking, and decides whether student contributions are satisfactory. When following the lead of the teacher, we hypothesize students are less likely to view themselves as real decision makers, thus attenuating engagement in the decision-making process.
Based on the idea that CG places students in the role of active decision makers, whereas DI positions them in a passive role conforming to the reasoning of their teacher, we hypothesized that CG students would show greater transfer of decision-making skills acquired from the wolf unit to a novel task than DI students. The novel transfer task had nothing to do with wolf management but addressed whether a boy should tell the teacher that a friend of his cheated in a model car competition.
Method
Participants
The participants were 764 fifth-grade students (410 girls, 354 boys), primarily African American (n = 338, 44.2%) or Hispanic American (n = 356, 46.6%), from 36 classrooms in eight public schools serving children from low-income families in two urban districts in the Midwest of the United States. From 79% to 99% of the students were registered for free or reduced-price lunch, depending on the school.
The 36 classrooms were divided into 12 triples of classrooms matched in terms of demographic composition and academic record. Classrooms within triples were randomly assigned to the three intervention conditions: collaborative groups (CG), direct instruction (DI), and wait-listed control (control). The 12 CG classrooms contained 254 children (143 girls, 111 boys), the 12 DI classrooms contained 256 (138 girls, 118 boys), while the 12 control classrooms contained 254 (129 girls, 125 boys).
Prior to the intervention, students took the Gates-MacGinitie reading comprehension test (MacGinitie, MacGinitie, Maria, & Dreyer, 2000). The normal curve equivalent–scaled scores on the reading comprehension test were below the national average (M = 38.37, SD = 16.24). Raw scores corrected for guessing, which improved reliability and validity, were employed in subsequent data analyses. Also prior to the intervention, students individually completed a speeded object naming task (Snodgrass & Vanderwart, 1980) to assess basic oral English proficiency. Students named common objects in two sets of pictures as quickly as they could; the score was number of pictures correctly named per minute. Two-level regression analyses with condition as a fixed effect and classrooms as a random effect showed no condition difference in reading comprehension, F(2, 690) = 1.08, p = .34, or basic English proficiency, F(2, 690) = .06, p = .94.
Wolf Reintroduction and Management Unit
In the wolf unit, students played the role of officials at the Wolf Management Agency. They were asked to make a decision about whether a community should be granted permission to hire professional hunters to kill a pack of wolves that concerned many of its citizens. The unit was divided into three sections, each incorporating an important perspective on the complicated issue of wolves, to cultivate students’ ability to discern different aspects of problems and understand interrelationships and trade-offs. The three sections were ecosystem, economy, and public policy. While killing the wolves may be favored by the majority of residents in the community (public policy), doing so would alter the food web (ecosystem), which would impact community businesses (economy).
Each section of the unit was explained in an information booklet and expanded in an activity booklet. Information booklets provided students with essential concepts. Unlike most readings for middle grade students, the booklets had an argument structure that contrasted opposing viewpoints. For example, the public policy information booklet compared the viewpoints of hunters who want a large elk population and nature lovers who want to hear howling wolves. The activity booklets contained problem-solving exercises that strengthened and expanded concepts that students were acquiring.
The wolf unit was designed to provide a balanced treatment, giving neither more nor less weight to granting permission to eradicate wolves or protecting them from being killed. In other words, the purpose of the curriculum was not to lead students to a predetermined best answer but rather to help young students become aware of how to make a responsible and reasoned decision that takes different perspectives into consideration. For example, wolves kill livestock such as cattle, sheep, and turkeys, which results in economic loss for ranchers and farmers. In order to protect livestock from being killed, ranchers may need to build fences and barns, which increases the cost of doing business. Therefore, allowing the wolf pack to be eradicated may seem to be the optimal choice for ranchers. However, the historical record indicates that livestock losses from wolf predation are small and, furthermore, the government compensates ranchers for losses they can document are due to wolves. Ranching is not the only business in the community. A campground, motel, and several restaurants benefit from tourism. Keeping wolves in the community may attract tourists from other parts of the country to see the wolves. On the other hand, family campers may stay away for fear that wolves will harm their children.
CG work was a combination of collaborative reasoning discussions and other group activities. Students were broken into groups to discuss the “big question”—whether the community should be permitted to hire professional hunters to kill the wolves. On a typical day during the unit, the task for a small group was to answer a specific question related to the big question, for example, “What effect would killing the wolves have on the elks?” Groups worked independently and spoke freely among themselves, with occasional assistance from the teacher. Each small group was assigned to become “experts” in one of the three domains of knowledge (ecosystem, economy, public policy). After four weeks of group work, each expert group shared what they had learned in a poster presentation to the whole class. Then new discussion groups were created, with members from the three different expert groups, to reconsider the big question in a collaborative reasoning discussion.
DI entailed teacher-guided whole-class activities and individual seatwork. Students in DI condition sat facing toward the teacher. Students were supposed to raise their hands and wait for the teacher to select them before speaking. The teacher led students through all three domains of knowledge in the wolf unit. Students read the information booklets and completed the activity booklets individually at their seats. Students discussed the policy decision as a whole class.
Teachers who implemented the CG or DI interventions attended a two-day workshop to receive a detailed introduction to the Wolf Reintroduction and Management Unit and training in the method to which they were assigned. Teachers watched videos of the wolf unit as it had been implemented in other classrooms and discussed the design and content of the curriculum. Teachers who implemented collaborative group work learned about the goal of the intervention, the research and theory supporting collaborative reasoning, how to facilitate CR discussions, and effective strategies for promoting group work. They saw videos of the wolf unit being taught using collaborative groups. Teachers who implemented whole-class direct instruction learned about the research and theory supporting explicit teaching of concepts and strategies and effective teaching strategies for direct instruction. They saw videos of the wolf unit being taught using direction instruction.
The wolf unit occupied about an hour a day, four or five days a week, for six weeks. Throughout the unit, in both CG and DI classrooms, a research assistant was onsite working with teachers to ensure fidelity of implementation, video-record lessons, make field notes, and administer tests. Wait-listed control classes completed pretests and posttests but continued to receive regular language arts instruction during the intervention period. In the following semester, control teachers were invited to a workshop and offered the opportunity to implement the wolf unit in their classes.
Outcome Measures
After the Wolf Reintroduction and Management Unit, extensive information was gathered to evaluate outcomes in CG, DI, and control classrooms. Students completed two tests assessing unit mastery: a 50-minute individually written essay in which they explained their personal decision about whether the pack of wolves should be eradicated and a 100-item sentence verification test that addressed the main concepts and information in the wolf unit. Near transfer was evaluated in a one-to-one interview about an analogue to the wolf question, whether whaling should be allowed. Far transfer tasks consisted of an oral narrative prompted by the wordless picture book, Frog, Where Are You, and an independently written essay involving a decision about a moral and practical dilemma posed by The Pine Wood Derby story. Near and far are customary although informal ways of referring to the degree of similarity between the training task and the transfer task. The distinction is informal because, as Salomon and Perkins (1989) have noted, there is no satisfactory way to scale all of the features of tasks and contexts that contribute to similarity.
The current article reports analyses of the far transfer task of reflective essay writing based on The Pine Wood Derby. In this story, Thomas is portrayed as a poor and unpopular kid who has never won anything. He finally gets his chance and wins the championship in the Pinewood Derby model car competition. But his older brother helps him make the car, which is against the rule of working independently. He tells this secret to his friend, Jack, and asks him to keep his secret. Jack is disappointed at Thomas’s dishonesty; on the other hand, he is glad his poor friend has finally won something. The decision that students were asked to make is whether or not Jack should tell on Thomas. Students had 50 minutes to compose the essay; most of them finished before the time limit.
Essays prompted by The Pine Wood Derby have provided the outcome measure in several previous CR studies (e.g., Kim et al., 2011; Reznitskaya et al., 2001); however, in these studies, writing the essay was a near transfer task because the preceding collaborative discussions were based on similar moral and practical dilemmas. In the current project, writing the essay becomes a far transfer task because The Pine Wood Derby has little in common with the wolf unit. Thus, comparisons of performance on this task by children in CG, DI, and control classrooms provides a strong basis for determining which children have acquired generalizable competence in decision making.
Coding Children’s Reflective Essays
Three aspects of the Jack and Thomas essays were coded: (a) whether two or more sides of a dilemma were articulated; that is, the essay expressed costs as well as benefits of decisions. For instance, if a child thought that Jack should tell on Thomas because of his dishonesty, did he or she also mention possible negative effects of this decision, such as Jack would be called a tattletale by his classmates or might lose Thomas as a friend? (b) Ability to generate different types of reasons. These reasons were further broken down into nine moral principles and 10 practical considerations, as enumerated in Table 1. These reasons have previously been shown to exhaust the reasons children consider in the Pinewood Derby essay (Zhang et al., 2013). And (c) explicitness in weighing different options; for example, stating that being called a tattletale is trivial compared to protecting fairness.
Definitions and Examples of Moral Principles and Practical Considerations
Children’s essays were exported into the NVivo8 qualitative research software (QSR, 2008) for coding. Raters were blind to whether students were from CG, DI, or control classrooms. They had no information about students’ gender, ethnicity, reading comprehension scores, or other personal characteristics. Both dilemma and weighing were coded as binary variables, with 1 indicating present and 0 representing absent. A second rater independently coded 20% of the essays. Interrater reliability was satisfactory (Cohen’s kappa = .87 for dilemma, .81 for weighing).
For the content analysis of types of reasons, each of the nine moral principles and 10 practical considerations was coded as present (1) or absent (0). Number of reasons is the sum of the number of moral principles and practical considerations. This measure represents the range of different reasons children appealed to when making decisions. A second rater independently coded 20% of the essays. Interrater reliability was satisfactory (Cohen’s kappa = .75 for moral principles, .76 for practical considerations).
Results
We conducted three sets of analyses of children’s decision-making competence, one for each of the three aspects of sound decision making: recognizing two sides of a dilemma, comprehensiveness of reasons, and weighing reasons according to importance. Excerpts from children’s essays, with spelling corrected (to enable keyword searches) but otherwise unedited, are presented in Table 2 to illustrate each aspect of decision making. No one will fail to notice that the excerpts contain grammatical mistakes, awkward wording, and difficult to grasp ideas. It needs to be reiterated that as a group, the children who participated in this study were well below the national average in academic attainment, represented in this study by a standardized measure of reading comprehension. Moreover, the essays were the children’s first drafts, and to do their best work, children need the opportunity to edit and improve their writing in several drafts (Graves, 1994). Table 3 presents the means and standard deviations of the three measures of decision making by instructional condition.
Coding Children’s Essays: Recognition of Two sides of a Dilemma, Comprehensiveness of Reasons, and Weighing Importance of Reasons
Means (SDs) by Condition for Recognizing Two Sides of the Dilemma, Comprehensiveness of Reasons, and Weighing Importance of Reasons
Note. CG = collaborative groups; DI = direct instruction.
Considering Two Sides of a Dilemma
Children who managed to recognize more than one side of a dilemma in making a decision about whether Jack should tell on Thomas were either able to display a rationale for each side or to see undesirable implications of their own preferred decision (for illustrations, see Table 2).
Since individuals were nested within classrooms, a two-level logistic regression model was constructed to predict consideration of more than one side of a dilemma. Individual-level predictors were gender, ethnicity, Gates-MacGinitie reading comprehension (classroom-mean centered), basic English proficiency as represented by time to name common objects (also classroom-mean centered), and condition (CG, DI, control). Classroom-level predictors were classroom mean reading comprehension and classroom mean basic English proficiency. A series of models were fitted. The initial model contained all main effects and their interactions. Nonsignificant terms were dropped one at a time if there was not a significant difference between the model with the term and the model without it, as indicated by the change in χ2 of the two models. None of the interaction terms (e.g., condition × gender, condition × reading comprehension) were significant, and retaining them led to poorer model fits; thus, interactions were excluded from subsequent models. Since neither individual nor classroom mean basic English proficiency significantly predicted the outcome measure, these terms were removed. Although not significant, ethnicity was retained; results do not change when ethnicity is removed from the final model shown in the following.
Level 1: Individual-level model is
Level 2: Classroom-level model is
As shown in Table 4, condition and gender were significant factors predicting consideration of more than one side of a dilemma. After controlling for other factors, including any effects associated with classroom, CG students were significantly better than their counterparts in DI classrooms, β = .45, SE = .23, odds ratio = 1.57, 95% CI [1.02, 2.43], t(683) = 2.03, p < .05, in recognizing different sides of a dilemma; however, no significant differences were found between CG and control classrooms, β = .25, SE = .23, odds ratio = 1.23, 95% CI [.82, 2.00], t(683) = 1.07, p = .29, or between DI and control classrooms, β = –.19, SE = .23, odds ratio = .83, 95% CI [.53, 1.30], t(683) = –.91, p = .37. Girls were better than boys in perceiving the dilemma, β = .89, SE = .17, odds ratio = 2.47, 95% CI [1.77, 6.45], t(683) = 5.33, p < .001.
Two-Level Logistic Regression Models Predicting Recognition of Two Sides of the Dilemma and Weighing Importance of Reasons as a Function of Condition, Gender, Ethnicity, and Reading Comprehension
Note. Gender coded girl = 1, boy = 0. Ethnicity coded Hispanic = 1, others = 0. CG = collaborative groups; DI = direct instruction.
p < .05. **p < .01. ***p < .001.
Comprehensiveness of Reasons
A deliberate and thoughtful decision-making process is marked not only by an individual’s ability to attend to both sides of an issue but also by the variety of reasons that he or she can consider (see Table 1 for the classification of reasons).
Table 5 presents the two-level Poisson regression analysis of the total number of distinct reasons, including both moral principles and practical considerations. Following the same model fitting and selection procedures described previously, the final models included all main effects other than individual or classroom mean basic English proficiency. The results indicate that CG students appealed to a significantly greater number of distinct reasons in their essays than DI students, β = .17, SE = .07, t(683) = 2.47, p < .05, or control students, β = .16, SE = .07, t(683) = 2.23, p < .05, whereas students from DI and control classrooms were indistinguishable from each other, β = –.01, SE = .07, t(683) = –.15, p = .88. Glass’s Δ (Glass, 1976), which is suited to effect size comparisons involving skewed outcomes measures, was Δ = .40 for CG versus DI and Δ = .43 for CG versus control. Girls included significantly more reasons in their essays than boys, β = .16, SE = .05, t(683) = 3.32, p < .001. The results were the same when ethnicity was removed from the final model.
Two-Level Poisson Regression Model of Comprehensiveness of Reasons as a Function of Condition, Gender, Ethnicity, and Reading Comprehension
Note. Gender coded girl = 1, boy = 0. Ethnicity coded Hispanic = 1, others = 0. CG = collaborative groups; DI = direct instruction.
p < .05. ***p < .001.
Weighing the Importance of Reasons
For students who were able to present two sides of a dilemma in their essays, we further investigated whether their essays contained the decision-making criterion of weighing. Relying on the same model fitting and selection procedures as before, a two-level logistic regression of the likelihood of weighing competing arguments was fitted. Table 4 shows that students in CG classrooms performed significantly better than students in either DI classrooms, β = .80, SE = .27, odds ratio = 2.22, 95% CI [1.32, 3.73], t(683) = 3.03, p < .01, or control classrooms, β = .54, SE = .27, odds ratio = 1.72, 95%, CI [1.02, 2.89], t(683) = 2.19, p < .05. DI students did not differ from their counterparts in control classrooms, β = –.26, SE = .27, odds ratio = .77, 95% CI [.44, 1.37], t(683) = –.89, p = .37. Girls were more likely to weigh reasons than boys, β = .97, SE = .24, odds ratio = 2.63, 95% CI [1.66, 4.17], t(683) = 4.10, p < .001. Children with higher reading comprehension had a greater tendency to weigh reasons than less proficient children, β = .03, SE = .01, t(683) = 2.10, p < .05. Ethnicity did not make a significant difference.
General Discussion
The major finding of the present study is that students in collaborative group work classrooms wrote essays that were significantly better than the essays of students from direct instruction classrooms in each of three aspects of decision making—recognizing more than one side of a dilemma, considering a range of reasons, and weighing the importance of reasons. In contrast, DI students performed no better than uninstructed control students.
Comparing the CG and control conditions, CG students trended better in terms of recognizing two sides of a dilemma; however, the difference was not statistically significant, which is inconsistent with findings from previous studies (e.g., Dong et al., 2008, 2009; Reznitskaya et al., 2001). A possible reason for the discrepancy is that the Pinewood Derby story was a near transfer task in prior studies but a far transfer task in the current research. In the previous studies, students in the collaborative reasoning condition discussed stories that contained similar moral or practical dilemmas, which might have increased their ability to recognize both sides of the dilemma in the Pinewood Derby story. CG students were significantly better than control students in the other two aspects of decision making, considering a variety of reasons and weighing the importance of reasons. Taken together, these findings demonstrate far transfer for students who experienced collaborative interaction, inasmuch as wolf reintroduction and management, which CG and DI students had studied for six weeks, has few if any surface features in common with the transfer task—whether or not a boy should tell the teacher that his friend has cheated in a contest building and racing model cars.
Performance on the Wolf Reintroduction and Management Unit mastery tests provides a benchmark for interpreting the transfer effects since that which is not learned cannot be transferred. After the intervention, students completed two unit mastery tests: a 100-item sentence verification test that addressed the main concepts and information in the wolf unit and a 50-minute individually written essay in which they explained their personal decision about whether the pack of wolves should be eradicated. Analysis of performance on the sentence verification test indicated that DI students grasped significantly more concepts and information from the wolf unit than CG students or control students (for details, see Morris, Anderson, & Collaborative Reasoning Research Group, 2013). Morris and colleagues (2013) also examined argument elements in the essays children wrote to explain and justify their decisions about whether the pack of wolves should be eradicated. Using a coding scheme somewhat different from the one employed here, based on an argument-counterargument-rebuttal structure, Morris et al. (2013) found that CG and DI students were equally good at considering counterarguments and rebutting counterarguments as well as producing elaborated reasons based on the concepts and information from the wolf unit. Thus, DI students were able to use knowledge acquired from the wolf unit in an essay that was a recapitulation of the unit.
However, in an essay about a novel situation, the dilemma faced by the boy in the Pinewood Derby story, DI students were unable to make use of knowledge that they had apparently acquired. The contrast of DI students’ performance on the unit mastery tests and the far transfer task seems to indicate that DI students had acquired “inert ideas,” ideas that had been “received into the mind” but which could not be “utilized . . . or thrown into fresh combinations” (Whitehead, 1929/1957).
Direct instruction students and students who participated in collaborative groups studied the same curriculum, so the differences between DI and CG on the far transfer task in the three aspects of decision making must arise from the contrasting conditions of learning. We theorize that the essential difference between direct instruction and collaborative group work lies in what students learn about the “self as agent and others as audience” (Morris et al., 2013). In DI classrooms, teachers do most of the initiating and evaluating, leaving students with circumscribed opportunities for extended reasoning and, we suppose, little cause to believe that their own thinking counts for much in making a decision about the wolves. In a collaborative learning environment, students have the chance to become active arguers and decision makers who engage in chains of reasoning themselves (Ma et al., 2015). CG students are more likely to understand that, like themselves, other people may need help in understanding the implications of ideas, so they work together with others to clarify issues and look at the issues from different perspectives. CG students, in other words, have a stronger sense that they ought to test their ideas against the competing ideas of others, either to persuade or to be persuaded.
Explaining the improved performance of CG students in the essay about the model car competition necessarily involves the assumption that CG students were able to see deep relational correspondences (Chi & VanLehn, 2012; Hummel & Holyoak, 2003) between the wolf management problem and the question about cheating in the model car race because the two situations do not share common surface features. The set of relational correspondences in common between these and other situations that call for decision making may be said to comprise a decision-making schema (Anderson & Pearson, 1984; Reznitskaya et al., 2009; Rumelhart, 1980), or overall sense of when and how to engage in a deliberate decision-making process. We put the focus on process rather than outcome because it is incorrect to equate a sound decision-making process with a favorable outcome, as unexpected or uncontrollable events may affect an outcome. However, in the long term, knowing how to make reasonable decisions should lead to an increased chance of obtaining good outcomes compared to a thoughtless or imprudent decision-making process (Clemen & Hampton, 1994).
The Wolf Reintroduction and Management Unit afforded an opportunity to learn something about making decisions about complex and controversial issues. CG students were positioned to take advantage of this opportunity. DI students were not. DI students were positioned as bystanders looking on as others, described in the instructional materials and by the teacher, considered the decision about the wolves. Rumelhart (1980) likened a schema to a script for a play, explaining that, Just as a play has characters that can be played by different actors at different times without changing the essential nature of the play, so a schema has variables that can be associated with (bound to) different aspects of the environment on different instantiations of the schema. (p. 35)
The CG student could readily bind to the role of decision maker, the DI student not so readily or not at all.
Daily engagement in dialogue with others was expected to raise CG students’ awareness of varied perspectives on the decision about wolves. Increasing awareness might be realized through the snowball process: When one child introduces a new perspective or another way of reasoning, it often spreads to other children. Anderson and colleagues (2001) found that once a child utilized a new argument strategy in a small-group collaborative discussion—for instance, linking propositions with but as a way to show disagreement and introduce a counterargument—other discussants immediately picked up the strategy and began using it. The consequence was increasing use of worthwhile argument strategies over time and among increasing numbers of discussion participants. Lin et al. (2012) reported that strategies as complicated as arguing by analogy snowball during collaborative discussions.
CG students appealed to a significantly greater number of different moral principles and practical considerations in their essays. Plausibly, many of these were generated analogically from arguments that occurred during the wolf unit discussions. For example, the principle of fairness would have been invoked in an argument for wolves’ right to live or an argument about the infringement of the interests of tourism-dependent business owners if wolves were eradicated. The Golden Rule entails taking another’s perspective, like putting oneself into the position of wolves. Based on the snowball process, if one child brings up a new perspective, other children in the group may pick it up, elaborate, and extend it. Eventually, children internalize these moral principles and develop greater insight into the circumstances in which they apply, allowing them to flexibly use the principles when they encounter a novel situation. The following is an example in which the Golden Rule was applied in a collaborative discussion by Albert. The form is echoed by Sandra, but she takes the other side, saying that she and her family would not like it if a wolf killed their pet.
[00:25:22.00] Albert: Yeah, but wolves are like humans like they have like like if you were a wolf how would you feel if somebody came and killed you? . . . You don’t like it. [00:25:32.01] [00:25:34.20] Sandra: But we like we have an animal and if it will kill it we would not (neither) like it. [00:25:41.14]
When making a decision, the weighing of options builds on the recognition of the competing sides of a dilemma and awareness of various reasons that can be advanced on each side. Piaget (1947/1976; see also, Johnson & Johnson, 2009) maintained that real learning does not occur until children reach a state of disequilibrium when newly encountered information contradicts their previous beliefs. During collaborative discussions, when a conflict between a student’s belief and others’ opinions arose, there was a need for them to reconcile the difference, and this is likely to be the time when weighing occurred. Students in CG classrooms had many opportunities to compare competing claims and hear others compare competing claims, which would introduce them to the process of weighing.
Initial analyses of classroom talk during the wolf unit are consistent with the explanation that differences in the dialogue in CG and DI classrooms figured in the differential development of decision making of CG and DI students (Ma et al., 2015; Morris et al., 2013). Morris and colleagues (2013) analyzed the frequency of use of coordinating conjunctions in video clips systematically sampled from the approximately 500 hours of lesson videos recorded in CG and DI classrooms. Conjunctions showing how ideas are connected are a low-inference indicator of reasoning (Lin et al., 2015). Because and so can mark causal connections, if and then can indicate temporal and logical relations, while the contrastive but can be used to introduce a counterargument, as is illustrated in Table 2. CG students’ rate of use of the coordinating conjunctions because, so, if, then, and, and but was more than four times higher than the rate of DI students. In contrast, interestingly, DI teachers used the set of conjunctions at over twice the rate of CG teachers. The picture that emerges of the DI classroom is one in which the teacher does most of the reasoning, with students filling in small pieces when requested, whereas in CG classrooms, students do most of the reasoning, with the teacher occasionally redirecting when students go off track.
As a specific indicator of whether students considered themselves to be active decision-making agents, Morris and colleagues (2013) searched the video clips from the CG and DI classrooms for the performative verb phrases I think and I know. CG students used these phrases at twice the rate of DI students.
Girls performed significantly better than boys in the three aspects of decision making: recognizing a dilemma, appealing to a range of reasons, and weighing reasons on the two sides, perhaps related to the general finding that girls are better writers than boys (e.g., National Center for Education Statistics, 2011). Gender did not interact with condition, meaning that CG and DI had a comparable influence on boys and girls.
The present study shows the value of an embedded issue-based socio-scientific decision-making curriculum in helping young students become aware of how to make reasonable and responsible decisions. Importantly, the curriculum was successful only when students participated in collaborative groups. Students who received teacher-led direct instruction did no better than uninstructed controls. The study is an advance over previous studies because it demonstrates that students who studied the decision-marking curriculum in collaborative groups were able to transfer what they had learned to a very different task. This implies that they had acquired generalizable skills and dispositions important for decision making.
An educational policy implication of our findings is that if children are to start on the path toward becoming thoughtful decision makers, they need more time during the school day for active reasoning about significant issues. More time for active reasoning about significant issues is critical for children of color because it is one key to “affirmative development of intellective competence and academic ability” (Gordon, 2014, p. 26). But since the passage of No Child Left Behind, the nation’s schools have been moving in the opposite direction. The race to the bottom has been swift in schools with large enrollments of minority and low-income children, who now spend most of the school day on arithmetic exercises and simple reading strategies. Science, social studies, art, and music have been stripped from the curriculum, and recess has been eliminated to make more room for basic skills instruction. Collaborative learning in small groups has given way to teacher-directed whole-class instruction and individual seatwork, which are assumed to be more efficient for transmission of basic skills. The Common Core State Standards were supposed to represent a change of direction. The standards do stress reasoning and other higher-order skills, but as put into practice, the Common Core is perpetuating a test-driven accountability system and a teacher-dominated approach to discussion that are inconsistent with the development of reasoning.
Footnotes
Notes
X
R
J
B
K
T
J
M
T
J
B
S
K
J
