Abstract
Previous research has demonstrated the importance of ensuring that programs are implemented as intended by program developers in order to achieve desired program effects. The current study examined implementation fidelity of Pathways to Health (Pathways), a newly developed obesity prevention program for fourth- through sixth-grade children. We explored the associations between self-reported and observed implementation fidelity scores and whether implementation fidelity differed across the first 2 years of program implementation. Additionally, we examined whether implementation fidelity affected program outcomes and whether teacher beliefs were associated with implementation fidelity. The program was better received, and implementation fidelity had more effects on program outcomes in fifth grade than in fourth grade. Findings suggest that implementation in school-based obesity programs may affect junk food intake and intentions to eat healthfully and exercise. School support was associated with implementation fidelity, suggesting that prevention programs may benefit from including a component that boosts school-wide support.
Introduction
Over the last decade, childhood obesity rates in the United States have increased dramatically (Spruijt-Metz, 2011). The significant psychosocial, health-related, and financial costs associated with obesity suggest a need for preventive interventions in childhood (Datar & Sturm, 2004; Koplan, Liverman, & Kraak, 2005; Oude Luttikhuis et al., 2009; Strauss, Rodzilsky, Burack, & Colin, 2001). Schools may be an ideal setting for delivering obesity prevention programs (Dietz & Gortmaker, 2001; Howerton et al., 2007; Story et al., 2000). Reviews and meta-analyses suggest that school-based obesity prevention interventions can be effective, at least in the short term, for reducing the prevalence of childhood obesity, improving protective factors (e.g., consumption of fruits and vegetables, increased physical activity) and creating a healthier environment (Dietz & Gortmaker, 2001; Dobbins, De Corby, Robeson, Husson, & Tirilis, 2009; Gonzalez-Suarez, Worley, Grimmer-Somers, & Dones, 2009). However, school-based obesity prevention programs are not universally successful (Shaya, Flores, Gbarayor, & Wang, 2008). Variation in success may be due in part to differences in program implementation, suggesting the need to focus on the quality of intervention delivery (Domitrovich & Greenberg, 2000).
Implementation Fidelity in Obesity Prevention
Implementation fidelity is the degree to which programs are implemented as intended by program developers in order to achieve desired program effects (Durlak & DuPre, 2008). Numerous school-based health promotion and disease prevention studies have shown that program implementation is associated with program mediators (Resnicow et al., 1998; Rohrbach, Graham, & Hansen, 1993; Rohrbach, Gunning, Sun, & Sussman, 2010) and program outcomes (Durlak & DuPre, 2008; Lillehoj, Griffin, & Spoth, 2004; McGraw et al., 1996; Pentz et al., 1990). Saunders, Ward, Felton, Dowda, and Pate (2006) found that a greater percentage of girls engaged in vigorous physical activity in high-implementing schools than in low-implementing schools. McGraw et al. (1996) found that the percentage of classroom sessions modified by the teacher, a proxy for implementation fidelity, was associated with positive changes in student dietary self-efficacy and knowledge. Authors hypothesized that modifications were in essence tailoring the lessons to the needs of their students, making the program more effective (McGraw et al., 1996). Resnicow et al. (1998) found an association between health knowledge and observed fidelity but no effects on fruit and vegetable intake or asking behaviors. These results are promising but inconclusive.
Contextual Factors
Contextual factors within the school structure are likely to influence the fidelity of implementation of prevention programs (Chen, 1998; Domitrovich et al., 2008). Schools are often strained to meet academic and policy-related priorities. Therefore, when a principal supports the implementation of an intervention, teachers are likely to implement the program with greater fidelity (Gregory, Henry, & Schoeny, 2007; Kam, Greenberg, & Walls, 2003; Ringwalt et al., 2003). Other factors, such as teachers’ beliefs about the benefits of a particular program and comfort implementing the program, have also been shown to increase implementation fidelity (Beets et al., 2008; Ennett et al., 2003; Kallestad & Olweus, 2003; Klimes-Dougan et al., 2009; Little, Sussman, Sun, & Rohrbach, 2013; Ringwalt et al., 2003). The influence of these types of contextual factors on the fidelity of prevention program implementation in schools is an integral, yet often overlooked, part of implementation research.
The Current Study
The present study uniquely contributes to the field of implementation research by exploring the associations between implementation fidelity, contextual factors, and program outcomes in a school-based obesity prevention program. The current study had three objectives: (a) explore the implementation fidelity of the Pathways program across the first 2 years of program implementation, (b) examine associations of implementation fidelity measures with program outcomes of the Pathways curriculum, and (c) explore the associations between teachers’ beliefs about school support for the Pathways program, benefits of the program, comfort implementing the program, and implementation fidelity. We hypothesized that children whose teachers implemented the Pathways program with a high degree of fidelity would demonstrate significantly less sedentary behavior and junk food intake and significantly greater fruit and vegetable intake, physical activity, and intentions to exercise and eat healthy.
Method
Study Design
Data for the present study are from the implementation of a newly developed school-based obesity prevention program, Pathways to Health (Pathways), for fourth- through sixth-grade children. Pathways was adapted from two nationally recognized evidence-based programs, The Midwestern Prevention Project (MMP or STAR; Pentz, Mihalic, & Grotpeter, 1997) for drug abuse prevention and Promoting Alternative THinking Strategies (Greenberg & Kusché, 1993) for violence prevention. A more detailed account of the translation process and curriculum development can be found in Sakuma, Riggs, and Pentz (2011). Data were collected from the 12 schools and 38 classrooms implementing the Pathways curriculum (Riggs, Spruijt-Metz, Chou, & Pents, 2012). Principals selected teachers who had a fourth-grade class for participation. Teachers attended a 1-day in-person training session using methodology adapted from a previous large-scale drug abuse prevention trial (Pentz & Trebow, 1997). The program was implemented in both fourth and fifth grades. Regular classroom teachers delivered the 15-session fourth-grade curriculum over the course of 7 weeks and the 10-session fifth-grade curriculum over the course of 5 weeks. Sessions lasted an average of 45 min; teachers were encouraged to implement the curriculum 2–3 times per week.
Subjects
Students
We approached 1,204 students for consent in the program schools in fourth grade (see Figure 1). Fifty-seven percent of students (N = 685) received parental consent for participation, 24% (N = 294) did not have parental consent and were treated as anonymous, and 19% declined to participate (N = 225). Anonymous students were not included in the current analyses because baseline and follow-up data could not be linked by individual. Of the 685 eligible students, 581 (85%) took the baseline survey. Five hundred and forty-two students completed the fourth-grade follow-up (93% retention rate) and 411 students completed the fifth-grade follow-up (76% retention rate). Students with incomplete teacher-level data (observations and self-reports) were excluded (N = 8 in fourth grade; N = 20 in fifth grade). The final analytic sample for the fourth-grade analyses was 534 students, and the final analytic sample for the fifth-grade analyses was 391 students. At baseline, students were 9.2 years old (SD = 0.5), on average, and a quarter of the sample received free or reduced lunch (24.9%). The sample was 51.5% female, 31.5% White, 3.6% African American, 25.9% Hispanic, 8.2% Asian, and 31.7% mixed ethnicity.

Flow of study participants.
Teachers
A total of 32 teachers were trained in fourth grade, but 2 were excluded from this study because they did not have both observation and self-report data (N = 30, 94%). In fifth grade, 40 teachers were trained, but 1 was excluded due to incomplete observation data (N = 39, 98%). Individual teacher demographic characteristics were not evaluated in order to increase confidentiality and the validity of teacher self-reports.
Data Collection Procedures and Measures
Student Assessment
Baseline and follow-up measures were collected from students using paper-and-pencil surveys. At each wave of data collection, surveys assessed food intake, physical activity, sedentary behavior, and intentions to eat healthfully and exercise. Surveys took approximately 45 min to complete. Surveys were read aloud to students by research project staff, with a second staff member available to answer comprehension questions. Due to limited time, abbreviated scales were used. Construction of abbreviated scales proceeded through an extensive pilot testing process where full scales of food intake and physical activity were pilot tested and reduced to index items representing the highest loading items for each scale (Riggs, Chou, Spruijt-Metz, & Pentz, 2010).
Food intake
Fruit and vegetable intake was assessed through items adapted from the Youth Risk Behavior Survey (YRBS; Eaton et al., 2006). Three items asked about fruit intake (e.g., How often did you eat any fruit, fresh, or canned?) and 4 items assessed vegetable intake (7 items total, α = .74; Riggs et al., 2010). YRBS does not assess high fat or high sugar items, so 5 additional items were added from a previously validated food frequency questionnaire (Willett et al., 1985), which asked how often students consumed items such as candy, soda, potato chips, French fries, and pastries. Response choices ranged from 1 = less than once a week to 6 = two or more of these a day (5 items, α = .80).
Physical activity
Children’s level of physical activity was assessed using a shorter version of the Physical Activity Questionnaire for Children (Crocker, Bailey, Faulkner, Kowalski, & McGrath, 1997) that included 3 of the original 9 items (Riggs et al., 2010). These items assessed the level of activity outside of school (i.e., immediately after school, in the evenings, and on the weekend). Items asked, “In the last 7 days, (immediately after school, in the evenings, on the weekend) how often were you very active?” Response choices ranged from 1 = none to 5 = six or more times (3 items, α = .77).
Sedentary behavior
Sedentary behavior was assessed by 3 items on watching TV, playing videogames, and using a computer, adapted from the School-Based Nutrition Monitoring Student Questionnaire (Hoelscher, Day, Kelder, & Ward, 2003; Penkilo, George, & Hoelscher, 2008). Items asked, “On a regular school day, how many hours per day do you (a) usually watch TV or video movies at home or away from school, (b) spend on a computer at home or away from school and (c) play video games that you sit down to play like PlayStation, Xbox, GameBoy, or arcade games?” (3 items, α = .60; Riggs et al., 2010).
Intentions
Four items assessed students’ intentions to eat healthier and exercise more, adapted from a previous research trial (Pentz, Cormack, Flay, Hansen, & Johnson, 1986). These items are hypothesized to lead to behavior change based on the theory of planned behavior (Ajzen, 1991). Items asked, “When you get home today from school will you (e.g., choose to watch less TV than normal)?” Response choices were as follows: 1 = no, 2 = maybe and 3 = yes (α = .59).
Implementation Fidelity
Both observation and self-report methods assessed implementation fidelity. Project staff observed teachers during two sessions (fourth-grade curriculum Sessions #7 and #13; fifth-grade curriculum Sessions #6 and #9). These sessions were chosen because they were highly interactive and thought to be more difficult to implement (Durlak & DuPre, 2008). Self-reported implementation was assessed and measured using a standard form adapted from a previous research trial (Pentz et al., 1990). Teachers received the form prior to implementation and were instructed to complete it throughout the implementation period and return it in a prestamped envelope after program completion.
Observed implementation
A multidimensional implementation fidelity index was generated using two components: participant engagement (the extent of engagement of participants within the intervention, beyond exposure) and quality of delivery (as measured by teacher’s enthusiasm, integrity, and quality) (Dane & Schneider, 1998; Durlak & DuPre, 2008; Linnan & Steckler, 2000; Resnicow et al., 1998; Rohrbach et al., 2010). Three items assessed observed participant engagement (α = .80fourth grade; α = .80fifth grade). A sample item asked, “Approximately what percentage of the students actively participated in the discussions, role-plays or activities?” scored 1 = less than 10% to 5 = 76–100%. Six items assessed observed quality of delivery (α = .76fourth grade; α = .76fifth grade). A sample item asked, “How much enthusiasm did the teacher demonstrate for the program?” scored 1 = almost no enthusiasm, just went through the motions to 5 = very enthusiastic, seemed really involved and excited. Since these two indices were highly correlated (r = .73fourth grade; r = .80fifth grade), individual items were averaged to create a composite observed implementation fidelity score (α = .84fourth grade; α = .89fifth grade).
Self-reported implementation and beliefs
The self-report implementation instrument assessed implementation fidelity through three components: participant engagement, fidelity (as measured through quality of delivery), and dose delivered (Dane & Schneider, 1998; Durlak & DuPre, 2008; Linnan & Steckler, 2000). One item assessed self-reported participant engagement, “On average, when active participation of your students was required, what percentage of your students actively participated?” scored 1 = less than 25% to 4 = 76–100%. Self-reported quality was assessed by asking for each session, “How much do you think this lesson achieved its objectives?” scored 1 = not at all to 5 = very much (α = .95fourth grade; α = .98fifth grade). These two indices were correlated (r = .29fourth grade; r = .67fifth grade), and individual items were averaged to create a composite self-reported implementation fidelity score (α = .95fourth grade; α = .98fifth grade). To assess self-reported dose delivered, teachers reported for each session “Did you teach it?” scored 0 = no to 1 = yes. These items were summed to create an overall self-reported dose delivered index. However, this index was highly skewed (see Table 1) and was not included in the overall self-reported implementation fidelity score.
Implementation Fidelity Scores and Teacher Beliefs, M (SD) or %.
Note. aScale ranges from 1 to 5. bScale ranges from 1 to 4. cThere were 15 sessions in the fourth-grade curriculum and 10 sessions in the fifth-grade curriculum; t-tests were not performed for these items.
*p < .05.
Teacher beliefs about the program
Perceived school support for the Pathways program was assessed through 2 items (e.g., “Most administrators at my school were supportive of my teaching Pathways.”) scored 1 = strongly disagree to 5 strongly agree (α = .71fourth grade; α = .78fifth grade). Perceived benefits of the program were assessed through 3 items (e.g., “Pathways is an effective way to prevent unhealthy eating.”) scored 1 = strongly disagree to 5 strongly agree (α = .62fourth grade; α = .68fifth grade). Comfort implementing the program was assessed through 9 items (e.g., “Pathways fits well with the way I like to teach.”) scored 1 = strongly disagree to 5 strongly agree (α = .88fourth grade; α = .92fifth grade).
Data Analysis
The first objective was to explore the implementation fidelity of the Pathways program in the first 2 years of program implementation. We conducted Pearson product moment correlations to test associations between self-report and observed implementation fidelity scores. Next, we utilized t-tests to explore differences in implementation fidelity between the fourth- and fifth-grade implementation. The second objective was to examine associations between implementation fidelity measures and program outcomes of the Pathways curriculum. General mixed linear modeling (Murray & Hannan, 1990) was conducted to assess the relationship between implementation fidelity and program outcomes. Classroom and school were considered random factors, which allowed us to statistically account for the intraclass correlation within clustered units on computed significance levels. Analyses adjusted for student gender, ethnicity, reduced/free lunch status, the baseline value of the dependent variable, and a propensity-for-attrition score (to be described later). Fifth-grade implementation models controlled for fourth-grade fidelity scores because the program was implemented in both grades. βs and standard errors are reported. The third objective was to test the associations between teachers’ perceived school support, benefits, and comfort implementing the Pathways program and implementation fidelity. We ran Pearson product moment correlations to explore associations between contextual factors and implementation fidelity. Next, we utilized t-tests to explore differences in teachers’ beliefs between the fourth- and fifth-grade implementation. All analyses were conducted using Statistical Analysis System (SAS) statistical package (SAS Institute Inc. SAS/C Online Doc TM, 2000).
Results
Assessment of Attrition Bias
To assess the potential sample bias due to subject attrition at the 1-year follow-up, an attrition analysis was conducted on the analytic sample (n = 391) and the sample that was lost to follow-up (n = 143) on seven key baseline measures. The measures were age, gender, ethnicity, reduced or free lunch, and the five program outcomes. The comparisons used chi-square or t-tests to explore differences between the two groups. Analyses revealed statistically significant incomparability between lost to follow-up and retained subjects on gender, ethnicity, receiving reduced or free lunch, junk food intake, and sedentary behavior. Retained subjects were less likely to be female, 48.1% versus 60.8%; χ2(1, N = 534) = 6.82, p < .01; less likely to be Hispanic, 22.5% versus 32.2%, χ2(1, N = 534) = 5.20, p < .05; less likely to receive free lunch, 16.9% versus 46.9%, χ2(1, N = 534) = 50.30, p < .001; less likely to consume junk food, 2.33 versus 2.60, t(218) = −2.44, p < .05; and less likely to be sedentary, 2.49 versus 2.81, t(529) = −2.69, p < .01. To assess whether our results are generalizable to those students not included in the current study, an analysis was conducted between the consented students (N = 581) and the anonymous students (N = 198; i.e., students who did not have parental consent and were treated as anonymous) surveyed at baseline. The comparisons used chi-square or t-tests to explore differences between the two groups. Analyses revealed statistical incomparability between the consented students and the anonymous students on gender, ethnicity, receiving reduced or free lunch, junk food intake, and sedentary behavior. Consented students were more likely to be female, 51.6% vs. 42.4%; χ2(1, N = 779) = 5.01, p < .05; less likely to be Hispanic, 25.7% versus 33.5%, χ2(1, N = 779) = 4.53, p < .05; less likely to receive free lunch, 27.1% versus 41.1%, χ2(1, N = 779) = 13.65, p < .001; less likely to consume junk food, 2.38 versus 2.56, t(772) = −2.07, p < .05; and less likely to be sedentary, 2.57 versus 2.95, t(770) = −3.50, p < .01. Due to these differences, the extent of this generalizability is restricted to a population with baseline measurement access restrictions such as those experienced in this study.
To statistically adjust for possible bias introduced by nonrandom attrition at 1-year follow-up, a “propensity-for-attrition” score was computed for each subject retained at the follow-up, and adjusted for in each of the statistical models (Rosenbaum & Rubin, 1984). This score was computed among the entire baseline sample by associating the difference in the selected baseline measures (gender, ethnicity, receiving reduced or free lunch, junk food intake, and sedentary behavior) to the actual attrition status in a multiple regression analysis, and then assuming the association is also maintained among the subjects retained at the 1-year follow-up.
Implementation Fidelity in Fourth and Fifth Grade
Results comparing implementation fidelity items in fourth and fifth grade are presented in Table 1. In fourth grade, 81.1% of teachers reported implementing all 15 sessions, and in fifth grade, 97.6% of teachers reported implementing all 10 sessions. Both self-reported participant engagement and implementation quality were significantly higher in fifth than fourth grade (ps < .05), while observed fidelity was lower in fifth than fourth grade (p < .05). There was no significant difference across years for observed participant engagement. In fourth grade, observed implementation fidelity scores were negatively correlated with self-reported scores (r = −.10, p < .05) but, in fifth grade, were positively correlated (r = .72, p < .001).
Implementation Fidelity and Program Outcomes
Table 2 shows results of the mixed linear regression models exploring the relationship between observed and self-reported implementation fidelity and program outcomes in fourth and fifth grades. There were no significant effects in the fourth-grade models. In fifth grade, both observed and self-reported implementation fidelity were significantly associated with higher intentions to eat healthfully and exercise and lower junk food intake (ps < .05). There were no significant effects on fruit and vegetable intake, physical activity outside of school, or sedentary behavior.
Parameter Estimates From Mixed Effects Models for Implementation Effects.
Note. Values are βs and standard errors. Two-tailed significance tests were used.
Random effects adjusted for in the models included school and classroom (teacher); Fixed effects controlled for in the models included propensity-for-attrition score, student gender, ethnicity, socioeconomic status, the baseline value of the dependent variable, and fourth-grade fidelity score (only in fifth-grade models).
*p < .01. **p < .001.
Implementation Fidelity and Teacher Beliefs
Perceived beliefs about school support for the Pathways program and comfort implementing the program were higher in fifth than fourth grade (ps < .05; see Table 1). There were no significant differences in perceived benefits of the program between grades. The results exploring associations between teacher beliefs and implementation fidelity in fourth and fifth grades are presented in Table 3. In both fourth and fifth grades, perceiving school support for the Pathways program, perceiving positive benefits of the program, and having comfort implementing the program were all significantly associated with increased self-report and observed implementation fidelity (ps < .05). In fifth grade, the associations were stronger than in fourth grade.
Correlation Matrix of Implementation Fidelity and Teacher Beliefs.
Note. Pearson product moment correlations of data presented. All correlations are statistically significant at p < .05.
Discussion
The current study examined implementation fidelity of the Pathways program in the first 2 years, associations between self-reported and observed implementation fidelity, effects of implementation on health outcomes, and associations between teacher beliefs and implementation fidelity. While there was a high level of dose delivered across the fourth- and fifth-grade implementation (81.1% and 97.6%, respectively), it appears that fifth-grade implementation was associated with program outcomes.
In fourth grade, observed implementation fidelity scores were negatively correlated with self-reported scores (r = −.14, p < .01), revealing discrepant perceptions between staff observers and teachers. In contrast, in fifth grade, observed implementation fidelity scores were positively correlated with self-reported scores (r = .74, p < .0001), indicating that staff observers and teachers had similar perceptions of implementation fidelity. A limited number of studies have used multiple methods to assess implementation fidelity, and within these studies a wide range of agreement between observed and self-reported implementation has been reported (Hansen & McNeal, 1999; Lillehoj et al., 2004). Self-reported adherence may overestimate adherence, compared to objective assessments, possibly due to social desirability or interviewer bias (Adams, Soumerai, Lomas, & Ross-Degnan, 1999). Self-reports may be more inaccurate compared to observed data when respondents fear consequences from the results, but self-reports may be more valid measures for less sensitive information, such as height (Donaldson & Grant-Vallone, 2002). We used different measures and assessed slightly different components of implementation fidelity in self-report versus objective assessments. Also, different teachers implemented the curriculum in fourth and fifth grades. In addition, the program was tailored based on teacher feedback following fourth-grade implementation, and changes may have enhanced teachers’ understanding and improved their ability to align program delivery with the intended curriculum. Unfortunately, because we did not assess teacher background characteristics, such as teaching experience, it is impossible to rule out which, if any, of these factors had an effect on our results. Therefore, our results should be interpreted with caution but support the use of multimethod (i.e., self-report and observed) assessments to increase accuracy.
Consistent with previous research, we found that perceived school support, belief that the program was beneficial, and comfort with implementation were associated with higher levels of implementation fidelity (Beets et al., 2008; Ennett et al., 2003; Gregory et al., 2007; Kallestad & Olweus, 2003; Kam et al., 2003; Klimes-Dougan et al., 2009; Ringwalt et al., 2003). Given the considerable pressures teachers face to prepare students for standardized tests, prevention curricula are often seen as ancillary (Ringwalt et al., 2003). Therefore, it is important to ensure that teachers receive support from the principal and school staff. Principals can be central in promoting positive attitudes toward the program among parents and the community and in fostering a sense of collaboration among teachers and other staff to assist in the implementation of a prevention program (Kam et al., 2003). Adequate training also helps ensure that teachers feel comfortable implementing interactive sessions.
Interestingly, comfort with the program and perceived school support for the program were both higher among fifth-grade teachers than fourth-grade teachers, which may explain why fifth-grade teachers reported higher levels of implementation fidelity. It is unclear why observed implementation fidelity was lower in fifth grade compared to fourth grade. Because only two sessions were observed, one potential explanation is that the sessions observed in fifth grade were more challenging to implement. Previous studies have found that implementation fidelity is often lower in sessions that are more difficult to implement (Botvin, Baker, Filazzola, & Botvin, 1990; Hahn, Noland, Rayens, & Christie, 2002; Kallestad & Olweus, 2003). Since observed fidelity was relatively high among both fourth- and fifth-grade teachers, the question remains open. Surprisingly, we found that implementation fidelity affected two of the five program outcomes in fifth grade but had no effect in fourth grade. A number of potential explanations could explain these unexpected findings. In fourth grade, teachers reported making significantly more adaptations to the curriculum compared to fifth grade (p < .05). Although researchers expect teachers to implement the program without changing the core elements and internal logic of an intervention, often programs are adapted to meet the needs of the program recipients (Dusenbury, Brannigan, Hansen, Walsh, & Falco, 2005). Although we do not have specific details regarding the adaptations made, despite asking teachers to report the details of their adaptations on the self-report survey, it appears that these adaptations negatively affected program outcomes, which could be a reason we did not observe associations between implementation fidelity and program outcomes in the fourth-grade curriculum. Similarly, McGraw et al. (1996) found that teachers who implemented their program made significant adaptations to their curriculum, altering the effectiveness of the study. In that case, however, the adaptations were positive and made the more program more effective (McGraw et al., 1996). Based on the frequency with which adaptations are made in the adoption of prevention curriculum, program developers need to be aware of the reasons teachers make modifications and develop recommendations to ensure program goals are realized (Dusenbury et al., 2005). Additionally, it could be useful to provide additional support and coaching to teachers as they deliver core elements of the intervention. The provision of technical assistance has been associated with improved implementation of prevention interventions in schools (Fagan, Hanson, Hawkins, & Arthur, 2008; Gingiss, Roberts-Gray, & Boerm, 2006; Roberts-Gray, Gingiss, & Boerm, 2007; Rohrbach et al., 2010).
Another potential explanation for the unexpected findings is that multiple iterations of program delivery are required before the effects are observed. The Pathways program focuses on changing child impulse control, decision making, and social competence through interactive classroom lessons (Sakuma, Riggs, & Pentz, 2011). Altering these cognitive functions in children takes time, and changes may not be visible immediately, possibly explaining why we did not see changes in student behavior immediately after fourth-grade implementation but did after fifth-grade implementation. However, additional longitudinal analysis will be required to test whether the curriculum was able to change health behaviors over time. Additionally, the Pathways curriculum could be too mature for fourth graders, which could explain the lack of effects. However, future studies will need to replicate the program during more controlled conditions to disentangle these possibilities.
Strengths and Limitations
The current study contributes to the field of implementation research by examining the associations between implementation fidelity, contextual factors, and program outcomes over the course of 2 years of program implementation. The Pathways curriculum was taught in both fourth and fifth grades, focusing on different skills in each grade. Using feedback from the fourth-grade teachers, the fifth-grade curriculum was refined and strengthened. These improvements, however, may have made it difficult to disentangle the longitudinal effects of the fourth-grade curriculum on health outcomes from the proximate effects of the fifth-grade curriculum. One limitation of the study design is that students received both. However, the current analysis attempted to separate effects statistically by controlling for fourth-grade implementation fidelity scores when examining fifth-grade outcomes.
Another limitation is that two contextual factors that may have been especially valuable for assessing changes in school environment were not measured. First, this study occurred during the height of the 2009 economic collapse. Four participating schools closed between fourth- and fifth-grade implementation. Although these closures placed a large burden on research program staff to track students who switched to new schools, retention remained high in fifth grade (76%). However, a climate of school closings may have affected morale at schools that remained open. Second, many of the fourth-grade teachers received pink slips during the study, which could have affected the quality of program implementation. Unfortunately, lowered morale may be a common problem that researchers face when working in schools in the current economic climate.
Finally, the use of a single item indicator to assess self-reported participant engagement could have masked important findings. However, the αs for self-reported implementation fidelity are high in both fourth and fifth grade suggesting that the items constitute one index and using both self-reported and observed ratings of implementation fidelity is more reliable that using a single method.
Conclusion
Given the enormous burdens schools are under to meet academic and policy-related priorities and significant reductions in public funds, it is crucial that future studies include measures of school and community-level contextual factors (Durlak & DuPre, 2008). In the current study, these burdens may have had a significant impact on the implementation and effectiveness of the Pathways program. The program was better received and implementation fidelity had more effects on program outcomes in fifth grade compared to fourth grade, when much of the initial turmoil of the economic collapse had settled. Additionally, school support was associated with implementation fidelity, suggesting that prevention programs may benefit from including a component that boosts school-wide support.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by grant HD 052107 (NICHD, NIDA; Pentz, P.I.), and is registered at
(#NCT00787709). ML was supported during the work on this project by a postdoctoral fellowship on grant R25 CA90956.
