Abstract
We conducted a community-based, randomized control trial with intent-to-treat analyses of Promoting First Relationships (PFR) to improve parenting and toddler outcomes for toddlers in state dependency. Toddlers (10–24 months; N = 210) with a recent placement disruption were randomized to 10-week PFR or a comparison condition. Community agency providers were trained to use PFR in the intervention for caregivers. From baseline to postintervention, observational ratings of caregiver sensitivity improved more in the PFR condition than in the comparison condition, with an effect size for the difference in adjusted means postintervention of d = .41. Caregiver understanding of toddlers’ social emotional needs and caregiver reports of child competence also differed by intervention condition postintervention (d = .36 and d = .42) with caregivers in the PFR condition reporting more understanding of toddlers and child competence. Models of PFR effects on within-individual change were significant for caregiver sensitivity and understanding of toddlers. At the 6-month follow-up, only 61% of original sample dyads were still intact and there were no significant differences on caregiver or child outcomes.
Keywords
Infants and toddlers are a growing population in child welfare. Between 2000 and 2010, the percentage of maltreated children in child welfare who were under 3 years old increased from 15.7% to 27.3% (Administration for Children and Families, 2000, 2010). In the first year of life, human infants are biologically primed to develop primary attachments to their parents, from whom they seek comfort and nurturance, and who are critical to the development of early regulatory capacity (Bowlby, 1969/1982; Siegel, 2001). Maltreatment, subsequent out-of-home placement, and multiple moves in care result in disruptions of primary attachment relationships that are critical to later social and emotional functioning (Sroufe, 1988). For infants and toddlers in child welfare state dependency, risks associated with the lack of protection in their birth family that led to child welfare involvement may be compounded by sequential separation from one or more attachment figures during this critical period of rapid physiological, neurological, and developmental maturation (Rubin et al., 2004; Rubin, O’Reilly, Luan, & Localio, 2007). In a recent plea for developmentally informed foster care for maltreated infants and young children, Zeanah, Shauffer, and Dozier (2011) emphasized that attachment must be of paramount consideration in child welfare planning, decision making, and training of foster care providers and parents. In accord with a call to action from Zeanah and colleagues (2011), we report on the results of a randomized control study of a brief intervention to help caregivers promote secure attachments and emotional regulation for toddlers in child welfare. This is the first randomized trial of the intervention and training program, Promoting First Relationships (PFR; Kelly, Sandoval, Zuckerman, & Buehlman, 2008).
Multiple Placements and Attachment Disruptions
Infants in child welfare must develop a new attachment relationship with each placement. Bowlby (1969/1982) described how very young children grieve when attachment relationships are disrupted, even when the children receive excellent care. Infants younger than 12 months tend to seek comfort and closeness from new attachment figures within a week of placement, but toddlers older than a year do not readily seek out their new attachment figures, and may be inconsolable or push them away (Stovall-McClough & Dozier, 2004). The process of attachment formation in toddlers and older children is complicated by these difficult child behaviors, which may be exacerbated if there are serial placements resulting in multiple caregiver changes. These events can compound the effects of early maltreatment and lead to mental health problems. Placement instability increases the risk of later internalizing and externalizing disorders for children with initially normative levels of these problems (Newton, Litrownik, & Landsverk, 2000). However, caregiver sensitivity and understanding of the meaning of the child’s behavior is associated with child attachment security, even for older toddlers and preschoolers in foster care (Stovall-McClough & Dozier, 2004).
Because they experienced disturbed or disrupted primary relationships, many infants and toddlers in child welfare or foster care are less able to use their caregivers to buffer or regulate stress than infants with consistent caregivers and are more likely to develop insecure or disorganized attachments which contribute to regulatory problems (Dozier, Higley, Albus, & Nutter, 2002). The pervasive effects of early separation from caregivers may be compounded by other risk factors. Infants who enter foster care as newborns have elevated rates of low birth weight, birth abnormalities, lack of prenatal care, and exposure to drugs, alcohol, and other teratogens (Klee, Kronstadt, & Zlotnick, 1997; Putnam-Hornstein & Needell, 2011). These factors compromise infants’ regulatory capacities and pose additional challenges to caregivers (Klee et al., 1997). Regulatory problems in infancy have significant costs, as these children continue to differ from peers in mood regulation, sensory integration, motor control, sleep, and behavioral control as they get older (Degangi, Breinbauer, Doussard, Porges, & Greenspan, 2000).
Foster mothers more frequently report that their children are difficult to calm or soothe than mothers of children who have never been in care (Tyrrell & Dozier, 1999). Because infants with disrupted attachments do not elicit care in expectable ways, they are regarded as difficult by caregivers (Dozier et al., 2002). Toddlers in foster care often behave in avoidant or resistant ways when hurt or frightened, and their caregivers respond “in kind“ (Stovall-McClough & Dozier, 2004) by ignoring or getting angry. Evidence-based interventions, informed by the attachment literature, are needed to help caregivers to provide care that promotes security for this vulnerable population of children.
Conceptual Framework for Interventions to Promote Attachment Security
A primary concept undergirding effective, attachment-based interventions is caregiver sensitivity. De Wolff and van IJzendoorn (1997) conducted a meta-analysis of 21 studies (N = 1,099) that assessed maternal sensitivity and child attachment security using observational measures. Results showed a combined effect size of r = .24 for the association between the two. Thus, sensitivity is an important (but not exclusive) condition of attachment security and should be a focus in interventions to promote attachment security. In a meta-analysis of 51 randomized controlled studies of attachment-based interventions, Bakermans-Kranenburg, van IJzendoorn, and Juffer (2003) found that brief interventions that used video feedback techniques and a focus on promoting positive parent–child interactions were most effective in improving caregiver sensitivity and that interventions that had a moderate to large effect on sensitivity also improved child attachment security.
Juffer, Bakermans-Kranenburg, and van IJzendoorn (2008) summarize why video feedback is a potent intervention. When video feedback is strengths-based, as in PFR, the provider delivering the intervention highlights instances of sensitive nurturance, thereby increasing caregiver awareness, confidence, and competence. When the interaction is not going well and the child is behaving in challenging ways, the provider, in a sensitive manner, helps the caregiver notice, reflect on, and interpret the meaning of the child’s behavior. From this perspective, the provider helps the caregiver acquire a deeper understanding of developmentally appropriate expectations of the child and reframe the meaning of challenging behaviors as revealing the child’s need for the caregiver’s sensitive and nurturing care to regulate or repair the interaction. Considerable research indicates that reflective video feedback improves parents’ sensitive responsiveness because it increases their ability to think about the feelings and needs underlying their child’s behavior, to attune to the child’s positive and negative emotions, and to understand and respond appropriately to attachment versus exploratory behavior (Juffer et al., 2008).
Video feedback used in reflective ways helps caregivers appropriately understand the mind of the developing child, an ability which in turn is associated with attachment security (Meins, Fernyhough, Fradley, & Tuckey, 2001). Grienenberger, Kelly, and Slade (2005) showed how the effect of maternal reflective ability on child attachment security was mediated by maternal sensitivity. Lieberman (2003) studied adoptive parents who had concerns about their children’s ability to attach to them. She found that parents missed or overlooked child cues of need for care, including misinterpreting the meaning of tantrums, defiance, and noncompliance as signs that the children did not care for them, rather than as expressions of anxiety and fear of loss. Parents often responded with disciplinary measures such as “time out” when they perceived the children’s behavior was inappropriate, instead of responding with firm but comforting behavior that would have reassured the children about the parents’ ongoing availability. Similar challenges, further exacerbated by parental guilt, anxiety, and fear of loss, are faced by birth parents being reunified with their young children after a period in foster care during which the children developed attachments to other caregivers.
A brief attachment-based intervention for infants and toddlers in the child welfare system that has been tested in a randomized control trial is Dozier’s Attachment and Biobehavioral Catch-up (ABC; Dozier & Infant-Caregiver Lab, 2003). ABC is designed to help caregivers provide nurturance when they would not naturally do so and when infants do not signal their needs. In preliminary results, foster parents who received ABC reported less avoidance on the part of their children after distressful episodes, but no differences in secure behaviors (Dozier et al., 2009). Other than Dozier’s pioneering work, to our knowledge no additional brief interventions have been tested for infants and toddlers in foster care, perhaps because the behavioral problems of infants and toddlers are less apparent and more manageable than those of older children in care. While we know infants in child welfare are at high risk of short-term and long-term effects of disruptions in attachment relationships, their mental health problems do not demand attention until they are older.
PFR
PFR was developed in the late 1990s and has been a popular infant mental health training program for workers in early intervention, community mental health, home visiting, and early care and education in Washington State. Quasi-experimental studies of PFR have shown promising increases in sensitivity (Kelly, Buehlman, & Caldwell, 2000; Kelly, Zuckerman, & Rosenblatt, 2008) and in child attachment security (Kelly, 2007) among dyads who have received the intervention. PFR includes many of the effective elements of brief attachment-based interventions summarized by Bakermans-Kranenburg and colleagues (2003). (These attachment-based interventions should not be confused with “attachment therapies” that have been criticized by Chaffin and colleagues [2006], as part of a task force of the American Professional Society on the Abuse of Children.) In conjunction with video feedback, PFR uses reflective practice principals (Slade et al., 2005; Suchman et al., 2010) to focus on the deeper emotional feelings and needs underlying difficulties in the parent and child relationship and to help caregivers think about their child’s developing mind. For this study, PFR also addressed the fact that infants and toddlers in child welfare tend to give behavioral signals that lead even nurturing caregivers to provide nonnurturing care (Dozier et al., 2002). PFR helps caregivers identify possible “miscues,” empathize with the child’s underlying distress, and reframe or understand the child’s behavior as reflecting an unmet need. This better understanding of the child’s cues should, in turn, lead to more responsive, nurturing care.
Current Study
In 2005, the researchers invited community stakeholders (child welfare workers and administrators, judges, mental health and public health agency directors, parents, and early intervention workers) to identify and discuss a problem that could be addressed in a research proposal in response to an NIH program announcement, PAR-05-026, Community Participation in Research. Community stakeholders focused on infants and toddlers in foster care. They indicated a desire for PFR to be tested for use with caregivers of infants and toddlers in state dependency. They acknowledged that a randomized control design would be optimal.
This is the first randomized trial of PFR. It fills a gap in our knowledge because there are few or no community-based trials of attachment interventions appropriate for toddlers in state dependency. In our model of change, PFR would increase caregiver awareness of children’s behavioral cues and miscues for nurturance. PFR would also improve caregiver understanding that difficult toddler behavior reflects unmet social emotional needs. In turn, caregiver sensitivity would improve and children would become more secure and engaged in their primary relationships, and more behaviorally and emotionally regulated. We hypothesized that PFR, as used by community providers, would result in improved parenting and child outcomes relative to a comparison condition in which families received home-based services that were not relationship-focused.
Method
Participants
Two hundred and ten toddlers and their caregivers were recruited into the Fostering Families Project (FFP) between April 2007 and March 2010. Recruitment into the study involved close collaboration between research study personnel, the state Institutional Review Board, and the state Department of Social and Health Services (DSHS). The study supported an authorized worker within DSHS to access DSHS records and identify infants in one county between the ages of 10 and 24 months who had experienced a court-ordered placement that resulted in a change in primary caregiver within the prior 7 weeks. Permission to contact caregivers about the project was obtained from a Department of Child and Family Services social worker, after which a research team social worker made the contact, determined eligibility, and scheduled the baseline research visit. Eligible caregivers spoke English and could be foster parents, biological parents, or adult kin. The caregiver and child were assessed at baseline, received intervention services, and then were assessed postintervention and 6 months later if they were still in the same household. (Children were also assessed with new caregivers if they had experienced a placement change, but dyads with new caregivers are not included in these analyses.) Caregivers were compensated $75 for each research visit but received no compensation for intervention visits. Research visitors were blind to condition status.
Randomization into the PFR intervention and the Early Education Support (EES) comparison condition was computer generated, blocked by caregiver type. Of the 210 infants who were randomized, 29 (13.8%; 18 in EES and 11 in PFR) experienced a placement change within 4 months after study enrollment. For these infants, the new caregiver agreed to participate in the study. A new baseline assessment was conducted and the dyad remained in the intervention condition to which the infant was initially assigned. The flow of participant recruitment, assignment, and attrition over time is shown in Figure 1. Demographic and background characteristics of the 210 toddlers and caregivers included in the analysis sample are shown in Table 1. Most toddlers in the sample experienced multiple placement changes prior to enrollment in the study, and there were substantial numbers of three types of caregivers: birth parents (five fathers), kin, and foster. There were no differences by condition on any demographic variable, with the exception that more PFR infants experienced two or more removals from their birth home, χ2(1, N = 210) = 7.31, p < .01.

Flow of participant recruitment, assignment, and attrition over time.
Baseline Characteristics by Intervention Condition
Note. *p< .05.
Interventions, Dosage, and Fidelity
The providers who delivered the PFR and EES interventions were selected by the directors of five community agencies. One agency provided three EES providers to fill a single EES position over the course of the project, and four agencies provided five PFR providers to fill five PFR positions over the course of the project. Thus, turnover occurred for EES providers but not for PFR providers. The providers’ time spent training and delivering the interventions was supported by university subcontracts to their respective agencies. All providers also had other caseloads in their agencies.
PFR
Seventy-one percent of the 105 caregivers in the PFR condition received all 10 weekly sessions of the intervention. Seven percent had none, 20% had fewer than half, and 3% had more than half. The sessions were 60–75 min long and took place in caregivers’ homes, guided by one of five concurrently working masters prepared providers. The sessions were designed for this project and covered activities and instructional content in the PFR manual, although the pace of delivery of these components was tailored to individual caregivers. There were five videotaped caregiver–child interactions used for reflective video feedback. These tapes were viewed together by the caregiver and the intervention provider, who guided discussion focusing on parenting strengths and interpretation of the child’s cues. Weeks 2–10 began with reflecting on the prior week’s content. During the course of the 10 sessions, providers and caregivers reviewed up to 15 handouts on topics such as “Staying Connected During Difficult Moments.” They discussed two short videos, “Attachment Story” and “Elements of a Healthy Relationship,” and reflected on topics such as “Memory of a Strong Emotion.”
EES
The comparison intervention condition consisted of three monthly 90-min visits and 81% of the 105 caregivers in the EES condition received complete dosage of the intervention; 4% had no EES visits and 15% had only one or two. Visits occurred in caregivers’ homes and were led by one of three serially employed bachelors prepared providers from one community agency over the course of the 42 months of the project. The EES provider helped to connect the family to resources such as Early Head Start, early intervention, housing, mental health services, and child care; the provider also suggested activities to promote growth and development.
Training PFR providers
Training involved 90 hr over 6 months and consisted of a 3-day, large-group community-based PFR workshop as knowledge building as to the underlying practice of PFR and the completion of three 10-week, mentored PFR interventions with families. The provider first shadowed, then co-led, and then led, the PFR home visits with one of two university-based masters-prepared PFR trainers. Topics in the manualized PFR curriculum include (1) theories of attachment; (2) social and emotional needs specific to the infant–toddler period and young children who have had disrupted attachments; (3) caregiving that promotes secure infant attachment and emotion regulation (individualized attention; empathy, labeling, and organizing feelings and emotions; and predictability); (4) caregiving that promotes healthy identity formation in the toddler years (managing feelings of distress; offering rituals and routines; encouraging exploration, independence, and cooperation through appropriate choices and limits); (5) intervening with challenging behaviors (assessing through discussions and observations; identifying young children’s feelings and unmet social emotional needs; identifying possible causes for challenging behaviors; reframing the behaviors for caregiver; developing individualized intervention plans); and (6) building parent’s own reflective capacity by exploring the parent’s own sense of self, emotion regulation and supports that influence caregiving. In addition, providers practiced the PFR Ways of Being (establishing an emotional connection with the caregiver; sensitive interviewing; reflective observation; and the four specific consultation strategies of positive feedback, positive and instructive feedback, reflective questions or comments, and validating responsive statements). During the mentored PFR training visits and throughout the project, the providers were guided by a weekly reflective consultation group led by a PFR trainer. The consultation group members adhered to the PFR ways of being with each other to provide a parallel process experience of the way we wanted providers to be with families. This reflective consultation model is central to PFR and infant mental health practice (Gilkerson & Kopel, 2005).
Fidelity coding of video feedback sessions
Providers videotaped or audio recorded (which providers preferred) three video feedback sessions per family. The recordings were rated for providers’ fidelity to PFR. There were three aspects to the fidelity coding. First, a rater who was also a PFR trainer coded each provider’s comments as being either one of the four PFR consultation strategies (positive feedback; positive and instructive feedback; reflective comment or question; or validating, responsive statement) or other. The rater recorded the ratio of PFR reflective consultation strategies to other comments. Second, the rater categorized the predominant focus of each minute of the segment as being on the dyadic relationship; child; or caregiver. Third, the rater assigned the segment a global rating from 1 = no PFR strategies used, provider not relationship-focused to 5 = Provider using all PFR strategies, is solidly focused on the dyadic relationship, makes the reflective dialogue rich and meaningful for the caregiver. To pass fidelity, the provider needed a four to one ratio of PFR consultation strategies to other comments, be predominantly focused on the relationship in 90% of the minutes coded, and receive a 4 or higher on the global rating. If a provider received three global ratings in a row of 3 or below, the PFR trainer engaged in weekly individualized remediation with that provider. During remediation, the provider did not receive any new cases but continued to provide services to the provider’s current caseload. Fidelity to consultation strategies was reassessed 3 times before new cases were referred to the provider. Early in the project, two PFR providers entered into this remediation process. One provider resumed cases after 2 months. The second provider, after 7 months of remediation, accepted a mutual agreement between the researchers and her agency to no longer provide services to the project. No other providers required remediation, and no provider received lower than a 4 in the second 2 years of the project. The mean ratio of PFR reflective consultation strategies to other comments across all providers and cases was 15:1 (range 3:1 to 30:1). The mean fidelity score was 4.04 (SD = .76, range = 3–5). The agency delivering EES understood that it was delivering a comparison intervention that did not have an explicit focus on the caregiver–child relationship. The EES provider submitted audio recorded EES home visits for coding according to the fidelity plan outlined above. In most cases, the majority of comments were coded as “other” and the conversation was predominantly not focused on the relationship between caregiver and child. The average EES ratio across all three providers was 1 PFR consultation strategy to 30 other comments (range 1:14–1:30). All EES fidelity sessions received a global rating of 1, indicating that no PFR strategies were used.
Fidelity to PFR protocol
The second measure of fidelity assessed the delivery of weekly PFR activities. Providers checked the completed activities after each session. For each provider, the indicator was number of activities checked divided by the total activities in the protocol for the PFR sessions actually completed. Providers followed the protocol almost all the time; 97% of activities assigned for weeks completed were checked.
Measures and Procedure
Infants and their caregivers were assessed in 2-hr research home visits at baseline (enrollment), postintervention, and a 6-month follow-up postintervention. Visits included interviews, self-report measures, and videotaped caregiver–child interactions, including a brief separation when the caregiver went outside for 3 min (or fewer if the child was distressed). When the child experienced a caregiver change following the intervention, later assessments were completed with the new caregiver. In the current study, only data from visits with infant–caregiver dyads that remained intact since baseline are used, resulting in a sample size of 175 at postintervention and 129 at the 6-month follow-up; 123 had the same caregiver at all three time points (58.6% of the original sample).
Dyads that were intact and assessed at postintervention did not differ (p < .05) from dyads that experienced a caregiver change or did not complete the postintervention follow-up with respect to intervention condition, gender of the child, caregiver type (foster, kin, or birth), whether the infant had experienced multiple removals from their birth home, or number of caregiver changes experienced by the infant prior to enrollment. Intact dyads assessed at the postintervention time point included younger children, M = 17.62 vs M = 19.98 months, t(208) = 2.74, p = .007, and were more likely to have completed all sessions of the intervention entailed by their condition assignment, 90.3% versus 2.9%, χ2(1, N = 210) = 121.25, p < .001. For the 86 PFR dyads intact at the postintervention time point, 86% received the full PFR dose, 2% received nothing, 9% received less than half, and 2% received more than half but not the full dose. At 6-month follow-up, assessed intact dyads no longer differed with respect to infant age compared to noncompleters or infants with new caregivers, but did differ with respect to caregiver type at baseline, χ2(2, N = 210) = 7.66, p = .022, with intact completers more likely to be headed by birth parents (33% vs. 16%) and less likely to have been headed by a foster parent (38% vs. 49%). Intact completers at 6-month follow-up were again more likely to have completed their intervention than were attriters or dyads with new caregivers, 87% versus 58%, χ2(1, N = 210) = 22.44, p < .001. For the 59 PFR dyads intact at the postintervention time point, 85% received the full dose, 2% received nothing, 10% received less than half, and 3% received more than half but not the full dose. The postintervention assessment was closer to baseline in the EES condition than in the PFR condition, EES: M (SD) = 2.88 (0.93) months; PFR: M (SD) = 4.12 (1.36); t(173) = 7.05, p < .001. Also, months between baseline and the 6-month follow-up differed by condition, EES: M (SD) = 8.85 (0.97); PFR: M (SD) = 10.05 (1.59); t(127) = 5.25, p < .001.
Caregiver Outcomes
Our primary outcome was sensitivity measured by a modified total score of the Nursing Child Assessment Teaching Scale (NCATS; Barnard, 1994), a videotaped interaction to assess caregiver sensitivity, stimulation of the child, and emotional responsiveness. An extensive literature supports its predictive validity of cognitive and social emotional outcomes (Oxford & Spieker, 2006) and sensitivity to intervention effects (Bakermans-Kranenburg et al., 2003). The current NCATS total score was initially developed on a nationally representative data set with over 8,000 cases using a combination of item response theory models and multidimensional scaling which reduced the set of 73 items to a set of 27 (M. Oxford, personal communication, September 16, 2011). Six of the 27 items measure the caregiver’s response to distress; because not all children displayed distress, these items were not used in the total score. The remaining items represent aspects of positive interaction and indicators of mutuality (e.g., contingency, gaze, and positive affect), caregiver verbal and nonverbal support of child, and sensitive instruction during the teaching task. Sample items include “caregiver laughs or smiles at the child during the teaching interaction;” “caregiver avoids making critical or negative comments about the child’s task performance.” Items were scored yes or no, and yes scores were summed. Cronbach’s α for the sensitivity scale ranged from .71 to .79. A single coder was trained to reliability by a certified NCATS instructor and passed quarterly reliability checks.
A secondary caregiver outcome, Support (15 items; αs ranged from .76 to .84), was rated using the Indicator of Parent–Child Interaction (IPCI; Baggett, Carta, & Horn, 2009). This measure was selected because it was developed as an observational screening and monitoring tool for early intervention use in the home. Items were on a 4-point scale (never, rarely, sometimes, or often). Sample items include “acceptance/warmth;” “descriptive language;” and “follows child’s lead.” Caregiver support was rated by two coders and averaged across three activities: free play, book reading, and a distraction task. Interrater reliability was assessed on 34% of coded episodes. IPCI interrater agreement ranged from r = .80 to r = .84.
Commitment to child was rated from caregivers’ answers to interview questions from This Is My Baby (TIMB; Bates, 1998; Dozier & Lindhiem, 2006). Sample questions include “How much would you miss (child) if he or she had to leave?” “What do you want for (child) right now? “In the future?” Indices of high commitment included expression of the desire to parent the child as long as the child remains in care or is benefitting from care; evidence that the caregiver has allowed herself to become fully attached to the child without withholding feeling or putting up barriers to limit the extent of attachment; and evidence of a commitment of emotional or physical resources to support the child’s growth and development. Scores were assigned on a 5-point scale with a higher score indicating higher commitment. Two coders were trained and reliability was monitored on 10% of data. Interrater agreement was r = .89.
Understanding of toddlers was measured by Raising a Baby (RAB; Kelly & Korfmacher, 2008), a measure of caregiver knowledge of infant and toddler social emotional needs and developmentally appropriate expectations. Caregivers rated RAB items on a 4-point scale (strongly agree to strongly disagree; 16 items; αs ranged from .73 to .77). Sample items include “The best way to help a toddler through his tantrum is to ignore him;” “A toddler’s misbehavior is a sign they need help from his/her parent.”
Parenting stress associated with the perception of having a difficult child or a dysfunctional parent–child relationship was measured with the short form of the Parenting Stress Index (PSI-SF; Abidin, 1995). The PSI was administered as part of the structured interview. Two 4-point scales (strongly agree to strongly disagree) were used: difficult child (12 items) and parent–child dysfunction (11 items). Alphas ranged from .87 to .89. Sample items include “My child smiles at me much less than I expected;” “Sometimes I feel my child doesn’t like me and doesn’t want to be close to me.”
Child outcomes
The primary child outcome of attachment security was measured with the Toddler Attachment Sort-45 (TAS45; Kirkland, Bimler, Drawneek, McKim, & Schölmerich, 2004), which was scored immediately after each research home visit. The TAS45 is a 45-item modified version of the Attachment Q-Sort (AQS; Waters, 1987), a gold standard attachment measure which has been extensively validated (van IJzendoorn, Vereijken, Bakermans-Kranenburg, & Riksen-Walraven, 2004). The first version TAS45 was used in a large, nationally representative longitudinal sample. It used a five-pile sorting procedure and had excellent psychometrics for an overall security score comparable to that produced by the AQS (Andreassen, Fletcher, & Park, 2006; Bimler & Kirkland, 2004). We used a sorting technique that the developers of the TAS45 termed trilemmas in which the 45 descriptive statements are presented in specific sets of three. The 3 items in a sample trilemma are “Child wants to be at the center of mother’s attention;” “Child is very independent;” and “Child will go towards mother to give her toys, but does not touch nor look at her.” The observer decides which one of the three statements in the set is most like and which is least like the child’s behavior during the observation just completed. Each of the 45 statements appears in two trilemmas; there are 30 trilemmas in all. The scoring results in an overall security score. Two research visitors were trained to administer the TAS45 by the first author; in 16% of visits the TAS45 was coded by the two raters on-site. Interrater reliability was r = .92.
Engagement (9 items, αs = .79 to .82) was scored from the IPCI (Baggett et al., 2009) by two coders. Items such as “positive feedback,” “sustained engagement,” and “follow through (including turn-taking)” were coded on a 4-point scale (never, rarely, sometimes, or often). Reliability was assessed by the IPCI trainer on 34% of coded episodes across all three time points. IPCI interrater agreement ranged from r = .80 to r = .84.
Competence (11 items; αs = .69 to .70) and ProblemBehavior (31 items; αs =.77 to .79) were measured by the Brief Infant Toddler Social and Emotional Assessment (BITSEA; Briggs-Gowan & Carter, 2002). Descriptions of positive social behaviors and problem behaviors in the last month were rated on a 3-point scale (not true/rarely; somewhat true/sometimes; very true/often).
At the 6-month follow-up, caregivers completed the Child Behavior Checklist for Ages 1 ½–5 (CBCL; Achenbach & Rescorla, 2000). The children were too young for caregivers to complete the CBCL at previous assessments. Descriptions of behavior in the last two months were rated on a 3-point scale (not true; somewhat true/sometimes; very true/often). Four scales were used: Internalizing (36 items; α = .80), Externalizing (24 items; α = .90), Sleep problems (7 items; α = .70), and Other problems (32 items; α = .70).
At baseline and again at the 6-month follow-up, data collectors used 1–5 scales to rate the child's behavior during administration of the Bayley-III Screening Test (Bayley, 2005) on 7 of 10 items from the Emotional Regulation factor and 6 of 9 items from the Orientation/Engagement factor from the Bayley Behavior Rating Scales (BRS; Bayley 1993). We did not administer the Bayley at the postintervention assessment because the interval from baseline was too brief. Alphas ranged from .79 to .87. Two research visitors were trained to rate the BRS scales; interrater reliability was assessed on 10% of visits. Interrater agreement ranged from r = .67 to r = .70.
Analysis
All data were analyzed with “intent-to-treat” models. Analysis of covariance (ANCOVA) models were estimated to assess differences by experimental condition in caregiver and child variables at baseline and both follow-up time points. Covariates included whether the child experienced multiple removals from the birth home (differed between conditions), caregiver type, age of child (both associated with who was retained in the follow-up samples), and months between baseline and the given follow-up assessment (differed between conditions postintervention). For postintervention and 6-month follow-up, baseline score on the given measure (when available) was also included as a covariate. This approach gives an unbiased assessment of treatment effects in a randomized design (Van Breukelen, 2006). The maximum sample size for these models was 210 at baseline, 175 at postintervention, and 129 at 6-month follow-up; in addition, some measures had further missing data (<5%) due to observational data being uncodable.
For measures available at all three time points, models of intervention effects on within-individual change were estimated using Hierarchical Linear Modeling (HLM; Raudenbush & Bryk, 2002). These models utilize data from all three time points and provide information on the timing of change. Two growth models were estimated for each outcome. The first model used two codings of time to estimate both change from baseline to postintervention and change from postintervention to 6-month follow-up. This model addressed (1) whether the intervention affected immediate within-individual change and (2) whether the differences increased or decreased in the subsequent 6-month period. The second model used just one coding of time and estimated intervention effects on linear change averaged across the three time points, thus addressing whether change began with exposure to the intervention and then continued to unfold in the postintervention period. Codings of time were based on the number of months between dates of assessments. The models included intervention condition as an individual-level variable predicting effects of time. Caregiver type (represented by two dummy variables), age of child at enrollment, and whether there had been multiple removals were included as predictors of growth model intercepts (i.e., the expected value at baseline) and of the effects of time. The HLM models used data on all participants who had data from at least one time point, with the assumption of data being missing at random after adjusting for model covariates (Raudenbush & Bryk, 2002).
Results
A correlations matrix including experimental condition, covariates, and baseline measures is shown in Table 2. There were no baseline differences on any of the outcome measures. As expected, child security was significantly associated with higher caregiver sensitivity, support, and understanding of toddlers. In the child domain, child security was also significantly associated with higher engagement, competence and emotion regulation, and lower problem behavior.
Correlations Among Experimental Condition, Covariates, and Baseline Measures of Study Outcomes
*p < .05.
Caregiver Outcomes
Results of the ANCOVA models for caregiver outcomes are presented in Table 3. All effect sizes are reported such that a positive coefficient indicates an effect in favor of the PFR condition. At postintervention, there was a significant intervention group difference on sensitivity, with a medium effect size. At the 6-month follow-up, the sample size was smaller, mainly due to changes in caregiver due to child welfare practice, and the group difference was not significant, although direction of the difference favored the PFR group with a small effect size. At postintervention, caregivers in the PFR condition scored higher on understanding of toddlers than caregivers in the EES condition, with a small effect size. The direction of difference and effect size was similar at the 6-month follow-up, although the difference was not significant. No other caregiver outcome was significant postintervention, and no differences were found at the 6-month follow-up, although all effect sizes except one were in the predicted direction.
Means and Standard Deviations of Caregiver Outcomes at Three Time Points by Experimental Condition: Test Statistic, Significance Level, and Effect Size for PFR at Postintervention and 6-Month Follow-Up
Notes: M = mean; Adj. M = mean adjusted for ANCOVA model covariates; SD = standard deviation; d = effect size (diff in adjusted means/square root of mean square error). A positive effect size (d) reflects a beneficial PFR effect. ANCOVA models adjust for baseline score, age of child, whether multiple removals, caregiver type, and time between baseline and the given follow-up assessment.
Child Outcomes
Results for ANCOVA models of child outcomes are in Table 4. There were no significant differences between intervention and comparison children on security at either postintervention or 6-month follow-up. Caregivers in the PFR condition reported greater child competence on the BITSEA at postintervention with a medium effect size, but this result was no longer significant at 6-month follow-up, when the small effect size was negative (higher scores in the EES group). Finally, there was a marginally significant difference in caregiver report of child sleep problems at the 6-month follow-up, with a small effect size. PFR caregivers reported fewer sleep problems for their children than did EES caregivers.
Means and Standard Deviations of Child Outcomes at Three Time Points by Experimental Condition: Test Statistic, Significance Level, and Effect Size for PFR at Postintervention and 6-Month Follow-Up
Notes: M = mean; Adj. M = mean adjusted for ANCOVA model covariates; SD = standard deviation; d = effect size (diff in adjusted means/square root of mean square error). A positive effect size (d) reflects a beneficial PFR effect. ANCOVA models adjust for baseline score, age of child, whether multiple removals, caregiver type, and time between baseline and the given follow-up assessment.
Growth Models
Results for the HLM growth models for outcomes measured at all three assessments are reported in Table 5 and generally mirrored the results for the ANCOVA models. The effect of being in the PFR condition on monthly change in sensitivity scores between baseline and postintervention follow-up was positive and significant, while the effect on change between postintervention and 6-month follow-up was slightly negative and nonsignificant. The HLM model of linear change across all time points showed a nonsignificant effect of PFR on monthly change in sensitivity. Scores on understanding of toddlers increased for caregivers in the PFR condition compared to caregivers in the comparison condition from baseline to postintervention, while the effect on change between postintervention and 6-month follow-up was nonsignificant. The effect of PFR on change in understanding of toddlers across all time points was positive and marginally significant. The one slight divergence from the results for the ANCOVA models was for parent report of child competence where the HLM model of growth in this measure showed a nonsignificant PFR effect on change from baseline to postintervention and a negative effect between postintervention and 6-month follow-up, reflecting that scores in the comparison condition caught up with scores in the PFR condition. Change averaged across all three time points did not differ by condition.
Parameter Estimates for PFR Effects on Monthly Change. Model 1: Estimated Effects on Change for Two Time Periods: Baseline to Postintervention and Postintervention to 6-Month Follow-Up. Model 2: Estimated Effect on Change From Baseline to 6-Month Follow-Up
Note. b = effect of PFR on monthly rate of change; SE = standard error. Model 1 used two codings of time to estimate both change from baseline to postintervention and change from postintervention to 6-month follow-up. Model 2 used one coding of time and estimated intervention effects on linear change across the three time points.
**p < .01. *p < .05. + p < .10.
Equations for Model 1:
Level-1 Model
Level-2 Model
aCoded 0 for baseline and months between baseline and postintervention for both post-intervention and 6-month follow-up; if postintervention time point missing, coded as 2.9 for EES condition and 4.2 for PFR condition.
bCoded 0 for baseline and post-intervention time point and months since baseline for 6-month follow-up
Equations for Model 2:
Level-1 Model
Level-2 Model
Caregiver Type
Caregiver type as a covariate in the ANCOVA and HLM models had few statistically significant effects. For example, of the 24 ANCOVA models, caregiver type had a significant main effect in only two. These main effects reflected the fact that foster caregivers reported fewer child competencies at postintervention and reported less commitment at the 6-month follow-up. Secondary analyses that examined caregiver type by experimental condition interactions also revealed only two statistically significant results for the 24 ANCOVA models. One interaction pointed to foster parents in the PFR condition reporting more problem behaviors for their children at the 6-month follow-up than foster parents in the EES condition, while PFR versus EES differences were in the opposite direction among birth parents and kin caregivers. Another interaction pointed to lower observed behavioral support among PFR kin caregivers than EES kin caregivers at the 6-month follow-up, while differences among birth parents and foster caregivers were in the opposite direction, albeit nonsignificant. Given the lack of power and inconsistent direction of effects, we feel that no strong inferences can be made about differential effectiveness of PFR by caregiver type.
Discussion
PFR: An Attachment-Based Intervention for Caregivers of Toddlers in Foster Care
The need to address the unique developmental vulnerability and primary attachment relationship needs of infants and toddlers in child welfare has been an increasing focus in recent years (Jones Harden, 2007; Stahmer et al., 2005; Zeanah et al., 2011). This study subjected a respected infant mental health training program, PFR, to a rigorous randomized control trial evaluation. Our aim was to improve caregiver sensitivity and caregiver understanding that difficult toddler behavior reflects unmet social emotional needs, so that caregivers could sensitively respond in ways that would help children become more secure and more behaviorally and emotionally regulated. Community-based providers with no previous infant mental health specialization were trained to deliver PFR to parents and caregivers of toddlers in child welfare dependency. Results indicate that the brief, 10-week PFR program could improve caregiver sensitivity and understanding of toddlers. Modest effect sizes for these caregiver outcomes, though not statistically significant for sensitivity and only marginally so for understanding of toddlers, persisted to the 6-month follow-up, when the sample was smaller because only 61% of the children remained with the caregivers who had received the intervention. Results from an ANCOVA model indicated differences in perceptions of child competence, with caregivers in the PFR perceiving more competencies, but this was not supported by a growth model of change. It would have been expected that competencies would continue to develop at a higher rate for children in the PFR condition, but the growth model of linear change across all three time points did not show this, and the piecewise model showed a significant increase in reported competences postintervention that was greater in the EES condition.
Effect sizes on improved caregiver sensitivity in this study are consistent with moderate effect sizes shown in randomized trials of other brief attachment-based interventions. For example, the Bakermans-Kranenburg et al. (2003) meta-analysis, reported a combined effect size of d = 0.33 for a core set of 51 randomized studies (n = 6,282). Eight of those studies (n = 1,707) used an older version of our measure of sensitivity, the NCATS total score, for which Bakermans-Kranenburg and colleagues reported a combined effect size of d = 0.25.
Contrary to our hypothesis, we did not find that PFR improved child security. We expected children in the PFR condition to show growth in attachment security greater than children in the EES condition, with rates of improvement maintained across the three assessment time points resulting in higher scores at the 6-month follow up. In the Bakermans-Kranenburg et al. (2003) meta-analysis, attachment interventions that specifically focused on improving caregiver sensitivity did have an overall significant effect on attachment security (d = .39). Although the effect sizes for our sensitivity measure were comparable, the effect sizes on attachment were not. It may be that factors other than caregiver sensitivity which were not addressed by the intervention contribute to attachment security. These could be both child and caregiver factors (Barnett, Butler, & Vondra, 1999). For example, infants older than 12 months less readily seek closeness to new attachment figures (Stovall-McClough & Dozier, 2004). The mean and median infant age in this study was 18 months; 88% of the sample was older than 1 year. Toddlers with maltreatment and foster care histories may have more complex developmental delays that influence attachment formation (Klee et al., 1997; van IJzendoorn, Goldberg, Kroonenberg, & Frenkel, 1992). They may bring behaviors and expectations to new attachment relationships that profoundly affect their trajectory (Lieberman, 2003), and some will exhibit attachment behaviors indicative of inhibited or disinhibited attachment disorder (Cyr, Euser, Bakermans-Kranenburg & Van IJzendoorn, 2010). Oosterman and Schuengel (2008) found an association between foster parent sensitivity and foster children’s attachment security only when controlling for symptoms of inhibited and disinhibited attachment disorder; the bivariate association between sensitivity and attachment security, r = .18, was not significant. The 10-week PFR intervention, as delivered in this study to families with toddlers, may not sufficiently address the child factors that affect the newly developing attachment relationship.
Similarly, caregiver factors that affect child security include atypical caregiver behaviors that are at times frightened, frightening, or dissociated. These atypical behaviors can be displayed by caregivers who are also generally sensitive with their children (Moran, Forbes, Evans, Tarabulsy, & Madigan, 2008). Despite improving caregiver sensitivity, PFR may not influence caregiver atypical behaviors, which can be unconscious and have their roots in early loss and trauma (Lyons-Ruth, Bronfman, & Parsons, 1999). However, improving caregiver sensitivity may benefit child development in other domains not measured in this study, such as problem solving, peer relations, achievement, and social competence (NICHD Early Child Care Research Network, 2003, 2004; Raikes & Thompson, 2008).
Children in the PFR condition did show a marginally significant decline in sleep problems, as reported by caregivers, with a modest effect size, a promising finding that warrants further study. Sleep is an essential self-regulatory activity (Degangi et al., 2000). Sleep problems in early childhood are implicated in several child and adolescent psychopathologies (Reid, Hong, & Wade, 2009). Tininenko, Fisher, Bruce, and Pears (2010) found preschool children in therapeutic foster care had better sleep quality than children in regular foster care. Sleep hygiene was not a focus of PFR, but may be a consequence of an increase in caregiver sensitivity, consistency, and routines.
Study Strengths and Weaknesses
The strengths of this study include its “real-world” use of community providers to deliver the interventions, use of observational outcome measures in addition to caregiver report, and the choice of PFR, a manualized attachment-based training program that has had wide appeal among service providers and administrators in Washington State. This project, by design, was consistent with Lee, Aos, and Miller’s (2008) assertion that the most convincing evidence for effective interventions that states are willing to fund are those programs that have been evaluated in “real-world” settings. We assessed community providers’ fidelity to PFR intervention approaches, and four of the five were able to consistently deliver the model with high fidelity. These strengths support the study’s relevance to real-world child welfare practice.
A weakness of the study is that this real-world approach made it vulnerable to exogenous factors, such as the heterogeneity in caregiver types and varied effectiveness of community providers. The most important threat to the internal validity of the study came from placement changes after enrollment, which were dictated by child welfare practices. These placement changes disrupted over a third of the caregiver–child dyads that enrolled in the study and reduced the sample for assessing differences at the 6-month follow-up to 129, only 61% of the original sample. This resulted in a reduction in power which decreased the ability to detect differences between the PFR and control conditions. The study was designed to have sufficient power to detect small to medium effect sizes. We expected 85% retention of the sample at the postintervention time point, which yields power of .80 to detect an effect sized of d = .42 with an α of p < .05. The actual retention of dyads that did not experience a caregiver change by postintervention was 83%. We anticipated that numerous caregiver changes would occur before the 6-month follow-up, although not as many as in fact occurred. The sample of 129 intact dyads yields .80 power to detect an effect size of d = .50 with an α of p < .05. Lack of power further limited our ability to investigate the potentially important role of caregiver type (birth parent, kin provider, or foster parent) in change in child outcomes or in the effectiveness of the PFR intervention.
Another possible weakness of the study is the version of the TAS45 that was used to measure attachment security. Other intervention studies with child welfare populations (Bernard et al., 2012; Dozier et al., 2009; Fisher & Kim, 2007; Moss et al., 2011) have shown intervention effects on child attachment security using the gold standard brief separation and reunion procedure, the Strange Situation (Ainsworth, Blehar, Waters, & Wall, 1978). Our measure of child attachment security, the TAS45, has promising features associated with another gold standard (according to van IJzendoorn et al., 2004), AQS (Waters, 1987). The five-pile version has been shown to be reliable and valid, with a range of security scores (from −1.0 to +1.0) similar to that produced by the AQS (Andreassen et al., 2006). We think the trilemma version of the TAS45 we used did not permit observers enough nuance in their ratings. As an example of the kind of results that limit our confidence in the measure, the mean security score in our sample at baseline was .48 compared to the AQS means of .32 for normative samples or .21 for clinical samples (van IJzendoorn et al., 2004). If the security scores on the TAS45 and AQS are comparable, as intended by the developers of the TAS45 (Andreassen et al., 2006), we would not expect that children in this sample would be more secure than children who have never been maltreated or separated from attachment figures.
Implications for Policy and Practice
The results from this study, by itself, provide support that PFR improves the caregiving experienced by maltreated toddlers in child welfare dependency with a history of relationship disruptions, but there is no indication that PFR can boost attachment security or self-regulation in this population. Perhaps findings from future trials, when combined with results from this trial, will provide a clearer picture regarding both caregiver and child outcomes. A consideration of the factors that might be associated with the modest results for PFR in this study could inform policy and practice recommendations regarding programs for vulnerable infants and toddlers in foster care.
First, given the rapid development of very young children and our finding that initial effects weakened over time, in the future stakeholders may decide that a program with planned “boosters” is necessary for sustained impacts in this population. Second, given the developmental risks experienced by this population, a potential point of entry for delivering infant mental health services to prevent or ameliorate socioemotional and regulatory problems for infants and toddlers in child welfare may be through early childhood intervention (ECI) programs funded under Part C of the Individuals with Disabilities Education Act (IDEA of 1997). Since 2003, an amendment to the Child Abuse Prevention and Treatment Act (CAPTA) has required developmental screening of children under 3 years in child welfare. Socioemotional and regulatory problems, as well as motor, cognitive, and language delays, qualify for ECI services. Early childhood interventionists are willing to serve the child welfare population, but they may not have the skills to effectively intervene at the level of the relationship (Herman-Smith, 2011). PFR has been training ECI providers from a variety of disciplines in this skill set for many years (Kelly et al., 2008), but given the absence of intervention effects on children’s socioemotional and regulatory problems, further research on PFR with this population is needed.
Early in this study we detected greater attrition in the PFR intervention compared to the EES group, possibly due to the greater time commitment required for PFR when families also had many appointments to keep for their children. We improved retention over the course of the study by working with the community PFR providers to be more persistent and tenacious when caregivers canceled or were not home, and to be more flexible in their scheduling of visits to meet the needs of families. Although this sort of flexibility in service provision is central to infant mental health practice, it was not initially the case for these community providers.
There is also a need to increase child welfare workers’ understanding of infant and/or toddler development and the importance of stability and consistency of attachment figures, and as well as to research the impact of placement quality and stability, and the level and type of parent–child visitation, on children’s developmental trajectories (Jones Harden & Klein, 2011). Although in this study PFR was an intervention for caregivers of children in the child welfare system, we believe that training child welfare workers in the PFR core curriculum and consultation strategies would enable them to make developmentally informed, relationship-based decisions that promote the well-being of infants and toddlers.
Relatedly, we note that there was no voluntary attrition in over 3 years among the five community mental health providers who were trained in PFR and received weekly reflective consultation for the duration of the project. In contrast, two of three EES providers, who did not receive intensive weekly reflective consultation, left during the course of the project. Reflective consultation is a core tenet of infant mental (Gilkerson & Kopel, 2005) and, based on comments the PFR providers made to the PFR trainer who facilitated the weekly meetings, one reason why they stayed with the project. The providers also told the PFR trainer that they liked the opportunity to improve their skills and receive strengths-based mentoring, which, after all, is the approach we wanted them to use with families. We believe the same opportunity should be available to child welfare workers, who experience considerable stress in the course of their work with vulnerable children and families. Future research on practice in child welfare should investigate the cost and benefits of reflective consultation for providers’ well-being, effective service delivery, and family outcomes.
Conclusions
With strong, sustained support from a regional child welfare office and the committed participation of community mental health agencies and their designated providers, we conducted the first randomized control trial of PFR, a brief, strength-based, attachment theory–based home visiting program that utilizes reflective consultation strategies and video feedback, with caregivers of maltreated toddlers adjusting to new primary caregivers in new homes. We found modest improvements in caregiver sensitivity and attitudes but no observable PFR effects on child functioning. The study demonstrates that a rigorous evaluation can be conducted in a real-world setting with a vulnerable, mobile population. The results of this study were not as strong as earlier results of quasi-experimental evaluations of PFR, but the caregiver outcomes were aligned with published results from similar clinical trials with less vulnerable populations. We are currently conducting a randomized, control trial directed toward biological parents who are under investigation by child protective services for maltreatment of their toddlers. This study, now in its third year, will add to our knowledge of PFR’s effectiveness as an intervention for this population. It is important for policy makers to consider both quality and quantity of evidence when allocating resources for “evidence-based” programs for their communities.
Footnotes
Acknowledgment
We thank Maureen Marcenko for critiquing the article.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by grant R01 MH077329 from the National Institute of Mental Health. This work was also facilitated by grant P30 HD02274 from the National Institute of Child Health and Human Development.
