Abstract
Building Better Caregivers (BBC), a community 6-week, peer-led intervention, targets family caregivers of those with cognitive impairments. BBC was implemented in four geographically scattered areas. Self-report data were collected at baseline, 6 months, and 1 year. Primary outcome were caregiver strain and depression. Secondary outcomes included caregiver burden, stress, fatigue, pain, sleep, self-rated health, exercise, self-efficacy, and caregiver and care partner health care utilization. Paired t tests examined 6 month and 1-year improvements. General linear models examined associations between baseline and 6-month changes in self-efficacy and 12-month primary outcomes. Eighty-three participants (75% of eligible) completed 12-month data. Caregiver strain and depression improved significantly (Effect Sizes = .30 and .41). All secondary outcomes except exercise and caregiver health care utilization improved significantly. Baseline and 6-month improvements in self-efficacy were associated with improvements in caregiver strain and depression. In this pilot pragmatic study, BBC appears to assist caregivers while reducing care partner health care utilization. Self-efficacy appears to moderate these outcomes.
Keywords
Introduction
There are more than 10 million family caregivers in the United States caring for people with cognitive impairments (Centers for Disease Control and Prevention, 2011). Caregivers often have high stress and poor health (Schulz & Beach, 1999; van der Lee, Bakker, Duivenvoorden, & Dröes, 2014). In 2012, the Rosalynn Carter Institute for Caregiving (2012) made recommendations, including the following:
Caregivers receive evidence-based, effective support services targeting their identified needs.
These programs be translated into community settings.
Relevant to the first recommendation, reviews by Chesla (2010) and Corry, While, Neenan, and Smith (2015) have determined that many caregiver programs are efficacious. However, the clear majority are designed for specific subgroups defined by specific cognitive-impairment causing conditions. Many target caregivers of patients with dementia (Akkerman & Ostwald, 2004; Beauchamp, Irvine, Seeley, & Johnson, 2005; Belle et al., 2006; Blom, Zarit, Zwaaftink, Cuijpers, & Pot, 2015; Coon, Thompson, Steffen, Sorocco, & Gallagher-Thompson, 2003; Eisdorfer et al., 2003; Mittelman & Bartels, 2014; Mittelman et al., 1995; Mittelman, Roth, Haley, & Zarit, 2004) or stroke (Clark, Rubenach, & Winsor, 2003; Grant, Elliott, Weaver, Bartolucci, & Giger, 2002). Despite similarities in caregiving experiences and challenges, few programs are designed to include caregivers across different cognitive-limiting conditions.
Gitlin, Marx, Stanley, and Hodgson (2015) summarized the translation evidence for dementia caregiving interventions. Only six interventions translated into wider practice were found. The authors identified several knowledge gaps: (a) differences between study subjects and the population of caregivers, limiting generalizability; (b) limited evidence about the efficacy of translated programs; (c) limited information about outcomes important to stakeholders, such as utilization; and (d) limited information about the duration of effects.
The current article addresses these gaps (including lack of interventions for caregivers of those with varying cognitive conditions and lack of translatability) by examining the effectiveness of a small-group, community-based, peer-led intervention, Building Better Caregivers (BBC), as implemented in a pragmatic multisite trial. This intervention was previously studied in an online version designed for the caregivers of veterans with cognitive impairment. That trial demonstrated improved caregiver outcomes (Lorig et al., 2012). While the Internet reaches many caregivers, other caregivers either do not have Internet access or prefer small-group, in-person learning. We chose to target caregivers of those with all cognitive conditions because of previous experience and because the problems of these caregivers are similar. We chose to conduct a pragmatic trial of an in-person, peer-led program. The strength of a pragmatic trial is that it “maximizes the applicability of the trial’s results to usual care settings” (Zwarenstein et al., 2008). The pragmatic design addresses specific knowledge gaps: (a) Study subjects represented the greater population of caregivers, (b) outcomes included those important to caregivers and community stakeholders, and (c) participants were followed for 1 year, assuring duration of effects.
Hypotheses were as follows:
Method
Design
We chose a 1-year pragmatic, longitudinal design that allowed study of real-world effectiveness (Lorig et al., 2012; Thorpe et al., 2009) Data were collected at baseline, 6 months, and 12 months. There were few inclusion or exclusion criteria.
Conceptual Framework and Rationale
Several conceptual frameworks inform this study: (a) self-management, (b) self-efficacy, (c) caregiver stress and coping, and (d) social support.
Self-management is defined as the tasks that individuals must undertake to live with chronic conditions. These tasks include having confidence to deal with the medical management, role management, and emotional management of their conditions (Corrigan, Greiner, & Adams, 2004). Caregiving, like chronic illness self-management, takes place largely outside of professional settings and is more complex. Caregivers deal with their own and their partner’s medical, role, and emotional management.
Confidence to undertake a new task is a key self-management component. We operationalized confidence as self-efficacy which was first discussed by Bandura (1997) and has been linked to behavior change and task completion. Bandura suggests four means to increase self-efficacy: skills mastery, modeling, reinterpretation of circumstances, and social persuasion. These were systematically applied during the intervention. Coon et al. (2003) found that self-efficacy was a mediator of caregiver improvement in anger management and depression.
Stress is the third component of the conceptual framework. Thirty years ago, Montgomery, Gonyea, and Hooyman (1985) identified that stress is caused by personal, interpersonal, and cognitive factors. Personal factors include disruption of normal life and financial burden. Interpersonal factors include role change and conflict with one’s care partner, as well as loss of meaningful relationships. Stress is largely internal and revolves around the caregiver’s interpretation and meaning of the caregiving experience and their ability to undertake this new roll. Collectively, these stressors have been termed caregiver burden (Zarit, Reever, & Bach-Peterson, 1980). Because stress is a key caregiver concern, BBC is centered around breaking the stress cycle.
The fourth concern is social support. Thompson, Futterman, Gallagher-Thompson, Rose, and Lovett (1993) identified lack of social participation as strongly related to caregiver burden. Thus, BBC encourages social support by allowing and encouraging participants to learn from and help each other. See Table 1 for how each of these are integrated into the intervention.
Building Better Caregivers Workshop Overview.
Note. SE activities designed to enhance self-efficacy through skills mastery, modeling, reinterpretation, and/or persuasion.
Items represent self-management skills.
Intervention
BBC is a 6-week, group workshop taught by two trained peers, who had previously been trained in (4-day training) and facilitated the Chronic Disease Self-Management Program (CDSMP; Lorig et al., 1999). Most of the peer facilitators had caregiving experience, lived in the community where BBC was offered, and were not health professionals. BBC is aimed at enhancing the caregiving skills and reducing stress of both older and younger people caring for those with cognitive impairment. The peer facilitators received an additional 1.5 days of BBC training.
The BBC workshop consists of weekly 2.5-hr sessions. Each session covered several topics and was highly interactive. When constructing the program, we look at several components: (a) content to reduce stress and enhance caregiver skills, (b) activities to enhance self-efficacy and (c) structure to enhance social support. Content can be found in Table 1.
BBC stress-reduction content dealt with how to reduce stressful caregiver problems and teaching stress reduction techniques. The skills taught reflect the three distinctive tasks of caregiving self-management: medical management such as medication management and interacting with providers, role management such as how to get help and to accomplish important role tasks while being a caregiver, and emotional management such as dealing with both caregiver and care partner depression and anxiety.
Activities to enhance self-efficacy include (a) Skills Mastery, having participants make weekly action plans and keeping a diary of the difficult care partner behaviors. Participants gave weekly feedback for these activities. Facilitators handled problems using a standardized problem-solving methodology. The solving of problems also leads to stress reduction. (b) Modeling was demonstrated by peer facilitators and by participants helping each other; the later adds to social support. Cognitive restructuring or reinterpretation of meaning was emphasized throughout the workshop especially when discussing ways of conceptualizing and dealing with care partner problem behaviors. Social support was fostered by having at least one activity per session where participants helped each other and gave feedback to each other. The investigators offered no reinforcement during the 10.5 months after the intervention and the last data collection.
Settings
We invited five organizations which were currently offering other Stanford Self-Management Programs (Lorig et al., 1999) to participate. (a) Dignity Health hospitals and clinics in Sacramento, California, is part of a large two-state multihospital and clinic system. (b) Aligning Forces Humboldt is a clinic located in Eureka, California, serving rural communities within 100 miles. (c) The Health Trust, in San Jose, California, is a foundation serving a diverse ethnic population. (d) Fairhill Partners, in Cleveland, Ohio, is a community-based organization that provides direct and ancillary services to older adults, their caregivers, and others who serve them. A fifth organization, because of unforeseen staff changes, was unable to recruit participants.
Investigators conducted a day and a half of training at each site. All facilitators were current leaders of Stanford Self-Management Programs. The organizations received material for the facilitators and participants as well as US$50 for each participant completing four or more sessions. Participants, in keeping with how the program may be used in the real world, were not paid for attending. Each site did their own recruiting, scheduling, and fidelity monitoring. The later was done by the site coordinators who in all cases were CDSMP trainers who had attended the BBC training. The usual monitoring consisted of attending random sessions with a predetermined check list. There is also a fidelity manual for all Stanford programs (Stanford Patient Education Research Center, 2016).
Participants
Each organization recruited participants (caregivers of those with cognitive impairment), in many cases working with local partners such as the Alzheimer’s Association, Veterans Support organizations and Senior Day Health Centers. Recruitment varied by organization but included public service announcements, announcements in media reaching caregivers, and flyers. Because organizations did not want to exclude anyone, any family caregiver could attend. Additional screening was conducted by questionnaire at the first session. If participants met study criteria, they completed an informed consent and baseline questionnaire. To qualify for our study, caregivers needed to provide care 10 or more hours a week, have a care partner with cognitive impairment who was living with or nearby in the community, and have a score of four or more on a 10-point visual-numeric stress scale. These criteria are similar to those in other studies for the caregivers of those with cognitive impairment. Participants could not have taken part in a previous Stanford Self-Management Program. One hundred seventy-six people attended the first session of who 128 qualified as study participants. As seen in Figure 1, most of those not qualifying for the study did not have the required level of stress. Because of the pragmatic nature of the study, people not qualifying for the study continued in the workshops but were not considered study subjects. The study was approved by the Stanford institutional review board (IRB). At the 12-month follow-up questionnaire, the investigators removed an additional 18 participants from the study because they were no longer caregivers (partner died or moved to a residential facility, etc.). See Figure 1. Of the 128 original study subjects, 110 were still caregivers at 12 months and qualified for 12-month data collection. Seventy-five percent of these supplied 12-month data. Figure 1 also shows that participation was similar at the time of the 6-month questionnaire.

Participants.
Data Collection and Measures
Using self-administered questionnaires, data were collected at pre-workshop orientation sessions or at the first workshop session by local program staff members who had completed Collaborative Institutional Training Initiative training as well as training on how to follow the data collection protocol. After collection, the questionnaires were mailed to Stanford. The investigators mailed all subsequent questionnaires directly to participants. Caregiver strain and depression were the primary outcomes. The Caregiver Strain Index (CSI) measures caregiver strain and stress (Robinson, 1983) and consists of the sum of 13 items and ranges from 0 to 13. The CSI had a reliability coefficient of α = .82 in the earlier online BBC study. Depression was measured by the PHQ-8 scale (Kroenke et al., 2009). That eight-item scale had a reliability of α = .89 in the earlier BBC study. Secondary outcomes included Self-Rated Health, which was measured with a single-item scale from the National Health Interview Survey (U.S. Department of Commerce, 1985). Visual-numeric scales measured pain, shortness of breath, stress, problems sleeping, and fatigue. Visual-numeric scales have been shown to correlate well (r = .72) with same-worded visual-analogue scales and had a better completion rate (Ritter, González, Laurent, & Lorig, 2006). The Zarit Burden Inventory (ZBI) measured caregiver burden (Parks & Novielli, 2000). It consists of the mean of 12 items and has a range of 0 to 4. The reliability coefficient in the online BBC study was α = .90.
Aerobic exercise was measured as the number of minutes of self-reported exercise in the last week (Lorig et al., 1996). Exercise is a health behavior benefiting caregivers as well as a means of reducing stress. For both caregiver and care partner, we measured utilization: caregiver report of physician visits, hospital emergency department visits, and number of nights in a hospital. In a previous study, self-report of outpatient visits correlated r = .70 with chart audit data, and days in the hospital correlated r = .83 (Ritter et al., 2001). We used a nine-question version of a Caregiving Self-Efficacy scale adapted from a scale developed and validated by Steffen, McKibbin, Zeiss, Gallagher-Thompson, and Bandura (2002). The nine items addressed the workshop content and had reliability of α = .89.
In addition to demographic variables (age, gender, marital status), we also asked participants the number of hours they cared for their care partners per week, number of days care partner spent in skilled nursing and/or assisted living in the last 6 months, whether participants worked or changed work status, and the relationship between care partner and caregiver.
Data Analysis
To ascertain attrition bias, we compare baseline characteristics of those who completed 12-month questionnaires with those who did not. Independent-sample t tests were used to compare 12-month completers with noncompleters.
Paired t tests tested whether changes over 12 months for caregiver primary and secondary outcomes were significantly different from zero (Hypothesis 1). Similarly paired t tests were used to examine 12-month changes in care partner health care utilization (Hypothesis 2).
Although the focus of this study was on 12-month outcomes, we also examined 6-month outcomes using the same methodology. This allows comparison of 6-month and 12-month changes to determine if improvements that may have occurred soon after the intervention were maintained, increased, or lessened over 1 year.
The relationship between baseline and 6-month change in self-efficacy and 12-month change in primary outcomes (Hypothesis 3) were tested using general linear regression models including demographic variables, the baseline measure of the outcome score, and both baseline self-efficacy and 6-month change in self-efficacy. The dependent variables were the primary outcomes: 12-month change in CSI and PHQ-8 Depression scale.
Differences among organizations (Hypothesis 4) were tested using general linear models ANCOVA. Organization was included in models with baseline measures of the outcome variable (caregiver strain or depression) and demographic variables as covariates.
Results
Participation
Fifteen workshops were attended by 176 caregivers (range = 7-15). Mean attendance was 4.6 out of six sessions (SD = 1.7, range = 1-6) with 82% attending four or more sessions. Of the 176 caregivers, 168 were potentially eligible for the study and completed baseline questionnaires. Forty were later found to have been ineligible (two had less than 10 hr of caregiving per week and 38 had a stress level less than 4), resulting in 128 initial study participants (Figure 1). The 128 study participants also attended a mean of 4.6 workshops (SD = 1.7) with 78.1% attending four or more sessions and 45% attending all six workshops.
At 12-month follow-up, 18 participants were no longer caregivers (Figure 1), making the final number of those eligible for the study equal to 110. Of these, two withdrew from the study and 83 (75%) completed 12-month questionnaires.
Baseline Characteristics
Table 2 includes caregiver and care partner demographics for the 128 baseline participants, broken down by those who completed 12-month questionnaires and those who did not. The most frequent care relationship was a female spouse or partner taking care of a male partner. This was followed by children caring for a parent or parent-in-law. Care partners were typically elderly. The mean age for completers was nearly 77, while care partners for noncompleters had a mean age of 82. This difference partly reflects that noncompleters include participants whose care partner was deceased at 1-year or in full-time care, and the likelihood of this happening increases with age. Caregivers were typically in their sixties.
Caregiver and Care Partner Baseline Characteristics (N = 128).
Note. p values are from t tests (means) or chi-square (percentages) indicating the probability of no difference between 12-month completers and noncompleters. Standard deviations are given in parentheses for means. Noncompleters include those no longer caretaking at 12 months.
Characteristics of Those Completing Versus Not Completing 12-Month Questionnaires
Comparing baseline primary and secondary outcomes for 12-month completers versus those who did not complete 12-month questionnaires, there were no significant differences (Table 3). Comparing baseline demographic and partner characteristics, there were four statistically significant differences (Table 2). Those who returned 12-month questionnaires were more likely to be Hispanic (p = .01), had, as noted above, younger care partners (M age = 77 vs. 82, p = .02), and had care partners less likely to be veterans (p = .01). Completers were also less likely to be caring for a parent or parent-in-law: 46% of caregiver of parents/parent-in-law did not complete the 12-month questionnaire compared with 28% for other caregivers (p = .03). Those caring for spouses (n = 62) or children (n = 5) did not significantly differ from the overall sample (ps = .16 and .42, respectively). The number of sessions attended (out of six) was not strongly correlated with completing 12-month questionnaires (r = .126, p = .157). Those who did not complete 12-month questionnaires also had care partners with significantly more nights with professional care (skilled nursing or assisted living, p < .001).
Caregiver and Care Partner Baseline Means for Outcome Variables.
Note. p values are from t tests indicating the probability of no difference between 12-month completers and noncompleters. Standard deviations are given in parentheses for means. Noncompleters include those no longer caretaking at 12 months. MD = medical doctor/physician, ED = emergency department.
Six-Month Changes
Table 4 shows 6-month changes in primary and secondary outcomes. Both primary outcomes and five secondary outcomes had statistically significant improvements among the 91 participants who continued in the study and completed 6-month questionnaires. These data are included for comparison purposes, as the focus of this study is on the 12-month outcomes.
Six-Month Change (N = 91).
Note. ↓ indicates that a lower score is desirable (e.g., less stress); ↑ indicates that a higher score is desirables. Possible ranges of scales are shown next to measure names. Italics for effect sizes indicate that change worsened. MD = medical doctor/physician, ED = emergency department.
Twelve-Month Changes
Both primary outcomes (CSI and PHQ-8 Depression scale) improved significantly (effect size = 0.3 and 0.4, p = .008 and I > .001, respectively; Hypothesis 1, see Table 5). All secondary measures improved significantly (effect size = 0.22-0.38) with the exceptions of exercise and self-rated health. In particular, the visual-numeric stress measure improved (effect size = 0.56, p > .001), as did the Zarit caregiver burden inventory (effect size = 0.27, p = .001). Self-efficacy also improved significantly (effect size = 0.56, p > .001).
Baseline and 12-Month Change Scores (N = 83).
Note. ↓ indicates that a lower score is desirable (e.g., less stress); ↑ indicates that a higher score is desirable. Possible ranges of scales are shown next to measure names. Italics for effect sizes indicate that change worsened. Intent-to-treat p values assume no change for those not completing 12-month questionnaires. MD = medical doctor/physician, ED = emergency department.
When we applied a conservative Bonferroni correction for multiple comparisons for the two primary outcomes (using a .025 significance criteria rather than .05), both primary remain statistically significant. Four rather than seven secondary measures remain significantly improved using an adjusted p value of .005 (for 11 caregiver secondary outcome measures).
Caregiver health care utilization did not change significantly. Care partners had significantly fewer physician visits (−2.1 visits) and emergency department visits (−0.29 visits), while nights in hospital did not change significantly (Hypothesis 2).
Because help with caregiving may influence outcome, we examined the effects of residential care days on outcomes. At baseline, there were 12 care partners who had any days in skilled nursing care or assisted living in the past 6 months (M number of days = 39.2, SD = 58.7). Five of the 12 had 60 days or more. At 12 months, there were 13 care partners who had any days of living in skilled nursing or assisted care in the past 6 months (M = 58.8 days, SD = 55.3). Of these, five had 60 days or more. If we remove the five who had more than 60 days living in professional care at 12 months from the analyses, all outcomes that were significantly improved with all 83 cases were still significantly improved at 12 months.
We also examined whether there was a relationship between session attendance and 12-month primary outcomes. Correlations between number of session attended and both CSI and PHQ-8 Depression were very low (r = −.04 and −.04, respectively), and not significant. Similarly, t tests comparing those who completed at least four sessions with those who completed less were not significant for the two primary outcomes at 12 months (t = 0.87 and 0.73 for CSI and PHQ-8 Depression, respectively).
Finally, there were no significant differences in 12-month changes for primary outcomes by relationship to care partner, using either ANOVA or dichotomous comparisons of relationships (spouses, n = 44, parents/parent-in-laws, n = 28, or others, n = 11). The F values from ANCOVA models were 0.09 (p = .92) for 12-month change in caregiver stress index and 0.54 (p = .58) for change in PHQ-8 Depression.
Table 5 also included p values for intent-to-treat analyses. All 128 baseline participants are included and, as is customary, it is assumed that noncompleters would not have improved at the same level as participants. Those who did not answer 12-month questionnaires, or who were no longer caregivers were given change score values of 0. As can be seen in the table, the p values are virtually unchanged from the 83 participants who completed 12-month follow-ups.
Self-Efficacy as an Outcome Mediator or Moderator (Hypothesis 3)
Self-efficacy improved significantly at both 6 and 12 months. Using general linear models, both baseline self-efficacy and 6-month change in self-efficacy were significantly associated with 12-month improvements in both primary outcomes (caregiver strain and depression, see Table 6). For linear models estimating CSI, p values were .007 and <.001 for baseline and 6-month change in self-efficacy (partial r2s = .077 and .021, respectively). For PHQ Depression, p values were .015 and .019, with partial r2 of .040 and .063, for baseline and 6-month change in self-efficacy. None of the demographic variables were associated with 12-month change scores.
General Linear Models Estimating Primary 12-Month Outcome Variables.
Note. PHQ= Patient Health Questionnaire.
Differences Among Organizations (Hypothesis 4)
The general linear models analyses of covariance used to determine if there were differences by organization included baseline value of the outcome, age, gender, whether married and whether non-Hispanic White as covariates (Table 7). The organization was not significantly associated with either 12-month primary outcome (caregiver strain or depression). Removing the demographic variables did not change the results (all p > .15 for organization).
General Linear Models ANCOVA Estimating Primary 12-Month Outcome Variables.
Comparison of 6-Month With 12-Month Results
When we compare the results for 6 months with those for 12 months, for both primary outcomes the effect sizes were slightly lower at 6 months. Similarly effect sizes were slightly less for each of the significantly improved secondary outcomes. The secondary variables that improved significantly at 12 months also improved significantly at 6 months with the exception of the sleep problems score (p = .070 at 6 months vs. p = .044).
Discussion
In this pragmatic trial, participants sustained improvements in the primary outcomes (caregiver strain and depression) for at least 12 months. Care partners experienced lower MD and emergency department visits, but not hospitalizations. We also found that baseline and 6-month changes in self-efficacy were associated with 12-month improvements in caregiver strain and depression and those outcomes did not differ among community organizations. To elucidate our findings, we first place them in the context of prior studies. Second, we examine the study limitations, most importantly, the degree to which temporal trends—rather than the intervention itself—were likely to account for our findings. Third, we discuss advantages of pragmatic studies using the current study as an example.
In contrast to pragmatic trials, randomized trials tend to emphasize design characteristics that strengthen internal validity (by creating more “idealized” study conditions) but may have limited external validity. The weakness of the current design is that it is not a randomized trial and may lack internal validity. The greatest threats to internal validity are probably temporal (the possibility of regression to the mean or caregiver maturation over time) and the possibility that changes may be due to other factors. The majority of prior caregiver interventions used randomized designs and had restrictive inclusion criteria, highly resourced intervention settings and personnel, and tighter investigator control. To somewhat compensate for lack of a control group, we examined past studies for their effectiveness in reducing stress and depression. Schulz et al. (2002), in summarizing dementia caregiver intervention research, found that for depression, 17 out of 22 studies had improvements in depression ranging from 0.75% to 10.5%. The current BBC study showed a 25% decrease in depression. Of the 83 participants, 32 (38.5%) had baseline PHQ-8 scores of 10 or above, generally considered an indication of clinical depression (Kroenke et al., 2009). When we examine change scores for the 32 with baseline indications of depression, the mean improvement in PHQ-8 was −4.97. A decrease of 5 points or more has been used as a general indicator of clinically significant improvement (Wells, Horton, LeardMann, Jacobson, & Boyko, 2013). Standards for clinical significance for stress are more difficult to quantify.
In a review of stroke caregiver interventions, Legg et al. (2012) found only one trial in which stress improves significantly. Gaugler, Reese, and Mittelman (2016) found a 1-year reduction in stress compared with controls. We have been unable to find a standard criteria for clinical significance in changes in stress although Schulz et al. (2002) provided some insights. In sum, the outcomes for participants in the current study are equal to or better than the outcomes for participants in most other caregiver studies. This still leaves the possibility that controls would have had changes in the same direction and magnitude as treatment subjects.
To explore the possible behaviors of caregiving study controls over time (1 month to 3 years), we examined published caregiver randomized controlled trials to determine the direction of control-group change for our key variables, depression and stress. For depression, we found six dementia caregiving studies where, over 1 to 6 months, the depression of controls either did not change or worsened (Beauchamp et al., 2005; Belle et al., 2006; Blom et al., 2015; Coon et al., 2003; Draper et al., 2007; Gitlin et al., 2003). Four other studies were of durations of 1 year or more and found worsening depression among controls (Eisdorfer et al., 2003; Gaugler et al., 2016; Mittelman et al., 1995; Mittelman, Roth, Coon, & Haley, 2004). One of these studies demonstrated a consistently worsening of depression in the usual-care group over 3 years (Gaugler et al., 2016). We were unable to find a single study in which control-group depression improved. Stress (including caregiver strain and burden) was more difficult to assess, as this was less frequently reported. Gitlin, Winter, Dennis, Hodgson, and Hauck (2010) found that over 4 months caregiver “upset” increased in controls. In another study (Gitlin et al., 2008), control-group caregiver burden worsened slightly. Mittelman, Ferris, Shulman, Steinberg, and Levin (1996) found a 1-year increase in caregiver control-group stress as did Draper et al. (2007) in a 4-month study. These studies suggest that it is unlikely that untreated caregiver controls over time demonstrate improvements in either depression or stress. It also suggests that our findings may be conservative as they were independent of possible deterioration of key variable in controls. While not complete, this review is representative of studies in the field. In sum, the primary BBC outcomes are equal to or better than most other studies in the field and unlikely due to temporal trends. It is also possible that improvements were due to caregivers receiving other help. While we cannot rule this out, we did exclude from the study all caregivers where the care partner had entered residential care or died.
As reported above, for those remaining in the study, we examined those with any, less than 60, and more than 60 days of care partner time in residential care. The lack of differences in outcomes when we remove those with 60 or more days in skilled nursing care prior to 1 year also suggests that it is unlikely that any increases in skilled nursing facility had a significant impact on the outcomes. As one component of the intervention focused on types of help and asking for help, it may have been that part of the observed improvements were due to caregivers getting help as a direct result of the intervention. This is a matter for future studies.
In the current study, care partners experienced significant decreases in office and emergency room visits and nonsignificant decrease in hospital nights. Caregivers demonstrated no utilization changes. In a review of interventions for caregivers of stroke survivors that was issued as a joint statement by the American Heart Association and American Stroke Association, Bakas et al. (2014) found several studies with decreases in care partner emergency room visits (Pierce, Steiner, Khuder, Govoni, & Horn, 2009), hospital readmissions (Pierce et al., 2009), and institutionalization (Shyu, Kuo, Chen, & Chen, 2010). Only one of the identified interventions noted a decrease in caregiver utilization, specifically in office visits (Gräsel, Biehler, Schmidt, & Schupp, 2005). Overall, review authors noted that “few studies have examined caregiver service use and more studies are needed.” Our findings of reduction in care partner health care utilization probably make the intervention cost neutral or cost-effective. This was not assessed. Unlike most caregiver interventions, BBC is delivered by peer leaders in community settings. This makes it less expensive than many other programs that depend on professional interventionists and/or are delivered one-on-one. A formal evaluation of cost impacts is warranted.
As in previous studies (Cheng & Chan, 2005; Coon et al., 2003; Lorig et al., 2012; van den Heuvel et al., 2002), caregiver self-efficacy increased significantly and both baseline and 6-month changes in self-efficacy were associated with 1-year improvements in both strain and depression. This continues to argue for the central role of confidence or self-efficacy in mitigating the deleterious effects of caregiving.
The present study has several limitations. Foremost is the question, addressed in detail above, of whether temporal trends among caregivers could explain findings. As controls in the randomized controlled trials we found got worse over time, it seems unlikely that temporal factors account for our findings. Another potential limitation is that utilization measures are participant-reported. But for the recall time frames (every 6 months), self-reported health care utilization data on visits and hospitalizations are reliable (Jiang et al., 2015). We did not collect detailed information about the diagnosis of care partners, relying on the self-identification of caregivers as those caring for someone with cognitive impairment. Thus, we cannot analyze whether the program had varying results for caregivers of those with different kinds of impairment. It would have been desirable to have data on care partners’s type of impairment, and future studies should collect this information.
The number of cases was insufficient for a full analysis of the effect of the relationship between caregiver and care partner. Preliminary analyses suggested that (a) caregivers of parents were more likely to complete 12-month questionnaire and (b) there was no evidence of differences in outcome changes by relationship to care partner. A larger sample would have allowed fuller analyses by caregiver–care partner relationships.
We also have limited data on participants who did not complete the 12-month questionnaire. Nonetheless, baseline scores for both primary and secondary outcome measures hardly differed between participants who completed and did not complete the final questionnaires. Two of the four demographic differences that were found can be explained by inclusion of those whose care partner had died or gone into professional care. Such partners were more likely to be older and parents of caregivers. Finally, we have no explanation for the apparent lack of association between number of sessions attended and outcomes. A future study focusing on engagement with the program would be desirable.
The focus of this study was on 12-month outcomes. Six-month data are included for comparison and it can be seen that the results at 6 months were similar but not quite as strong as at 12 months. This suggests that improvements seen immediately after the program continued and even further improved over the next 6 months. The intent-to-treat outcomes suggest that this would be true even after taking into account the slightly greater attrition (from 91 to 83) at the later period, although a study of the BBC with a randomized control group would be desirable.
Pragmatic trials have external validity. BBC was tested in real-world community organizations, with few exclusion criteria. Settings—rural versus urban, health care system versus community-based organizations—were not controlled by the investigators. Nevertheless, the findings of this study should be taken as proof of concept. Additional studies are warranted.
In conclusion, participants in BBC workshops delivered in diverse, real-world community settings achieved significant improvements in caregiver strain and depression that endured for 12 months. Care partners’ health care utilization outcomes also improved. Given the positive results of this pragmatic trial, investigators and stakeholders may want to conduct future randomized trials and formal assessments of the cost implications of widespread program adoption and dissemination.
Footnotes
Acknowledgements
Audrey Alonis and Maisoon Ayish at the Stanford Patient Education Research Center assisted with data collection and management. Stephanie FallCreek of Fairhill Partners, Melissa R. Jones of Aligning Forces Humboldt, Erika Zúñiga of The Health Trust, and Sydni Aguirre of Dignity Health administered the Building Better Caregivers workshops at their respective sites.
Author Contributions
K.L. designed the program and study and wrote and edited the manuscript. P.L.R. analyzed and interpreted the data and wrote and edited the manuscript. D.D.L. contributed to the design of the study, managed the intervention, and edited the manuscript. V.Y. interpreted the data and wrote and edited the manuscript. All authors approve the final version.
Declaration of Conflicting Interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: If these analyses were to contribute to dissemination of the program from which the data are derived, K.L. and D.D.L. have the potential to receive royalties.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by the Archstone Foundation (Grant 14-02-42).
