Abstract
Background:
Compassion fatigue is recognized as impacting the health and effectiveness of healthcare providers, and consequently, patient care. Compassion fatigue is distinct from “burnout.” Reliable measurement tools, such as the Professional Quality of Life scale, have been developed to measure the prevalence, and predict risk of compassion fatigue. This study reviews the prevalence of compassion fatigue among healthcare practitioners, and relationships to demographic variables.
Methods:
A systematic review was conducted using key words in MEDLINE, PubMed, and Ovid databases. Data were extracted from a total of 71 articles meeting inclusion criteria, from studies measuring compassion fatigue in healthcare providers using a validated instrument. Quantitative and qualitative data were extracted and compiled by three independent reviewers into an evidence table that included basic study characteristics, study strength and quality determination, measurements of compassion fatigue, and general findings. Meta-analysis, where data allowed, was stratified by Professional Quality of Life version, heterogeneity was quantified, and pooled means were reported with 95% confidence interval. A table of major study characteristics and results was created.
Ethical consideration:
This paper contains no primary data obtained directly from research participants. Data obtained from previously published resources have been acknowledged within references. Psychological distress, particularly compassion fatigue, can be insidious, no health profession is immune, and may significantly impact the ability to provide care.
Results:
A total of 71 studies were included. Compassion fatigue was reported across all practitioner groups studied. Relationships to most demographic variables such as years of experience and specialty were either not statistically significant or unclear. Variability in reporting of Professional Quality of Life results was found.
Interpretation:
Compassion fatigue exists across diverse practitioner groups. Prevalence is highly variable, and its relationship with demographic, personal, and/or professional variables is inconsistent. Questions are raised about how to mitigate compassion fatigue.
Keywords
Introduction
Compassion fatigue is described as a healthcare practitioner’s diminished capacity to care as a consequence of repeated exposure to the suffering of patients, and from the knowledge of their patient’s traumatic experiences. 1 Compassion fatigue is a result of providing patient care, and is more often considered as the result of many events, though it could arise from the experience of caring for an individual patient or event. Compassion fatigue is closely related to the concepts of “vicarious trauma” and “secondary traumatic stress (STS),” both of which also result from exposure to the trauma experienced by patients, rather than to the trauma itself. 2 However, other factors contribute to the development of compassion fatigue, for example, burnout (BO) may develop through factors such as work hours (i.e. shift work) and work environment, impacting an individual’s capacity to care. 3
Instruments have been developed to measure the prevalence of, and examine risk factors for, compassion fatigue in care providers. One such instrument is the Professional Quality of Life (ProQOL) scale developed by Stamm. The ProQOL began as the “Compassion Satisfaction and Fatigue Test (1993),” and was developed to measure both the positive and negative elements experienced by persons who act as professional helpers. 4 The instrument uses three subscales: Compassion Satisfaction (CS), BO, and STS/compassion fatigue (depending on the version). The subscales are scored separately, and scores are not considered cumulatively; there is no accepted method of combining subscales to report an overall score. 4 Instead, combinations of high and low scores in the subscales indicate an overall level of compassion fatigue. Within each of the 3 domains, a score greater than 42 is considered as high, 23–41 as average, and 22 or less as low. CS indicates the positive feelings associated with doing a type of work, and a higher score is suggestive of more contentment. Higher scores in STS and BO would indicate more negative feelings and distress. The ProQOL does not have a scale specific control value for use as a diagnostic instrument. The scale has evolved as more is learned about the factors contributing to compassion fatigue; the current instrument represents the understood relationships between the positive and negative effects of work on care providers.
Aim and objectives
This systematic review sought to describe and summarize the prevalence of compassion fatigue in healthcare practitioners using narrative synthesis, and meta-analytic methods where data presented permitted. We asked the following: What is the frequency of reported compassion fatigue and in which care providers? What variables impact compassion fatigue? What are the common values reported in the subscales of ProQOL?
Methods
Search strategy
A broad electronic search was conducted using the MEDLINE (Ovid interface) and PubMed databases, using MeSH term “compassion fatigue,” with additional keyword searches for “secondary traumatic stress,” “secondary traumatization,” and “vicarious traumatization.” Search terms were combined using Boolean operator “OR.” Articles were included from earliest available content up to 31 December 2017. The bibliographies of articles meeting inclusion criteria were also searched by reviewers for any additional articles.
Inclusion criteria
To be included in this review, studies were required to meet the following inclusion criteria: English language, containing data that included a measurement of “compassion fatigue,” using a validated measurement tool and reporting mean or median scores, using a participant sample consisting of practitioners from a professional healthcare discipline providing frontline patient care, and originating from a peer reviewed scientific journal. Studies that were specific to an examination of interventions or mitigation strategies were excluded.
Study selection and reporting
This systematic review and meta-analysis was conducted consistent with recommendations from “The Meta-analysis of Observational Studies in Epidemiology” guidelines, and reported in accordance with the PRISMA statement. 5,6 No protocol was registered. A total of 1302 records were returned from the electronic search strategy. An additional 46 records were found through reference searches. Two reviewers (N.L.C. and C.H.) reviewed the abstracts of those articles and included them for full review if the abstract suggested they met inclusion criteria. Of the 113 abstracts identified, the same reviewers independently and in parallel stringently applied the inclusion and exclusion criteria to the full text of the articles to identify the 71 articles that remained for this study. See the PRISMA diagram (Figure 1). Fifteen studies were excluded due to an issue with data reporting (including incomplete data), 17 did not specifically report prevalence, 4 studies did not use a validated instrument, 3 studies were not presented as journal articles, and 2 did not study healthcare practitioners.

PRISMA diagram of study selection.
Data extraction
After the final list of articles for inclusion was identified, data were extracted by two authors (N.L.C. and G.C., or N.L.C. and C.D.) with all data independently verified by each author, and discrepancies resolved by consensus. Quantitative data used in the meta-analysis (e.g. mean score, standard deviation, sample size) were abstracted by two authors (C.D. and L.D.) and any discrepancies were resolved through discussion.
Full text review of included articles resulted in a list of study characteristics for extraction. These characteristics were used to develop an evidence table that included the following: the study’s demographic details (source, year, country, number of participants) and information pertaining to the overall strength and design (population, total sample size, sampling method, research design, level of statistical analysis, response rate, method of assessment, and the measurement tools utilized). Study result data were recorded as quantitative results (mean scores, risk/incidence) and narrative summary.
Assessment of quality
Compassion fatigue is most often investigated using descriptive cross-sectional survey methodology, which is appropriate for this subject matter. During full-text review, studies were assessed under guidelines from the National Health Institute Quality Assessment Tool for Observational Cohort and Cross-Sectional Studies.
Data synthesis and analysis
As described, the ProQOL is measured on three sub scales: CS, BO, and compassion fatigue/STS. The third scale is sometimes measured as “compassion fatigue” and sometimes as “STS,” and each is scored differently. The versions of the ProQOL used in the reviewed studies varied (when reported) among versions III, IV, and V, and both “compassion fatigue” and “STS” were reported as the third subscale.
Data were reported in individual studies one of three ways: (1) a detailed measure of risk based on participant’s mean score on each of three subscales compared with established cut off scores, (2) a limited measure of risk based on participant’s mean scores in two subscales (usually the CS and BO scales), (3) a proportional analysis where the proportion of participants reporting values above (or below) established cut-off scores are reported, or (4) a combination of the above. For all studies, we applied standard content analysis of study results and discussion to create a narrative summary. In studies with detailed measure of risk, mean subscale scores (and accompanying standard deviations) were pooled using random effects meta-analysis in STATA 14. Meta-analyses were stratified by the version of the ProQOL to ensure appropriate pooling. Heterogeneity was quantified using the I2 statistic and accompanying Q p-value. Pooled means are reported with 95% confidence intervals (CIs). In cases where studies reported estimates from multiple groups (e.g. nurses, physicians, social workers), results were pooled as long as the groupings were mutually exclusive. A summary evidence table was created presenting the study results (title, author, year, country, sample population, measurement tools used, overall findings, quantitative prevalence, or risk measured by ProQOL).
Results
Study characteristics
Table 1 presents a summary of the study characteristics and major quantitative and overall findings, including narrative summary. Of the 71 articles, 50 (70.4%) were exclusive to one professional care provider group, whereas the remaining 21 (29.6%) included 2 or more care providers. Of the 50 studies restricted to one profession, nursing professionals were most commonly represented (n = 30), followed by physicians or medical trainees (n = 7), social workers (n = 3), genetic counselors (n = 3), and a mix of other care providers (n = 7). Nursing professionals included were as follows: (1) broadly representative of care environments such as acute care hospitals—including emergency departments, critical care units, and acute care wards—and varied settings outside of hospital including clinics and outpatient treatment settings; and (2) broadly representative of clinic conditions and patient groups cared including examples such as palliative care, pediatrics, geriatrics, oncology and as transplant coordinators, and oncology. Of the 21 studies that included multiple providers, these studies were not as expansive in care environments or patients served but did include mental health providers/therapists, hospice care providers, and care in hospitals or acute care.
Summary of studies.
CS: Compassion Satisfaction; BO: Burnout; STS: Secondary Traumatic Stress; RN: registered nurse; PTSD: posttraumatic stress disorder; SCAW: self care assessment worksheet; ICU: intensive care unit; ED: emergency department; ER: emergency room; NICU: neonatal intensive care unit; CFST: Compassion Satisfaction and Fatigue Test; HVIMC: Heart and Vascular Intermediate Care Unit; HVICU: Heart and Vascular Intensive Care Unit; APSS: Accumulated Pain/Stressor Scale; DASS: Depression Anxiety Stress Scale; PCAs: Patient Care Associates; STAI: State-Trait Anxiety Inventory; PSS-SR: PTSD Symptom Scale-Self Report; MBI-HSS: Maslach Burnout Inventory: Human Services Survey.
The studies had a significant focus on economically developed western healthcare systems with most (41/71, 57.8%) being from North America and 37 of these exclusively the United States, Europe being the next most frequent (10/71, 14.1%)—with 1 study comparing Brazil and Portugal, and 6 (8.5%) from Australia–New Zealand. Only 4 of 71 studies (5.6%) were from Asian rim countries, and 2 of 71 (2.8%) from the Indian sub-continent despite the large populations served by healthcare providers in these regions. There was only one study from the African continent, and no study exclusively from South America (apart from the aforementioned one study examining Portugal and Brazil).
Of the 71 articles, 67 (94.4%) used the ProQOL scale: 6 (9.0%) used version 3, 3 (4.5%) used version 4, and 28 (41.8%) used version 5. Thirty of the 67 (44.8%) studies did not explicitly identify which ProQOL version was used. All studies included some form of demographic questionnaire, and 56% (40/71) of studies used one or more additional tools to further assess domains of psychological distress. Fifty six (78.9%) of studies were published in the last 5 years, with 30 (42.3%) published since 2016, suggesting a recent increased interest in this subject. The reporting of the results from the 71 studies varied. Of the 67 studies that reported results from ProQOL, 41 studies reporting detailed measures of risk only from the 3 subscales (61.2%), 12 (17.9%) studies reported only a proportional or percentage risk, 2 (3.0%) reported limited results on only two of the three subscales, and the remaining 10 (14.9%) reported a combination of mean results from the 3 subscales and proportional values above cutoffs.
The correlations between demographic and work variables and reported values on the ProQOL were inconsistent between study results. A number of studies demonstrated that psychological distress and underlying mental health conditions were associated with worse scores, but as cross-sectional surveys, it was not possible to determine if one or the other was causal, or simply part of the same experience. CS was variably reported with some studies suggesting it increased with experience 15,20,39,41 or age, 46,47 and other studies suggesting it was inversely associated. 12,19,30,38 BO and compassion fatigue were correlated 8 –10,25,30 ; higher scores in BO scales in particular were more commonly associated with the work environment such as location, shift type, or level of oversight responsibility. 19,29,31,34 Factors associated with lower scores, or improved CS, included positive work environments (including social support networks), 25,28,50,53,66 work in private clinics, 20 and support from management/leadership. 35,37,39 Recent negative events were more likely overall to be associated with worse scores. 31,36,63
Table 2 presents a summary of those studies that used the ProQOL tool as the primary measure of compassion fatigue, and reported mean scores for all three sub scales (CS, BO, and STS/compassion fatigue). The summary shows the scores between practitioner groups (nurses vs other health practitioners) to be relatively consistent albeit, the scores for BO appeared to less variable across nurses. The results were particularly interesting in that the scores for BO and STS/compassion fatigue were in the average to high range, whereas the CS scores were not particularly “low”; this may raise the question as to the impact of different domains on the overall effect on compassion fatigue.
Summary of studies using ProQOL tool, reporting all three sub scales.
CS: Compassion Satisfaction; BO: Burnout; STS: Secondary Traumatic Stress.
Meta-analysis
In studies that used version 5 of the ProQOL (n = 28), the pooled mean CS score was 41.8 (95% CI: 40.1–43.5) (Figure 2), mean BO score was 28.4 (95% CI: 26.3–30.4) (Figure 3), and mean STS score was 25.8 (95% CI: 23.3–28.3) (Figure 4). This indicates average levels of BO and STS and average high levels of CS. Heterogeneity was above 99.5% (p < 0.00001) on all version 5 subscale analyses. For the CS subscale, there was an outlier value of 93.60; this study was conducted in a sample of 231 novice, pediatric RNs.

ProQOL-V pooled mean scores by subscale: Compassion Satisfaction.

ProQOL-V pooled mean scores by subscale: Burnout.

ProQOL-V pooled mean scores by subscale: Secondary Traumatic Stress.
In studies that used version 3 of the ProQOL (n = 6), the pooled mean BO score was 18.2 (95% CI: 13.6–22.9) (Supplemental Figure 1), mean CS was 43.3 (95% CI: 36.2–50.4) (Supplemental Figure 2), and mean STS was 16.6 (95% CI: 14.4–18.8) (Supplemental Figure 3). These scores indicate average high levels of BO, average levels of CS, and low levels of STS. There was significant heterogeneity in all analyses (I2 > 89%, p < 0.00001).
Discussion
The results of our systematic review and meta-analysis demonstrate that across diverse healthcare practitioner groups, most studies report results BO and compassion fatigue subscale results that fall into the average risk groups within the ProQOL scoring matrix. The “BO” subscale had the highest mean value (and therefore potentially most significant impact on compassion fatigue) across studies. This was also the subscale that had the highest number of studies with heterogeneous results with 10 studies significantly above the pooled mean. The synthesis of existing work as presented here demonstrates an association between personal factors such as an existing diagnosis of anxiety or depression, 7,8 and prior negative life events 3 and increased levels of compassion fatigue. Although the subscale scores for BO and compassion fatigue were correlated, it was not as clear how factors related to the work environment such as shift time and length, 9,11,19,20 and the type of caring work being done, 17 explicitly impacted compassion fatigue levels. The pooled results of this meta-analysis are important for future research as reasonable summary estimates for comparison. The high proportion of studies that were either exclusively in nursing professionals, or included nurses in a diverse group of healthcare professionals reflects the important role of nurses in our healthcare system, and their proportionally large cohort of healthcare providers. The similarity in results across healthcare providers suggests that comparisons within and between health professionals are possible and for non-nursing healthcare professionals, comparisons are still possible and relevant.
The finding that compassion fatigue exists across diverse practitioner groups can have a serious impact on professional practice and workforce. Compassion fatigue, BO, post-traumatic stress disorder, and other types of psychological/emotional distress are often associated and sometimes conflated. It is important to differentiate these conceptually, as well as to recognize overlap as approaches to mitigating and treating the issues differ. Compassion fatigue is triggered by the continual use of empathy and emotional energy, previous exposure to trauma, prolonged exposure to secondary trauma (a consequence of being witness to the trauma of others, and being in a position of having to care for those who are suffering, rather than being the primary subject of the trauma themselves), and the work environment. 16 BO is defined as a “psychological syndrome that involves a prolonged response to stressors in the workplace.” Specifically, BO involves the chronic strain that results from an incongruence, or misfit, between the worker and the job. 7 BO is primarily triggered by work-related and organizational characteristics, with some predisposing personal characteristics. 16 Considering that BO develops in large part as a result of stressors that are organizational and structural (i.e. related to working environment), changes in the work environment may potentially mitigate negative impacts to patients, and be relatively easier to implement. However, BO may also be related to personal characteristics or a disrupted care provider that is more difficult to address such as inter-personal conflicts in or outside the work place, or personal financial stresses.
Although BO and other distress may affect providers, compassion fatigue may more severely affects patients as it is the direct effect of a healthcare provider’s diminished capacity to care that results from repeated exposure to the suffering of their patients, as well as from the knowledge of their patient’s traumatic experiences. 1 Nimmo and Huggard, 1 in a review of compassion fatigue in physicians, report that issues are “often reflected in outcomes of emotional distress, pain, and suffering, and may manifest in increased rates of absenteeism, reduced service quality, low levels of efficiency, and high attrition rates and eventually, workforce dropout.” This evidence of compassion fatigue and the effect on providers and patient care raises an important question on what strategies and programs health systems should consider to prevent or mitigate its effect. As mentioned in a number of the studies included in this review, there are programs in existence, which include general wellness programs that encourage self-care, and increased social and managerial support. Education to practitioners, from students to the most experienced, about the existence and impact of compassion fatigue could assist with identification of the condition, and also mitigate stigma in practitioners about the prevalence and impact of psychological distress in health professions. Prior to the implementation of any treatment or mitigation program, organizations should attempt to understand issues that affect their practitioners, and implement programs tailored to meet specific mitigating causes and concerns. This may be one specific benefit of a measurement tool such as the ProQOL as the effect of different domains can be measured, and more focused interventions considered.
Strengths and limitations
Our study used robust methods and analysis. A limitation of cross-sectional studies in general is the inability to infer causality from observed associations, and this includes in the synthesis of data in meta-analytic techniques. There was significant heterogeneity in the pooled mean subscale scores on all versions of the ProQOL; however, the 95% CIs were narrow indicating precision in the reported estimates.
Conclusion
Compassion fatigue exists across diverse practitioner groups and specialties and can be successfully measured using the ProQOL. Compassion fatigue’s relationship to demographic, personal, and professional characteristics is unclear, as demonstrated by the variability in studies reviewed. Future research should be directed to identifying specific triggers as root causes of compassion fatigue, impact of support programs for providers, and developing education programs to mitigate the prevalence and severity of compassion fatigue.
Supplemental material
Supplemental Material, Supp_Figure_1 - Compassion fatigue in healthcare providers: A systematic review and meta-analysis
Supplemental Material, Supp_Figure_1 for Compassion fatigue in healthcare providers: A systematic review and meta-analysis by Nicola Cavanagh, Grayson Cockett, Christina Heinrich, Lauren Doig, Kirsten Fiest, Juliet R Guichon, Stacey Page, Ian Mitchell and Christopher James Doig in Nursing Ethics
Supplemental material
Supplemental Material, Supp_Figure_2 - Compassion fatigue in healthcare providers: A systematic review and meta-analysis
Supplemental Material, Supp_Figure_2 for Compassion fatigue in healthcare providers: A systematic review and meta-analysis by Nicola Cavanagh, Grayson Cockett, Christina Heinrich, Lauren Doig, Kirsten Fiest, Juliet R Guichon, Stacey Page, Ian Mitchell and Christopher James Doig in Nursing Ethics
Supplemental material
Supplemental Material, Supp_Figure_3 - Compassion fatigue in healthcare providers: A systematic review and meta-analysis
Supplemental Material, Supp_Figure_3 for Compassion fatigue in healthcare providers: A systematic review and meta-analysis by Nicola Cavanagh, Grayson Cockett, Christina Heinrich, Lauren Doig, Kirsten Fiest, Juliet R Guichon, Stacey Page, Ian Mitchell and Christopher James Doig in Nursing Ethics
Footnotes
Conflict of interest
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
