Abstract
Positive psychology is gaining credence within dementia research but currently there is a lack of outcome measures within this area developed specifically for people with dementia. Authors have begun adopting positive psychology measures developed with other populations but there is no consensus around which are more appropriate or psychometrically robust. A systematic search identified measures used between 1998 and 2017 and an appraisal of the development procedure was undertaken using standardised criteria enabling the awarding of scores based on reporting of psychometric information. Twelve measures within the constructs of identity, hope, religiosity/spirituality, life valuation, self-efficacy, community and wellbeing were identified as being used within 17 dementia studies. Development procedures were variable and scores on development criterion reflected this variability. Of the measures included, the Herth Hope Index, Systems of Belief Inventory and Psychological Wellbeing Scale appeared to be the most robustly developed and appropriate for people with dementia.
Introduction
In the last 20 years there have been efforts to explore dementia from a positive methodology perspective rather than just a narrative of decline, centred on the medical model. Historically, behavioural and psychological symptoms of dementia were viewed as being caused solely by a somatic organic illness (Dawson & Reid, 1987) and people with dementia were often viewed as dependent and unable to contribute to society (Lyman, 1989). Person-centred theory (Kitwood, 1993) represented a shift from the prevailing biomedical view point to a more holistic approach, in which social and individual factors contributed to a unique experience of dementia for each person (Kitwood, 1997).
This shifting awareness and empowerment for people with dementia was also represented through the use of outcome measurement within research. Previously, there was the view that people with dementia were unable to make accurate judgements within research and proxy rated outcome measures were often relied upon (Dawson, Welsh-Bohmer, & Siegler, 2000). Furthermore, outcome measures used were based on deficits or problem such as depression (Alexopoulos, Abrams, Young, & Shamoian, 1988), anxiety (Shankar, Walker, Frost, & Orrell, 1999) or neuropsychiatric symptoms (Cummings, 1997). Quality of life is now recognised as a desired outcome for psychosocial research (Logsdon, Gibbons, McCurry, & Teri, 1999) and people with dementia are being asked to complete outcome measures themselves, in conjunction with proxy report.
Positive psychology may be the next step within this framework and refers to the use of empirical approaches to examine human strengths and capabilities that contribute to wellbeing, sometimes called ‘flourishing’ (Seligman, 2002). This theory is beginning to be applied to dementia populations, for example, the role of hope (Wolverson, Clarke, & Moniz-Cook, 2010) and humour (Clarke & Irwin, 2016). People with dementia are capable of using these strengths to actively seek enjoyment and pleasure but there has been no quantitative research to supplement the qualitative findings. As positive psychology refers to the scientific study of wellbeing, quantative measurement of positive constructs is needed.
The use of outcome measures has long been held as a gold standard in research and they often undergo a rigorous development procedure (Moniz-Cook et al., 2008). However, currently there is a lack of positive psychology outcome measures developed for this population (Clarke, Wolverson, Stoner, & Spector, 2016).
Existing measures of positive psychology have begun to be applied within dementia research but there is no consensus as to which are more appropriate or psychometrically robust. This review aimed to identify those measures currently in use for dementia populations and conduct an appraisal of the measures’ psychometric properties including data from dementia populations in order to guide future choice of measures in research and practice.
Methods
Design
A systematic search of positive psychology outcome measures used within research for people with dementia was undertaken. Systematic principles for searching, screening and appraising studies were followed (Moher, Liberati, Tetzlaff, Altman, & PRISMA Group, 2009) and searches were then conducted to identify development information of included measures.
Search strategy
PsychInfo, PubMed and MedLine were searched for results from 1998 to 2017. Search terms were self-efficacy, life satisfaction, hope, resilience, wisdom, growth, coherence, control, autonomy, pleasure, self-realisation, sense of agency, gratitude, happiness, optimism, transcendence, positive, dignity, social participation, social inclusion, self-concept, reciprocity, connectedness, engagement, humour, creativity, flow, spirituality, love, compassion, benefit finding, community integration, opportunity, social adjustment, mindfulness, acceptance, successful aging, wellbeing, quality of life, independence and social health (Seligman, 1998). These search terms were then combined again with dementia, Alzheimer, cognitive impairment, senile and vascular. Truncations of search terms were used where appropriate.
Terms indicative of related fields were also included (quality of life; wellbeing; social health) and in-depth search strategy was employed in order to fully capture positive psychology measures. Titles were included if the study reported on a dementia population, abstracts were then screened for methodology indicative of the use of outcome measures and full texts were sought for the remaining results to identify positive psychology outcome measures. Ambigious titles or abstracts were included until a decision could be made, including research with ‘dyads’.
Inclusion criteria
Use of positive psychology outcome measure as identified within the search terms. Use of above measure(s) within a dementia population. Both development of measures and use of measures published within a peer-reviewed journal. Published between 1998 and 2017.
Exclusion criteria
Studies published in a language other than English if a translation was not available. Only used proxy-reporting. Development information for outcome measures was not freely available.
Appraisal of psychometric properties
Included measures were grouped and a quality assessment was undertaken using an established criteria (Terwee et al., 2007) (Table 1), which assesses development procedures of measures and has been used successfully in other reviews (Stoner, Orrell, & Spector, 2015; Windle, Bennett, & Noyes, 2011). For each item within the criterion, a score of two was awarded if the study was adequately designed and appropriate statistics given, a score of one was awarded if there were methodological shortfalls such as inadequate design. If, despite adequate design, the study produced results indicating poor psychometric properties or no information was reported, a zero was awarded (possible range 0–18). Two authors (CRS and JS) undertook this analysis independently and a consensus meeting was held to ensure reliability of reporting (Table 2).
Terwee criteria.
AUC: area under the curve; MIC: minimal important change (this is the smallest difference in score in the domain of interest which patients perceive as beneficial and would agree to, in the absence of side effects and excessive costs); RR: responsiveness ratio; SDC: smallest detectable difference (this is the smallest within person change, above measurement error. A positive rating is given when the SDC or the limits of agreement are smaller than the MIC); SEM: standard error of measurement.
Quality assessment of development procedures.
Results
After limitations were applied and duplicates removed, 3910 results were identified of which 2363 were included on title. The 1547 papers excluded at this stage did not report on a dementia population. At the second stage, 828 abstracts were included, for which full texts were sourced. Of these, the vast majority were excluded as no positive psychology outcome measures were identified (568) or measures were used in caregiver studies with no outcomes used for the person with dementia (78) (Figure 1). Twelve studies were subsequently excluded as development information for the measures used was not published in a peer-reviewed journal or was not freely available. This left a total of 12 positive psychology outcome measures used within 17 studies for analysis (Table 3). Of the studies included here, three used a combination of self and proxy-reporting (Cohen-Mansfeld et al., 2010; Hilgeman et al., 2014; Jolley et al., 2010). The remainder used self-report only for the measures included here. Development scores were variable, with scores ranging from 3 to 11. CRS and JS agreed on most ratings, apart from the interpretability section of the Terwee criteria. Disagreements were discussed until a consensus could be reached.

Review process.
Description of included studies.
PwD*: people/ person living with dementia.
Identity
The Self-identity in Dementia (SID; Cohen-Mansfeld et al., 2000) (3/18) was the only measure identified to have been developed within a dementia population. It was notably lacking all psychometric information on the assessment criteria apart from content validity, for which it scored two points, and construct validity for which it scored one point. This was due to the involvement of the target population and experts in item development and a clear description of the aims and domains to be measures. In a predictive analysis of variance in mood and quality of life from aspects of identity, authors suggested a model including aspects of identity could predict depression. More specifically, scores on the SID family and leisure subscales, significantly predicted depression (p<.01) (Caddell & Clare, 2012) lending evidence to the SID’s predictive validity. An indication of discriminant validity was found between identity and cognition, as Caddell and Clare (2013a) observed no significant correlation between mean SID scores and the Consortium to Establish a Registry for Alzheimer's Disease (CERAD) cognitive battery. The SID was also used in an additional study examining differences in identity of people with dementia and older adults without dementia. Both groups scored family role as the strongest aspect of their identity and occupational identity as their weakest, suggesting that dementia may not negatively affect identity, within early stages. This provides a further indication of the SID’s content validity and is supported by an additional study that reported family identity as being most important (87%) (Cohen-Mansfeld et al., 2010). Evidence of the SID’s convergent validity was also observed between self-identity, engagement duration, attention and attitude (p<.001). Healthy older adults reported significantly more distress relating to identity than people with dementia (Caddell & Clare, 2013b), possibly indicating some degree of interpretability or discriminant validity.
Hope
The Herth Hope Index (HHI; Herth, 1992) and Adult Hope Scale (AHS; Snyder et al., 1991) were the most robustly developed measures (10/18 and 11/18) and The Life Orientation Test-Revised (LOT-R; Scheier et al., 1994) scored less (7/18). All three measures failed to define a minimal important change to assess the responsiveness of the measure and the HHI reported an internal consistency indicative of multicollinearity (α=.97). The HHI was used in a small scale feasibility study for Dignity Therapy for people with dementia (Johnston, et al., 2016). Pre- and post-intervention scores, as percentage changes, on the HHI were only available for four participants and were variable. The maximum increase was 6.25% and one participant had a decrease of 18.7%. This may indicate an issue with sensitivity to change as the HHI was found to be stable over a two-week period (.91) during the development stage. However, as this was a feasibility study, the sample size was small and no firm conclusions can be drawn for the sensitivity of the HHI in people with dementia.
The AHS was the most thoroughly developed measure of all the measures included here. Notably, it had high levels of construct and convergent validity with correlations being observed between a number of scales including life orientation, self-esteem, hopelessness and depression. Furthermore, discriminant validity was established between hope and self-consciousness.
The LOT-R was reported as having a good level of internal consistency (.82) and convergent validity was established between dispositional optimism, self-mastery, trait anxiety, neuroticism and self-esteem. Both the AHS and the LOT-R were used in a study examining biological markers of allostasis (the ability to maintain stability in a changing environment through psychological or behavioral change) as an index of psychological resilience. These biological markers were compared to baseline resources including hope (Meeks, et al., 2016). However, the authors reported no significant findings in relation to allostasis and hope.
Religiosity/spirituality
The Systems of Belief Inventory (SBI-15; Holland et al., 1998) and the Royal Free Interview for Religious and Spiritual Beliefs (RFIRSB; King & Speck, 1995) scored 11/18 and 5/18, respectively. Internal consistency for the SBI-15 was excellent (.93) as was test-retest reliability (.95) for both religious and non-religious groups. Furthermore, criterion validity was adequate, with the SBI-15 correlating with other measures of religiosity and there was a significant difference between scores for religious and non-religious participants, indicating discriminant validity. A small-scale study utilised the SBI to examine spiritual beliefs in people with early stage dementia (Katsuno, 2003) and a positive correlation was observed between spirituality and quality of life (p<.05) indicating convergent validity between these measures.
The RFIRSB was developed with the use of experts and population involvement but the internal consistency analysis indicated an issue with the philosophical belief subscale (.60) and the overall internal consistency was not provided. However, test-retest reliability was adequate over a one-week period for both philosophical and spiritual subscales (.91 and .95, respectively). Furthermore, construct validity was established as there was a significant relationship between spirituality and the frequency of practice of religious faith (p<.0005) and people with dementia appeared to rate the strength of belief as most important (Jolley et al., 2010), suggesting additional content validity of the measure.
Life valuation
The Meaning in Life Scale (MLS; Krause, 2004) and the Terrible Delightful Scale (TDS; Michalos, 1980) scored 6/18 and 3/18, resepctively. The MLS was developed in a large sample of older adults. Whilst item selection did not involve target population or experts, the internal consistency (.925) and factor analysis yielded satisfactory results. No significant effects or trends of MLS were reported in a trial of advanced care planning and identity for people with dementia (p=.71) (Hilgeman et al., 2014), potentially indicating an issue with sensitivity or low sample size and study design.
The TDS was developed without the input of experts or the use of a target population and was not awarded points for content validity. Michalos (1980) undertook an extensive pathway analysis but did not report the internal consistency. However, it was one of the few papers to report the floor and ceiling effects of the measure. Predictive validity of the TDS was examined in an observational study examining the relationship life satisfaction and functional impairment (St. John & Montgomery, 2010). Within this study, people with dementia or Mild Cognitive Impairment had slightly lower overall life satisfaction than those without.
Self-efficacy
The General Self-efficacy Scale (GSE; Schwarzer & Jerusalem, 1995) and the Self-efficacy Scale (SES; Sherer et al., 1982) both scored moderately (6/18). Authors did not report on aspects of content validity of the GSE but internal consistency was .86. Subgroups were explored but no minimal important change was defined, limiting the measures interpretability. The GSE was used in an evaluation of a health promotion course for 89 people with dementia (Buettner & Fitzsimmons, 2009) but no significant differences between pre- and post-testing were reported. However, this may be an issue with intervention fidelity, as the authors reported significant findings in an earlier unpublished pilot study.
Authors of the SES did report some aspects of content validity but failed to report responsiveness, floor and ceiling effects or interpretability. The authors reported the internal consistency for both subscales (.86 and .71) but not the overall internal consistency. However, the measures demonstrated a high level of convergent validity with locus of control, ego strength, interpersonal competency and self-esteem. Clements-Cortes (2013) used the SES to assess the effectiveness of a choir group for older adults, an unclear proportion of which had dementia. It is, therefore, not possible to draw conclusions about the content validity of this measure for people with dementia. Furthermore, the authors were not able to demonstrate the measures responsiveness on either subscale following the intervention (p=.20, .37) but this may be attributable to the low sample size.
Community
The Brief Sense of Community Scale (BSCS; Peterson et al., 2008) scored moderately low (5/18) lacking information regarding test-retest reliability, responsiveness, skew of data and interpretability. There was no target population involvement or information regarding item selection. However, convergent validity was established between the BSCS and measures of community participation, depression and intrapersonal psychological empowerment (p<.01). Within a dementia setting, the measure was used to assess the efficacy of an intergenerational intervention but no significant were found (p= .168) (Low et al., 2015). Authors reported no significant findings of other measures in the study including agitation and quality of life, possible indicating issues with study design.
Wellbeing
The Ryff Psychological Wellbeing Scale (PWB) (7/18) contains six subscales that measure self-acceptance, positive relations with others, autonomy, environmental mastery, purpose in life and personal growth. It was found to have robust criterion validity and interpretability but lacked information on stability and responsiveness. One study utilised the measure in its entirety (Gonzalez et al., 2015), one used the environmental mastery subscale (Wettstein et al., 2014) and one used the purpose in life subscale (Mak, 2011). Gonzalez et al. (2015) examined the effect of reminiscence therapy within two retirement homes. The authors found significant improvements on all dimensions of the PWB, except for the purpose in life subscale, indicating the measures ability to detect change. There was a significant interaction effect of time and group for self-acceptance (p=.002), positive relations with others (p=.019), autonomy (p=.001) and environmental mastery (p=.003). The second study utilised the environmental mastery subscale of the PWB in an observational study and provided further evidence for the measures construct validity noting that higher walking distances and walking speed were significantly related to higher environmental mastery (r=.40 and r=.45, p<.05) (Wettstein et al., 2014). Mak (2011) utilised the purpose in life scale in a randomised trial and reported the internal consistency as .73, lower than the original reported alpha of .90 but still satisfactory. A positive correlation was observed between purpose in life and goal pursuit (p<.001), further evidencing construct validity.
Discussion
Twelve positive psychology outcome measures were identified within the constructs of identity, hope, religiosity/spirituality, life valuation, self-efficacy, community and wellbeing as being used in 17 research studies of people with dementia. Only one of these measures was developed for people with dementia (SID). Most outcome measures identified scored moderately on their development procedures, with hope scales developed the most robustly and identity the least robustly.
Whilst the development information was lacking for the SID, it has been used in a number of studies successfully and suggests the measure is an appropriate tool for assessing identity for people with dementia. Despite the AHS scoring slightly higher than the HHI at the quality assessment stage, it is possible that the HHI may be more applicable as hope for people with dementia appears to be more generalised in nature (Wolverson et al., 2010), rather than goal oriented (Snyder et al., 1991). Therefore, the HHI may hold more content validity for this population. Spirituality and religiousness appears to be a pervading concept, in that spirituality and religiousness hold significance in the self-concept and change, hope for the future and positive attitudes for people with dementia (Dalby, Sperlinger, & Boddington, 2011). The SBI-15 appears to be an adequate tool to detect and measure spiritual beliefs, as development procedures were robust and some psychometric properties have been found to be satisfactory in dementia populations. Both the TDS and the MLS scored moderately for their development procedures. Whilst the TDS appeared to be more successful within a dementia population, it is recommended that both measures are in need of further psychometric examination before they are routinely used. Both self-efficacy scales and the BSCS scale failed to show sensivitiy to their respective interventions. Whilst this may be due to issues with the study design, rather than the measures, it is recommeneded that a more detailed examination of self-efficacy scales and community scales for people with dementia is needed.
Of the scales included, Ryff’s PWB appeared to be the most successful, in terms of its development procedures and the studies in which it was used. The lower alpha reported within a dementia population is still within the acceptable range and is sensitive to change. It would then appear that this scale is appropriate for people with dementia and it is recommended for use within research.
Methodological problems
All measures included here failed to define a minimal important change, which is a requirement of the Terwee criteria for interpretability. This meant that it was nearly impossible to award scores for responsiveness. Reporting on reliability was mixed with only four studies reporting the test-retest reliability of measures. Inferring sensitivity of change of measures within dementia studies included here was problematic, due mostly to study design including low sample sizes. A large majority of the studies included were feasibility studies and were not powered to detect effect sizes. Additionally, obtaining the development papers of included measures was sometimes difficult and could only be accomplished by extensive searching.
Limitations
Whilst an effort was made to include search terms that were all-encompassing and indicative of positive psychology, it is noted that definitions of what constitutes this theory vary. Consequently, broad search terms including the related fields were used resulting in a large number of studies excluded. The criteria used here is one of the few comprehensives enough to cover most aspects of a measures psychometric properties. However, it may have been overly constraining as responsiveness and interpretability were rarely reported. Future authors may wish to include such information for the purpose of reviews or for measure selection.
Future research
These studies highlight the need for authors to consider psychometric analyses when designing their research. All measures included here provided no information regarding responsiveness to change. This is particularly important when considering interventional research as most studies here failed to find a significant effect of the interventions detailed. Whilst this may be due to a range of factors including low sample size or the effectiveness of the actual intervention, it is recommended that future researchers select measures that have been established as sensitive to intervention to ensure the most accurate appraisal of efficacy of their intervention.
Furthermore, only one measure identified here was developed within a sample of people with dementia, drawing on their perceptions and experiences. The majority of studies included here used measures developed for other populations and, as such, may not have been suitable to detect meaningful change for a person with dementia. It is possible that positive dementia specific measures such as those developed for quality of life (e.g. Logsdon et al., 1999) may be a more valid tool for detecting change within interventional research. It is, therefore, also recommended that dementia specific measures of positive psychology are developed to ensure valid and reliable tools are consistently used in this emerging area of research.
Whilst positive experiences have been an emerging theme for people with dementia, it is often within a ‘coping’ paradigm in which positive experiences are conceptualised as a strategy to adapt or retain normality when faced with negative experiences and loss (Clare et al., 2013). This prevailing model of positive psychology and dementia is not supported within the qualitative literature (see Wolverson, Clarke, & Moniz-Cook, 2015 for a review) and this should be reflected in future outcome measure development or adaptation. By reducing positive psychology to coping or as a means of denial (de Boer et al., 2007), potentially valuable positive characteristics are at risk of being not investigated properly. Future researchers should be wary of assuming that content validity remains the same, despite using measures with populations they were not designed for. This is exampled through the use of the AHS, despite multidimensional hope measures potentially being more applicable.
Conclusion
Twelve positive psychology outcome measures, with development information available, have been used within dementia research. However, the quality of the development procedures was variable, with authors failing to report important aspects of psychometric analyses including responsiveness and stability. The HHI, SBI-15 and PWB appear to be the most psychometrically sound and appropriate for people with dementia and it is recommended that future authors explore these concepts.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
