Abstract
OBJECTIVE:
To adapt the SS-QoL into French and test its psychometric properties.
METHODS:
Seventy-seven patients from a population-based registry were enrolled 3 months after their stroke. SS-QoL, NIHSS score, Barthel index, HAD, FSS, SF-36 scales, and MMSE were administered at enrolment. SS-QoL was re-administered at 15 days and 2 months. Internal consistency was assessed by Cronbach’s α coefficients, factorial validity by an exploratory factor analysis and external validity by Mann-Whitney test and Spearman’s correlations (ρ), comparing SS-QoL scores with those obtained from established scales. Reliability was assessed by intra-class correlation coefficients (ICC) and responsiveness by standardized effect sizes (ES).
RESULTS:
Test-retest and inter-observer reliabilities were excellent (ICC> 0.88). Internal consistency was acceptable (α= 0.65–0.91), except for the Personality domain (α= 0.58). Factor analysis individualized eight homogenous axes. SS-QoL scores were different between groups opposed by their modified Rankin score at enrolment or their overall quality of life compared with pre-stroke status (p < 0.001). Ten of the twelve domains correlated moderately (ρ> 0.35) to strongly (ρ> 0.5) with established measures. Nine domains were mildly to moderately responsive to change (ES> 0.3).
CONCLUSION:
The French version of the SS-QoL is a valid, reliable and moderately responsive instrument.
Introduction
Despite major improvements in acute care and rehabilitation in recent decades, stroke remains a disabling disease (Béjot, Rouaud, Durier, Caillier, Marie, Freysz, Yeguiayan, Chantegret, Osseby, Moreau, & Giroud, 2007; Miller, Murray, Richards, Zorowitz, Bakas, Clark, & Billinger, 2010; Kunst, Amiri, & Janssen, 2011; de Peretti, Grimaud, Tuppin, Chin, & Woimant, 2012; Koton, Schneider, Rosamond, Shahar, Sang, Gottesman, & Coresh, 2014). Currently, the evaluation of post-stroke impairment in epidemiological studies, or as an outcome of clinical trials mostly relies on functional scales such as the modified Rankin score (mRS) or the Barthel Index. Although very useful, such assessments are limited by the fact that they do not entirely reflect the real impact of stroke on patients’ lives since subjective adverse consequences of stroke including those on family, social, and occupational life, mood, or fatigue may remain unexplored by these scales (Ali, Fulton, Quinn, & Brady, 2013). Efforts are needed to create a comprehensive assessment of patients’ outcomes after stroke. Quality of life (QOL), as a multidimensional and patient-centered concept (World Health Organization, 1993), appears to be a relevant endpoint to achieve this goal. Specific scales to measure QOL are therefore needed, but only a few have been developed. Only the Stroke Impact Scale (SIS) has been validated in French (Caël, Decavel, Binquet, Benaim, Puyraveau, Chotard, Moulin, Parratte, Béjot, & Mercier, 2015), but its use in clinical practice is hampered mainly because it takes a long time to complete, especially in vulnerable patients. The Stroke Specific Quality of Life Scale (SS-QoL) (Williams, Weinberger, Harris, Clark, & Biller, 1999) appears to be an interesting alternative to the SIS. It is more synthetic than the SIS, easier to use and understand, with a smaller number of items but a wider range of explored domains as Vision, Personality and Energy domains. Also, the patient approach used in the process of item development justifies the choice of the SS-QoL. Although there are other specific scales, such as the Health Related Quality of Life In Stroke Patients-40 scale (Ojo Owolabi, 2010) or the Stroke Adapted Sickness Impact Profile-30 scale (Van Straten, De Haan, Limburg, Schuling, Bossuyt, & van den Bos, 1997), the SS-QoL has been used and quoted much more widely, and cross-culturally adapted in international stroke publications.
The aim of this study was to translate the SS-QoL, while adapting it to French cultural characteristics (linguistic validation), and to test its psychometric properties in terms of validity, reliability and responsiveness in stroke patients.
Methods
Study population
Consecutive stroke patients were recruited from the Dijon Stroke Registry, an ongoing population-based study that collects all cases of stroke occurring within the population of the city of Dijon, France (Béjot, et al., 2007; Béjot, 2015). From 1st November 2014 to 31st October 2015, patients were enrolled three months after the index stroke during a scheduled follow-up visit at an outpatient clinic or rehabilitation center. This period was chosen because in clinical trials on acute stroke treatment patients’ outcomes are usually evaluated at 3 months. Informed consent to participate in the study was collected from all of the patients.
Patients were enrolled if they were 18 years old or older and had suffered a first-ever symptomatic stroke (ischemic stroke or spontaneous intracerebral hemorrhage) between 1st August 2014 and 31st July 2015. Exclusion criteria were a history of previous symptomatic stroke, severe visual or hearing disability, severe aphasia, inability to read or speak French, dementia, psychiatric disorders, addiction to alcohol or drugs, bedridden state and other serious conditions impairing the ability to complete the questionnaire.
Linguistic validation of the French version of the SS-QOL
The SS-QoL is a disease-specific QOL measurement tool that includes 49 items across 12 domains (Williams, et al., 1999): mobility, energy, upper extremity function, work/productivity, mood, self-care, social roles, family roles, vision, language, thinking and personality. Each item is scored from 1 to 5 according to several possible answers (1 = lowest quality of life, 5 = maximum quality of life). The total score ranging from 1 to 5 is calculated from mean scores for each domain.
Permission to develop a French version of SS-QoL was obtained from the original corresponding author (Williams, et al., 1999). The translation and cross-cultural adaptation were performed according to standard recommendations (Epstein, Santo, & Guillemin, 2015). First, the SS-QoL was translated from English into French by two independent bilingual translators whose first language is French (DC and JPC). Second, a debriefing meeting including both translators (DC and JPC) and investigators (NL and YB) was held so as to compare the two translations, to discuss differences and propose equivalencies. At the end of this stage, a new French version was obtained by consensus. Third, a back translation of the last version was done by a bilingual translator, whose first language is English and who had not participated in the first translation (PB). Last, a review of the back translation and analysis of translations was organized during a meeting that included all of the members. Idiomatic, semantic or conceptual equivalencies were discussed and possible amendments were consensually adopted to reach the final questionnaire.
To assess content validity, the scale was presented to stroke neurologists of the University Hospital of Dijon so as to evaluate the relevance of questions and response modalities, the understanding of items by the target population, the non-ambiguity of the vocabulary and the overall consistency of the scale compared with the original one. A pre-test was conducted by submitting the final version of the questionnaire to a sample of 10 stroke patients. Understanding of the questionnaire was assessed by asking patients to reformulate items and possible difficulties or remarks were noted.
Data collected
At registration in the Dijon Stroke Registry, the following variables are collected: demographics, vascular risk factors, pre-stroke dependency according to the modified Rankin Scale (mRS) score, type of stroke (ischemic versus hemorrhagic), initial clinical severity according to the National Institute of Health Stroke Scale (NIHSS) score, acute treatment with intravenous thrombolysis.
In the present study, patients were interviewed face-to-face at enrolment during the three-month follow-up visit, and the following clinical scores were determined: NIHSS, mRS, Barthel index, Mini Mental State Evaluation (MMSE), Fatigue severity scale (FSS), Hospital Anxiety Depression scale (HAD), General Health Survey Short Form 36 (SF-36) score, SS-QoL.
To validate the reliability and the responsiveness of the SS-QoL, a sample of 60 patients (30 patients for the test-retest and 30 patients for the inter-observer reliability) were contacted again for a telephone interview 2 weeks after inclusion and again 2 months after enrolment (responsiveness).
Statistical analyses
All psychometric parameters were assessed in accordance with the internationally recommended taxonomy (Mokkink, et al., 2010). The analyses were performed with SAS ® software (version 9.4, SAS Institute, Inc., Cary, NC, USA) and STATA ® 10.0. The level of statistical significance was set at p < 0.05. Internal consistency was determined using Cronbach’s alpha coefficient (α), which was calculated for each dimension. An α≥0.7 was considered good (Nunally, 1978; Lance, Butts, & Michels, 2006). Structural validity was assessed by an exploratory factorial analysis with a varimax rotation on the first 12 principal components. We analyzed the factorial axes highlighted by this method. Correlation coefficients (or standardized load) of the items for each factorial axis determined the scale structure. The expected result was convergence of this factorial structure to the expected content of the scale. A standardized load≥0.5 in the correct domain was considered satisfactory. For external validity, convergent validity was determined using the Spearman correlation test (Streiner, & Norman, 1995; Polit, 1996; Fayers, & Machin, 2000; Lance, et al., 2006), and by comparing SS-QoL scores with those obtained from established scales. Spearman coefficients ρ< 0.35, 0.35–0.5 and >0.5 were considered low, moderate and high, respectively (Devilliers, Amoura, Besancenot, Bonnotte, Pasquali, Wahl, Maurier, Kaminsky, Pennaforte, Magy-Bertrand, Arnaud, Binquet, & Guillemin, 2012). Divergent validity was based on known-group comparisons. We first studied SS-QoL scores from groups defined by their mRS at enrolment (mRS = 0-1 vs mRS > 1). We then repeated the analysis, defining groups according to their self-reported overall quality of life compared with their pre-stroke status (worse versus not worse, using a Likert scale). In both analyses, we hypothesized that SS-QoL scores would be significantly different between groups. The nonparametric Mann-Whitney test was used (Polit, 1996). Test-retest and inter-observer reliabilities were assessed by intra-class correlation coefficients (ICC), by comparing SS-QoL scores at enrolment with those determined 2 weeks later. An ICC greater than 0.70 indicated very good reliability (Fayers, & Machin, 2000). Responsiveness was determined by the effect size (ES) and the standardized response mean (SRM). ES was calculated in patients who reported a change in their quality of life between the enrolment visit and their pre-stroke status. This change was identified using a –2 to +2 Lickert scale ranging from “much worse” to “much better” than before the stroke. We considered a domain not responsive for an ES <0.2, mildly responsive for an ES between 0.2 and 0.5, moderately responsive for an ES between 0.51 and 0.7 and highly responsive to change for an ES > 0.7 (Cohen, 1988).
Feasibility and acceptability were evaluated with the percentage of missing data and the distribution of responses by studying domain scores. We investigated the presence of floor or ceiling effects, defined by the proportion of patients who had a minimum or maximum score for a given domain. A score above 20% was considered high (Cruz-Cruz, Martinez-Nuñez, Perez, Kravzov-Jinich, Ríos-Castañeda, & Altagracia-Martinez, 2013).
Ethics
This study was approved by the ethics committee (Comité de Protection des Personnes Est 1, number N°: 2014-A01654-43). Patients were informed about the study and their consent was obtained at enrolment.
Results
Characteristics of patient
A total of 77 patients were included in this study. The baseline characteristics are shown in Table 1.
Baseline characteristics of patients
Baseline characteristics of patients
The mean age of patients was 64.3 years and 58.4% of them were men. The mean initial NIHSS score was 5.2±0.6. At the enrolment visit, the mean NIHSS score was 1.6±2, 67.6% of patients had a modified Rankin score of 0 to 1 (Fig. 1), and 83.1% had a Barthel index > 90 (Table 1).

Modified Rankin score in stroke patients three months after the inclusion visit.
Small differences in semantics and sentence structure were observed between the original and the back-translated versions and between the French and the English versions. To obtain a consensus, every word and expression was discussed at the synthesis meeting to choose the best equivalence. During the pre-test, the sample of 10 patients had no missing responses. The items were clearly understood and none of the patients had problems with the wording of questions. The group of experts considered the questionnaire relevant and understandable. No changes were made after the pre-test.
Psychometric validity of the French version of the SS-QoL
Mean scores of the domains of the French version of the SS-QoL are presented in Table 2. The lowest scores were found for the Fatigue (3.2), Social Roles (3.4) and Personality (3.5) domains. Conversely, the Self-care (4.7), Upper extremity function (4.7) and Vision (4.5) domains had the highest mean scores. The distribution of the domain scores ranged from 1 to 5 except for the Language, Mobility, Mood, Energy, Thinking and Vision domains, whose lowest scores were 2.2 - 1.2 - 2 - 1.6 and 1.7, respectively.
Internal consistency – floor and ceiling effects
Internal consistency – floor and ceiling effects
Cronbach’s alpha coefficients were satisfactory for seven domains (α> 0.7) and acceptable for four (α> 0.65), thus showing satisfactory homogeneity (Table 2). The internal consistency of the Personality domain was moderate (α= 0.58).
Factorial validity
The first 12 components of the factorial analysis explained 75% of the total variance of the items. This analysis did not exactly reconstitute the original structure of the scale (Table 3). Twelve groups of items organized into only eight factorial axes. Several theoretical domains overlapped on axis 1: Family role, Mobility, Self-care, Upper extremity function and Work/productivity domains gathered into one large domain. The Work/Productivity domain also overlapped on axis 4 with the Energy domain. Standardized loads were mostly higher than 0.5 (except for nine items), which indicated a good correlation between items and the domains to which they belonged.
Exploratory factorial analysis with varimax rotation
Exploratory factorial analysis with varimax rotation
Table 4 shows the correlations between domains of the SS-QoL and the corresponding scales listed. The Energy, Family role, Work/Productivity, Mobility, Mood, Self-care and Upper extremity function domains correlated strongly with the FSS scale, the SF-36 (limitations due to physical condition - physical activity), the HAD scale, the Barthel index and the NIHSS score (motor arm), respectively (ρ> 0.5). The Vision, Language, Social Role domains correlated moderately with the Visual fields and Language domains of the NIHSS score and the SF-36 scale (ρ> 0.35). The Personality domain did not correlate with the HAD scale, nor did the Thinking dimension with the MMSE.
External construct validity - convergent validity
External construct validity - convergent validity
The results of the analysis of known-group comparisons are presented in Tables 5 and 6. Nine domain scores of the 12 were significantly different between patients reporting a worse or better quality of life compared with their pre-stroke status (p < 0.05). For three domains (Thinking, Personality, Language) the difference was not significant. Furthermore, the total score for the French version of the SS-QoL scale was significantly different between these two groups (4.5 versus 3.8; p < 0.001) (Table 5). Similarly, the total score for the French version of the SS-QoL scale was significantly different between patients with an initial Rankin score less than or equal to 1 and patients whose Rankin score was higher than 1 (4.3 vs. 3.6, p < 0.001) (Table 6).
External construct validity - divergent validity
External construct validity - divergent validity
Quality of life at baseline compared with pre-stroke status. Group 1 = not worse QOL; Group 2 = worse QOL.
External construct validity: divergent validity
Quality of life according to the modified Rankin score. Group 1: mRS > 1; Group 2: mRS = (0-1).
All domains of the SS-QoL had excellent test-retest and inter-observer reliability. Intraclass correlation coefficients were all above 0.88 (Table 7).
Fidelity
Fidelity
Responsiveness was assessed in each domain for patients who had reported a change in their quality of life between inclusion and their pre-stroke status, using a –2 to +2 Lickert scale ranging from “much worse” to “much better”. Three patients were lost because of one death and two refusals to participate. Nine of the twelve domains of the French version of the SSQOL were mildly to moderately responsive (ES> 0.3) (Table 8). The Energy domain was the most responsive (ES> 0.7). In contrast, Social Role and Work/Productivity domains were the least responsive (ES< 0.2).
Responsiveness
Responsiveness
The acceptability of our questionnaire was good and there were no missing responses. The distribution of responses reflected no floor effect (Table 2). We found a ceiling effect ranging from 23.4% to 66.2% for seven domains (Family role, Language, Mobility, Mood, Self-care, Upper extremity function, Vision and Work/Productivity). It took a mean of 10.3 minutes to complete the SS-QoL.
Discussion
This study demonstrated that the French version of the SS-QoL scale appears to be reliable, valid, and moderately responsive to change to assess quality of life in stroke patients.
The feasibility and acceptability of the SS-QoL scale were good. There were neither missing data nor floor effects. Although there were moderate ceiling effects in some areas, only the Vision domain had a significant ceiling effect (66.2%), which is consistent with findings from the validation of both the original scale (Williams, et al., 1999), and its Danish version (Muus, Williams, & Ringsberg, 2007).
Seven of the twelve domains showed good internal consistency with α> 0.7, and four areas showed acceptable internal consistency with α> 0.65 (Energy, Family role, Thinking and Vision). In contrast, the Personality domain showed the lowest internal consistency (α= 0.58). Validation studies of other SS-QoL versions did not report such a low internal consistency for the Personality domain. Indeed, the lowest reported α was for the Thinking domain in the Turkish version (Hakverdioğlu Yönt, & Khorshid, 2012), and for the Family role domain in the Mexican version (Cruz-Cruz, et al., 2013). Our result may indicate poor homogeneity of the Personality domain. One hypothesis could be that the different items contained in this domain measure several concepts (evolution of the personality since the stroke or fixed aspects of personality). Another explanation could be an issue of translation and cross-cultural adaptation into French. However, after reviewing the different stages of the translation, the wording of questions did not appear to be problematic. Finally, it cannot be excluded that the low internal consistency for the Personality domain could be attributable to a sampling bias. Indeed, Cronbach’s alpha coefficient depends on the total number of items and on the size and the heterogeneity of the sample. As a consequence, a small and homogeneous group of patients decreases the α (Fayers, & Machin, 2000). Because our study was population-based, initial stroke severity was low (mean NIHSS of 5) as compared with hospital-based recruitment, and the functional outcomes of patients at 3 months were therefore good. As a result, this small sample appeared to be quite homogeneous, which may explain lower overall α values than those reported for the original (α= 0.73 to 0.89) and Danish (α= 0.81 to 0.94) versions of the SS-QoL.
Exploratory factorial analysis showed that the French version of the SS-QoL was organized into eight factorial axes. Items from the Family role, Mobility, Self-care, Upper-extremity function and Work/productivity domains seemed to gather into a new large domain. The Turkish (Hakverdioğlu Yönt, & Khorshid, 2012) and German (Ewert, & Stucki 2007) versions also highlighted a reduced number of domains, with a large domain gathering Self-care, Work/productivity, Mobility and Upper-extremity function dimensions. In the original development of the SS-QoL, the authors suggested that a smaller number of domains could be found in future factorial analyses (Williams, et al., 1999). They especially pointed out that the exact number of domains was less important than the content of items, as the final goal was to accurately assess all of the potential effects of stroke, in a patient-centered way. Although our factorial structure did not exactly correspond to the original version, the overall consistency of the scale was maintained.
The French version of the SS-QoL scale showed good convergent validity. Seven domains (Fatigue, Family role, Work/Productivity, Mobility, Mood, Autonomy, Functionality of the upper limbs) correlated strongly with validated external associated scales (ρ> 0.5). The Thinking domain did not correlate with the MMSE. This expected result could be explained by the fact that the MMSE is not sensitive enough to screen for cognitive impairment after stroke (Sivakumar, Kate, Jeerakathil, Camicioli, Buck, & Butcher, 2014; Pendlebury, Mariz, Bull, Mehta, & Rothwell, 2012), especially to explore dysexecutive syndromes. In addition, patients with cognitive impairment were excluded from the study because of their inability to complete the questionnaire. In contrast, items from the Thinking domain of the SS-QoL were very sensitive, and were better able to detect subtle functional signs (for example: “do you have trouble remembering things?”). The Personality domain did not correlate with the HAD. This result was not found in either the original or in the Danish version, in which the Beck Depression Inventory was used as the comparative tool (Beck, Ward, Mendelson, Mock, & Erbaugh, 1960). This finding suggests that items contained in the Personality domain explored different aspects from those studied in the HAD scale, i.e. more emotional elements related to the patient’s personality rather than anxiety and depression aspects. As observed in the original, Danish and Mexican versions, the Vision and Language domains correlated moderately with corresponding items of the NIHSS (ρ> 0.35). The NIHSS score is not a powerful tool to assess these clinical features exhaustively. Actually, it measures neurological impairment rather than disability; it thus underestimates the impact of a neurological symptom on QOL. The use of a specific language test and a systematic visual field examination would have overcome this limitation but was not applicable in this study.
In support of its divergent validity, the French version of the SS-QoL scale was able to differentiate groups with known differences. Scores from patients who reported a worse quality of life compared with their pre-stroke status significantly differed from those who did not, which is consistent with a previous report (Williams, et al., 1999; Muus, Williams, & Ringsberg, 2007). Similarly, using the mRS as an objective endpoint, the overall score was significantly different between patients with an mRS > 1 at inclusion and those with an mRS equal to 0 or 1.
Other positive psychometric properties were the excellent test-retest and inter-observer reliabilities for a sample of 60 patients (ICC> 0.88). Concerning responsiveness, most domains of the French version of the SS-QoL were mildly to moderately responsive, but two domains were not responsive to change: Work/Productivity and Social role. Two studies previously investigated the responsiveness of the SS-QoL. In the Danish version, based on analyses of effect size, the authors concluded that eight domains were mildly to moderately responsive (SRM = 0.33–0.56) and four were non-responsive (Muus, Christensen, Petzold, Harder, Johnsen, Kirkevold, & Ringsberg, 2011). In contrast, better results were demonstrated in the original version (Williams, et al., 1999). The use of a homogeneous sample of patients with low initial severity and good functional outcomes in our study may explain our findings, because of ceiling effects, thus limiting any further evaluation of responsiveness. Another hypothesis could be that the time between the two measurements was too short.
Psychometric properties of different scales are difficult to compare because of their heterogeneity and the extended range of statistical analysis. The SS-QoL can be compared with the SIS, that is the only other specific scale validated in French. For content validity, we noted that in the validation study for the SIS, about 25% of patients responded that they would have liked to be asked about sexuality and fatigue (Caël, et al., 2015). Despite a smaller number of items, the SS-QoL explored a wider range of domains by including areas such as Fatigue, Personality and Vision. Moreover, one SS-QoL item does assess sexuality issues. The SS-QoL test-retest and inter-observer reliabilities were better with ICC all above 0.88 versus 0.65 in the SIS test-retest. Both scales suffered from ceiling effects. As expected and previously discussed, except for the Personality domain, α coefficients ranged from 0.65 to 0.91 in the SS-QoL versus 0.89 to 0.97 in the SIS. Both scales showed good divergent validity according to different endpoints: professional activity, Boston Diagnostic Aphasia Examination, NIHSS and BI scores in the SIS, mRS and self-declared overall quality of life in the SS-QoL. Convergent validity was more difficult to analyse because SIS scores were compared with those from the BI, the HAD and the Duke profile. Conversely, SS-QoL domain scores correlated with SF 36, HAD, BI, NIHSS, MMSE and FSS scores. In both scales, Self-care and Mood domains correlated well with the BI and the HAD, respectively (ρ= –0.56 in SIS versus 0.56 in SS-QoL and ρ= 0.89 in SS-QoL versus 0.71 the SIS, respectively). Concerning responsiveness, the SRM range was quite similar in both scales: 0.2–0.86 in the SS-QoL (except for the Energy domain which was not explored in the SIS) versus 0.16 to 0.9 in the SIS. The SRM of some domains, such as the Self-care domain (SRM = 0.8 in SIS versus 0.86 in SS-QoL) and the Hand function domain (SRM = 0.57 in SIS versus 0.66 in SS-QoL), were very similar in both scales. The SRM of the Language domain was better in the SS-QoL (0.75 versus 0.26) whereas the SRM of the Mobility domain was higher in the SIS (0.34 versus 0.71).
Our study has several limitations. As discussed above, the relatively small study sample may have influenced the internal consistency of the scale. Similarly, the homogeneity of our patients may have limited its psychometric properties. For example, two thirds of our patients had a low residual handicap (mRS < 2), which is not representative of the general stroke population. This situation can be explained by the fact that we excluded patients with a history of previous symptomatic stroke, and patients who could not be tested because they had severe aphasia or dementia, or were bedridden. However, because of the population-based setting of our study, patients were representative of those seen during follow-up in clinical practice, thus reinforcing the usefulness of this tool by clinicians.
Conclusion
The French version of the SS-QoL scale has satisfactory psychometric properties and thus appears to be a useful tool for measuring QOL in stroke patients. QOL is an emergent outcome that will be addressed in future epidemiological studies and clinical trials.
Conflict of interest
Yannick Béjot received honoraria or consulting fees from AstraZeneca France, Daiichi-Sankyo, Pfizer, BMS, Covidien, Bayer and MSD France. Other authors: none.
