Abstract
The Ruminative Thought Styles Questionnaire (RTS) conceptualizes rumination as repetitive, recurrent, intrusive, and uncontrollable thinking. This article outlines the development and validation of a Chinese language version of the RTS, the RTS-CH. Following independent translation, back translation, and final translation checking, the factor structure, convergent and divergent validity, and item-level congruence of the RTS-CH was examined and improved. The resultant scale showed equivalence to the RTS and had attractive psychometric properties. The RTS-CH is the first Chinese language rumination measure that does not have inherently negative or depressive content.
Keywords
Rumination is repetitive, recurrent, intrusive, and uncontrollable thinking (Brinker & Dozios, 2009). The act of rumination is transitory (Riley, 2010), but the tendency to ruminate is stable within an individual across time (Francisco, Puterman, DeLongis, & Pomaki, 2010). A high level of rumination is associated with an array of potentially negative psychological constructs, including anxiety, depression, and negative mood (Calmes & Roberts, 2007; Hughes, Alloy, & Cogswell, 2008; Muris, Roelofs, Rassin, Franken, & Mayer, 2005; Wilkinson & Goodyer, 2006). Some studies have found that rumination invariably affects negatively on mood (Joormann & Tran, 2009), while others indicate that while repetitive thought per se is a general concomitant of negative mood, rumination predicts negative mood above and beyond this in a depressed sample (Segerstrom, Tsao, Alden, & Craske, 2000). A tendency toward ruminating is also associated with nonsuicidal self-injury (Tait, Brinker, Moller, & French, 2014; Voon, Hasking, & Martin, 2013), and suicidal ideation (Surrence, Miranda, Marroquín, & Chan, 2009).
A measure of rumination needs to be broad and context agnostic. The widely used Response Styles Questionnaire (RSQ; Butler & Nolen-Hoeksema, 1994) and its later iteration, the Ruminative Responses Scale (RRS; Treynor, Gonzalez, & Nolen-Hoeksema, 2003) have been criticized for containing explicitly depressive content in its quantification of rumination (Brinker & Dozios, 2009). This could erroneously inflate the apparent relationship between rumination, depression, and associated constructs such as anxiety. An example of this is where R. Y. Hong (2007) uses both the brooding and reflective subscale of the RRS to operationalize rumination in general, where it should only be used as a transdiagnostic measure of rumination, depression, and anxiety (Liao & Wei, 2011).
The 20-item self-report Ruminative Thought Styles Questionnaire (RTS; Brinker & Dozios, 2009) is distinguished from other rumination instruments in that it does not limit repetitive thought to a single domain, specifically depressive thought. It instead measures a dispositional ruminative thought style, regardless of the temporal orientation (past or future focused thought) or type of rumination (e.g., brooding or reflective). This allows for a clearer investigation between rumination and its concomitants without being purely concerned with (and thus confounded by) explicitly depressive content, either due to implicit negative content within the instrument or inappropriate application of an instrument with negative subscales.
Continually demonstrating excellent internal consistency reliability (Cronbach’s α typically above .90), the RTS has primarily been used in research in English-speaking countries, such as Australia (i.e., Pgdippsych & Mclaren, 2011; Tanner, Hasking, & Martin, 2013; Tanner, Voon, Hasking, & Martin, 2013; Voon et al., 2013) and America (i.e., Armour et al., 2012; E. Hu, Koucky, Brown, Bruce, & Sheline, 2014). Its international applicability has been expanded by recent translations into Thai (Vatanasin, Thapinta, Thompson, & Thungjaroenkul, 2012) and Turkish (Karatepe, Yavuz, & Turkcan, 2013).
Chinese researchers have shown a recent interest in exploring rumination (i.e., W. Hong et al., 2010; Liu, Tian, Zhou, & Lu, 2012; Wu, Sun, Miao, Yu, & Wang, 2011; X. Zhou & Wu, 2016; X. Zhou, Wu, Fu, & An, 2015). A Chinese translated version of the RSQ is the most widespread, either in its full form (Guo & Wu, 2011; Liu et al., 2012), or in its later form, the RRS (as in W. Hong et al., 2010). Other rumination measures translated into Chinese include Conway’s Rumination Sadness Scale (as in Guo & Wu, 2011), the Dispositional Rumination Scale (as in Wu et al., 2011), the Chinese Anger Rumination Scale (Maxwell & Siu, 2008), and the Event Related Rumination Inventory (as in X. Zhou & Wu, 2016). Research using these measures in Chinese samples have uncovered significant associations between rumination and negative outcomes parallel to findings in Western samples (W. Hong et al., 2010; Liu et al., 2012).
Note that the instruments used in the above research may invariably include clearly negative content as a major component within the assessment of rumination. This is particularly problematic in a Chinese context, because the multifaceted concept of rumination deeply embedded in Chinese tradition has more positive connotations than its Western counterpart (Guo & Wu, 2011). An example of this positive view of rumination is “moral rumination” (as discussed in J. Zhou, 2004), a socially desirable process by which repeated thinking, evaluation, and learning about moral concepts leads to moral balance. This socially positive conceptualization of rumination is arguably incompatible with operationalization of rumination which assumes negative content. It should be noted that a positive social perception does not necessarily correspond to a positive emotional experience, hence there may be tension between social desirability effects and social values viewing rumination as positive, and the negative emotional state associated with rumination. For these reasons, Guo and Wu (2011) call for better understanding of rumination in a Chinese sample in their review of rumination in the Chinese context. This would require an instrument that does not include inherently negative or positive content, which are specific characteristics of the RTS.
A Chinese translation of the RTS (RTS-CH) is the first measure of rumination to be translated into Chinese that is not arguably confounded by negative or depressive content. Despite a burgeoning interest in China for cross-cultural research on themes relating to mental health, research comparing rumination in Western and Chinese samples (i.e., Jose, Kramar, & Hou, 2014) is in its infancy. It is an opportune moment for the RTS-CH to foster cross-cultural research into rumination, where the same instrument is appropriate in both Western and Chinese samples. This article outlines the development and validation of the RTS-CH, beginning with investigation of conceptual equivalence, and then focusing on psychometric equivalence. The ethical aspects of all constituent studies were approved by the Australian National University Human Research Ethics Committee.
Study 1
Cross-language statistical comparability, such as similar factor structures, is a necessary but not sufficient indicator of conceptual equivalence (Byrne & Campbell, 1999). The conceptual equivalence of the instrument and its meaning needs to be established before focusing primarily on item wording and internal consistency reliability. This can be threatened by cultural differences regarding the construct itself, or other factors such as a culturally based tendency to select less extreme scores on Likert-type items (Candell & Hulin, 1986). These threats can be clarified by the use of a bilingual sample (Sperber, Devellis, & Boehlecke, 1994). This first study sought to uncover any such potentially confounding differences by administering the English language version of the RTS to an Australian sample, and Chinese (English-Chinese bilingual) sample. Major discrepancies would be taken to indicate that the RTS may not be suitable in a Chinese context, and thus a translation of the scale would not be useful.
Method
Participants
Four hundred ninety-two psychology students in Australia were recruited via posters, and participated in return for course credit. Of this sample, 369 were Australian, and spoke English as their first language, and 123 were Chinese individuals currently residing in Australia. The Australians were aged 17 to 60 years (M = 21), 66% were female. The Chinese participants were aged 18 to 29 years (M = 20), 72% were female.
Materials
Alongside demographic questions, participants completed the English language version of the RTS.
The RTS
Respondents rated how well each of the 20 statements described them on a 7-point Likert-type scale, with 1 indicating not at all and 7 indicating very well (Brinker & Dozios, 2009). An example item is “I tend to replay past events as I would have liked them to happen.” Scores range from 20, indicating no trait rumination, to 140, indicating high trait rumination. The Chinese language version of this instrument is the subject of this validation process. The items were presented in a fixed order as shown in Table 1.
Within-Subjects Item-Wise Comparisons Between RTS and RTS-CH, Study 3.
Note. r is the correlation between RTS and RTS-CH for each item. t and the associated p column refers to item-wise t test comparing RTS and RTS-CH scores. RTS = Ruminative Thought Styles Questionnaire; RTS-CH = Chinese version of RTS.
p < .01. ***p < .001.
Procedure
All participants completed the RTS via an online survey.
Statistical analysis
Total score and internal consistency reliability was compared across samples using ANOVA and Cronbach’s alpha. To compare the measurement model across samples, we first conducted confirmatory factor analysis (CFA) using diagonally weighted least square estimation (DWLS) in model estimation to the English sample to confirm the factor model. DWLS is suitable for responses with categorical and ordinal nature (Bandalos, 2014). Next, we run a multigroup CFA to fit the factor model to both samples. We fitted a first model that allowed all parameters being freely estimated, and a second model that fixed the factor loadings to be equal for both groups. Metric invariance is satisfied if the fit indices of the second model do not significantly differ from the first model. The estimation was performed in R (version 3.0.1) and by using the lavaan (version 0.5-17) package. The latent variables are determined by fixing their variances to 1.0, with allowing for the free estimation of other parameters.
Results and Discussion
ANOVA revealed RTS scores in the Chinese (M = 88, SD = 21) and English (M = 87, SD = 20) samples were not significantly different. Results of the CFA indicated that a single-factor model with allowing 10 correlated residuals pairs 1 achieved the optimal model fit in the Australian sample, χ2(160) = 767.60, p < .001, comparative fit index (CFI) = 0.980, Tucker–Lewis Index (TLI) = 0.976, root mean square error of approximation (RMSEA) = 0.102, ω = 0.89; as well as in the Chinese sample, χ2(160) = 201.20, p < .001, CFI = 0.988, TLI = 0.986, RMSEA = 0.085, ω = 0.91. This model achieved sufficient model fit for both samples when parameters were allowed to be different across samples, χ2(320) = 1,068.79, p < .001, CFI = 0.982, TLI = 0.979, RMSEA = 0.098. The model fit statistics for a model that constrained the factor loadings to be equal across groups were χ2(339) = 1,106.7, p < .001, CFI = 0.982, TLI = 0.979, RMSEA = 0.096. The model fit measured by χ2 changed significantly, Δχ2 = 37.89, p = .006. However, there was no substantial decrease in other model fit indices, for example, ΔCFI = 0 and ΔRMSEA = 0.002.
The scores, internal consistency reliability (measured by coefficient ω), and factor structures for both samples were similar to those reported by Brinker and Dozios (2009) in the initial development of the RTS. This analysis revealed no substantial difference in the RTS between the Chinese and Australian samples, supporting the validity of subsequent Chinese language translation.
Translation Procedure
The initial version of the RTS-CH was developed in three stages, by three native Chinese speakers who were also fluent in English, working independently. In the first stage, L. Wan translated the instrument from English into Chinese. Following the gold standard outlined in Brislin (1970), back translation into English was carried out by a different person (Stage 2, Y. Shou), and the congruence between the original, translated and back-translated versions was explored by a third person (Stage 3, H. Lin). Any discrepancies were then discussed with the authors of the article, including J. Brinker (the author of the original RTS) and the translation was adjusted. During this initial translation, and later modifications, we followed the recommendations of Harkness and Shoua-Glusberg (1998): Where a literal translation conflicted with the intended meaning of the question, a translation according to the meaning was chosen. Across subsequent studies, the RTS-CH was then iteratively improved by comparison with the English language RTS, and examined in terms of convergent and divergent validity, and internal scale structure (this process can be followed in the supplementary materials). Simplified and traditional Chinese character variants of the RTS-CH were created. This was done to reach as wide an audience as possible; though simplified and traditional Chinese characters do not have different meanings, people from different regions may be expected to be more fluent in one or the other.
Study 2
This study focused on the factor structure of the RTS-CH, and convergent and divergent validity. In both the Western and Chinese literatures, there is an established link between rumination and negative affect (W. Hong et al., 2010), but not positive affect. Convergent and divergent validity of the RTS translation was ascertained by correlating the total RTS-CH scores with the Chinese language version of the Positive Affect Negative Affect Schedule (PANAS). The expectation was that the RTS should correlate significantly and positively with the negative axis of the PANAS (PANAS-NA), but not significantly with the positive axis (PANAS-PA).
Method
Participants
One hundred sixty-one members of the general public in China were recruited via social media, and participated via a Chinese survey website (sojump.com) which offers chances to win prizes as an incentive. Participants were required to be age 18 or above to participate in this study. Aged 18 to 60 (M = 31), 52% of participants were female. Most (78%) were living in the top four most developed provinces in China (Beijing, Shanghai, Guangdong, and Zhejiang).
Materials
Alongside demographic questions, participants completed the Chinese language versions of the RTS (fixed item order) and the PANAS.
The PANAS
This consists of a series of 20 adjectives (Watson, Clark, & Tellegen, 1988). Respondents rated how much they experienced like those adjectives on a 5-point unipolar Likert-type scale, with 1 = very slightly/not at all, and 5 = extremely, for the past month. The PANAS yields a positive affect score (PANAS_PA) and a negative affect (PANAS_NA) score. Both scores range from 10 (does not at all describe the participant) to 50 (very much describes the participant). Sourced from Huang, Tingzhong, and Ji (2003), the Chinese language version of the PANAS has been validated and achieved acceptable reliability in Chinese samples (Guo & Gan, 2010).
Procedure
Participation was via online survey. Participants provided demographic information, then completed the RTS and PANAS, in that order, without item order randomization. The remaining section of the survey was not relevant to the current study, so is not discussed here.
Statistical analysis
CFA with DWLS estimation was used to measure the fit of the one-factor model identified in Study 1. Correlations between the total scores of RTS with other scales was used to establish whether the pattern of convergent and divergent validity was consistent across English and Chinese versions.
Results and Discussion
Model fit statistics of a CFA indicated that the factor model in Study 1 was sufficient for the RTS-CH, χ2(160) = 445.97, p < .001, CFI = 0.969, TLI = 0.964, RMSEA = 0.106, ω = 0.89. Cronbach’s α for PANAS-PA was 0.90, and for PANAS-NA was 0.90. The RTS-CH total scores correlated significantly with the Chinese language PANAS-NA (r = .33, p < .001) but not the PANAS-PA (r < .02, p = .831), suggesting appropriate convergent and divergent validity. This indicates the RTS-CH has comparable psychometric and theoretical properties as the English language RTS. Though the overall questionnaire performed well, three items has low loadings. These unsatisfactory items were reworded by the team involved in the initial translation.
Study 3
Having established the RTS-CH performed well in terms of overall factor structure, this study sought to more closely examine item-wise equivalence. As per Sperber et al. (1994), Chinese-English bilingual participants were used to allow a clear within-subject comparison of the RTS and RTS-CH.
Method
Participants
Twenty-nine Chinese-English bilingual students currently studying at the Australian National University completed both the English RTS and the Chinese RTS. Aged 19 to 36 years (M = 25), 76% of participants were female.
Materials
Participants completed both the RTS and the RTS-CH (simplified characters).
Procedure
Participants completed the RTS and RTS-CH in pencil-and-paper format. The order (English or Chinese language first) was randomized, and participants were required to have a minimum of 30-min interval between the two language versions of RTS. Subjects were allowed to do self-directed distraction tasks during the interval.
Statistical analysis
Item-wise correspondence was established by paired correlation and t tests examining the within-subject similarity between RTS and RTS-CH responses for each item. If correlation or t tests indicate differences between scales (nonsignificant correlation or significant t test at α = .05), item rewording will be discussed.
Results
RTS total score means were 82.56 (SD = 17.16) for English RTS, and 83.61 (SD = 16.19) for RTS-CN. Cronbach’s α was .90 for both English RTS and Chinese RTS. Paired correlation and t tests were used to examine the similarity between RTS and RTS-CH responses for each item (Table 1). All items had strong, significant positive correlations across the two language versions, except for the unsatisfactory Item 11, which was reworded to better reflect the original RTS Item 11’s connotations of distraction. Turning to t tests, Items 3, 7, and 13 had weak differences in means. However, all three items also had strong, significant positive correlations with their English language counterparts, and no clear unintended wording connotations. For these reasons, these three items were not altered.
Study 4
Having established construct validity, convergent and divergent validity, and acceptable item-level equivalence in previous studies, this final study established both the construct and psychometric validity of the final version of the RTS-CH. Convergent validity was examined here by way of the Patient Health Questionnaire–9 (PHQ-9; Kroenke, Spitzer, & Williams, 2001) as a measure of anxiety and depression, and the Suicidal Ideation Attributes Scale (SIDAS; Van Spijker et al., 2014), as depression, anxiety, and suicidal ideation have been related to rumination in the literature (Hughes et al., 2008; Surrence et al., 2009).
Method
Participants
As part of a larger study, 255 students at the Central Normal University were notified via teachers’ invitation in class, and participated in an online survey in return for course credit. Participants were aged between 18 and 27 years (M = 19.54), and 80% were female.
Materials
In addition to the RTS-CH (simplified characters; randomized item order), participants completed two other scales in Chinese to establish convergent validity.
The PHQ-9
This was used to measure depression and anxiety (Kroenke et al., 2001). The PHQ-9 is the depression module of the PHQ that includes nine Diagnostic and Statistical Manual of Mental Disorders (4th ed.; DSM-IV; American Psychiatric Association, 1994) criteria. Each item is scored from 0 (not at all) to 3 (nearly every day). The total score of PHQ-9 was used to represent the severity of depressed mood.
Generalized Anxiety Disorder–7 items (GAD-7)
This was used to measure anxiety (Spitzer, Kroenke, Williams, & Löwe, 2006). The GAD-7 scores seven common anxiety symptoms. The total scores of the two scales were used as measures of depression and anxiety.
The SIDAS
This is a five-item scale that measured the severity of suicidal ideation (Van Spijker et al., 2014). The five items assess the frequency, controllability, closeness to attempt, distress, and interference with daily activities. Each item was rated on a 10-point scale. The total scores of the five items were used to indicate suicidal tendency.
Procedure
Subjects accessed the survey via the online survey platform LIME. As part of a larger study, participants completed the PHQ-9, General Anxiety Diagnosis-7, SIDAS, and RTS, in order. To balance fatigue effects, all instruments were administered with randomized question order.
Results and Discussion
The mean RTS-CH mean score was 90.33 (SD = 22.53). The measurement model of the RTS-CH was explored, before turning to convergent validity. Model fit statistics of a CFA indicated that the factor model in Study 1 was sufficient for the RTS-CH χ2(160) = 330.91, p < .001, CFI = 0.995, TLI = 0.994, RMSEA = 0.069, ω = 0.94. As shown in Table 2, all items were significantly loaded on the single factor.
Standardized RTS-CH Factor Loadings for the One-Factor Model, Study 4.
Note. The item number corresponds to the item number in Table 1. Confidence intervals refer to λ.
A correlation analysis using the total scores of RTS was conducted to measure the convergent validity of RTS with PHQ-9, GAD-7, and suicidal tendency. Cronbach’s alpha for PHQ-9 was .83, and for GAD-7 was .90. The RTS-CH was significantly and positively correlated with general depression (r = .23, p <.001) and anxiety (r = .25, p <.001). The RTS-CH was not correlated with suicidal tendency (r = 0.5, p = .460), likely because very few (only 6%) of the sample indicated they had thought about suicide.
General Discussion
This article outlined the development and validation of the RTS-CH, a Chinese language version of Brinker and Dozios’s (2009) rumination questionnaire, the RTS. Translation to create the RTS-CH was undertaken following the procedure outlined in Brislin (1970), with initial translation, back translation, and checking undertaken by three independent bilingual individuals. As per Harkness and Shoua-Glusberg (1998), translation proceeded with an emphasis on translating meaning rather than literal wording, with consultation with the original RTS developer to ensure meaning was preserved.
The resultant RTS-CH performs comparably with the English language RTS, with a clear single-factor solution, and excellent internal consistency reliability. Similarity in within-subject administration, and patterns of convergent and divergent validity, suggests the RTS-CH also demonstrates construct validity. It does not correlate with positive affect, and is significantly positively associated with negative affect, anxiety, and depression, in line with what may be expected given both the Western and Chinese literature (W. Hong et al., 2010; Hughes et al., 2008; Muris et al., 2005; Wilkinson & Goodyer, 2006). Together, these results suggest that the scale is capturing the construct of rumination well, without inadvertently tapping unexpected psychological phenomena.
Given the relationship between suicidal ideation and rumination in the Western (Surrence et al., 2009) and Chinese (Liu et al., 2012) literatures, we had expected the RTS-CH would be significantly associated with suicidal thoughts and behaviors. This was not the case, perhaps because too few participants reported suicidality. Alternatively, a possible confound between negative thought and depression in extant Chinese language rumination measures may have biased previous conclusions. Either explicitly or implicitly, current Chinese language rumination questionnaires tend to measure negative content more akin to the “brooding” subtype. The way in which the RTS-CH is significantly associated with negative mood and anxiety, but not suicidality, suggests that the suicidality literature may be skewed due to the depressive content of rumination instruments being used. This is an important avenue for future suicidality research in both English- and Chinese-speaking countries.
The primary limitation of the development of the RTS-CH was sampling. The majority of participants were sourced from nonclinical settings, such as the general population and university student populations. Future research should investigate how the RTS-CH performs among individuals from a clinical population, as more extreme scores may reveal undetected psychometric problems with the RTS-CH. In addition, the majority of the Chinese participants came from urban, relatively wealthy areas. There are marked cultural differences across China, particularly between urban and rural areas (Hesketh & Ding, 2005), so accordingly the applicability of the RTS-CH in rural China should be evaluated in future research.
The RTS is a measure of rumination, operationalized as repetitive, recurrent, intrusive, and uncontrollable thinking. Its attractive psychometric properties and lack of inherently negative or depressive content make it a useful tool for rumination research. Due to its links with depression and suicidal thoughts and behaviors, there is growing interest in rumination in Chinese samples. This article outlined the translation and development of the RTS-CH, a Chinese language version of the RTS. The RTS-CH is the first Chinese language rumination measure that does not have inherently negative or depressive content. It also has attractive psychometric properties in terms of internal consistency reliability, and convergent and divergent validity. The RTS-CH is consequently well placed to meaningfully contribute to future research into rumination in a Chinese context.
Footnotes
Acknowledgements
The authors would like to acknowledge H. Lin and L. Wan for their help with translation. Thanks to Y. Shou and X. Zhu for collecting Chinese data in study, and W. Tam for collecting Chinese data in Study 2.
Authors’ Note
The ethical aspects of this research have been approved by the Australian National University Human Research Ethics Committee.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
