Abstract
The current research investigated the psychometric characteristics of the Turkish adaptation of the Teacher Emotions Scale (TES). Study I included a sample of 164 Turkish middle school teachers, and Study II included a sample of 222 mathematics teachers. The three-emotion factor model was confirmed in both studies with satisfactory reliability estimates. This model was invariant across gender. In addition, the latent mean structure analyses revealed no significant differences in teacher emotions by gender. Regarding convergent validity, teacher emotion scores showed a significant relationship with teacher self-efficacy. Thus, the Turkish version of the TES demonstrated sound psychometric characteristics for measuring teachers’ academic emotions in educational contexts.
Emotion is a complex and controversial construct, to comprehensively explain which several scientific and constitutive definitions have been proposed in the literature. For example, Rosenberg (1998) defined emotions as “acute, intense, and typically brief psychophysiological changes that result from a response to a meaningful situation in one’s environment” (p. 250). Hochschild (1990) proposed a more extended characterization of emotion as “an awareness of four elements that we usually experience at the same time: a) an appraisal of a situation, b) changes in bodily sensations, c) the free or inhibited display of expressive gesture, and d) a cultural label applied to specific constellations of the first three elements” (p. 118–119). More recently, Feldman Barrett (2012) described emotions as people’s socially constructed and subjective interpretations of experiences and the expression of these experiences via different channels. Depending on the valence of emotions, these experiences may induce various physiological, cognitive, and behavioral changes in people. For example, positive emotions may promote empathetic thinking, problem-solving ability, and acceptance of responsibility (Isen, 2008). On the other hand, negative emotions might lead to a decline in motivation and employment of rigid or superficial learning strategies rather than productive use of cognitive resources (Pekrun, 2006; Pekrun & Linnenbrick-Garcia, 2014). Whatever their effect, the critical role of emotions in people’s lives cannot be denied.
From this perspective, teaching is an emotion-laden job, particularly with regard to teacher–student, teacher–teacher, teacher–parent, and teacher–administrator interactions, and teachers experience various emotional states during their professional careers. Bahia et al. (2013) identified 160 teacher emotions experienced by teachers in Portuguese classrooms, ranging from joy and love to sadness, fear, anger, as well as surprise. In a study of teachers’ journal writing, O’Toole et al. (2010) found 113 emotions within a period of 5 days of entries, most expressed by tertiary teachers. The differential nature of teacher emotions was also explored by Prosen et al. (2014). Although primary school teachers experienced both positive and negative emotions, the latter seemed to outweigh the former, especially anger, which was highest on the list. Anttila et al.’s (2016) examination of teacher candidates’ emotions corroborated previous findings of many distinct emotions, including enthusiasm, satisfaction, and interest, but also disappointment and inadequacy. The experience of varying emotions in academic settings is found to be unavoidable for both pre-service and in-service teachers. However, the intensity and the valence of these emotions may differ across several conditions. Accordingly, teachers enjoy positive teacher-student relationships; feel gratified by high levels of student motivation, engagement, and success (Becker et al., 2015; Hagenauer et al., 2015; Hosotani & Imai-Matsumura, 2011; Khajavy et al., 2018; Sutton & Wheatley, 2003); take pride in students’ accomplishments, whether expected or unexpected; and be encouraged by students’ positive feedback (Buechner et al., 2016; Khajavy et al., 2018). On the other hand, teachers may experience anxiety due to inadequate teaching preparation, feel ineffective when unable to respond to students’ questions, or lack confidence as novices in the profession (Khajavy et al., 2018; Sutton & Wheatley, 2003). Teachers may also feel anger because of classroom management and discipline problems, student disengagement, and colleagues’ unsupportive behaviors (Becker et al., 2015; Chang, 2009; Hagenauer et al., 2015; Hargreaves, 2000; Hosotani & Imai-Matsumura, 2011; Khajavy et al., 2018; Sutton & Wheatley, 2003); sadness about students’ failures (Hosotani & Imai-Matsumura, 2011); shame due to feeling responsible for their students’ failures (Khajavy et al., 2018); and disengagement due to students’ lack of interest and motivation.
In addition to these potential sources, teacher emotions may be related to a wide range of cognitive and psychological constructs such as teachers’ pedagogical content-knowledge formation (Brigido et al., 2013), their self-efficacy (Brigido et al., 2013; Borrachero et al., 2013; Buric & Macuka, 2017; Chen, 2018; De Mauro & Jennings, 2016; Stephanou et al., 2013), their teacher identity (Bair et al., 2010), their teaching quality (Chen, 2019; Frenzel et al., 2015; 2016; Hagenauer & Volet, 2014; Klassen et al., 2011; Sutton & Wheatley, 2003; Taxer & Frenzel, 2015), and their interactions with students (Becker et al., 2014; Chen, 2019; Hagenauer et al., 2015; Yan et al., 2011).
It is important to investigate both potential sources and consequences of teachers’ academic emotions. Toward this end, several scales have been developed to measure teachers’ emotions, many of which are adapted to different languages and cultures. For example, Chen (2016) developed the Teacher Emotion Inventory (TEI) and validated it in Asian contexts. The TEI included the emotions of sadness, anger, fear, and love. Exploratory and confirmatory factor analyses resulted in five emotional dimensions for primary school teachers. Positive emotions were mainly related to interactions with students and colleagues. In contrast, negative emotions were linked to educational policy changes and teachers’ difficulty preserving a balance in their lives. Therefore, the TEI seems less comprehensive and may provide a limited measure of teachers’ emotions while interacting with students. Frenzel et al. (2016) adopted a discrete trait-based approach in developing the Teacher Emotions Scale (TES), taking into account the frequency and saliency of real-life emotions. Accordingly, the instrument consists of 24 items with a four-point Likert scale in two sections, addressing teacher enjoyment, anger, and anxiety in general and with regard to a specific group of students. To assess construct validity, Frenzel et al. (2016) tested the TES with a single-emotion factor model, a two-factor (positive vs. negative affect) model, and a three-emotion factor (enjoyment, anger, and anxiety) model with different groups of participants. The factor analyses supported the three-emotion factor model, which yielded satisfactory internal consistency estimates for each dimension. Hong et al. (2016) revised the TES by adding pride and frustration and validated the revised form with Japanese and Korean teachers. However, frustration items cross-loaded with anger dimension and displayed low reliability.
Prevalent research on discrete emotion categories (e.g., Henschel & Roick, 2018; Skiffington et al., 1998) stresses the complex structure of emotions, so there is a need to explore the structure of teacher emotions with accurate measures for different cultures. As Frenzel et al.’s (2016) scale takes into account the frequency and saliency of teachers’ real-life emotions only in interactions with students, the TES seems quite specific. Accordingly, the present research aimed to adapt the TES to the Turkish language and obtain evidence concerning its psychometric characteristics in Turkish educational contexts.
Methods
Translation and Adaptation Process
Merenda (2006) described three basic steps to satisfy item and test equivalence in the scale adaptation process. Following those steps, the TES items and response scales were first reviewed from emic and etic perspectives before translation. The TES was then translated into the Turkish language by the researcher and two bilingual experts. The “decentering” method rather than a literal translation of each item was employed. The Turkish translation of the scale was then back-translated to English by three experts who had a good command of English. The original and back-translated versions were compared to ascertain that the latter corresponded to the original scale. Problems were found with the equivalency of the translated versions for the phrases “feeling annoyed” (item 3), “being frustrated” (item 5), and “having much fun” (item 6). Merenda emphasized that some items may not “be directly transferrable” (p. 310) and may need modification or replacement. Thus, three experts in Guidance and Psychological Counseling were asked to clarify the meanings of those words to make a well-informed recommendation for their translation. The experts agreed on the best expressions to match the original scale. In addition, before finalizing the instrument, cognitive interviews were carried out with two former mathematics teachers, in which the interviewees were asked to identify unclear words, phrases, or sentences in the scale. They also commented on the format and the design of the instrument. They found no problematic items and considered the scale’s length and appearance appropriate. Therefore, no changes were made to the final version after the interviews.
Participants
Cluster sampling, in which intact groups are randomly selected (Frankel et al., 2019), was employed in recruiting a different sample of middle school teachers. Two independent samples were selected: the first sample was used for Study I—pilot study, whereas the second sample was used for Study II—main study.
Sample for Study I
The Teacher Emotions Scale (TES) was administered to 164 subject areas (Turkish, math, science, and social studies) teachers (67.7% female) from 16 public middle schools in four central districts of a metropolitan city in Turkiye. Eighty-two (50%) had ten or fewer years of teaching experience, 49 (29.9%) had 11–20 years, and 25 (15.2%) had 21 or more years of teaching experience. Eight teachers did not mention their years of teaching experience. Regarding the faculties from which they graduated, 121 (73.8%) were graduates of Education Faculties, and 33 (20.1%) were Faculty of Arts and Science or Open University graduates with a Pedagogical Formation Certificate for teaching. 10 teachers did not mention their graduation faculty. Besides, 140 teachers had a bachelor’s degree (85.4%), while 16 teachers completed a graduate-level program and had a master’s or doctoral-level degree (9.7%). Eight teachers did not specify their level of education.
Sample for Study II
Because teachers in different subject areas might experience different emotions, the second study sample was restricted to one subject area, mathematics, to control for such variation. It was assumed that mathematics teachers might experience particular issues because of the abstract nature of the discipline, so the scales were administered to 222 middle school mathematics teachers (66.7% female) from 53 public middle schools in 11 central districts of a metropolitan city in Turkiye. The majority of the teachers had bachelor’s degrees (n = 195, 87.8%), 26 (11.7%) had master’s degrees, and one (0.50%) had an Associate degree. Most had graduated from a Faculty of Education (n = 142, 64%), while some had graduated from a Faculty of Arts and Science (n = 55, 24.8%) and other faculties (n = 3, 1.4%). The remaining 22 teachers (9.9%) did not provide information about their faculty of graduation. Participants had 11 years of teaching experience on average (SD = 7.76), ranging from 1 month to 41 years.
Measures
Demographics information
In both study samples, demographic information included the teachers’ gender, age, latest education degree, faculty from which they graduated, and years of teaching experience.
Teacher Emotions Scale
Teacher Emotions Scale (Frenzel et al., 2016) is a multidimensional self-report instrument measuring three emotions: enjoyment (4 items, e.g., “I teach these students with enthusiasm”), anger (4 items, e.g., “I often have reason to be angry while I teach these students”), and anxiety (4 items, e.g., “I feel uneasy when I think about teaching these students”). Teachers respond using a 4-point Likert scale ranging from “strongly disagree” (1) to “strongly agree” (4). The questionnaire has two sections, 12 items addressing teaching in general and 12 addressing teaching a specific student group, to allow for differentiation between overall emotions about teaching and emotions while teaching a specific group. The instructions for responding in the second section direct teachers to think about the students in one of the classes they are currently teaching. The items are the same in both sections except for the addition of the phrase “these students” at the end of each item in the second section. In the present study, only the second section was used.
Frenzel et al. (2016) constructed a three-emotion factor structure based on the discrete emotion approach. For the English version, the three-emotion factor model of the general scale has the following goodness-of-fit indices: Comparative Fit Index (CFI) =.943, Standard Root Mean Square Residual (SRMR) = .044, and Root Mean Square Error of Approximation (RMSEA) = .068. The English version of the student-group specific scale has the following goodness-of-fit indices: CFI = .976, SRMR = .030, and RMSEA = .052. Thus the three-emotion factor model has satisfactory goodness of fit. Also, Cronbach’s alpha coefficients for the general emotions sub-scale were .73 for enjoyment, .80 for anger, and .81 for anxiety, and for the student-group specific emotions sub-scale, they were .80 for enjoyment, .87 for anger, and .87 for anxiety, all above the .70 threshold.
Teachers’ Sense of Efficacy Scale
Teachers’ Sense of Efficacy Scale was developed by Tschannen-Moran and Woolfolk Hoy (2001) to measure in-service and pre-service teacher self-efficacy beliefs. TSES was designed to have a nine-point rating scale, with responses varying from 1 (nothing) to 9 (a great deal). The scale has long (24 items) and short (12 items) versions, including both of which are composed of three dimensions: self-efficacy for student engagement (SE; 8 items, e.g., “How much can you do to help your students value learning?”), self-efficacy for instructional strategies (IS; 8 items, e.g., “How well can you implement alternative strategies in your classroom?”), and self-efficacy for classroom management (CM; 8 items, e.g., “How much can you do to get children to follow classroom rules? ”) In study II, the Turkish adaptation of the long version of TSES (Çapa et al., 2005) was used. The Cronbach’s alpha estimates were .85 for SE, .89 for IS, and .93 for CM.
Procedure
The permissions for the study were granted by the Human Subjects Ethics Committee (HSEC) of the host university and the Ministry of National Education in Turkiye. Study I was carried out in the Fall semester, and study II in the Spring semester of the 2018–19 academic year. Informed consent to participate was obtained from the teachers, who were informed of the purposes and procedures and their rights as participants before data collection. They were requested to fill out the scales with one of their classes in mind.
Data Analysis
Confirmatory factor analyses (CFA) were performed through Mplus 8.1 (Muthen & Muthen, 2017) to validate the factorial structure of TES in Turkish educational contexts. Consistent with the original scale (Frenzel et al., 2016), three models were tested in both studies. Accordingly, one-emotion factor model (Model I), a two-factor model (positive vs. negative affect, Model II), and the three-emotion factor models (enjoyment, anger, and anxiety, Model III) were tested (Figure 1). To evaluate the model fit, Kline (2016) recommends giving estimates of model Chi-Square test value, Steiger-Lind Root Mean Square Error of Estimation (RMSEA), Bentler Comparative Fit Index (CFI), and Standardized Root Mean Square Residual (SRMR) values. In the present study, chi-square values (χ2) and SRMR as absolute fit indices, CFI and NNFI values as relative fit indices, and RMSEA values as evidence of non-centrality fit indices were utilized (Kline, 2016). To adjust for non-normality, the Satorra–Bentler corrected χ2 values (Satorra & Bentler, 1994) were calculated. Reliability analyses were conducted using SPSS 22 (IBM SPSS Corp., 2013). CFA Models for Teacher Academic Emotions. Upper part: Model I (one-emotion factor model), Middle part: Model II (two-factor model), Lower part: Model III (three-emotions factor model). Enj represents enjoyment, anx=anxiety, ang=anger.
In addition, multigroup CFA was conducted to examine measurement invariance of the scale with female and male teachers using the sample of 222 middle school mathematics teachers in Study II. This analysis involved several steps starting with testing the model for male and female groups separately (Byrne, 2013). The following steps included models ordered in an increasingly restrictive manner and these nested models were compared using chi-square difference test. A non-significant difference was an indicator of invariance across groups. Difference in the CFI (ΔCFI) was also used, where difference less than <.010 showed invariance (Chen, 2007). In Model 1 (configural), no equality constraints are imposed, and this model serves as a reference model. Model 2 (metric invariance) imposes invariance of factor loadings across gender. In Model 3 (scalar invariance), both factor loadings and intercepts are treated as invariant across gender. Model 4 (strict invariance) adds the constraint of invariant residuals in addition to factor loadings and intercepts. Meredith and Teresi (2006) described last three steps as weak, strong, and strict tests of invariance, respectively. Testing strict invariance is considered unnecessary in practice (Wang & Wang, 2019). Therefore, strict invariance was not tested in the present study. Finding the metric and scalar invariance model tenable allows for researcher to compare group mean differences on the latent constructs (Meredith & Teresi, 2006). Next, equivalence of latent means across gender was tested via Mplus 8.1 to provide construct validity evidence for multigroup differences.
A canonical correlation was performed using the data collected in Study II to provide convergent validity evidence through SPSS 22. In light of the proposed relationship in the literature between emotion and self-efficacy, the relationship between the dimensions of TSES (i.e., SE, IS, and CM) and TES (i.e., enjoyment, anger, and anxiety) was investigated. Also, meaningful canonical variate(s), canonical loadings, and the redundancy indices were inspected for further validity.
Results
Factorial Validity and Reliability
Confirmatory Factor Analysis Results for Study I and II.
In Study II, CFA with Satorra–Bentler corrected chi-square test yielded significant results for three models. As displayed in Table 1, Model II and Model III fit the data better than Model I, corroborating the Study I results. However, Model III fit slightly better when CFI, NNFI, and RMSEA values were compared. Therefore, Model III was chosen, and the standardized estimates are presented in Appendix 2. All of the loadings were .40 and above.
Cronbach’s alpha coefficients were examined for reliability. As Model III fit better than the other models, internal consistency estimates were computed for each emotion dimension. Cronbach’s alphas were .92 and .90 for enjoyment, .75 and .75 for anger, and .87 and .87 for anxiety in study I and II, respectively.
Measurement Invariance
Measurement Invariance of the TES across Gender.
Note. None of the ∆χ2 was significant at 0.05. CFI = Comparative Fit Index.
To compare latent mean differences by gender, the strong invariance model was examined by setting the means of the three factors as 0 for men and free for women. Analysis showed that men and women did not differ on enjoyment (z = 1.01, p = .31), anger (z = 0.16, p = .87), and anxiety (z = −0.19, p = .85).
Convergent Validity
For convergent validity evidence, a canonical correlation was also performed with the sample of study II to determine the potential relationships between the dimensions of TES and TSES. The relationships between sets of multiple dependent and independent variables were examined through canonical correlation analysis (Hair et al., 2019). The first set consisted of three-emotion dimensions (i.e., enjoyment, anxiety, anger), and the second set included self-efficacy for student engagement (SE), self-efficacy for instructional strategies (IS), and self-efficacy for classroom management (CM). In accordance with the literature, a positive relationship was expected between enjoyment and TSES dimensions, and a negative relationship between either anxiety or anger and TSES dimensions.
Canonical Correlation Results.
According to the variances within and between sets for the first canonical variate, 69% of the variance was explained by the teacher emotions set. In comparison, the teacher self-efficacy set accounted for 57%. Based on the redundancy results, while the teacher emotions set explained 9% variance in the teacher self-efficacy set, the teacher self-efficacy set explained 8% variance in the teacher emotions set.
Discussion
For this research, the TES was translated into the Turkish language. Confirmatory factor analyses and reliability analyses were performed to determine the scale’s psychometric properties in two studies. In line with the original scale, three models were tested, among which Model II (the two-factor model or positive vs. negative affect model) and Model III (the three-emotion factor model) fit the data better than Model I (one-emotion factor model) in both studies. Frenzel et al. (2016) reported similar findings, particularly the high fit indices for Model III. Although the fit indices seemed to be at the edge of criteria for both German and Canadian samples, Model III results best fit the data. Also, in the current research, Model III seemed to fit the data better when indices were compared in both studies. The reliability coefficients were also above the cut-off criterion (above .70) for each emotion dimension (Nunnally, 1978).
Although significant differences in teachers’ academic emotions regarding gender have been found in some studies (e.g., Borrachero et al., 2014; Kimura, 2010; Lee et al., 2016), many have indicated no difference (e.g., Hargreaves, 1998; Mattern & Bauer, 2014; Sutton, 2004; Yin, 2015). Given these mixed findings, no conclusions can be drawn in the literature about gender differences in teachers’ academic emotions. Strong measurement invariance across gender was observed in the present study that indicated the TES was assessing the same latent constructs in men and women. Therefore, mean values can be effectively compared across genders. No significant difference was found based on latent mean structure analysis, adding additional support for the use of the TES.
Also, convergent validity was examined by looking at the relationship between teacher emotions and teacher self-efficacy. According to canonical correlation results, enjoyment had a positive relationship with teacher self-efficacy dimensions, whereas anxiety and anger had a negative relationship. This finding was in line with the literature, confirming that self-efficacy was positively associated with positive emotions and negatively associated with negative emotions (e.g., Brigido et al., 2013; Borrachero et al., 2013; Buric & Macuka, 2017; Chen, 2018; De Mauro & Jennings, 2016; Stephanou et al., 2013).
Overall, the Turkish version of the TES displayed appropriate psychometric characteristics for measuring teachers’ academic emotions in Turkish educational contexts. Validity and reliability findings supported the assumption that the scale reflected the three dimensions of the original scale: enjoyment, anger, and anxiety (Frenzel et al., 2016). As the first instrument to systematically measure teachers’ academic emotions, TES assesses teachers’ trait-based retrospective judgments about their emotional experiences in non-obtrusive, economical, and easy ways (Frenzel et al., 2016). Although there is still a debate on differentiating trait- and state-based emotions (Nett et al., 2017), the trait-based scale in this study could also be converted into state-based assessments for future studies. However, the limited number of emotions (i.e., anger, anxiety, enjoyment) in the scale might cause confusion by focusing on the valence of emotions as positive or negative rather than highlighting their distinctive nature. That possibility might help explain the higher index values in Model II. Indeed, a bulk of the research in the literature stressed the differential nature of teacher emotions (e.g., Frenzel et al., 2015; Hagenauer et al., 2015; Šarić, 2015; Sutton & Wheatley, 2003). To explore this direction further, the number of emotions might be increased in future studies.
In the present study, the adapted version of the TES was administered in Turkish educational contexts. Thus, the participants had different socio-cultural characteristics from those of the teachers who participated in administering the original scale. The results were similar in the two settings, providing some evidence about the universal nature of emotions. The availability of the scale in different languages might advance research in this field to better understand emotions and to examine how teachers regulate and cope with their emotions, which, in turn, might suggest solutions to the problems related to emotional exhaustion frequently experienced by teachers (Karakelle & Canpolat, 2008; Yıldız Kırılmaz et al., 2003; Seferoğlu et al., 2014). Furthermore, the scale might be used to measure teacher emotions associated with a specific group of students to avoid overgeneralization of their emotions. As emotions are unstable and highly specific for each person and condition, general emotion scales might not reveal teachers’ actual emotional states in particular settings and situations. In this regard, the TES-student group-specific scale can enable researchers to detect teachers’ academic emotions with a maximum level of specificity. Moreover, the scale might be administered to teachers in different subject areas and at different grade levels if the instructions are adapted accordingly.
Emotion, especially teacher emotion, is a nascent field. Further development requires more research in this area to fully explore possible antecedents and consequences of teachers’ emotions and their relations with the emotions of other stakeholders in education (i.e., students, administrators, and parents). In future studies, in addition to convergent validity, the divergent validity of TES should be examined to provide sufficient evidence for the external validity of the instrument. Also, test–retest reliability evidence could be provided to confirm the stability of the findings.
Footnotes
Author Note
This study was just a part of the Doctoral Dissertation of the corresponding author under the supervision of the second author at Middle East Technical University.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was supported by the Middle East Technical University-Scientific Research Projects Coordination Unit (BAP) [METU-BAP-YLT-502-2018-2766] and Turkish National Science Foundation (TUBITAK) 2214-A International Research Fellowship Programme.
Appendix I
Standardized Estimates of Items across Model II and Model III for Study 1
Model II
Model III
Dimension
Items
Standardized Estimates
Dimension
Items
Standardized Estimates
Positive
Item 11
.89
Enjoyment
Item 11
.89
Item 6
.89
Item 6
.89
Item 7
.84
Item 7
.84
Item 1
.81
Item 1
.81
Negative
Item 12
.87
Anxiety
Item 12
.88
Item 5
.83
Item 10
.80
Item 10
.81
Item 8
.65
Item 9
.79
Item 2
.43
Item 4
.78
Anger
Item 5
.84
Item 3
.76
Item 4
.80
Item 8
.68
Item 9
.79
Item 2
.43
Item 3
.77
Appendix II
Standardized Estimates of Items across Model III for Study 2
Dimension
Items
Standardized Estimates
Enjoyment
Item 11
.91
Item 6
.86
Item 1
.83
Item 7
.76
Anxiety
Item 12
.84
Item 10
.69
Item 8
.59
Item 2
.48
Anger
Item 5
.87
Item 4
.83
Item 3
.75
Item 9
.74
