Abstract
Several studies have focused on identifying the significant behavioral predictors of learning performances in web-based courses by examining the log data variables of learning management systems, including time spent on lectures, the number of assignments submitted, and so forth. However, such studies fail to quantify the impact of emotional, motivational, behavioral, and cognitive–metacognitive factors simultaneously. This research was an attempt to understand the relations between students’ motivation, cognitive–metacognitive strategies, behavior, and learning performance in the context of blended courses in higher education. Then, relevant predictors are used to obtain a model to classify the students’ performance and to identify those who are at risk of failing the course. The authors conducted an empirical study in a higher educational course with 137 Mexican students. Nineteen variables related to emotions, motivation, cognitive–metacognitive strategies, and behavior. Only six were found to be significant. These variables explain approximately 67% of the variance between each student’s overall grade. The model, based on those variables, correctly classifies 96% of the students.
Keywords
Introduction
Efficiency levels in higher education have been criticized for their graduation and abandonment rates. The Internet has become an important tool to increase the number of enrolled students in higher education. Thus, online and blended learning are two solutions to the issues of graduation and abandonment rates. Approximately 504,000 Mexican students pursue a bachelor’s degree by means of online or blended programs (Secretaría de Educación Pública, 2015). However, only 42% of students conclude their career on time (Asociación Nacional de Universidades e Instituciones de Educación Superior, 2015). Students fail or drop out of the courses. This delays their graduation or, in the worst-case scenario, causes them to abandon their studies.
Most universities around the world offer online courses and careers. Learning management systems (LMS) share learning resources through a system to monitor student learning progress (You, 2015). Several empirical studies have examined LMS log data variables (in the context of distance learning), including time spent on lectures, the number of assignments submitted, and so forth. However, the studies fail to quantify the impact of emotional, motivational, and cognitive–metacognitive factors at the same time.
Research has recognized the importance of emotions in educational settings. Several studies have focused on emotions’ relationships to learning achievements. According to Pekrun (2006) and Pekrun, Frenzel, Goetz, and Perry (2007), emotions related to learning activities and outcomes can impact learning achievements. Based on the control-value theory, Pekrun (2006) proposed a theoretical framework to describe the effects of these emotions on cognitive resource, motivation, self-regulation, strategy, and achievements. Motivation plays an important role in learning achievements. It is also a variable influenced by the educational context, subject area, and task (Linnenbrink-Garcia, Patall, & Pekrun, 2016). Motivation can deactivate negative emotions. On the contrary, negative emotions, including test anxiety, can prevent failure by increasing intrinsic motivation and the effort. The relationship between emotions and motivation is ambivalent and complex (Hembree, 1988; Pekrun et al., 2007).
The goal of this research is to understand relationships between students’ emotions, motivation, cognitive–metacognitive strategies, behavior, and learning performance in the context of blended learning in higher education. Relevant predictors are to obtain a model to forecast students’ performance and identify those who are at risk of failing the course. In the next section, the authors provide a description of relevant literature. Later, the method and results are presented. Finally, the discussion, conclusions, and further topics for investigation are considered.
Related Works
Blended Learning on LMS
Blended learning combines online and face-to-face instruction (Rooney, 2003; Young, 2002). Thus, this method emphasizes the central role of computer-based technologies. Face-to-face learning environments rely on human–human interaction. Distributed learning environments place an emphasis on learner–material interactions (Graham, 2005). Learner–material interactions can easily be tracked by means of LMS. Thus, several distance learning research studies have focused on these interactions. On the other hand, emotions, motivations, and cognitive–metacognitive strategies have not been considered.
Most universities around the world have adopted LMS (such as Blackboard and Moodle) in blended courses to facilitate content distribution and monitor students’ learning progress (You, 2015). Also, the teacher can create a learning sequence (Hirumi, 2002). This means that the sequence in which students are invited to review the course material modules is suggested but not required (Graham, 2005). Other advantages include the cultivation of students’ self-discipline (Harding, Kaczynski, & Wood, 2005) and the enhancement of students’ motivation in comparison with traditional face-to-face classes (Woltering, Herrler, Spitzer, & Spreckelsen, 2009).
Emotions, Motivation, and Regulation
Emotion is a feeling directed toward a real or unreal person, thought, or situation (Pons, Rosnay, & Cuisinier, 2011). Matthews (1997) stated that emotion may be conceptualized as a dependent variable (influenced by processes such as appraisal) or an independent variable (influencing information processing and cognition).
Research shows the importance of emotions in educational settings. Pekrun (2006) and Pekrun et al. (2007) identified a set of emotions related to educational activities and outcomes. These emotions (or achievement emotions) are the rationale of the control-value theory. Achievement emotions are organized in a three-dimensional taxonomy (Pekrun, Goetz, Titz, & Perry, 2002): (a) valence (positive or negative), (b) object focus (activity or outcome), and (c) level of activation implied (activating or deactivating). According to the control-value theory, achievement emotions are instigated by the appraisal of both control and value. For example, a student will enjoy a topic if they are interested in linear regression and feel capable of handling the work. However, the student will be frustrated if the valued topic has obstacles without a sufficient control.
Activating positive emotions (e.g., task enjoyment) preserves cognitive resources and promotes the interest and intrinsic motivation. Conversely, deactivating positive emotions (e.g., relaxation) reduces task attention and leads to superficial information processing. Activating negative emotions (e.g., test anxiety) reduces cognitive resources available for task purpose and intrinsic motivation. On the other hand, they can produce a strong motivation. Deactivating negative emotions reduces cognitive resources and both extrinsic and intrinsic motivation (Pekrun & Stephens, 2010). This complex relationship between positive and negative emotions may explain the low correlation between test anxiety and academic achievement (Hembree, 1988; Ma, 1999). It allows us to speculate about the circular causal relationship between emotion and cognition (Kim, Park, & Cozart, 2014; Pekrun, 2006; Pons et al., 2011).
Traditional approaches to the analysis of online learning have been conducted for the purpose of explaining emotional ambivalence (Baker, D’Mello, Rodrigo, & Graesser, 2010; S. D’Mello, Lehman, Pekrun, & Graesser, 2014). For example, frustration was found to be less associated with poor learning. Also, confusion can be beneficial to learning if appropriately regulated and resolved. When monitoring emotions in different computer-based environments, several studies found that poor learning-related negative emotions (i.e., frustration, boredom, and confusion) less frequently impact performance (Baker et al., 2010; Craig, Graesser, Sullins, & Gholson, 2004; S. K. D’Mello, Craig, Witherspoon, McDaniel, & Graesser, 2008). Students in cognitive disequilibrium (i.e., incongruity, dissonance, and internal conflict) can create opportunities for the deep learning of difficult content (S. D’Mello et al., 2014). However, if they fail to restore equilibrium, this state triggers frustration and eventual boredom (S. D’Mello & Graesser, 2012; S. D’Mello et al., 2014; Lehman, D’Mello, & Graesser, 2012).
Motivation is a teleological process whereby goal-directed activity is instigated and sustained (Schunk, Pintrich, & Meece, 2008). Academic motivation is determined by both social factors and learners’ cognition (Anderman & Dawson, 2011). Cognitive emotion theories (e.g., control-value theory) state that emotions are related to motivation, learning strategies, cognitive strategies, self-regulation, and academic achievement (Pekrun et al., 2002). In addition, emotions are closely and reciprocally linked to motivational antecedents and their emotional effects (Pekrun, 2006).
Linnenbrink (2007) and Pekrun, Elliot, and Maier (2006) discussed two theoretical models that integrate emotions and motivation. Linnenbrink’s (2007) model depicts a relation between motivation and engagement mediated by emotions. It supports the hypothesis that a reduction in unpleasant emotions for motivated students may help to explain higher levels of learning. The model proposed by Pekrun et al. (2006) suggested that mastery goals are positively related to hope. It also noted that pride was negatively related to boredom and anger. While performance-approach goals were positive predictors of pride, performance-avoidance goals were positive predictors of anxiety, hopelessness, and shame. The theoretical relations of this model were extended to include academic performance. Empirical evidence shows that performance-approach goals were positive predictors of performance, and performance-avoidance goals were negative predictors of performance (Pekrun, Elliot, & Maier, 2009).
Several online and face-to-face empirical studies examined emotions, motivation, and academic outcomes. For example, Kim et al. (2014) found that motivation and emotions explain approximately 37% of the variance of students’ final score. Similarly, Elias, Mustaf, Roslan, and Noah (2011); Stegers-Jager, Cohen-Schotanus, and Themmen (2012); Mega, Ronconi, and De Beni (2014); and Kim, Park, Cozart, and Lee (2015) found that motivational variables explain approximately 25% of students’ performances. González, Fernández, and Paoloni (2017) found that hope and anxiety are mediators between motivation and learning performance. Hope was found to be a positive predictor of the latter.
The influence of metacognitive self-regulation on performance has been studied in both online and face-to-face courses. Kim et al. (2015) detailed a significant difference in emotion, motivation, and metacognitive self-regulation between students with low and high performance. Kramarski and Gutman (2006) and Azevedo, Moos, Greene, Winters, and Cromley (2008) found that metacognitive self-regulation is closely related to students’ achievements. On the other hand, Lynch (2010); Al-Harthy, Was, and Isaacson (2010); and Cho and Heron (2015) found different levels of correlation (modest correlation, low correlation, and noncorrelation, respectively) between organization, metacognitive self-regulation, and each student’s final grade.
Data Mining to Forecast Learning Performance
Educational data mining is an interdisciplinary research area dealing with the development of method to explore data originating in an educational context (Peña-Ayala, 2014; Romero & Ventura, 2010). Educational data mining is useful in student modeling, particularly in the prediction of performance. The forecast can be carried out by a regression data mining task (numerical/continuous variable) or classification task (discrete/categorical variable; Romero & Ventura, 2010).
Data mining techniques have advantages over statistical techniques, including linear and logistic regression (LR). Neural networks (NNs) and decision trees (DTs), unlike statistical techniques, address data sets with a mixture of numerical and discrete attributes. The data to be analyzed with statistical techniques must satisfy several assumptions, including normality, nonmulticollinearity, linearity, and homoscedasticity. Some of these assumptions are often violated, and the fit of the model is decreased or adjusted. On the other hand, data mining techniques are less susceptible to the effect of violations. Moreover, most data mining techniques deal with missing values or are used to replace it (e.g., expectation maximization) and enhance regression models (Horton & Kleinman, 2007).
There has been an increasing interest in data mining in online learning. The subject has been considered in several articles, most of which deal with the construction of models based on behavioral indicators extracted from LMS logs to predict students’ performance. Cerezo, Sánchez-Santillán, Paule-Ruiz, and Núñez (2016) found that three out of six variables are strongly related to students’ final marks: (a) time spent on practical tasks, (b) words posted in forums, and (c) delays in the completion of tasks. Similarly, You (2016) modeled self-regulated behavior using total viewing time, number of sessions, and number of late submissions as behavioral indicators. You (2016) found that the regression model explained 58% of the variance in the overall score for the course. The models for predicting students’ success or failure in a course are based on data extracted from online forums (Romero, López, Luna, & Ventura, 2013), activities (i.e., quiz views, assignment views, etc.; Lara, Lizcano, Martínez, Pazos, and Riera, 2014), and the historical grades of activities (Burgos et al., 2018).
In the context of blended courses, Zacharis (2015) used 29 variables extracted from LMS and found that four were good predictors: (a) the number of messages read and posted, (b) content creation, (c) the number of quizzes viewed, and (d) the number of files viewed. The predictors explained 50% of the variance in the course grade. The binary classification model (base on those predictors) correctly classified 81.3% of students as having failed or not failed the course. On the other hand, other factors have not been considered in these models, including emotion, motivation, cognitive–metacognitive strategies, and behavior. Traver, Volchok, Bidjerano, and Shea (2014) established that demographic, status, social presence, teaching presence, cognitive presence, and learning presence variables have not been correlated with dropout from or completion of a blended course. In face-to-face courses, socioeconomic, social, and family variables, along with historical grades, are used to build classification models. The model with an overall accuracy of 97% was selected (Marquez-Vera, Morales, & Soto, 2013).
This research was intended as an attempt to understand the relation between students’ motivation, cognitive–metacognitive strategies, behavior, and learning performance in the context of blended courses in higher education. Thus, the following research questions are posed:
What is the relationship between emotions, motivation, cognitive–metacognitive strategies, behavior, and learning performance in blended learning? Which variables (ranging from emotions and motivation to cognitive–metacognitive strategies and behavior) are most effective for the construction of the best model for classifying student learning performance?
Method
Participants and Research Context
The data sets used in this research were gathered from a blended course in applied computing in biological sciences. The course enrolled 137 undergraduate university students. Included in a chemical biology degree program, it combined online and face-to-face sessions. The online sessions contained learning materials and learning activities delivered throughout the semester via the Moodle platform. The face-to-face sessions were intended for student feedback and deep explanations of learning activities. Thus, communication via forums and chats rarely took place.
A weekly online session was scheduled. Thus, the delivery of learning activities was provided the same week through Moodle. The students received feedback for all uploaded learning activities. The interactions between the students and Moodle were recorded in a log file. It provided information about the name, the time of each online session, the date of delivery of an activity, and the interaction with learning materials. The learning materials were designed to comply with the shareable content object reference model (SCORM) standard (Advanced Distributed Learning, 2004). Therefore, the completion of a given material and the number of topics reviewed could be tracked by Moodle. The learning materials or SCORM objects were organized in a hierarchical structure of topics and subtopics. Students were required to browse all topics and subtopics to consider a given learning material as reviewed.
Students needed to log into Moodle each time to review current session or previous session learning materials. They were not permitted to download learning materials or browse offline. The course contained three offline examinations, learning activities, and an end-term project. These were considered when computing each student’s overall grade. Hence, the overall grade represented a measure of learning performance.
Instruments
Students’ emotions and behaviors were assessed using adaptation of the questionnaire (Skinner, Furrer, Marchand, & Kindermann, 2008). It measured student engagement and disaffection in school (SEDS; Wellborn, 1991). It was administered at the end of the second face-to-face session and after the midterm and end-term examinations. Twelve items referring to negative emotions (i.e., boredom, anxiety, frustration, etc.) measured emotional disaffection. Similarly, five items relating to positive emotions (or behavioral engagement) measured emotional engagement, and positive behaviors (i.e., attention, effort, etc.) were measured by five items. Finally, the behavioral disaffection was measured by five items.
Students’ answers were based on a 5-point Likert scale of agreement where 1 meant strongly agree and 5 meant strongly disagree. The emotional disaffection questions involved negative emotions. Therefore, low emotional disaffection values meant that they experienced negative emotions. For example, a 1-point answer to the question “when we work on something in class, I feel bored” meant that a student experienced boredom. The reliability of scores on these subscales ranged from .871 to .96. The results of the three instruments administered throughout the semester computed the average of the four subscales.
The motivated strategies for learning questionnaire (MSLQ) assessed motivation (i.e., self-efficacy, test anxiety, etc.) and cognitive–metacognitive strategies (i.e., organization, metacognitive self-regulation, etc.; Pintrich, 1991; Pintrich & de Groot, 1990). The questions in this survey were answered using a Likert scale ranging from 1 not at all true of me to 7 very true of me. The instrument was administered midsemester. The reliability of scores on self-efficacy, test anxiety, metacognitive self-regulation, and organization ranged from .739 to .896.
Finally, Moodle’s log file computed the number of delivered learning activities, the number of learning materials reviewed, time spent interacting with Moodle, the number of partially viewed learning materials, and the number of missed learning activities.
Data Gathering and Data Analysis
The data used in the current study were collected from the second semester of 2016 scholastic year. At the end of the semester, the logs of students’ interactions with Moodle, results of SEDS and MSLQ, and overall grade of each student were used to build a data mining view with 19 variables, including 3 from Moodle, 4 from SEDS, 11 from MSLQ, and the overall grade. The data analysis process is illustrated in Figure 1.
Data analysis process.
First, a correlation analysis was conducted to select the variables that correlated highly with the overall grade of each student. Then, hierarchical regression analysis was performed. This analysis had two goals: (a) to explain the relationship between emotions, motivation, cognitive–metacognitive strategies, behavior, and learning performance in blended learning and (b) to identify significant predictors (see Figure 1). Thus, from a data mining point of view, the hierarchical regression analysis acted as a feature selection task to improve the classification performance (Fayyad, Piatetsky-Shapiro, & Smyth, 1996). Finally, several classification models were built based on the full (F) data mining view (19 variables) and reduced (R) data mining view (6 variables selected by hierarchical regression analysis) in conjunction with the naïve Bayes (NB), NN, LR, and DT techniques implemented in Waikato Environment for Knowledge Analysis (WEKA; Hall et al., 2009) as NB, multilayer perceptron, logistic, and J48, respectively. In building classification models, an overall grade of less than six was considered as failing the course.
Classification Models Based on Two Views Using Naïve Bayes, Neural Network, Logistic Regression, and Decision Tree Techniques.
Results
Correlation Analysis
Descriptive Statistics of LMS Variables and Subscales of SEDS and MSLQ.
Note. LMS = learning management systems; SEDS = student engagement and disaffection in school; MSLQ = motivated strategies for learning questionnaire.
Correlation Analysis of LMS Variables and Subscales of SEDS and MSLQ.
Note. LMS = learning management systems; SEDS = student engagement and disaffection in school; MSLQ = motivated strategies for learning questionnaire. 1 = Missing learning activities; 2 = Partially viewed learning materials; 3 = Behavioral engagement; 4 = Emotional engagement; 5 = Behavioral disaffection; 6 = Emotional disaffection; 7 = Spent time (s); 8 = Extrinsic goal orientation; 9 = Intrinsic goal orientation; 10 = Task value; 11 = Self-efficacy; 12 = Test anxiety; 13 = Control of learning beliefs; 14 = Metacognitive self-regulation; 15 = Rehearsal; 16 = Elaboration; 17 = Organization; 18 = Critical thinking; 19 = Overall grade.
*p < .05. **p < .01.
The condition index, variance inflation factor, and tolerance were checked among the predictors to determine the existence of multicollinearity issues. This analysis shows multicollinearity issues between returned and missing learning activities and fully viewed and partially viewed learning materials. Thus, returned learning activities and learning materials are not included in the hierarchical regression analysis.
Hierarchical Regression Analyses
To perform the correlation analysis, those variables with a significant correlation with the student’s overall grade were selected (see Table 3). First, the missing learning activities variable was entered in Step 1. Extrinsic goal orientation, intrinsic goal orientation, task value, self-efficacy, test anxiety, and control of learning beliefs were entered in Step 2. Emotional disaffection was added in Step 3. Metacognitive self-regulation, rehearsal, elaboration, organization, and critical thinking were added in Step 4. Finally, behavioral disaffection was entered in Step 5.
The analysis strategy was based on the following considerations. First, the learning activities represent a percentage of the overall grade of each student. Second, emotions are often considered a result of motivational factors despite their bidirectional influence (Kim et al., 2014; Marchand & Gutierrez, 2012; Pekrun, 2006; Pekrun et al., 2002). Third, metacognitive self-regulation influences organization, elaboration, and so forth (Al-Harthy et al., 2010; Cho & Heron, 2015).
Summary of Multiple Regression Analysis for Variables Predicting Students’ Overall Grade (n = 137).
*p < .05. **p < .01.
Multicollinearity is presented with conditions index values above 20 and variance proportions values above 0.90 (Meyers, Gamst, & Guarino, 2006). It is indicated for tolerance values less than 0.1 and variance inflation factor values above 10 (Meyers et al., 2006). The condition index of the fourth model predictors ranged from 1 to 17.6; variance proportion values associated with each predictor ranged from 0.001 to 0.87. Similarly, tolerance values ranged from 0.145 to 0.77. Variance inflation factor values ranged from 1.29 to 6.91. The results suggest that multicollinearity is not presented in the final model.
In brief, the missing learning activities, test anxiety, and emotional disaffection (or negative emotions) negatively affected students’ overall grades. Noticeably, the overall grade of a student who experiments a high level of emotional disaffection (low Likert value) will decrease. Thus, β value of emotional disaffection is positive (see Table 4). The SEDS instrument was collected after the midterm and end-term examinations. This fact may have affected students’ answers. One-way ANOVA was performed among emotional disaffection measures collected at the end of the second face-to-face session (M = 2.30, SD = 0.82) and after the midterm (M = 2.36, SD = 0.92) and end-term examinations (M = 2.21, SD = 0.84). The result shows that the effect of the examinations on emotional disaffection measures was not significant (F = 0.874, p = .418). As one might have expected, an increase in the number of nondelivered learning activities decreases the students’ overall scores. On the other hand, self-efficacy, organization, and metacognitive self-regulation positively correlated with students’ performance.
Data Mining Models
Descriptive Statistics of the Classification Models (M, SD).
Note. NB = naïve Bayes; NN = neural network; LR = logistic regression; DT = decision tree.
The Scheirer–Ray–Hare test results were evidence that the effects of data mining view (H = 7.050, p = 7.92E-03), data mining technique (H = 55.098, p = 6.54E-12), and Data Mining View × Data Mining Technique (H = 21.690, 7.57E-05) are significant (Scheirer et al., 1976). Thus, the models that relied on the R data mining view outperformed the classification accuracy of those models that relied on the F view. Finally, pairwise comparisons among data mining techniques were performed by means of Mann–Whitney–Wilcoxon, with Bonferroni correction being used to adjust the significance level (Rovai, Baker, & Ponton, 2013).
Pairwise Comparison Among Data Mining Techniques.
Note. NB = naïve Bayes; NN = neural network; LR = logistic regression; DT = decision tree.
*p < .05.
Discussion
Relationship Between Emotions, Motivation, Cognitive–Metacognitive Strategies, Behavior, and Learning Performance
In this report, the authors provided a comprehensive identification of the relationship between emotions, motivation, behavior, and cognitive–metacognitive strategies, as well as their effects on students’ performance. Negative emotions (or emotional disaffection) were reported by the participants in this research. Although emotional ambivalence was reported (Baker et al., 2010; S. D’Mello et al., 2014), the presented results show that each students’ overall grade was negatively affected by both negative emotions and test anxiety. Also, positive emotions (or emotional engagement) were not significantly correlated with overall grade. Thus, they were not included in the hierarchical regression analysis. This suggests that negative emotions play a more important role than that of positive ones.
Recent efforts by several researchers in the field of online learning have demonstrated that motivation plays an important role in learning performance. The results show that self-efficacy (motivational factor) positively affect the overall grade. On the other hand, extrinsic goal orientation, intrinsic goal orientation, task value, and control of learning beliefs are not significant predictors of learning performance. In this research, emotions were considered a result of motivational factors despite their bidirectional influences (Kim et al., 2014; Marchand & Gutierrez, 2012; Pekrun, 2006; Pekrun et al., 2002). The correlation analysis shows that motivation (extrinsic goal orientation, intrinsic goal orientation, task value, self-efficacy, test anxiety, and control of learning beliefs) is significant and negatively correlated with emotional disaffection. Thus, motivation may reduce negative emotions. Similarly, motivation may increase the use of cognitive and metacognitive strategies.
Differences between students with low and high performances in metacognitive self-regulation have been found in previous studies. Contrary findings of other investigations have yielded modest to low correlation or noncorrelation between metacognitive self-regulation and students’ overall grades. It suggests complex relationships. The results of the authors show that metacognitive self-regulation and organization significantly correlate with students’ performance and play an important role in blended learning. These results support the findings of previous research studies (Kim et al., 2015).
In this study, Moodle logs were analyzed to extract indicators of students’ behaviors. The variable measure of whether learning activities had been missed (missing learning activities) was included in the model to explain 19% of the variation in the overall grade (Zacharis, 2015). Other studies revealed that several LMS indicators were highly correlated with learning performance (Cerezo et al., 2016; Lara et al., 2014; Romero et al., 2013; You, 2016; Zacharis, 2015). Yet, the variables of partially viewed learning materials and time spent were excluded from the model at the first step of the hierarchical regression analysis. Finally, students’ self-reported behaviors (behavioral engagement and behavioral disaffection) were not good predictors of learning performance. In blended learning, these results suggest that negative emotions, motivation, and cognitive–metacognitive factors are more important than LMS indicators of students’ behaviors.
Classification Models
Several classification models were compared based on two data mining views and four classification techniques. The Scheirer–Ray–Hare test shows a significant difference between R and F views. The average of R view is higher than F view. Thus, R view enhances the classification performance of DT and LR (see Table 5). This evidence shows that the hierarchical regression analysis is a valuable feature selection task to improve classification rate. A feature selection task aims to select attributes with a high correlation value with the class (in this case the learning performance) and a low correlation value among them. In the hierarchical regression analysis, these goals are accomplished by correlation analysis and eliminating multicollinearity issues.
The averages of DT data mining technique outperform the others (see Table 5). The interaction Data Mining View × Data Mining Technique (H = 21.690, 7.57E-05) was significant; thus, the best model is RDT. It was built by the DT technique and regression model predictors. The model correctly classified the pass/fail status of 96% of students. This indicates the adequacy of using data regarding missed learning activities, along with self-efficacy, test anxiety, emotional disaffection, organization, and metacognitive self-regulation to discriminatively predict students’ performance.
Limitations of the Present Study
The correlation analysis shows that relationships between motivation and cognitive–metacognitive strategies and motivation and missing learning activities were significant. However, the causation cannot be established by this analysis. Thus, structural equation modeling is appropriate. The hierarchical regression analysis identifies direct relations from the predictors to overall grade. To avoid multicollinearity, the predictors with low correlation are preferable. Variables that play a mediator role may be neglected.
Conclusion and Future Work
In this article, empirical evidence supports the hypothesis that emotions, motivation, cognitive–metacognitive strategies, and missed learning activities (as an indicator of student behavior) predict students’ performance. Self-reported behaviors (time spent on LMS and partially viewed learning materials) are unrelated to students’ overall grades. Thus, empirical evidence suggests that emotions, motivation, and cognitive–metacognitive strategies in blended learning are closely correlated with student performance.
These findings, as well as the need to identify students at risk of failing the course, encouraged the building of a data mining model to correctly classify 96% of the students. This model paves the way for enhanced LMS use in blended learning by monitoring emotions, motivation, and cognitive–metacognitive strategies to forecast the learning performance.
The present research shows the importance of emotion, motivation, and cognitive–metacognitive strategies in predicting students’ learning performance. However, the causal relationships among them have been undervalued. The outcome of investigating more complex relationships in the context of blended learning in higher education remains an open question. For example, the authors plan to examine the relationship between emotions, motivation, and personality. The impacts of a learning environment (face-to-face, blended, and online) on emotions, motivation, and cognitive–metacognitive strategies will be evaluated.
Footnotes
Acknowledgments
The authors thank the referees for comments that greatly improved the article.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was partially supported by grants numbers 20170922 and 20171422 from SIP programs at Instituto Politécnico Nacional.
