Abstract
We propose a heuristic learning method forecasting future performance of stock market indices based on high-order fuzzy-trend jump rules generated from historical training data. Firstly, the training time series (TSs) are fuzzified by equal intervals referencing to the whole mean differences of historical training data. Then, it generates the groups of nth-order fuzzy logical relationships (FLRs). With the knowledge of the generated relationship groups, it summarizes the probability of the jumps of the nth-order “down”, “equal” and “up” trend rules, respectively. Finally, it performs the forecasting based on the nth-order FLRs and the probabilities of their corresponding jump rules. To evaluate the outcome of the presented model with the performances of the others, we use the presented model to predict the Taiwan Stock Exchange Capitalization Weighted Stock Index (TAIEX) dataset. The outcomes show that the presented model outperforms the other models using single factor and point-wise one-step ahead forecasts. Moreover, it is easily to realize by software computing without artificial participation and can be extended to deal with multiple years of dataset. We use this model to predict Shanghai Stock Exchange Composite Index (SHSECI) as well to analyze its effectiveness and universality.
Keywords
Introduction
Since Song and Chissom [17] originally presented the fuzzy modeling for time series (TSs) [17–19], many fuzzy forecasting models, approaches and procedures have been presented based on fuzzy time series (FTSs) [28–30]. These models are introduced to forecast the enrollment at Alabama University [25, 26], project cost [4], electricity load demand [20] and stock market behavior [8, 24]. Because of the dynamic and complex environment [3, 33], stock market prediction has been regarded as the most challenging application of time series prediction [16]. Authors developed models to approximately forecast the uncertainty of the future trend by analyzing the change of corresponding stock indicates over a time-window [15]. In order to provide accurate forecasts for stock market indices, some authors combine fuzzy time series with heuristic optimization methods [12, 35], and other authors combine neural networks and heuristic procedures to forecast stock market time series [21].
Concerning the early stages of FTSs methods, the most significant points are (1) the determination of the universe of discourses and suitable intervals to fuzzify TSs into FTSs, (2) the construction of fuzzy logical relationships (FLRs), (3) the defuzzification to get crisp number of the future. According to the first step, researchers usually observe the min-max values of the TSs and determine the universe of discourse and the interval-lengths. Existing fuzzy forecasting models can be divided into two categories-fixed interval-lengths [11, 34] and variable interval-lengths [5, 14]. All of these methods require each data be in the scope of universe of discourse. Based on the second step of existing models, they can be divided into first-order fuzzy logical relationships (FLRs) [6, 32], high-order FLRs [14], and two-factors high-order FLRs [24], etc. According to the third step, most of researchers employ the interval-lengths to forecast the crisp number [23, 24]. Although existing forecasting models can perform the future stock market indicates, however, the forecasting error rates of current approaches are pretty high. Therefore, to develop the accurate forecasts, Rubio et al. (2017) proposed a mixed method to determine the defuzzification number [8]. Other researchers propose hybrid approaches for fuzzy time series [7, 24]. These new methods introduce covariates in the FTS analysis and enhance the stability of the time series. However, the determination of covariate requires more extensive research.
This paper introduces a new heuristic learning model based on nth-order FLRs and jump rules for FTSs. Firstly, real numbers of training TSs are fuzzified into FTSs and high-order FLRs are summarized. Then, it analyzes the FLRs to obtain their fuzzy-trend and jump rules. It summarizes the probabilities of jump rules by a point-wise heuristic learning procedure. Finally, it performs the forecasting based on the current high-order fuzzy-trend and its corresponding probabilities of jump rules. To test the utility of this method, we apply it to forecast the TAIEX and compare the forecasting accuracy with existing methods. We also apply it to SHSECI to illustrate its universality.
The rest of this paper is described as the followings: Section 2 introduces basic ideas of fuzzy sets (FSs) [10], fuzzy time series (FTSs) [17–19], fuzzy logical relationships (FLRs), fuzzy-trend and jump rules. Section 3 describes a new method of forecasting with high-order fuzzy-trend jump rules. In Section 4, the presented method is used to forecast the stock market using TAIEX and SHSECI datasets, respectively. Conclusions are assembled in Section 5.
Preliminaries
Song and Chissom [17–19] originally proposed the theories of FTSs. This part we will briefly look for some related concepts of FTSs.
Groups of fuzzy-trend logical relationships for n = 4
This paper introduces a new model based on FTSs and high-order fuzzy-trend jump rules. To compare with other researchers’ work [1, 32], this section uses authentic Taiwan Stock Exchange(TAIEX 1999) to represent the forecasting procedure. All data for the first ten months of year 1999 are treated as training dataset and the rest data for the last two months as testing dataset. The initial steps are as follows.
In our proposal, we define U = [0, Xmax + U0], Xmax is the maximum value of original datasets and U0 is a positive real number such that enables the universe cover the possible value of the historical data. The lengths of each adjacent intervals len are equal and determined by the whole mean of the training data.
Suppose x is a real number in the training dataset and x ∈ u
i
, where
trunc(y) means the integral part of y and u i = [len × i, len × (i + 1)), the real number x is fuzzified to D i . For example, let len = 80 and x = 6052.45, according to Equation (2), i = 77, then x is fuzzified to D77. By this way, the historical TSs can be fuzzified to FTSs.
For each fuzzy set in the fuzzy time series of training dataset H (t), suppose its fuzzy-trend belongs to group p
t
and the jump is s
t
according to Def.3 and Def. 4. The jump rules for the high-order fuzzy-trend groups can be computed as follows:
The pseudo-code of the rule generation algorithm is shown in Fig. 1(taking n = 4 for example).

Pseudo-code of rule generation algorithm.
Locate the left-hand sides (LHS) of the last obtained data point H(t-1) to obtain its nth-order fuzzy-trend group number. Then use the corresponding jump rule to perform the forecast as follows:
Many researches use TAIEX1999 as an example to evaluate their proposed forecasting methods [1, 32]. We will use TAIEX1999 as an example as well. Then we collect the stock market time series datasets of SHSECI (Shanghai Stock Exchange Composite Index) to analyze its effectiveness and universality.
Forecasting TAIEX
Let the order number n = 4, interval len = 80 (approximately equal to the whole mean of the training dataset). Based on the model and forecast steps described in Section III, we use TAIEX1999 as an example to illustrate the forecasting performance.
[Step 1] According to Equation (2), the actual time series are fuzzified into fuzzy time series shown in Table 2.
Actual dataset and the Fuzzified results of TAIEX1999
Actual dataset and the Fuzzified results of TAIEX1999
[Step 2] According to the fuzzified training data from January 5, 1999 to October 30, 1999 in Table 2, the fuzzy logical relationships, fuzzy-trend groups and the corresponding jumps can be generated, as shown in Table 3.
Logical relationships, fuzzy-trend groups and jumps of the training fuzzy datasets
Based on the fuzzy-trend groups and the corresponding jump rules generated from training data, we can perform the forecast. For example, the 4th-order fuzzy-trend group number of November 1, 1999 is (D97, D97, D97, D99), belongs to group 15 (see Table 1). From Table 3, the jump rule for group 15 is 0.625 (the jumps are –1, –1, 3, 0, 1, 0, 3, 0, respectively). Then according to Eq.(4), the forecasting value of the TAIEX on for November 1, 1999 is obtained as follows:
The other outcomes are shown in Table 4.
Predicting results from November 1,1999 to December 30, 1999
We can evaluate the forecasting performance by some common difference comparison methods. The following indicators are usually applied in the comparisons of TSs forecasting models, including mean squared error (MSE), root of the mean squared error (RMSE), mean absolute error (MAE), mean percentage error (MPE), etc. The definitions of these indicators are detailed by Equations (5–8):

The stock market fluctuation for TAIEX test dataset (1997–2005).
RMSEs of forecast errors for TAIEX 1997 to 2005
The following Table 6 describes the RMSEs of different methods when predicting TAIEX1999. Obviously, the accuracy of presented method outperforms many existing methods. The result of Chen & Chen’s method [13] is slight better than the proposed method in this paper. However, Chen & Chen’s method employed complex discretization partitioning method and extra adjustment deviation methods. The method presented in this paper is more easily to be automatically carried out by a computer.
A comparison of RMSEs for different methods for forecasting TAIEX1999
In this paper we use SHSECI (Shanghai Stock Exchange Composite Index) as our second example. The following section are composed of using the presented method to forecast the SHSECI from the year of 2001 till 2015 during which in each year the actual datasets from January to October are training data and the rests of the year are the testing data. The forecasting results and the RMSEs of forecast errors are shown in Fig. 3 and Table 7, respectively.

The stock market fluctuation for SHSECI test dataset (2001–2015).
From Fig. 3 and Table 7, we can realize that the proposed method can successfully predict the stock market. The maximum RMSEs of forecast errors occurred when forecasting the November to December stock market in 2007 and 2008. In these two years, because of the significant changes in economic environment, the fluctuation rules of stock market from January to October are not the same as from November to December. For example, in 2007, the SHSECI kept rising from January to October and remained consolidation from November to December. For 2008, as everyone knows, the financial crisis caused the abnormal fluctuation of stock market and led to a more RMSE of forecast errors.
RMSEs of predict errors for SHSECI from 2001 to 2015
In this paper, we have described a novel method for the prediction of the TAIEX and SHSECI based on high-order fuzzy-trend and the corresponding jump rules. The jump rules forecasting the change of future are generated by heuristic learning from the high-order FLRs and their fuzzy-trend of the training FTSs. The experiment results show that the proposed method both outperforms the existing method and it is effective in stock market forecasting. In this paper, jump rules are generated from the stock indicate and the training datasets are always from January to October for each year. In the future, we will use all of the historical data to generate the jump rules. Also, we will consider the influences of surrounding stock markets and other factors of the stock market itself.
Footnotes
Acknowledgments
Thanks reviewers for providing so insightful comments and suggestions, which led to a better quality of this paper. This work is supported by the Fund of China Nation Tourism Administration (15TACK003), the Fund of Ministry of education of Humanities and Social Sciences (14YJAZH025), the Foundation program of Jiangsu University (16JDG005), and the Natural Science Foundation of Shandong Province (ZR2013GM003).
