Abstract
Stock market volatility exhibits characteristics such as clustering and time-varying fluctuations. This paper proposes a two-stage method for addressing these concerns. The involved procedure is as follows: First, a fuzzy system is used to analyze clustering regimes according to the size of fluctuations. Second, the clustering regimes of Stage I are used to establish a support vector regression (SVR) model, which is used to reduce the time-varying complexity. However, the fuzzy-SVR model combines the parameters of membership functions and SVR models, further complicating the problem. Thus, this paper presents parallel research based on a genetic algorithm (GA) for estimating the parameters of the membership functions and SVR model. Data from four stock markets—the Taiwan Stock Exchange weighted stock index (Taiwan), the NASDAQ Composite index, the Hang Seng index (Hong Kong), and the Shanghai Composite index (Shanghai)—were analyzed in this study to illustrate the performance of the proposed model. According to the simulation results, the forecasting of out-of-sample volatility performance was significantly improved when the model accounted for the behavioral effect of both clustering and time-varying fluctuations.
Introduction
Volatility is a measure of risk in financial markets [22, 38] and is known to be a major financial tool for describing stock markets. Although the autoregressive conditional heteroscedasticity (ARCH) model, developed by Engle in 1982 [8], has been proven useful in several economic applications, some problems continue to affect its forecasting accuracy. Therefore, using the ARCH model is limited by the fixed lag structure of the conditional variance. Bollerslev used the autoregressive moving average to expand the ARCH model, creating the generalized autoregressive conditional heteroscedasticity (GARCH) model [4], which uses volatility clustering and the fat tail features of financial time series to estimate volatility.
Volatility clustering is a common feature of various financial series and refers to the phenomenon in which large changes tend to be followed by large changes and small changes tend to be followed by small changes. Based on the weighted average of the past squared residuals, the GARCH model features declining weights that never entirely achieve zero. Therefore, the leverage effect that frequently appears in financial markets decreases the forecasting accuracy of the GARCH model. The leverage effect indicates that volatility responds differently to positive and negative information, and negative returns of financial assets tend to result in greater volatility. Over the past few years, several studies have developed many improved versions of the GARCH model to resolve this problem; for example, the exponential GARCH (EGARCH) model was introduced by Nelson [34], the fuzzy switch GARCH model has been proposed in [2, 18], and the Glosten–Jagannathan–Runkle GARCH (GJR-GARCH) model was introduced by Glosten et al. [13]. These models can capture the leverage effect in financial markets; however, they do not simulate the time-varying fluctuations according to the volatility of stock markets effectively.
To overcome the time-varying characteristics of volatility, many studies have proposed using robust methods, such as neural networks and support vector regression (SVR), for financial forecasting [12, 42]. Various volatility forecasting methods are combined through simple averaging based on the artificial neural network by Xiao et al. [41]. The volatility of foreign exchange data are forecast using the support vector machine (SVM) proposed by Gavrishchaka and Ganguli [11]. A novel neural network technique with an SVM is used in financial time-series forecasting [37]. The Bayesian framework is combined with least squares SVMs in the nonlinear regression method by Gestel [12]. These methods can efficiently capture volatility under small fluctuations; however, they degrade capture volatility when the fluctuations are large.
SVR, a variation of the SVM proposed by Vapnik et al. [39], is a powerful machine-learning method based on statistical learning theory. The method involves a structural risk minimization principle, instead of the usual empirical risk, which aims at minimizing a bound on the generalization error of a model, rather than minimizing only the mean square error over the data set [16]. Thus, SVR is useful for constructing data-driven nonlinear empirical process models and is effective in forecasting the time-varying characteristics of volatility [42]; however, SVR loses its efficiency by overfitting when volatility fluctuates greatly. The SVR model describes random errors or noise instead of the underlying relationship. Therefore, this paper proposes a fuzzy-SVR model, in which a fuzzy system is used, for analyzing the clustering regimes on the basis of the size of fluctuations; the robust characteristics of SVR are combined to forecast the volatility of stock markets.
In recent decades, fuzzy systems have been extensively applied in a wide variety of industrial systems because of their model-free approach [1, 43] and the excellent approximation ability of fuzzy systems [10, 26]. In generally, fuzzy systems are universal approximators to uncertain nonlinear systems (i.e., they can approximate any behavior related to complexity dynamics within a predefined range of desired accuracy), such as nonlinear discrete-time systems with backlash [27] and multiinput multioutput nonlinear systems [28, 29] by an adaptive fuzzy controller to approach the desired control performance. The fuzzy systems combine the ease of implementation and the convenience of linear models with the ability to capture and approximate a wide range of functions. This paper proposes fuzzy systems as a judicious choice for analyzing the size of fluctuations that feature time-dependent variances. In the proposed method—the fuzzy-SVR model—SVR models and fuzzy systems are combined and applied to forecast the volatility of stock markets. The process of optimizing the parameters of fuzzy systems and SVR models is highly complex and nonlinear. Therefore, a genetic algorithm (GA)-based parameter estimation algorithm is proposed for deriving the optimal solution for the fuzzy-SVR model.
The GA is a parallel search method for obtaining the global optimal solution of complex optimization problems that emulate natural genetic operations such as reproduction, crossover, and mutation [7, 14]. The GA applies operations to a population of binary strings that represent potential solutions. At each generation, the GA explores different areas of potential solutions and then directs the search to the region in which a high probability of determining improved performance exists. Because the GA simultaneously evaluates many points in a parameter space, it can ultimately converge on the global solution. In particular, the algorithm can iterate several times on each data point. Accordingly, it is suitable for addressing the parameter problem of the fuzzy-SVR model.
The remainder of this paper is organized as follows. Section 2 describes the SVR and fuzzy-SVR models. Section 3 presents the GA-based optimization of the fuzzy-SVR model and an adaptive forecasting algorithm. The experimental results that illustrate the effectiveness of the proposed method are explained in Section 4. Finally, Section 5 presents the conclusion.
Fuzzy-SVR model
The empirical and theoretical show the fluctuations of finance market volatility is asymmetries, general falling stocks to returns give rise to higher volatility than do equivalent rising stocks to returns [13]. Moreover, the volatility clustering characteristic that large changes tend to follow large changes and small changes tend the follow small changes [13]. In this study, we used the SVR model to capture the clustering characteristic of volatility. The SVR model involves obtaining widespread acceptance in data-driven nonlinear modeling applications [15]. The SVR model is an extraction of the training data that serves as a support vector and therefore represents a stable characteristic of the data. Moreover, it entails a structural risk minimization principle—instead of the usual empirical risk—that aims at minimizing a bound on the generalization error of a model, rather than minimizing only the mean square error over the data set [30]. Thus, SVR is useful for forecasting time-varying clustering data [37]. To consider the differential effects of the propagation of volatility caused by a rising or falling stock market, we applied fuzzy systems to address the fluctuation asymmetries by falling/rising stock to returns. The general expert information can represent the fuzzy terms (e.g., low, high); this representation may be used for convenience because of a lack of more precise knowledge. In this study, we used the IF-THEN rules of fuzzy systems to appropriately simulate the fluctuation asymmetries of the stock market. The following subsections describe the proposed method indetail.
SVR model
Consider an in-sample data set {
Equation (2) is a quadratic optimization problem with inequality constraints; according to the Karush–Kuhn–Tucker optimality conditions [39], the SVR training procedure amounts can be used to solve the convex quadratic problem by using the Lagrange multiplier. Thus, Equation (2) can be translated into a dual problem as follows:
Fuzzy systems are universal approximators [26, 43] that can approximate the behavior of a system in which analytic functions or numerical relations do not exist. The empirical and theoretical appeal of the SVR model is due to it minimizing the regression error based on the structural risk minimization principle and capturing the small fluctuations of time-varying data. However, the model uses the insensitive error tolerance that contains a fixed and symmetrical margin, thus failing to accommodate the large fluctuations and sign asymmetries: Negative shocks to returns give rise to higher volatility than do equivalent positive shocks to returns. Ignoring this fact can lead to poor prognostic characteristics. According to Fama [9], stock market volatility exhibits the property that large changes tend to follow large changes and small changes tend to follow small changes. To account for the differential effects of the fluctuation size of stock market volatility, this study incorporated fuzzy systems into the proposed SVR model. The resulting fuzzy model is described using the IF-THEN rules and is used to ensure that the SVR model appropriately addresses the problem of forecasting stock market volatility. The basic configuration of fuzzy systems consists of three components: a fuzzy rule base, a fuzzy inference engine, and a defuzzifier. The fuzzy rule base consists of a collection of IF-THEN rules [18, 43]. The lth rule of the fuzzy system for SVR is described as follows:
For the functional fuzzy system, defuzzification [18] can be written as
and
Thus, the following expression [18] is obtained:
According to Equations (5)–(11), the system output can be expressed as
A reasonable success criterion is the minimization of the mean square error, the objective function of which is defined as
GA-based fuzzy-SVR model
The GA is composed of probabilistic heuristic search processes based on natural genetic systems. It is highly parallel in searching for the global optimal solution of complex optimization problems [24]. The range in the daily return of the stock market is generally approximately –10% to 10% hence, the realized volatility in this paper is defined as , where r (i) is the daily return of the stock market price at time i. Therefore, the parameter range is clearly bounded by the fuzzy-SVR model, as shown in Table 1. We use a GA to solve the fuzzy-SVR problem in Equation (12). This method can explore several points in the search space simultaneously, thereby increasing the opportunity to locate new points in the search space with an expected improvement in performance.
The GA is composed of three operations: 1) selection, 2) crossover, and 3) mutation. These operations are implemented by performing the basic tasks of copying strings, exchanging portions of strings, and changing the state of bits from 1’s to 0’s or vice versa. These operations ensure that the “fittest” members of the population survive, and their information is preserved and combined to generate “fitter” offspring. The result is an improvement in the next generation’s performance. The GA is described in the following subsections.
Coding
The GA featuring a population of strings and binary coding was used in this study. According to the binary coding method, each parameter of the membership functions (m
lj
, δ
lj
) and SVR-determined parameters (C
l
, ɛ
l
, σ
l
) have their own string length, which consists of 0’s and 1’s. The choice of the bit number B
i
for each parameter depends on the desired resolution R
i
, which is calculated as
The degree of fitness depends on the performance of the possible solution represented by that particular string. In this study’s design problem, locating the minimum in Equation (13) is equivalent to obtaining a maximal fitness value by using the genetic search process. A chromosome that has a lower objection function should be assigned a higher fitness value. A simple linear relationship between a fitness function and an objective function is expressed as
Selection, based on the principle of survival of the fittest, is a process by which individual strings are copied and placed in a mating pool for further genetic operations consistent with their fitness value. The probability PR (j) of the jth string with a fitness value Fit (j) selected for mating and reproduction in the next generation is
Crossover is the primary exchange of information for a chromosome. This study used the one-point crossover method [7], which is conducted in three steps. First, the two newly reproduced strings are selected from the mating pool created through selection in the previous generation. Second, a position including the two strings is selected at random. The third step involves exchanging all the characters by following the crossing sit. The crossover operation occurs only with a probability p c (crossover probability). The choice of p c is known to critically affect the performance of a GA. In general, the value of p c ranges 0.5 to 1.0. When combined with selection, crossover is an effective means of exchanging information and combining various elements of high-quality solutions.
Mutation
Selection and crossover yield most of the processing power of GAs. However, mutation, the third operation, enhances the ability of GAs to search for the optimal solution. Mutation is the occasional flip of each bit at a particular string position with a low probability of a chromosome from 1 to 0, or vice versa. The mutation operation is used to change some elements in selected individuals with a probability p m (the mutation probability). In general, the p m should be used sparingly because it is a random search with a high mutation probability.
Volatility forecasting
Adaptive volatility forecasting
The power of a model in forecasting volatility is crucial because volatility is a measure of risk in financial markets. A common approach involves using in-sample data to construct a model, and then making one-step-ahead predictions to obtain the future solution [35, 45]. Realized volatility [31] was used to estimate the performance of forecasting volatility. For improved forecasting results, in this study, the recursive least squares (RLS) [35] error formula was used for forecasting volatility. First, the forecasting objective function is defined in this paper as
The RLS of the following formulas are expressed as
The mean square error measure of Equation (13) is used to derive the forecasting model in the in-sample data process; however, it cannot be used alone as a conclusive measure for comparing different forecasting models [38]. Therefore, various statistics have been used to compare forecast errors: these include the mean square forecast error (MSFE), mean absolute forecast error (MAFE), and mean percentage forecast error (MPFE).
According to the aforementioned analysis, the design procedure for the fuzzy-SVR model applied to forecast stock market volatility by using GA is divided into the following steps:
Figure 2 shows the pseudocode of the fuzzy-SVR model, and Fig. 3 shows a flowchart of the design procedure for the fuzzy-SVR model.
Simulation
The data consisted of daily closing values for four stock indices from January 1, 2010, to December 31, 2013. This study focused on the Taiwan Stock Exchange weighted stock index (Taiwan), the NASDAQ Composite index, the Hang Seng index (Hong Kong), and the Shanghai Composite index (Shanghai) to illustrate the performance of the proposed method. The results were compared with those of other models, including the GARCH [4], GJR-CARCH [13], EGARCH [34], dynamic evolving neural-fuzzy inference system [21], SVR based on the GA [5], and various forecasting methods with multilayer perceptron (Combing MP) [41]. All the methods were implemented in Matlab and evaluated on an Asus desktop PC with a 3.4 GHz i7-4770 CPU, RAM 16G, and Windows 8. The data were divided into two parts: the first half (January 1, 2010–December 31, 2012) of the sample comprised the in-sample data of the training set (N = 748, 762, 750, 716 of Taiwan, NASDAQ, Hong Kong, and Shanghai, respectively) and the second part (January 1, 2013–December 31, 2013) out-of-sample data of the test set (M = 240, 259, 243, 254 of Taiwan, NASDAQ, Hong Kong, and Shanghai, respectively). The GA optimizes parameters of the SVR model and proposed model. Practically, the generalization capability and accuracy of the SVR model and proposed model are determined according to searched problem parameters. In this study, the simulated results were obtained by averaging the results of 20 independent Monte Carlo (MC) runs to train the SVR model and proposed model for deriving a more reliable result [15]. For time-series forecasting, a cross-validation statistic was obtained using in-sample data to construct a model, and one-step-ahead predictions of out-of-sample data were then made to obtain the future solution [18].
Bollerslev [4] indicated that the GARCH (1,1) model accurately describes the volatility of financial data. Therefore, in this study, n = 1. Figure 4 presents histograms of the in-sample period volatility for the four financial markets, showing the concentrated clustering properties with few distribution regimes. The results indicated that volatility was concentrated in regimes with small fluctuations and few value distributions of the larger regime. The forecasting results according to various fuzzy rules are presented in Table 2. The results showed that L = 2 was more satisfactory than L = 3, but the Shanghai index has more distribution regimes than those of the other three financial markets. Therefore, the number of regimes is more than that of the other financial markets; the results are the same as those shown in Fig. 4. Moreover, the L = 3 and L = 2 performance is almost the same as that of the Shanghai index; thus, L = 2 was used. In general, the value of the daily stock return r (t) is between – 10% and 10%. Therefore, the upper and lower bounds of the parameters related to the fuzzy-SVR model are defined as
The GA parameters [7, 24] are defined as
Table 3 lists the parameter estimates associated with the Taiwan, NASQ, Hong Kong, and Shanghai indices for the proposed method. As shown in Table 3, the two SVR models are distinct. The empirical results show that the volatility of the four markets exhibits characteristics such as clustering and exhibits the leverage effect. Table 4 depicts the various forecast statistics of the models. The results indicate that the volatility forecast from using the asymmetric GARCH models (e.g., the GJR-GARCH and EGARCH models) is superior to that of the GARCH model in the Hong Kong and Shanghai indices. This is because the realized volatility copes with the residual powers of past influence and negative shocks to returns give rise to greater volatility than equivalent positive shocks to returns [32]. Furthermore, the empirical results indicate that the Hong Kong and Shanghai stock market data are asymmetric. The information in Table 4 shows that the performance of the Taiwan, NASDAQ, and Hong Kong markets by SVR based on the GA is superior to that of the others models except for the proposed method because of the constructing data-driven capturing time-varying characteristics and the training model expending considerable time. The characteristic of the SVR is to minimize the generalization error bound for achieving generalized performance, rather than minimizing only the mean square error over the data set [30]. The Shanghai realized volatility distribution regimes is more than those of the other three financial markets. Therefore, the based on the GA performance in MAFE and MSFE of Shanghai is worse than the persistence of volatility models (GARCH, GJR-GARCH, EGARCH) but the MPFE performance is almost better than the persistence of volatility model. The characteristic of SVR is to minimize the generalization error bound for achieving generalized performance, rather than minimizing only the mean square error over the data set [37]. In addition, the MPFE performance of the proposed method is superior to that of other models and the performance of the MAFE and the MSFE was more satisfactory than that of the other models, despite the training model expending considerable time. The proposed model led to an improvement exceeding 30% in the forecasting performance of the MPFE compared with that of the other models under is not taken training time. Therefore, both clustering and adaptive methods were used to capture time-varying data.
The convergence of the objective function E f by using a GA-based estimator corresponding to a single run is shown in Fig. 5. Note that the cost functions of the GA-based estimator have exponential and rapid convergence at the beginning of generation and converge within eight generations. Figure 6 illustrates the volatility forecasting for the four markets according to the proposed method. The model involved using fuzzy rules to generate two SVR models and an adaptive method to forecast volatility. Therefore, the model could capture market volatility clustering and time-varying characteristics but not large change fluctuations, particularly for the part of Shanghai. Figure 7 shows the forecasting volatility and zoomed part for the Taiwan market obtained using the proposed method and the GA-based SVR model. According to the zoomed part, the proposed method could capture irregular behavior and fluctuations more effectively than the GA-based SVR model could, especially the fluctuations that rapidly change. The empirical results of this study indicated that the stock market data consist of clustering and time-varying fluctuations. The proposed fuzzy-SVR model outperformed the other models, as shown by the various rules concerning the forecast errors.
Volatility exhibits clustering and time-varying characteristics; moreover, many complex factors influence volatility. Therefore, this paper proposes a fuzzy-SVR artificial intelligence method. The predictive abilities of the volatility models were examined by comparing the forecast volatility measures defined in Equation (23). The proposed method appeared to be advantageous when we attempted to model both clustering and time-varying volatility. A GA-based design method was used to estimate parameters for the fuzzy-SVR model, and an adaptive algorithm was employed to forecast the volatility of various financial markets. The simulation results indicate that the proposed method afforded substantial improvements in forecasting performance. Moreover, the GA simultaneously evaluates many points in the search space. Therefore, specifying initial conditions is unnecessary for achieving improved results. The main disadvantage of the proposed method is that it cannot effectively capture large fluctuations of market volatility. The persistence models (GARCH, GJR-GARCH, EGARCH) involving conditional mean and conditional variance are frequently used to investigate the volatility in financial time series. They demonstrate the ability to capture the persistence of volatility but do not effectively capture clustering and time-varying characteristics. Future research problems entail designing a multifuzzy-SVR model or combining fuzzy rules to switch the persistence models or the SVR model to manage the varying fluctuation effects of forecasting volatility. Moreover, the optimal of the design model will is complexity problem and many local optimal may exist. In a future study, we will vary the meta-heuristics algorithm to evaluate the performance.
