Abstract
This paper focuses on improving the accurate prediction of tourist capacity, which is the key to solve the contradiction of tourism economy development and ecological environment protection. EMD-BP integrated predictive model is proposed and uses Empirical Model Decompose method to decompose time-series data of visitors into several TMF, then forecasting each component by BP neural network. In order to shown the effective of our method, an empirical study of Jiuzhaigou is conducted, and the average error rate of EMD-BP integrated prediction model is 8.8%, among which the error within 1% accounts for 31.3% of the total predicted amount, the error within 5% accounts for 41.7%, the error above 10% accounts for 30.2% in slack seasons and 69.8% in busy seasons.
Introduction
In the development of tourism, the predication of tourist capacity is of great importance. An accurate prediction of tourist capacity can greatly improve the quantitative level of tourism economy, which is helpful to the formulation of the development plans and policies in tourist areas as well as solving the problems existing in scenic spots (especially the damage to ecological environment caused by tourism in the development of tourism) [1].
The research on tourism demand prediction made by western scholar’s dates from the 1960s, and it experienced a rapid development in the 80s. Most of their research literature lays particular emphasis on tourism demand model and empirical analysis. Relevant research in China didn’t start until the late 20th century, the existing literature is mainly the theoretical introduction and discussion based on the research in the west, and empirical studies are quite few [2]. At present, the prediction methods of tourist capacity can be divided into two types: qualitative method and quantitative method, among which the major research method is the quantitative method. The quantitative prediction method can be further classified into four types, including non-causal time series model, causal measurement model, artificial intelligence model and combined model.
Time series model believes that the dynamic state observed from historical data may sustain for a period of time in the future, so we can recur future according to the past [3], thus saving data collection cost. Major time series models include moving average method, exponential smoothing method, autoregressive moving average method and seasonal autoregressive moving average method, among which the most widely used in tourist capacity prediction is Autoregressive Integrated Moving Average (ARIMA) [4–7]. However, there still exist disputes over whether ARIMA model is superior to other models. For example, Coh mentioned that ARIMA model was much better than other time series [8], and Smeral and Wuger made empirical analysis, which showed that SARIMA model or ARIMA model were not better than Native (no-change) [9]. Among the time series models, Generalized Autoregressive Conditional Heteroskedasticity (GARCH) is widely used in tourism demand prediction, such as Liang [10].
Relative to time series model, the superiority of measurement model is that it analyzes the causal relationship between tourist capacity and its influencing factors, which can provide reference to relevant measures and policies of administering authority of the scenic spots. However, traditional measurement models may experience spurious regression, so in order to break through this defect, Autoregressive Distributed Lag Model (ADLM), Error Correction Model (ECM), Vector Auto-regression Model (VAR) and Time Variable Parameter (TVP) Model appear successively and are widely used. For example, Fung-Thai [11], Koonnam [12] all use ECM to predict tourist demand; Tran [13], as well as Tsaur [14] use VAR model; in literature, the author also uses TVP model, and proves that TVP model is superior to other measurement models.
With the rise of artificial intelligence technology, artificial intelligence technology has also been introduced to the prediction of tourist capacity. It breaks through the limitation of linear prediction of the previous prediction method. In the middle and late 1990s, Rob Law introduces artificial neural network to the prediction area in order to predict the Japanese tourists’ tourism demand towards Hong Kong [15], which was widely used soon afterwards.
Various single prediction models have achieved relatively high prediction accuracy, but influenced by many factors, all single prediction methods usually can’t cover the overall and effective information of tourism demand, thus greatly reducing the accuracy of the prediction result [16]. Therefore, various combined prediction models become the research focus of many scholars. Through combination, the advantages of various single models are integrated, thus improving the effect of the prediction. For example, Liao et al. firstly used ARIMA combined model to predict the initial predicted value of Chinese inbound tourists, then used BP neural network to correct the error of the initial predicted value, thus improving the prediction accuracy [17] as well as Shao [18]. Wang Yong et al. used LOGISTIC model and regression iterative model to make combined prediction, thus improving the accuracy of the predicted result [19]. In order to break the limitation of quantitative method and improve the accuracy of prediction, some scholars combine the quantitative method with qualitative method, constituting the quantitative-qualitative combined model. But usually, in terms of the combination method, summing and weighting are made after the prediction with different prediction methods. However, there are many factors influencing tourist capacity and existing the feature of loud noise; in addition, various emergencies will also influence the result, so their relationship doesn’t just present a simple linear relation. Therefore, it exists limitations in the combination methods of the combined model, and its prediction accuracy also remains to be improved.
Accordingly, this thesis adopts the method of empirical model decomposition to decompose the time series of tourist capacity into multiple IMFs which can represent its internal features, and then makes use of BP neural network to predict the components.
EMD-BP network prediction model
EMD-BP integrated prediction idea
The basic idea about the prediction of tourist capacity in Jiuzhaigou based on EMD-BP integrated model is to make use of EMD to decompose the historical time series of tourist capacity in Jiuzhaigou into multiple IMFs which can represent its internal features, and then make use of BP neural network to predict the components [20].
The realization process based on EMD-BP integrated model prediction is shown in Fig. 1.

Flow chart based on EMD-BP integrated model prediction.
According to the EMD method mentioned above, the tourist capacity time series x (t) (t = 1, 2, …, n) is divided into n IMF, c i (t) (i = 1, 2, …, n) and a residual term r n (t).
Frequency grouping stage
The extracted intrinsic model functions are classified from high to low. The intrinsic model functions with high frequency mean longer learning process while those with low frequency mean shorter learning process. Therefore, in order to construct network prediction model with short cycle and high accuracy, the classification of time series is very important.
Network prediction stage
Based on the preparation in step 1 and step 2, BP neural network model is respectively constructed in very component to do training and prediction, and finally all prediction results are added linearly to get the final predicted value.
Empirical model decomposition
Empirical Model Decomposition (EMD) is put forward by a Chinese American Huang in 1998. Its main idea is to make use of Hilbert transform to select the non-linear and unstable data series until the final data series is stable. Every data series is called Intrinsic Model Function, IMF for short. The definition of IMF is as follows:
In the whole time series, the difference between the extreme value amount (the maximum and the minimum) and the zero crossing point amount is no more than 1;
In any area of the time series, the mean value of the upper and the lower envelope line is zero.
The decomposition course of EMD is as follows:
Step 1, work out all local minimum and maximum of the original times series x (t);
Step 2, work out the lower envelope line composed of all minimum and the upper envelope line composed of all maximum (the upper and the lower envelope line should cover all data), which is respectively denoted by u0 (t) and v0 (t), and their mean value is denoted by m1 (t); the difference of the signal x (t) and the mean value m1 (t) is h1 (t). Thus,
Step 3, judge whether h1 (t) can satisfy the definition of IMF above. If so, h1 (t) is IMF; otherwise, h1 (t) is denoted by the next iterative signal x (t), so
Step 4, repeat step 1 to step 3 until the K iteration finds the first IMF, which is denoted by c1 (t);
Step 5, the residual term r1 (t) = x (t) - c1 (t) is denoted as the new time series. Repeat step 1 to step 4 until we find the next IMF, which is denoted by c2 (t), when the remainder term is r2 (t) = r1 (t) - c2 (t). Repeat step 1 to step 5 until the residual term is less than the set threshold value, when the loop is ended. Finally, it is decomposed into nIMF and 1 residual term r
n
(t). The original series x (t) can be indicated as
From the above iterative process, it can be seen that the original sequence can be decomposed into the sum of n intrinsic model functions and a residual term. The decomposition process is shown at Fig. 2.

Decomposition process of EMD.
BP neural network is also called error back propagation network, which is currently the most widely used neural network [21]. BP neural network is composed of input layer, output layer and hidden layer (usually one layer, or multiple layers). BP neural network structure is shown in Fig. 3.

BP neural network structure.
Take the BP neural network structure in Fig. 3 as an example. The network is composed of N input layer nerve cells, K hidden layer nerve cells and M output layer nerve cells. Suppose O2 pm and O1 pk are respectively the output value of the output layer and the input layer, W2 km and W1 nk are respectively the connection weight from the hidden layer to the output layer and from the input layer to the hidden layer. Suppose the learning sample is x pn and the corresponding target output value is t pm .
The process of BP algorithm is shown at Fig. 4. The detailed procedure is shown as follows [22]:

BP Algorithm.
Step 1, initialize the weight. Suppose the learning rate is α, the error is ɛ, the maximum cycle index is max, and i = 0;
Step 2, feed forward calculation: Input x
p
(x
p
= { xp1, …, x
pn
}) into the network, calculate O2
pm
and O1
pk
according to the formula below:
The transfer function f usually uses sigmoid function.
Step 3, calculate the mean square error MSE. If MSE ≤ ɛ, the loop will end. Otherwise, it switches to step 4;
Step 4, reverse calculation: calculate the variable quantity of the weight as below:
In the formula:
Reset the weight:
Step 5, make i = i + 1, go back to step 2.
Though BP algorithm has been widely used, there are still some defects. For example, it needs lots of training samples, the learning efficiency is low and the rate of convergence is slow, the selection of parameters is sensitive and it is easy to get into local minimum value. BP neural network learning process includes BP network connection weight learning and network structure learning. Therefore, improvement can be done in those two aspects. Major improvement methods include simulated annealing algorithm, genetic algorithm, training sample normalization method, logical selection of initial weight value and threshold value as well as adjustment of network structure.
BP algorithm usually uses s shape function as the transfer function, whose range is [0,1]. The actual sample data may have great difference, making all training samples have no comparability. Therefore, in order to avoid the circumstance that the small value is “submerged” by the big one, normalization processing is adopted to normalize the sample input to the interval [0,1], and in the calculation process, use formula 19 to do data preprocessing:
In the formula, α, β are constant, xmin, xmax respectively denote the minimum value and the maximum value in each sample group; x
i
and

Procedure of improved BP algorithm.
Data source
Jiuzhaigou is a famous natural scenic spot in China and has attracted millions of tourists at home and abroad. Therefore, to have an accurate prediction of its daily tourist capacity can provide basic data for the administrators of Jiuzhaigou, help decision-makers make overall arrangement of proper manpower resources, offer optimal configuration of relevant supply of catering and hotels, as well as formulate the regulation scheme of tourist transportation in advance.
In this thesis, the author takes the daily tourist capacity of Jiuzhaigou as the prediction object, and there are 365 data samples (from January 1, 2010 to December 31, 2010) from the access control system of Jiuzhaigou. In the earlier stage, we’ve cleared up the ticket checking data of the access control system and got the data table 1 of daily tourist scale in 2010 including data from January, 2001 to December, 2007, which is shown in Fig. 6.

Time series data of daily tourist capacity of Jiuzhaigou in 2010.
To ensure the validity of the evaluation model, this thesis sets the error evaluation method of tourist capacity prediction and uses average deviation rate as the statistics of error evaluation.
In the formula,
In this thesis, the author firstly carries out EMD towards the original data. After decomposition, we get 11 intrinsic model functions and 1 residual term. Then, the three-layer neural network is used to construct the model so as to have an empirical prediction study of the tourist capacity in Jiuzhaigou. In the process of using EMD-BP prediction model, because of the huge number of 365 training sample data all year round, the number of neuron nodes in the hidden layer n2 is supposed to be as many as possible, so the thesis chooses, n2 = 150. Here the number of neuron nodes in the hidden layer is not fixed, which should be modified according to actual training. The number of neuron nodes in the input layer is n1 = 1, and the number of neuron nodes in the output layer is n3 = 1. This thesis adopts MATLAB test, the transfer functions both from the input layer to the interlayer and from the interlayer to the output layer are tangent functions, the learning rate is set as 0.5, training time 5000 and training target error 1 × 10-10.
To use MATLAB to do neural network training and get the output data after normalization processing, we should firstly restore the data of the output layer and get the restoration formula according to the normalization formula.
In the formula, Y is the output data of the restored neural network, y* is the output data of the neural network.
Figure 7 is the comparison diagram of the predicted value of tourist capacity by making use of EMD-BP prediction model. From it we can find that the curve of predicted value of tourist capacity by means of EMD-BP prediction model is very close to the actual value curve of tourist capacity. These two curves are almost coincident.

Comparison between EMD-BP predicted value and actual value.
In order to further verify the validity of this prediction method, this thesis compares EMD-BP prediction model with the simple BP neural network prediction model. Figure 8 is the comparison diagram between predicted value and actual value of neural network. Figure 9 is the comparison of the prediction error of these two methods.

Comparison between the predicted value and the actual value of neural network.

Comparison between EMD-BP prediction error and BP neural network prediction error.
According to BP neural network prediction and EMD-BP integrated prediction model prediction, the variation trend of the predicted value and the actual value are basically consistent. From the comparison in Figs. 8 and 9, it can be seen that: 1. The actual touristy capacity of Jiuzhaigou every year presents multimodality; 2. The peak value of the actual tourist number and the predicted tourist number are basically consistent; 3. The comparison diagram of the two shows that the prediction of the two in number is basically correct. Besides, in terms of EMD-BP integrated model prediction (i.e. using empirical model decomposition to process data and then use BP neural network prediction), the predicted value and the actual value of tourist capacity are closer. Through calculation, we can find that after the correction of the predicted value through the neural network model, the error becomes smaller significantly. Through calculation, the average error rate of EMD-BP integrated prediction model and the actual value is 8.8%, among which the error within 1% accounts for 31.3% of the total predicted amount, the error within 5% accounts for 41.7%, the error above 10% accounts for 30.2% in slack seasons and 69.8% in busy seasons. Bigger error mainly lies in busy tourist seasons, when the predicted value is basically smaller than the actual value, probably because that at that time, factors influencing tourist amount are more complex.
Scholars’ research on the prediction of tourist capacity both at home and abroad dates back to the 1960 s. In recent years, domestic and overseas scholars have done lots of research on the prediction of tourist capacity, but most of them lay more emphasis on the prediction of yearly tourist capacity and there are quite a few studies on daily tourist capacity prediction. In this thesis, by constructing EMD-BP integrated prediction model, taking daily tourist capacity of Jiuzhaigou as research object, the author carries out empirical analysis and reaches the following conclusion: compared with simple BP neural network, the integrated prediction model of BP neural network based on empirical model decomposition has better prediction effect, which can accelerate the rate of convergence and improve the prediction accuracy. However, there are still some defects in this thesis. It doesn’t consider the influence of emergencies on tourist capacity. Thus, in future studies, the author will try to apply techniques like text mining and consider the influence of emergencies so as to seek for a more vivid prediction method.
Footnotes
Acknowledgments
The humanistic and social science of Sichuan Education Department (Tourism Studies) project “Research on Sichuan Tourism Enterprises’ Competitiveness from the Perspective of Industry Integration” (LYC15-35); Social Science Planning Fund Program of Sichuan Province “Research on relationship embedded model between agricultural e-commerce enterprises and customer under the trend of industrial convergence (SC16C022)”.
