Abstract
Compared with coarse-grained forecasting, fine-grained tourism demand forecasting is a more challenging task, but research on this issue is very scarce. To address this issue, a decomposition ensemble deep learning model is proposed by integrating CEEMDAN, CNNs, LSTM networks, and AR models. The CEEMDAN can decompose complex tourism demand data into multiple components with simpler characteristics, thereby reducing the complexity of forecasting. The CNNs and LSTM networks can fully capture the locally recurring patterns and the long-term dependencies of the components obtained by CEEMDAN. The AR model can capture the scale of tourism demand data, which can overcome the problem that the output scale of the deep neural networks (i.e., CNNs and LSTM networks) is not sensitive to the scale of the inputs. The effectiveness of the proposed model is verified by comparing with five benchmark models using real-time data on tourist volumes at two attractions.
Introduction
Tourism products are inherently perishable, and this makes accurate forecasting of the demand for these products a necessary part of business operations (Jiao and Chen, 2019; Li, et al., 2022; Zhang, et al., 2020). Consequently, many tourism demand forecasting models have been proposed (Song et al., 2019). These models are discussed in detail in the next section. It should be noted that most of these produce relatively coarse-grained forecasts for large regions. Such forecasts are required for the formulation of macro policies (Li et al., 2020). However, smaller areas (e.g., tourist attractions) generally require high-frequency and fine-grained forecasts (e.g., daily, hourly, and every minute) (Zheng et al., 2021). Such forecasts allow managers to formulate customized operational strategies, such as emergency plans, real-time crowd management strategies, staffing arrangements, discounts and special packages (Bi et al., 2020). For example, if the forecasts suggest that there will be a peak in the number of tourists at a particular time, then the manager can, for instance, adjust the staffing arrangements; or, if the demand is expected to be low, then managers can increase tourist numbers by offering discounts (Bi et al., 2020). However, studies on fine-grained tourism demand forecasting are still very scarce.
Compared with coarse-grained forecasting, fine-grained tourism demand forecasting is more challenging, for two reasons. First, the exogenous variables commonly used in coarse-grained forecasting may not be applicable in fine-grained forecasting, or the relevant data may not be readily obtainable. For example, search engine data is commonly used in coarse-grained forecasting and has been empirically verified to be an effective prediction variable (Sun et al., 2019; Li et al., 2018). However, the minimum granularity of search engine data is days, and it cannot be used to produce forecasts with a smaller granularity (e.g., hours) (Bi et al., 2020; Li et al., 2017; Pan and Yang, 2017). Additionally, the data of fine-grained tourism demand has more complex characteristics, such as non-linearity, non-stationarity, high volatility, time variability, etc. These two reasons make most of the existing tourism demand forecasting models unable to produce accurate fine-grained forecasts.
To be able to use data with the above-mentioned complex characteristics, one approach is to decompose the non-stationary tourism demand data with high volatility into several components with low volatility through the data decomposition method (Li et al., 2017). Compared with the original tourism demand data, each component obtained by decomposition has more simple characteristics (Zhang et al., 2021). By integrating the forecasting results of each component, the final (overall) forecasting results can be obtained. A variety of methods can be used to decompose tourism demand data, and these are discussed in detail in the next section. Among these, complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) can accurately reconstruct tourism demand data (Xie et al., 2020), thereby effectively reducing the complexity of fine-grained tourism demand data, and thus has the potential to improve the forecasting accuracy. Therefore, the CEEMDAN is used to decompose the fine-grained tourism demand data to reduce their complexity in this study.
To produce accurate fine-grained forecasts, it is necessary not only to reduce the complexity of tourism demand data, but also to capture the locally recurring patterns (e.g., the sensitivity and fluctuation of tourism demand) and the long-term dependencies (e.g., the seasonality and periodicity) of the data at the same time. Convolutional neural networks (CNNs) and long short-term memory (LSTM) networks are two kinds of deep learning models that have good performance in data feature extraction, among which CNNs are very suitable for capturing locally recurring patterns of data (Bi et al., 2021; Wang and Li, 2022), while LSTM networks are well suited to capturing the long-term dependencies within data (Bi et al., 2022; Law et al., 2019). Therefore, CNNs and LSTM networks are powerful tools to capture the locally recurring patterns and the long-term dependencies of the components obtained by CEEMDAN, respectively.
Unfortunately, due to the non-linear characteristics of CNNs and LSTM networks, one of their main shortcomings in forecasting is that the scale of their outputs is not sensitive to the scale of the inputs (Bi et al., 2021; Weytjens et al., 2021; Höpken et al., 2021). The actual fine-grained tourism demand data usually constantly changes in a non-periodic way. This will greatly reduce the forecasting accuracy of CNNs and LSTM networks, especially in multi-step forward forecasting (Lai et al., 2018). Nevertheless, some linear models (such as autoregressive and autoregressive integrated moving average models) can well capture the scale of the input data (i.e., the overall trend of tourism demand) (Li et al., 2020). Therefore, if a linear model and the deep neural networks (i.e., CNNs and LSTM networks) can be organically integrated, then the above shortcomings of the deep neural networks can be compensated and their forecasting accuracy improved. Thus, if CEEMDAN can be used to decompose tourism demand data into multiple components that are more suitable for use in forecasting, a linear model, CNNs, and LSTM are respectively used to capture the scale, the locally recurring patterns, and the long-term dependencies of those components, and if these results can be effectively integrated, then more accurate forecasts can be expected.
To make fine-grained demand forecasting, we proposed a decomposition ensemble deep learning model, which is composed of four modules: module 1 decomposes the data; module 2 captures non-linear patterns; module 3 captures linear patterns; and module 4 integrates the results. Module 1 uses CEEMDAN to decompose tourism demand time series data into multiple components. In module 2, both the locally recurring patterns and the local dependencies between components are captured by CNNs, and the long-term dependencies of the components are captured by LSTM networks. In module 3, the linear pattern (or skeleton) of each component is captured using the autoregressive (AR) model. In module 4, through the optimization and integration of the results obtained in module 2 and module 3, the final forecast is produced. The proposed model is applied to forecast real-time tourist demand of two tourist attractions (the Ming Tombs and the Temple of Heaven) in Beijing, China. The results show that the performance of our model is significantly better than the selected benchmarks in different forecasting horizons.
Literature review
The present study concerns the forecasting of tourism demand through the use of a deep learning model that analyzes data that has been decomposed into components that are more amenable to forecast. Accordingly, the literature on decomposition methods and that on models for tourism forecasting tourism demand is reviewed below.
Decomposition methods for tourism demand forecasting
The divide-and-conquer strategy is the core idea of using decomposition methods for tourism demand forecasting. Specifically, the time series data is first decomposed into several components with simple characteristics. Each component is used separately to produce a forecast, and then the forecasts produced by all of the components are integrated to obtain the final (overall) forecast. The use of decomposition methods in tourism demand forecasting not only reduces the complexity of the model but also significantly increases accuracy. Therefore, these methods have recently gained attention (Xie et al., 2020). The main decomposition methods used for tourism demand forecasting include spectral analysis, wavelet decomposition, Fourier decomposition, the structural time series model (STS), singular spectrum analysis (SSA) and empirical mode decomposition (EMD). The STS, the SSA and the EMD family are more often used than wavelet decomposition and Fourier decomposition (Chen et al., 2012). A brief literature review concerning these three kinds of models is given below.
Structural time series (STS) model
The basic idea of the STS models is to decompose the tourism demand time series into four components that are regarded as random variables, that is, trend, season, cycle, and irregular components (Chen et al., 2019). The final tourism demand forecast is obtained by integrating the separate forecasts produced from these components. For example, Song et al. (2011) proposed a new model for forecasting tourist arrivals based on the time-varying parameter regression and the STS model. By introducing a novel data restacking technique, Chen et al. (2019) proposed a multiseries structural time series method to forecast seasonal tourism demand. In the proposed method, the quarterly tourism demand sequence is divided into four components, each of which represents the demand in a specific quarter of each year, then a multiseries structure time series model is built by repackaging the obtained components. The empirical evidence on inbound tourism demand forecasting in Hong Kong shows that, compared with traditional models, the proposed method can increase the accuracy of seasonal tourism demand forecasts.
The STS models is very suitable for dealing with the stochastic seasonality in tourism demand by decomposing the time series into trends, seasons, cycles, and irregular components. The obtained trends, seasons, and cycles all have direct interpretations, which can be allowed to vary with time and explanatory variables are included wherever possible (Greenidge, 2001). However, the STS model assumes that the disturbances of the measurement and transition equations are mutually independent, which implies a multiple source of error for these equations (Chen et al., 2019). Additionally, the STS model requires relatively strict statistical tests on the original data and the decomposed components.
Singular spectrum analysis (SSA)
SSA is a powerful method for analyzing nonlinear tourism demand time series data (Silva et al., 2019). It constructs a trajectory matrix according to the observed tourism demand time series. Then, by decomposing and reconstructing the trajectory matrix, the signals representing different components of the original time series can be extracted, such as long-term trend signals, periodic signals, and noise signals. Based on these obtained signals, the structure of tourism demand time series can be analyzed, and tourism demand can be forecast. For example, Hassani et al. (2015) used SSA to analyze tourist arrivals in the United States over the period 1996–2012 and found that the SSA can significantly improve tourism demand forecasting compared with ARIMA, exponential smoothing, and neural networks. Silva et al. (2019) proposed a denoised neural network for tourism demand forecasting based on SSA and an automated neural network autoregressive algorithm. Its effectiveness was verified in forecasting the international tourism demand for 10 European countries.
The SSA models are very suitable for processing tourism demand data with strong nonlinear characteristics through filtering the noise and forecast the tourism demand signal (Hassani et al., 2015). However, the SSA models also have some limitations when they are applied in tourism demand forecasting. For example, when the SSA is used to process the tourism demand data with great fluctuations, the decomposed components may contain a lot of noise, and may undergo over-decomposition phenomenon; The filtering effect and calculation efficiency of SSA are seriously affected by the embedding dimension of trajectory matrix (Duan and Liao, 2022); The embedding dimension of each iteration is determined by the empirical formula, which may lead to the problem of mode mixing and over decomposition (Mao et al., 2020).
Empirical mode decomposition (EMD) family
EMD is very suitable for nonlinear and non-stationary tourism demand data analysis. It can adaptively represent the local characteristics of the given data (Chen et al., 2012). Using EMD, tourism demand data can be decomposed into a small number of intrinsic mode functions (IMFs). These IMFs have simpler frequency components and stronger correlations, which makes it easier to produce a more accurate forecast. Compared with STS and SSA, the EMD family relaxes the strict criteria (e.g., regarding stationarity) that have to be met for the use of the traditional methods (e.g., STS and SSA), and for this reason it has been widely employed in tourism demand forecasting in recent years. For example, to obtain accurate tourism demand forecasts, Chen et al. (2012) proposed a method through EMD and neural networks and found that it outperformed single neural networks without EMD and traditional ARIMA in forecasting the numbers of international visitors to Taiwan. Lin, Chen, and Liao (2018) proposed a tourism demand forecasting method that integrates EMD and neural networks, and the effectiveness of the proposed method was verified through forecasting tourism demand at Jiuzhaigou.
Although EMD increases forecasting accuracy in most cases, the IMFs obtained by EMD may be mixed and superimposed. To overcome this problem, ensemble EMD (EEMD) and complete ensemble EMD (CEEMD) can be used to decompose the time-frequency space of the original time series into different scale components by adding Gaussian white noise. In recent years, EEMD and CEEMD have been adopted in tourism demand forecasting. For example, Zhang et al. (2017) proposed a novel method for forecasting the daily occupancy of hotels by combining EEMD and ARIMA. Using the daily occupancy data of an individual hotel, the new method was compared with ARIMA and it was found to be effective.
Although EEMD and CEEMD can be used to overcome the mode-mixing problem associated with EMD, the Gaussian white noise that has been added to make the process work is difficult to eliminate from the final forecasts. To this end, the CEEMDAN algorithm can be used, which adaptively adds white noise to the EEMD process. CEEMDAN has recently been used in tourism demand forecasting and has been shown to be more effective than EMD and EEMD. For example, Xie et al. (2021) proposed a decomposition ensemble approach based on Elman neural networks and CEEMDAN. The proposed method outperforms the benchmark models in terms of both point and interval forecasts with respect to different prediction horizons for tourism demand in Hong Kong.
Although some scholars have begun to develop forecasting models through CEEMDAN, there have been few such studies. More importantly, the existing studies mainly focus on coarse-grained forecasting and ignore many factors that need to be considered in fine-grained forecasting, such as the locally recurring patterns of the components, the local dependencies between components, and the long-term dependencies of the components. Therefore, fine-grained forecasting methods based on CEEMDAN still need further research.
Models for forecasting tourism demand
A variety of forecasting models have been proposed for tourism forecasting tourism demand (Li et al., 2021; Song et al., 2019). These can be mainly divided into four categories, that is, (1) time series models, (2) econometric models, (3) artificial intelligence models, and (4) hybrid models.
Time series models
Time series models are widely applied in non-causal time series tourism demand forecasting (Athanasopoulos et al., 2011). Among these, the naïve model, the simple autoregressive model, and the exponential smoothing model are the commonly used benchmarks in studies of tourism demand forecasting. The autoregressive integrated moving average model (ARIMA) and its improved versions (e.g., SARIMA and ARARMA) are more commonly used than these models and can achieve good forecasting results in most cases (Goh and Law, 2002). In addition to these models, some other more advanced time series models are also applied in tourism demand forecasting, such as generalized autoregressive conditional heteroskedastic models (Chan et al., 2005), structural time series models (Chen et al., 2019), and state space models (Beneki et al., 2012).
Econometric models
Time series models ignore the key determinants of tourism demand, which may mean that they do not take account of some of the information that is nevertheless important for decision-making (Wong et al., 2007). Such key determinants have, though, been used in econometric models, including the vector autoregressive model (Cao et al., 2017), the autoregressive distributed lag model (Song et al., 2003), the error correction model (Wong et al., 2007), the time-varying parameter model (Song and Wong, 2003), the autoregressive distributed lag model (Li et al., 2011), the almost ideal demand system (Saayman et al., 2018), and the structural equation model (Ko and Stewart, 2002).
Artificial intelligence models
Compared with the above models, artificial intelligence models have some unique advantages, for example, they have strong nonlinear fitting ability and do not need to make assumptions about data distribution (Kon and Turner, 2005). They have been increasingly used in tourism demand forecasting in recent years (Sun et al., 2019), in the form of, for example, support vector machines, artificial neural networks, and deep learning models. When there is sufficient training data, artificial intelligence models can often achieve higher forecasting accuracy than time series models and econometric models (Law et al., 2019). However, they provide little theoretical insight into the associations between tourism demand and its key determinants.
Hybrid models
All three of the above types of models are usually employed on their own. Some scholars have tried to develop hybrid models to forecast tourism demand, but there have been few such studies. Hybrid models combine two or more different types of model to forecast tourism demand, and this allows them simultaneously to take advantage of the different strengths of the individual models, and thereby produce better forecasts.
For example, Chen (2011) combined a linear and a non-linear model to forecast Taiwan’s outbound tourism demand. The hybrid model significantly outperformed the benchmark models in terms of the mean absolute percentage error and the normalized mean square error. Bi et al. (2021) proposed a hybrid deep learning model for forecasting tourism demand with time series imaging, where CNNs and LSTM networks were adopted to capture the features from the images encoded through tourism demand and the temporal dependence of extracted features, respectively. The validity of their model was verified through multiple experiments.
Forecasting model
In this section, a fine-grained tourism demand forecasting model is proposed. The framework of the proposed model is shown in Figure 1. It is composed of four modules: module 1 decomposes the data; module 2 captures non-linear patterns; module 3 captures linear patterns; and module 4 integrates the results. The framework of the proposed model.
Module 1. Data decomposition
Let (1) Calculate the first IMF, (2) Calculate the first residue (3) Decompose (5) Repeat the above process until the residual sequence can no longer be decomposed; that is, when the number of extreme points of the residual signal is at most 2, the algorithm terminates. At this time, K IMFs have been obtained, and the final result of the residual sequence can be determined by equation (6):
Through the above process, the original tourism demand time series data,
Module 2. Non-linear pattern capture
After the data has been decomposed in module 1, the non-linear patterns embedded in the original tourism time data
Capture of the locally recurring patterns and local dependencies between components
If the obtained decomposition result
Capture of the long-term dependencies
The obtained
Let
According to the obtained
Module 3. Linear pattern capture
Although CNNs and LSTM networks have very strong nonlinear fitting capabilities, there is an important deficiency when using them for actual prediction, that is, the scale of their outputs is not sensitive to the scale of the inputs (Bi et al., 2021; Weytjens et al., 2021; Höpken et al., 2021). It should be noted that in actual tourism demand data, especially the high-frequency data, the scale of data usually constantly changes in a non-periodic manner. For data with such characteristics, if the LSTM and CNNs are directly adopted to make predictions without paying attention to the scale of the data, the forecasting accuracy may be lower. To address this deficiency, tourism demand data is considered to have both non-linear patterns (the non-linear part of the data, containing recurring patterns) and linear patterns (the scale of the data). As mentioned above, the non-linear patterns can be captured well by CNNs and LSTM networks. For the linear patterns, the classical autoregressive (AR) model is adopted in this study.
Let
Module 4. Model integration
Given the forecasts from the non-linear patterns,
Based on the obtained
To determine the parameters in the model, the optimization objective function needs to be constructed according to the difference between the predicted results
To solve the above optimization problem, the stochastic gradient decent optimization strategy is adopted in this study. The training samples are fed into the model, and then a trained predictor with actual predictive ability can be obtained. On this basis, the final tourism demand can be forecast.
Experimental study
The data used in the experimental study is first introduced in Section. Then, the experimental setup is explained in Section. Finally, the experimental results are given in Section.
Data
This study takes two famous tourist attractions (the Ming Tombs and the Temple of Heaven) in Beijing, China, as examples to verify the effectiveness of the proposed model. These two attractions are world cultural heritages, 5A-level attractions in China, and national key cultural relics protection units, attracting a large number of tourists from all over the world every year. In addition, the tourist volume of these two attractions fluctuates greatly within a day, which brings great difficulties and challenges to the forecast of the tourist volume of these two attractions. Therefore, the two selected attractions are highly representative and very suitable for verifying the performance of the proposed model. The real-time tourist volume data of the two attractions from November 27, 2018 to January 20, 2020, were collected from the Beijing tourism website (http://www.visitbeijing.com.cn/). This website records the number of real-time visitors to the tourist attractions every 15 minutes. Therefore, there are 38,846 observations in the collected data for each tourist attraction, as shown in Figures 2 and 3. The vertical axis represents the actual number of tourists, and the horizontal axis represents the timestamp, where the first point corresponds to the first observation on November 27, 2018, and the last point corresponds to the last observation on January 20, 2020. Thus, each point in each figure represents the actual number of tourists in the tourist attraction at the corresponding time. The real-time tourist volume of the Ming Tombs from November 27, 2018, to January 20, 2020. The real-time tourist volume of the Temple of Heaven from November 27, 2018, to January 20, 2020.

According to equations (1)–(6), the tourist volume of the two attractions can be decomposed into several IMFs and a residual sequence. The results are shown in Figures 4 and 5, where the horizontal axis of each sub-figure represents the timestamp, and the vertical axis of each sub-figure represents the decomposed results corresponding to the original tourist volume (i.e., the IMFs and residual sequence). It can be seen that the decomposition results for the Ming Tombs have 14 IMFs and a residual, and for the Temple of Heaven have 15 IMFs and a residual. Decomposition results of tourist volume data in the Ming Tombs. Decomposition results of tourist volume data in the Temple of Heaven.

Experimental setup
There are four aspects to the experimental setup: (1) data set partition, (2) forecasting horizons, (3) benchmark models, and (4) performance measures. A brief description of each is given below.
Data set partition
The tourist volume at each attraction has to be divided into three parts, that is, a training set, a validation set, and a test set. The ratio of these three data sets is 8:1:1. In other words, the first 31,076 observations are selected as the training set, the following 3885 observations are selected as the validation set, and the last 3885 observations are selected as the test set.
Forecasting horizons
For fine-grained tourism demand forecasting, one-step-ahead forecasting and multi-step-ahead forecasting are equally important. For example, managers of tourist attractions not only need to know the tourist volume in the next hour, but also the tourist volume at multiple time points in the future (e.g., in the next 3, 6, and 9 hours). Therefore, for fine-grained forecasts, to verify the effectiveness of the proposed model, it is necessary to compare not only the performance of different models in the case of h = 1 (1 hour’s time), but also the performance in the case of h > 1. Ideally, it would be better to verify the model performance for every value of h within a certain range. However, due to space constraints, it is almost impossible to do so. Consequently, selecting several representative h values to verify the performance of the model is a good alternative. Following Bi et al. (2021), Zhang et al. (2020), and Zheng et al. (2021), the value of h was set to 3, 6, and 9 in multi-step-ahead forecasting in the experiment.
Benchmark models
Five commonly used tourism demand forecasting models are selected as the benchmarks: the naïve model, the ARIMA, the back propagation neural networks (BPNN), the support vector machine (SVM), and the LSTM networks.
Performance measures
To compare the forecasting performance of the models, the following three performance measures are used: the mean absolute error (MAE), mean absolute percentage error (MAPE), and root mean square error (RMSE). These are calculated by equations (18)-(20), respectively:
Experimental results
The results from different models for the two tourist attractions with respect to different forecasting horizons (i.e., h = 1, 3, 6, and 9) are given below.
The results for h = 1
The absolute error and average absolute error of different models for the Ming Tombs and the Temple of Heaven in the case of h = 1 are, respectively, shown in Figures 6 and 7, where the horizontal axis of each sub-figure represents the timestamp, and the vertical axis of each sub-figure represents the forecasting error. The meanings of the horizontal and vertical axes in Figures 8–13 are similar to these two figures. It can be seen that the absolute errors of the five benchmark models fluctuate greatly, and the maximum absolute errors of the five benchmark models for the Ming Tombs and the Temple of Heaven are all greater than 3000 and 6000, respectively. In contrast, the proposed model has relatively small fluctuations in absolute errors, and the maximum absolute errors are less than 800 and 2000, respectively. Errors of different models concerning the Ming Tombs in the case of h = 1. Errors of different models concerning the Temple of Heaven in the case of h = 1. Errors of different models concerning the Ming Tombs in the case of h = 3. Errors of different models concerning the Temple of Heaven in the case of h = 3. Errors of different models concerning the Ming Tombs in the case of h = 6. Errors of different models concerning the Temple of Heaven in the case of h = 6. Errors of different models concerning the Ming Tombs in the case of h = 9. Errors of different models concerning the Temple of Heaven in the case of h = 9.







The forecasting results of the different models for the two tourist attractions in the case of h = 1.
The DM test results for the two tourist attractions in the case of h = 1.
Note: *** indicates statistical significance at the 1% level.
The results for h = 3
The absolute error and average absolute error of the different models for the two tourist attractions in the case of h = 3 are shown in Figures 8 and 9, respectively. It can be seen that the absolute errors of the five benchmark models in the case of h = 3 fluctuate more than in the case of h = 1. The maximum absolute errors of the five benchmark models for the Ming Tombs and the Temple of Heaven are all greater than 3500 and 7500, respectively. In contrast, the proposed method still has relatively small fluctuations in absolute errors. The maximum absolute errors of the proposed method for the two tourist attractions are less than 1200 and 2500, respectively.
The forecasting results of the different models for the two tourist attractions in the case of h = 3.
The DM test results for the two tourist attractions in the case of h = 3.
Note: *** indicates statistical significance at the 1% level.
The results for h = 6
The absolute error and average absolute error of the different models for the two tourist attractions in the case of h = 6 are shown in Figures 10 and 11, respectively. The absolute errors of the five benchmark models in the case of h = 6 fluctuate more than in the cases of h = 1 and 3. The maximum absolute errors of the five benchmark models for the Ming Tombs and the Temple of Heaven are all greater than 4000 and 8000, respectively. In contrast, the proposed method still has relatively small fluctuations in absolute errors. The maximum absolute errors of the proposed method for the two tourist attractions are less than 2000 and 4000, respectively.
The errors of the models for the two tourist attractions in the case of h = 6.
The DM test results for the two tourist attractions in the case of h = 6.
Note: *** indicates statistical significance at the 1% level.
The results for h = 9
The absolute error and average absolute error of the different models for the two tourist attractions in the case of h = 9 are shown in Figures 12 and 13, respectively. The proposed method outperforms the benchmark models in terms of the mean and variance of the forecasting error.
The forecasting results of the different models for the two tourist attractions in the case of h = 9.
The DM test results for the two tourist attractions in the case of h = 9.
Note: *** indicates statistical significance at the 1% level.
Comparison of the results for different forecasting horizons
For fine-grained tourism demand forecasting, multi-step-ahead forecasting is equally important for tourism-related decision-making as one-step-ahead forecasting. The actual fine-grained tourism demand data is usually constantly changing in a non-cyclical manner, which makes it difficult to accurately predict it, especially in the case of multi-step-ahead forecasting. To verify the advantages of the proposed model for multi-step-ahead forecasting, the results of different models with respect to different forecasting horizons are further compared. To be able to compare the results more intuitively, a ratio that reflects the change of forecasting error with respect to different forecasting horizons is defined as follows:
According to equation (21), the values of The values of The values of 

To further observe the difference of the forecasting results of the proposed model under different forecasting horizons, the forecasts of the proposed model for the two tourist attractions in terms of different values of h are given, as shown in Figures 16 and 17, respectively. It can be seen from these two figures that the difference between the forecasts of the proposed model in terms of different forecasting horizons is not obvious, which further supports the above conclusion. The forecasting results of the proposed model for Ming Tombs in terms of different values of h. The forecasting results of the proposed model for Temple of Heaven in terms of different values of h.

Conclusions
To increase the forecasting accuracy of fine-grained tourism demand, a novel decomposition ensemble deep learning model is proposed by integrating CEEMDAN, CNNs, LSTM networks, and AR models. The proposed model is composed of four modules: module 1 decomposes the data; module 2 captures non-linear patterns; module 3 captures linear patterns; and module 4 integrates the results. The first module decomposes complex tourism demand data into multiple components with simpler characteristics, thereby reducing the complexity of forecasting. In the second module, the CNNs and LSTM networks are used to capture the locally recurring patterns (e.g., the sensitivity and fluctuation of tourism demand) and the long-term dependencies (e.g., the seasonality and periodicity) of the components obtained by CEEMDAN, respectively. In this way, the advantages of CNNs and LSTM networks can be fully utilized. In the third module, an AR model is adopted to capture the scale of tourism demand data (i.e., the overall trend of tourism demand), which can be considered as the skeleton of the tourism demand data. In the final module, through the optimization and integration of the results obtained in module 2 and module 3, the final forecast is produced. In brief, by integrating CEEMDAN, CNNs, LSTM and AR models, the proposed model can well capture the overall changing trend, the seasonality and periodicity, and the local volatility of tourism demand. The proposed model is applied to forecast the real-time tourist volume at two famous tourist attractions (the Ming Tombs and the Temple of Heaven) in Beijing, China. The results show that the proposed model significantly outperforms the benchmark models over different forecasting horizons.
The theoretical contributions are as follows: (1) A novel forecasting framework is proposed. The proposed framework has good generalizability and expansibility, and can be used as a basic framework for forecasting the time series data with complex characteristics; (2) To fully capture the non-linear patterns in the components obtained by CEEMDAN, a hybrid method composed of CNNs and LSTM networks is proposed. In the method, the multiple components obtained by CEEMDAN are regarded as a two-dimensional image, and CNNs are used to capture the locally recurring patterns and local dependencies between components; (3) to overcome the problem that the output scale of the deep neural networks (i.e., CNNs and LSTM networks) is not sensitive to the scale of the inputs, a new idea of integrating an AR model and deep neural networks is proposed. The organic integration of an AR model and the deep neural networks gives full play to their advantages in forecasting.
Additionally, this study also provides several management implications. On the one hand, the tourism demand forecasting results are helpful for the government to formulate macro policies such as tourism planning, traffic route design, and release relevant early warning information. For example, the government or relevant departments can release the early warning information of tourist volume in advance according to the forecasting results to avoid the occurrence of adverse events such as tourist detention and stampede. On the other hand, relevant micro-management decisions in the tourism industry can also benefit enormously from accurate fine-grained forecasts of demand. For example, if the forecast suggests that the tourist volume will be very large, then the manager can arrange to have extra staff, formulate scientific real-time crowd management strategies, or managers may wish to make a public announcement regarding expected crowding at a particular time, to deter some visitors and keep numbers down. Conversely, if the forecast suggests that demand will be low, then managers can use discounts to increase numbers.
This study does have some limitations. First, compared with traditional tourism demand forecasting models, the computational complexity of the proposed model is high. However, with the advancement of computer hardware and computing technology, this limitation will gradually disappear. Second, some important parameters have a particularly large impact on the forecasting accuracy, and it will be important to research how to quantify these parameters quickly and accurately. Finally, only the historical data of tourism demand are used in the proposed model. In future research, other factors that may affect the fine-grained tourist demand (e.g., special events, holidays, weather, and specific access patterns of attractions) need to be considered.
Footnotes
Acknowledgments
This work was partly supported by the National Natural Science Foundation of China (project No. 72101124) and the Liberal Arts Development Fund of Nankai University (project No. ZX20210067).
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Natural Science Foundation of China (project No. 72101124)and the Liberal Arts Development Fund of Nankai University (project No. ZX20210067).
