Abstract
The modeling and prediction of short-term traffic flow can reflect the prediction results of the traffic state and traffic flow data. In this paper, first, we use a high-dimensional tensor to represent the multi-mode characteristics of traffic flow data, and we make use of the basic operations properties of tensors, such as Tucker decomposition, to study the methods for filling in data, such as ITRM. Additionally, we preprocess the lost traffic flow and abnormal data. At the same time, we study the short-term traffic flow based on the “week-day-time” multi-mode of the traffic flow data. Using the grey model (GM (1, 1)) to predict the same period of the weekly mode, the scrolling grey model (SGM) of the same time period is predicted. For the time mode, a neural network time series of wavelet analysis is used to predict the traffic flow forecast during the same period. Then, the prediction results of the three different models are weighted by the grey correlation analysis method, and then, the coupling prediction model of the three models is obtained. In the end, according to the traffic flow data of the main road of Shaoshan road in Changsha, Hunan, China, we first preprocess the lost data by using the filling algorithm for the tensor data, and then, we make the traffic flow data complete, use the three tensor data modes of traffic flow, and analyze the results. The experimental results show that the coupling prediction model with the tensor model is much better than the single GM (1, 1) model, the SGM and the neural network prediction model.
Keywords
Introduction
The traffic control system, traffic management system and traffic guidance system are important parts of an Intelligent Transportation System (ITS). Traffic prediction based on real-time traffic information is the premise of real-time control and induction, and it is an important theoretical basis of an ITS. The effective method of studying short-term traffic flow modeling and prediction theory can reflect the prediction results of the traffic flow data. It is directly applied to the advanced traffic information system (ATIS) and the advanced traffic management system (ATMS), and it provides travelers with real-time and effective travel information, such as stroke data, expected delay and path information. This information helps travelers to better choose paths, to realize path induction, to save time, to alleviate short circuit congestion, to reduce pollution, to save energy and so on. Traffic flow data from time and space contains rich multi-mode features [1]. It is still a difficult problem to make full use of the multi-mode characteristics of the traffic data in a unified framework. A tensor is a multidimensional array. It is a generalization of vector and matrix models to multidimensional data. It can represent the characteristics of multi-modal traffic data. According to the different modes of traffic flow data over a week, day, time and space, a multidimensional tensor can be used to overcome the issue that vector data flow and matrix data flow are difficult to characterize in terms of multidimensional features [2]. The traffic flow tensor mode [3] is constructed, and the short-term traffic flow prediction theory and method under the tensor multi-mode framework can be studied, which means that the traffic data can be better excavated in over a week, day, time and space. The characteristic of having multiple modes serves to the benefit of short-term traffic flow prediction.
Most of the functions of the subsystems of intelligent transportation systems are based on the complete and available traffic data. However, due to the breakdown of the detection equipment, transmission equipment and so on, the original traffic data often have the phenomena of data loss and data anomalies, which brings adverse effects to the analysis of traffic data and the deep mining of the actual traffic information system. In addition, the lost and abnormal traffic data have substantial impacts on the traffic information system and other intelligent transportation systems based on this system [4]. Traffic prediction, which is the key technical basis of intelligent transportation systems, requires complete historical data to obtain more accurate predictions. In the case of loss of traffic data, a traffic prediction model becomes especially sensitive, which leads to large error prediction. Therefore, according to the data characteristics of the tensor multi-mode, the traffic data are restored [5–8]. The clean, accurate, and structured traffic data information is the basis of the efficient operation of the intelligent transportation system.
Autoregressive Integrated Moving Average Model [9] is widely used in short-term traffic flow prediction and is based on vector data flow. The local chaos prediction model [10], support vector machine (SVM) [11] and others have better prediction effects on the link travel traffic flow. The short-term traffic flow prediction in the form of matrix data flow mainly includes a multivariable time series prediction model [12] and multi-section short-term traffic flow prediction, which are applications from Yibing Wang et al. [13] that have been popularized by the Kalman filter method to predict the real time traffic flow on an expressway. At the same time, researchers have studied the significant periodicities in traffic flow data, especially in the week-mode and day-mode [14], which improve the accuracy of short-term traffic flow prediction. Guo et al. [15] considered seasonal characteristics in the traffic flow data to predict the traffic flow in real time. Zhang and others [16] analyzed traffic data cycle trends, certainty and volatility to study traffic flow prediction, and all of these approaches had good results.
At the same time, the grey model is adaptable, and it can handle mutation parameters. Since the founding of [17] in 1982, the grey prediction model has been a core part of grey system theory. It has been widely applied in various fields [18–21]. Among them, short-term traffic flow prediction is an important application aspect. Xiao et al. [22] proposed a seasonal grey GM (1,1) rolling prediction model based on a periodic truncation accumulating operator (CTAGO). Guo et al. [23] established a grey nonlinear delayed GM (L, l) model to pretest the short time traffic flow of urban roads. Bezuglov and Comert [24] improved the Fourier error by improving the GM (L, l) model and grey Verhulst model, and they applied them to traffic flow prediction. On the other hand, wavelet analysis has good time frequency loal-ization. It is applied in the field of traffic flow prediction with a combination of wavelet theory and other algorithms, and a combination of wavelet theory and neural networks [25–27] can obtain better results.
However, in view of the above short-term traffic flow forecasting methods, it is found that most of the prediction methods are based on a single prediction model, while a single prediction model cannot guarantee absolute good performance indicators, and the combined model can obtain different information from different perspectives and different models. After learning from each other, the model can improve the accuracy and stability of the model prediction. The MA model is used to forecast the trends of the week by Tan et al. [28], forecasting the intraday trends by using an ARIMA model, while a neural network is used for data aggregation. Chan et al. [29] introduced the hybrid exponential smoothing method and the least square method into a neural network model. Their experiment proves the effectiveness and feasibility of the combined prediction model for traffic flow prediction. Wang et al. [30] integrated the Bayes ensemble method, ARIMA model, Kalman filter and three other models. In addition, the researchers used ARIMA [31, 32] combined with other models for short-term traffic flow prediction to achieve better results.
However, the above combined predictive models concern only time information and only pay attention to the characteristics of nonlinearity, volatility and periodicity in the single time series change of the target section, and they fail to make full use of the time information features of traffic flow data; thus, the prediction results are affected in this way. Therefore, to better excavate the characteristics of traffic data in the three modes of week, day and time, this paper uses a high-dimensional tensor model to represent the multi-mode data characteristics of the traffic flow data. First, we study the lost and abnormal aspects of the traffic flow data, and we use the basic operational properties of tensor Tucker decomposition to study the method of tensor filling data using ITRM and other approaches. The traffic flow loss and abnormal data are pretreated effectively to obtain clean, accurate and structured traffic data. We discuss the tensor “week - day - time” mode, and for the same time cycle mode, the classical GM (1,1) model is used for the prediction. For the day mode, we use the SGM. We forecast the traffic flow data at the same time over two weeks; we obtained the forecasting data, removed the old information and added new information to predict the next data, and then, we repeated the previous process and completed the prediction of the subsequent part of the data. For the time mode, a prediction method with good time frequency localization properties and neural networks is applied. Then, the prediction data over the same time period obtained by the three models are correlated with the original data by the grey correlation analysis method, and the different correlation coefficients are obtained. According to the correlation coefficients, the different weights of the three models are obtained, and the final prediction results are obtained according to the different weights. Finally, we take the 7:00–9:00 traffic flow data of the main road of the Shaoshan road in Changsha, Hunan as an example. According to the multi-mode characteristics of the data, three typical tensor models of “week - day - time” are established. First, we use the filling algorithm of the tensors for the missing data, and we make an empirical analysis of the data filling technology. Then, for the traffic flow prediction at the same time, three different tensor models are compared, and the experimental results show that the simulation and prediction effects of the coupling of the three models for the tensor is better than the GM(1,1) model, SGM model and wavelet analysis of neural network.
The sections in this paper are arranged as follows: The second section introduces the multi-modal characteristics of the traffic flow data tensor and introduces the filling algorithm that includes a tensor based on the operational properties of the tensors. In the third section, we use different tensor models to express the characteristics of the multimodal data of the traffic flow, and we introduce three different models and grey relational analysis. In the fourth section, we preprocess the traffic flow data and analyze the results of the coupling model and individual method. The last section is the conclusions.
High-dimensional tensor pattern data filling method
Decomposition model of tensors
A tensor is a multi-dimensional array that is composed of data. It can be seen as a further extension of vectors and matrices in high-dimensional space. It is composed of multi-dimensional arrays or multi-dimensional matrices. For example, a vector can be regarded as a 1-order tensor, and a matrix can be regarded as a 2-order tensor. Here are some related concepts [33]:
This process is called tensor n-mode matrixing.
is called the inner product of two N-order tensors.
The Frobenius norm is also called the tensor modulus.
which is called the outer product of two tensors.
Iterative tensor Tucker decomposition data recovery method
Tucker decomposition is proposed by Tucker in 1963 about Principal Component Analysis of higher order. It expresses a tensor as a core-tensor multiply by a matrix along each mode. Afterward, tucker decomposition is mainly used to matrix singular threshold decomposition in high dimensional space which called Higher Order Singular Value Decomposition (HOSVD). To learn more about tucker decomposition, we introduce two common decomposition model, CP decomposition and Tucker decomposition [32].
The CP decomposition model is used to decompose the N-order tensor χ ∈ RI1×I2×⋯×I
N
into a number of rank-1 tensor sums:
The factor matrix of tensor χ as follow:
R is an integer.
For three order tensor χ ∈ RI×J×K, its CP decomposition as follow:
The corresponding element of tensor χ:
Figure 1 shows a schematic diagram of a third-order tensor decomposition.

CP decomposition of third-order tensors.
Tucker decomposition of tensors aims to decompose the N-order tensor χ ∈ RI1×I2×⋯×I N into low-dimensional kernel tensors χ3 ∈ RR1×R2×⋯×R N and the N factor matrix product model
The factor matrix A(1), A(2), ⋯ , A(N) always orthonormal. The element of tucker decomposition as follow:
The matrix form of tucker decomposition as follow:
For instance, for three order tensor χ ∈ RI1×I2×I3, if
Figure 2 shows the Tucker decomposition of the third-order tensor.

Tucker decomposition of a third-order tensor.
CP decomposition can be considered as a special form of tucker decomposition. If core tensor
According to the principle of tucker decomposition, we introduce the process of recovery by tucker decomposition:
The tensor χ and the sheltered tensor Omega which means that some data of tensor are missing are superimposed to obtain a tensor with missing data B = χ· * O
mega
. In the process of an iteration, we first decompose the original tensor B′ = B to obtain the kernel tensor L and a series of singular value matrices U1(1), U1(2), ⋯ , U1(N), where 1 in front of the bracketed bracket indicates the number of iterations; then, there are
From the Tucker decomposition principle and process introduced above, it can be known that the tensor n-th order singular matrix is the left singular matrix that corresponds to the tensor n-th order matrix, and thus, the N singular value matrix of Tucker decomposition of N-order tensors is calculated. The process can be seen as calculating N times the size of I n = I1I2 ⋯ In-1In+1 ⋯ I N (1 ≤ n ≤ N).
With Tucker decomposition of the different matrices of the matrix in such a way that a singular matrix can be found U1(1), U1(2), ⋯ , U1(N), nuclear tensors can pass B′, and we have a series of singular matrix calculations:
Then, select k1, k2, ⋯ , k
N
singular vectors in the front of each singular value matrix, and the number of singular vectors can be chosen based on experience. Then, we obtain a matrix B1 similar to B′. Additionally, let
At this time, we have the nuclear tensor
We extract the missing part of the tensor and restore it to the original missing data tensor B. The missing data part of the tensor
Next, using Tucker decomposition for the new tensor
When the iteration process satisfies the convergence condition, the missing data recovery process completes the evaluation of the data recovery. The following error calculation formula was used:
The numerator represents the sum of the absolute value error in the sheltered part, and Nmis represents the number of sheltered elements in the tensor. Because the main basis for the algorithm is the known elements in the tensor and the value of the known elements do not change, the part that must be evaluated is only the missing data part.
The above are the process of tucker-decomposition, it aims to figure out a new tensor
ITRM algorithm
ITRM algorithm realize the tensor reconstruction by tensor Tucker threshold iterative method. In the algorithm, the data are in the form of tensor. Through tucker decomposition we get the potential structure of those items of tensor, then extract the principal component of tensor to realize the reconstruction of missing data.
The data used in this study are obtained from the Urban Transportation Research Institute of Central South University [34]. The selected data were derived from four straight lines in the direct direction of Shaoshan Road from south to north in Changsha, China. The collector collects the traffic flow every five minutes, and there are a total of 63 days of traffic flow from September 17, 2013 to November 18, 2013.
To verify the convergence of the algorithm, we make a comparative analysis from three aspects. First, a tensor χ was randomly generated on Wednesday and Saturday’s five-minute data, according to the continuous 63-day week. The missing data make up 50% (these data we don’t know). The mean absolute percentage error (MAPE) and iterative times are used in the algorithm’s convergence process, as shown in Fig. 3.

The convergence graph of the 50% loss rate.
Figure 3 shows that the algorithm can converge after approximately 10 iterations. It shows that the convergence speed of ITRM is better at filling in the initial incomplete traffic flow data, and the following experiments were conducted according to the traffic flow data. For different loss rates, when the loss rate increased from 0.1 to 0.9 with a step length of 0.1, the experiment was compared and analyzed. The following is a comparison of the data from the data of Friday in 63 days.
Figure 4 shows that when the loss rate is between 0.1 and 0.6, there is almost a smooth linear increase between the MAPE and the loss rate. Additionally, the error is relatively small for the five-minute interval traffic flow data, within acceptable limits. The figure shows that the ITRM algorithm in this range can achieve better experimental results in implementing the data recovery. Because the algorithm can successfully restore the unknown data in the traffic flow tensor, the equipment that currently tests the transportation equipment is also advanced, and the data that are lost in the experimental section of 63 days are less than 10%. Therefore, we choose this method to preprocess the data. Figure 5 represents the 16th day of the nine weeks (October 2nd). The experimental comparison diagram of the lost data and recovered data is shown below:

Comparison of the loss rate from 0.1 to 0.9.

Data-filled diagram for 20-minute intervals on October 2nd.
This section mainly introduces the GM(1,1) prediction model, GM (1,1), the SGM prediction model, and the model based on the wavelet network, in addition to the grey relational analysis.
GM (1,1)model
Set the original sequence to be
An accumulative generation sequence of the original sequence is
In addition, set Z(1) = (z(1) (2) , x(1) (3) , ⋯ , z(1) (n))
T
as the adjacent mean equal weight-generating sequence of X(1)
which is a grey system prediction model for first-order equations with a variable, referred to as the GM (1,1) model [17]. The equation
is called whitening equation of x(0) (k) + az(1) (k) = b which called grey differential equation The parameter estimation of grey differential equation as follows:
The GM (1,1) grey model has the following conclusions
(1) The time response function (solution) of the whitening equation
(2) The time response sequence of the GM (1,1) grey differential equation:
(3) Then, the restored values of x(0) is given by
GM(1,1) model is classical model of grey forecast. It studies specially on poor data system which is uncertain and is well-adapted to handle better variational parameter, and it can be used in traffic tensor flow forecast at week-mode.
which is called the cycle truncation accumulated generating operation (denoted CTAGO), where q is the number of elements in each cycle. Setting r = n - q + 1, the sequence
becomes the cycle truncation accumulated generating operation (denoted as CTAGO).
Here, y(1) (k) is called the 1-AGO sequence of the CTAGO sequence y(0) (k).
In addition, we regard
the grey system prediction model with a first-order equation with one variable, which is called SGM [21]. Its parameter estimation is as follows:
and
SGM provides the following conclusions:
(1) The time response function (solution) of the whitening equation
(2) The time response sequence of the
(3) The recuperative values of x(0) (k) as follow:
The grey rolling prediction SGM(1,1) prediction model is generalized form of classical grey model of GM(1,1). It applies the AGO of GM(1,1) to CTAGO of SGM(1,1). Each item of the y(0) (k) is sum of cyclic data. Grey rolling SGM(1,1) uses former q item of data to forecast the (q+1) th data. Then, replace the oldest data of x(0) (k + 1) with the predicted data, and so on.
This model can be used to forecast the traffic seasonal flow data. Through studying the cyclic raw, we can transform seasonal traffic flow data to mild sequence by means of limited past data and confirm the timeliness of forecast. Likewise, it can be used in day-mode forecast at the same time of each day.
The wavelet neural network is a neural network that is based on the topology of the BP neural network, which takes the wavelet basis function as the transfer function of the hidden layer node, and the signal propagates forward while the error is propagated in reverse. When the input signal sequence is x
i
= (i = 1, 2, ⋯ , k), the output formula of the hidden layer is
Here, h (j) is the output value of the j nodes of the hidden layer; ω ij is the connection weight of the input layer and the hidden layer; b j is the shift factor of the small wave base function h j ; a j is the expansion factor of the small wave basis function h j ; and h j is a small wavelet basis function.
The wavelet basis function is the Morlet mother wavelet basis function, and the mathematical formula is
The output calculation formula of the wavelet neural network is
Here, ω ik is the weight of the hidden layer to the output layer; h (i) is the output of the i-th hidden layer node; l is the number of hidden layer nodes; and m is the number of nodes in the output layer.
Wavelet neural network is better than neural network because its element and structure is decided by wavelet analysis theory, it avoids the blindness of BP neural network on structure design. It owns better learning skill and has faster, more accurate rate of convergence. Likewise, wavelet neural network has a better effect on short-time traffic flow forecast. It can forecast the whole trend of traffic flow. Consequently, it can be used to short-time traffic flow forecast of time-mode data.
Coupling of three models in tensor mode data
In traffic flow data, the traffic data between different weeks is correlated, and additionally, there is a stronger correlation between the weekday and weekend of a continuous week. Traffic flow data can be built into various forms of traffic flow data tensor models through a coupling mode. According to the characteristics of multiple modes of tensor data, a hybrid algorithm is used to predict the traffic flow data.
This paper collects traffic flow tensor data. Focusing on the correlation of traffic flow data, we adopted the “week-day-time” model. For the week model, we extracted data from the same time period in each week to build the sequence and adopted the classic GM(1,1) model for the prediction. The data of the day mode is seasonal, and the rolling seasonal SGM model based on the periodic truncation operator is used for the prediction. Focusing on the time-series data over time, we used the model of the neural network with wavelet analysis for the prediction. The predictive value of the data in the same time period on the same day of the week mode id was defined as
The coupling algorithm is used to identify the weight of Deng’s grey relational degree [15]. The basic principle is to use the model fitting value and the actual value of the correlation degree. The higher the correlation degree of the model is, the higher the reliability, and the greater the weight in the coupling model. Conversely, the lower the correlation is, the lower the reliability, and the smaller the weight in the coupling model.
The definition of Deng’s grey correlation is as follows:
These consequences have the same length. We define the grey relational degree of the vector as follows:
The coupling prediction model algorithm is as follows:
We extracted the fitting value and the real value sequence of the GM(1,1), SGM and Wavelet Analysis model. According to formula (3.11), the corresponding close correlation degree The corresponding weighting coefficients in the coupling model are determined according to the degree of correlation:
4) According to formula (3.10), the time series and the cross-sectional data coupling predictive value are obtained:
The method of predicting the above is called a coupling model of tensor multi-mode data, and the prediction flow diagram is shown in Fig. 6

A flowchart of the data coupling model for the traffic flow in tensor mode.
This section focuses mainly on the real-time data of traffic flows and fills in the lost data through the ITRM algorithm and through the properties of tensor operations. After filling in the traffic flow data, a tensor multi-mode representation is performed. For data of different tensor modes, the coupling model is empirically analyzed.
Establishment of traffic data model of tensor
Among traffic data, the traffic data of same day of different weeks have correlation, and especially the weekend of adjacent weeks. Hence, the correlation between several mode and traffic time-spatial data should be considered in traffic data model. We can express different forms of traffic data. Such as the tensor in location-week-day-time mode can be used to a traffic flow of some roads. We can get that:
Thereinto, L means loop, this model has 11 collectors, W means Weeks which contains 11 weeks data. D means Day which on behalf of that there are 7 days in a week. T means Time which shows that 288 times in a day.
Above model can be described as a week-day-time mode, and it also can be described as form like χ ∈ RW×D×T; If set T = 288 = 24 × 12, denote the collect data each five minutes, 12 times an hour, then a five order tensor χ ∈ R11×11×7×24×12 can be derived. For time series, we can build different tensor model according different collected time. Such as collect data each ten minutes, than a five order tensor χ ∈ R11×11×7×24×6 can be derived.
After the traffic flow data are preprocessed, the data are collected by the collector every 5 minutes, and 288 data points can be collected in one day in section 2.3. Therefore, the third-order tensor form of the traffic data “week-day-time” can be obtained: χ ∈ R9×7×288 tensor model, or fourth-order tensor, χ ∈ R9×7×24×3, χ ∈ R9×7×24×2, χ ∈ R9×7×24×1, and so on. Here, 9 represents 9 weeks of data, 7 represents 7 days per week, 24 represents 24 hours per day, 2 represents two collection times per hour, 3 represents three collection times per hour, and 1 represents one collection per hour.
Analysis of the results of the tensor multi-mode data coupling model prediction
A comparative experiment for the first tensor model χ ∈ R9×7×24×3 was compared to the experimental data with the data of 6 traffic flows every 20 minutes from 7:00 am to 9:00 am on Wednesday, November 14th. Here are the specific steps of the experiment: Choose September 19, September 26, October 10, October 17, October 24, October 31, November 7, and November 14, removing the holiday on October 3. Data from 7:00–7:20 in the 8th week are forecasted using the GM (1,1) grey prediction model, and the 8th data point is predicted from the first 7 data points. The same method is used to predict the data for the other 5 periods. For traffic flow from 7:00 to 7:20 on the 22nd day of November 14th, the real-time data from October 31 to November 14 are 14 days in length, and using the grey rolling prediction model SGM to predict the data for November 14th, the other time will also use the same method. Using the 20-minute data from November 11 and November 13 for three days, a wavelet neural network model was used to predict the traffic flow data for 20 minutes between 7:00 and 9:00. The data traffic predicted by the three models is correlated with the original data using formula (3.11), and then, the weights of the models are obtained by (3.10). The three models are coupled to obtain the final predicted values. Compute and compare the predictive values and errors
Define the mean absolute percent error as follows:
Here, x(0) (k) represents the original data, and
The predictive values and errors of the coupling model, GM(1,1), SGM and wavelet analysis neural network models are depicted in Table 2.
Simulation values and errors of the amount of the χ ∈ R9×7×24×3 model
In Table 2, the effect of the coupling model depends on χ ∈ R9×7×24×3 and is better than a single gray GM (1,1) model, gray rolling SGM model and wavelet neural network model. Among them, we have the GM (1,1) model predictive effect of the week-model and the day-mode gray rolling model SGM model, and the prediction results are comparable, while both are better than the wavelet neural network model. According to Table 1, the prediction percentage errors of the above three models for the traffic flow are illustrated in Fig. 7.

Experimental comparison of coupling models and three single models.
As seen from Fig. 7, the effect of the coupling model depends on χ ∈ R9×7×24×3 and is better than the three single models, which indicates that the coupling model is effective.
Traffic data provide an important data foundation for the research and development of intelligent transportation systems. The data contain rich multi-mode features. Tensors are a generalization of vector and matrix models, which can represent multidimensional data and exhibit multi-mode features. In this paper, multi-mode traffic flow data are represented by high-dimensional tensors, and a coupling model suitable for multi-mode state traffic flow is proposed. For comparison experiments, good results are achieved, specifically in the following areas:
To mine the characteristics of traffic data better in several modes such as week, day, and time, we attempt to fill in the lost traffic data and make use of Tucker decomposition and other basic computing properties; then, we preprocess the traffic flow data according to the ITRM algorithm to obtain whole and accurate traffic flow data. We study the multi-mode characteristics of the traffic flow data and use different high-dimensional tensor models to represent different traffic flow data. According to the multi-mode correlation of traffic flow data, we use a classical gray GM (1,1) model for forecasting the week-mode data, use a grey rolling SGM prediction model to predict day-mode data and use a wavelet neural network for the time model prediction for the time-mode. Finally, three models are coupled according to the grey correlation technique. Finally, we use the traffic flow data of 7:00–9:00 in the early peak period of the main road of Shaoshan Road in Changsha City, Hunan Province, China as an example for analyzing the data, and we establish the “week-day-time” mode according to the multi-mode characteristics of the data. Through comparative analysis, the experimental results show that the coupling model of the multi-mode data is better than the single model in terms of prediction effect.
Footnotes
Acknowledgments
This work is supported by the National Natural Science Foundation of China (No. 71871174, 71271226, 11671001). Project of Humanities and Social Sciences Planning Fund of Ministry of Education of China (18YJA630022).
