Abstract
Based on the conditional heteroskedasticity of following vehicle acceleration fluctuation in car-following behavior, this paper combines the idea of driving force, Intelligent Driving Model (IDM) and Generalized Autoregressive Conditional Heteroskedasticity (GARCH model) to establish a novel car-following model. First, the concept of safety driving force and efficiency driving force are recommended, and both of them are defined by the idea of IDM, and the dynamic balance of them are the reason for acceleration fluctuation. Then, according to the heteroskedasticity of acceleration sequence, the classic GARCH model is introduced to establish the relationship among the variance of acceleration fluctuation items, driving force items and fluctuating memory items. On this basis, a novel car-following model is established, and the relational properties are studied. Finally, an example is used to verify our model, the results show that the novel car-following model is more accurate than IDM model, and the forward prediction result is close to the actual acceleration change value, which can predict the occurrence of dangerous driving behavior.
Introduction
Acceleration is one of the most fundamental parameters to describe the motion of a vehicle, and building the relationship between acceleration and other parameters of the car-following behavior is also an important requirement of theory research. Many experts and scholars have made outstanding contributions in this direction. For example, linear car-following models were first proposed by Chandler [1] and Herman [2], and then developing to nonlinear car-following models [3, 4], optimal velocity models [5, 6], and IDM model [7]. These models were proposed by different ideas, but they all successfully described acceleration of following vehicle to a certain degree, and laying the foundation for the research of micro-traffic simulation and macroscopic road access.
There is a lot of practical significance to study the change rule of acceleration in car-following behavior, and one of the most important purposes is to study the driver’s safe driving behavior [8]. Traditionally, some researchers have divided drivers into two categories: aggressive and cautious [9]. An aggressive driver takes a driving behavior characterized by changing velocity large and fast together with numerous and sudden changes of instantaneous acceleration or deceleration. However, a cautious driver always tries to maintain a constant moderate velocity. The dramatic changes in velocity are closely related to the acceleration, when acceleration fluctuates violently, a rear-end traffic accident may occur. Therefore, studying the fluctuation of acceleration may be more important than studying the value of acceleration itself.
In general, the fluctuation of acceleration changes with time, so the acceleration sequence is a time series. There are many factors that result in a change of acceleration, on the one hand, it is influenced by objective conditions such as traffic conditions and weather. On the other hand, it is affected by the driving state of preceding vehicle and the driving habits of driver. Therefore, there may be obvious heteroskedasticity, which means that acceleration fluctuates violently at some point and gently at other times. In order to describe the heteroskedasticity of acceleration fluctuation, this paper attempts to introduce GARCH model. For the heteroskedasticity of financial time series fluctuation, the autoregressive conditional heteroskedasticity model (ARCH model) [10] was first proposed by Engle in 1982, and then developed into the generalize autoregressive conditional heteroskedasticity (GARCH model) [11] by Bollerslev. The basic formula of GARCH model can be divided into the mean equation and the conditional variance equation.
In the process of studying the theory of car-following, Yu [12] and Liu [13] have proposed and verified the memory effect. The memory effect is that drivers of vehicles make decisions relying on external stimuli for a period of time, which is consistent with the fundamental idea of the GARCH model that fluctuation relies on prior volatility information; on the other hand, the fluctuation of acceleration is influenced by drivers’ driving habits, and the fluctuation memory which GARCH model describes is to fit the previous characteristics of fluctuation series, therefore, in the application of GARCH model to study the fluctuation of acceleration, the previous characteristics of fluctuation series can also describe drivers’ habits. This is the other two important reasons for studying the fluctuation of acceleration by GARCH model.
In the car-following behavior, the other factor that effects acceleration fluctuation of following vehicle is the preceding vehicle. Considering that IDM model is a good example that can accurately describe the behavior of drivers and retain complex macroscopic traffic phenomenon [14]. The specific form is as following.
This paper defines the efficiency driving force and safety driving force based on the idea of IDM model, and the dynamic balance of the two driving forces are introduced into the GARCH model as exogenous variables to establish a novel car-following model.
The content of this paper is arranged as follows: In Section 2, a novel car-following model is established, and the related properties and parameters estimation methods are discussed. The third section uses actual data of NGSIM to fit the model, and the accuracy is analyzed. Section 4 summarizes the paper.
Fluctuation characteristics of acceleration
Acceleration is one of the most basic variables in the process of driving. Its fluctuation characteristic can not only reflect the driver’s personal driving habit, but also can be used as an important index to study the car-following behavior. Therefore, it is very important to study the fluctuation characteristics of acceleration.
Influenced by functionality of vehicles, acceleration and deceleration is always a continuous process, and acceleration at time t fluctuates based on acceleration at time t - 1, and considering the acquisition frequency of NGSIM data is 0.1 s, so t - 1 in this paper means 0.1 s earlier. All data of NGSIM is studied, and Fig. 1 is a set of acceleration data randomly selected from our research, when the acceleration a (t - 1) is determined in the preceding period, the acceleration a (t) in the latter period changes in the range of (a (t - 1) - u
t
, a (t - 1) + u
t
), where u
t
is the fluctuation item affected by factors such as spacing, velocity of preceding and following vehicles, and driver’s driving habit. Figures 2 and 3 show the frequency distribution of u
t
and Q-Q figure when acceleration is in the range of (–2, –1) (m/s2) and (1, 2) (m/s2). It can be seen from the Q-Q figure that the sample scatter is close to the line described by the normal distribution, so in general it can be assumed that u
t
obeys a normal distribution. At the same time, effected by precision of driver’s control, u
t
usually fluctuates in a certain range under the same external conditions [15, 16]. Therefore, this paper uses equation (5) to describe the fluctuation of acceleration.
where

Fluctuation characteristics of acceleration.

Frequency distribution and Q-Q figure of u t when acceleration is in the range of (– 2, – 1) (m/s2).

Frequency distribution and Q-Q figure of u t when acceleration is in the range of (1, 2) (m/s2).
In the process of studying car-following behaviour, Xing [17] and Chen [18] believe that there is a desired distance between the following and preceding vehicles, that is, the spacing is always within a certain range when velocity is determined, because the driver of following vehicle always try to maintain a suitable distance which is called desired spacing. As shown in Fig. 5, when velocity is about 11 m/s, the spacing is slightly adjusted within a small range, and Weidman calls this behaviour as Oscillating processes [19]. Because drivers usually cannot accurately judge the current driving environment, and also cannot accurately maintain the pressure applied to the throttle, resulting in the actual spacing always oscillating near the desired spacing. Therefore, the desired spacing can be considered as the mean in this range and the actual spacing is evenly distributed on both sides of the desired spacing. In the actual driving process, drivers often adjust spacing according to velocity. When velocity becomes large, the driver tends to stay away from the preceding vehicle for security considerations in light of the security concerns; in contrast, when velocity becomes small, the driver tends to be close to the preceding vehicle. Therefore, velocity is the most important factors which effects spacing. The relationship between velocity and desired spacing is discussed below by the data which we chose in Section 3.1.
Average spacing under various velocity conditions is shown in Fig. 4. It can be seen from the Fig. 4 that there is a clear positive relationship between the desired spacing and velocity, which means when velocity increases gradually, the average spacing would also gradually increase. This is consistent with the reality that the driver controls spacing according to velocity. Therefore, this paper attempts to find a best equation to fit the curve (or straight line), and the results are shown in Table 1.

Desired spacing.

Oscillating processes of spacing when velocity is in the range of (10, 12.5) (m/s).
Fitting equation of Fig. 4
In fact, we have made a lot of attempts to choose a better fitting equation to fit the relation of velocity and spacing, but limited by space of paper, we can’t put all results in Table 1. According to Table 1, we can see that quadratic function can fit this relation best, and the fitting curve equation is as Equation (6).
And we will use Equation (6) as the desired spacing in the following research.
After recognizing the law of acceleration fluctuations in Section 2.1, we need to know why acceleration fluctuates. Based on the idea of classic IDM model, this paper considers the concept of the driving force and establishes the relationship. Delpiano [20] and Lin [21] have applied the concept of the driving force to the transportation field. The driving force refers to the force that helps drivers to achieve some purpose. It doesn’t act on a person’s body, but is a quantity that describes his/her motivation to take actions. In the traffic discipline, the driving force can only affect traffic participants (including drivers, riders) rather than traffic tools. Note that the driving force is very similar to social force, since both of these force correspond to social interactions. We specifically put forward the driving force concept but not borrow the generalized or social force concept directly because the driving force is more appropriate for this paper which emphasizes a person’s different motivations. The most important kinds of driving force are the efficiency force and safety force [22], which represent the driver’s motivation to improve his/her safety and efficiency respectively. However, drivers cannot know the objective value of their safety and efficiency, but they can perceive the degree to which they are safe and efficient. If the driver believes that driving is not safe enough, the safety force increases, and if the driver believes that driving efficiency is low, the efficiency force rises. The two kinds of driving force are recorded as P
efficiency
and P
safety
; the car-following behaviour is the dynamic equilibrium process under the combined action of the two driving forces.
The driving force acts on the acceleration of the vehicle. When P efficiency > P safety , the vehicle speeds up; when P efficiency < P safety , the vehicle slows down; and when P efficiency = P safety , the vehicle travels at an approximately constant acceleration.
In car-following behaviour, the efficiency force of the vehicle is primarily determined by the velocity of preceding and following vehicles. The following vehicle will always try to reach the destination as soon as possible. If the velocity of the preceding vehicle increases or the velocity of the following vehicle decreases, the efficiency force will increase, and then, the following vehicle would tend to accelerate consistent with the preceding vehicle; if the driving force decreases, the vehicle will slow down. The efficiency force can be defined as
The driver’s perception of safety is primarily affected by the spacing between the following and preceding vehicles. When spacing is less than the desired spacing, the driver of the following vehicle would feel less safe, so the safety force would be smaller; on the other hand, the safety force would be greater, so the safety driving force can be defined as
In the above formulas, both c1 and c2 are sensitivity coefficients, and they have the effect of adjusting the dimensions and sizes of the efficiency force and safety force. S (t) is desired spacing used in this paper, and it is different from S*, we have conduct a detailed study of it in Section 2.2. Therefore, the driving force can be defined as
Further considering the basic characteristics of the car-following behaviour, the conditional heteroskedasticity
The meaning of the novel conditional variance equation is as follows: The conditional heteroskedasticity Regardless of safety force P
safety
or efficiency force Pefficiency becomes large, which results in the increasing of fluctuation of the acceleration.
Based on the acceleration equation and the conditional variance equation, the novel car-following model based on velocity fluctuation is as follows.
Before applying the GCF model, we need to verify the established model conditions. For the GARCH model, we need to know the parameter condition when the GCF model is a stationary process. Stationary process means that characteristics of its probability distribution don’t change, so we can predict
E (X
t
) is a constant independent of t; For each k, cov (X
t
, Xt+k) and t are independent.
From
Then
Let
Since
It is easy to see that E (Ft-k)< ∞, and for a particular vehicle, E (Ft-k) is a constant c n .
If and only if 0 < α + β < 1,
Therefore,
At the same time, both E (u t ) =0 and cov (u t , u s ) =0 (t ≠ s) hold. Therefore, theorem 1 is proven. The condition for the GCF model to hold is 0 < α + β < 1.
To better use GCF model to describe and determine traffic conditions, we will further explore the properties of the model and its parameters.
From Property 1, we can see that the variance of the acceleration fluctuations is mainly affected by the current and past driving force and the value of the acceleration fluctuation. We know that 0 < β < 1 in Theorem 1, showing that the influence of the driving force and acceleration fluctuation on the velocity variance is smaller as time elapses. Property 1 reflects the gradual decline of the memory of vehicle acceleration fluctuation.
That is,
Therefore, the corresponding conditional heteroskedasticity
Empirical analysis
Data collection
The data source is the next-generation simulation project NGSIM by the United States Federal Highway Administration. The data are collected at Highway 101, Los Angeles, California, USA, from 8:05 to 8:20 am. The target is the driving condition of all the vehicles that meet the car-following behaviour of the section.
The length of the road is approximately 640 metres, and there are five main lanes and an auxiliary lane. As the acquisition time for the traffic is from the early peak to peak recovery, traffic density is relatively moderate, with the occurrence of car-following behaviour more universal. The acquisition method is to use a camera near the road that detects video images. A continuous state of motion information of individual vehicles is recorded and entered into the database. A total of eight cameras are installed along the way and numbered 1 to 8, and their distribution is shown in Fig. 1. Image data are collected by these eight cameras; the acquisition frequency is 10 frames per second. Numbering of the vehicle is automatic when the vehicle enters the range of the 1/8 camera (according to the time of entering the observation area). The vehicle operating conditions are recorded until the vehicle is out of range of the 8/1 camera. The system automatically records vehicle information, including the velocity and acceleration of the following vehicle, number of preceding and following vehicles and other parameters.
In order to ensure that the data is reasonable and the research is accurate, this paper uses the following rules to select the appropriate data: Select all vehicles of the third lane which possess more car-following characteristics, except for the first vehicle; During the period of data collecting, both of the following and preceding vehicles don’t change lane and overtake; To ensure that the data used in this paper satisfy the car-following characteristic, time headway should be less than 5 s, while spacing should be less than 50 m; The time which both of the following and the preceding vehicles pass the test section shouldn’t be less than 70 s; Maximum acceleration, maximum deceleration and maximum velocity of the preceding and following vehicles are in a reasonable range.
Through the above criteria of data screening, a total of 72 sets of data are selected, the following research will focus on these data.
Heterogeneity test
The basic assumption of the model (5) is that the conditional variance of the fluctuation term of acceleration is affected by the preceding vehicle and varies with time, in other words, there is heteroskedasticity. Therefore, before fitting the data with (12), we need to verify the existence of heteroskedasticity (ARCH effect).
Use equation (5) to fit the acceleration of the following vehicle, and obtain the fluctuation term in Fig. 6. It can be seen that there is an aggregation effect of the fluctuation, which means that the fluctuation is small in some time periods and very large in other time periods, indicating that the fluctuation term may meet heteroskedasticity. To confirm the existence of heteroskedasticity, the Lagrange multiplier rule is used to test the heteroskedasticity.

Residual term u t .
The hypothesis of the Lagrange multiplier rule is that there is no ARCH effect until order p of the residual sequence. The value of P in (Table 2) is 0 significantly, so the hypothesis is rejected and the sequence {u t } has significant heteroskedasticity.
Heteroskedasticity test results
For GCF model, after verifying the existence of heteroskedasticity of the acceleration fluctuation term series, the car-following data can be further fitted by Equation (12). According to the Property 3, we can predict the square of one order difference value of acceleration by
The comparison of actual values and predicted values of IDM model and GCF model is as following Fig. 7 (the length of data is limited to 400*0.1 s to ensure recognition of Fig. 7). In Fig. 7, there are three groups of experiment fitting results selected by fluctuating frequency and amplitude. After comparison, the peak time and the frequency of fluctuation of GCF model are both found to be in basic agreement, and the amplitude of fluctuation is closer than IDM model. Therefore, it can be preliminary concluded that the GCF model established by the dynamic balance of two driving forces can effectively fit the fluctuation of acceleration. It can also be found that the predicted value of GCF is often lower than the actual value at the time when the acceleration fluctuation is abnormally increasing, mainly because when velocity of the preceding vehicle falls sharply, driver of the following vehicle will overreact, which means that driver will take a bigger deceleration, but the GCF model cannot take it into account. On the other hand, it can be seen from Fig. 7 that it doesn’t affect the overall prediction of acceleration of GCF model, it is mainly because the memory of GCF model can simulate the driver’s driving habits.

Comparison of predicted and actual values of
Taking into account that the predicted results of the GCF model is good, so we think that it can play the role of security warning. Such as the vehicle 1748, its acceleration fluctuation is gentler than the vehicle 1602 and the vehicle 1668 for most of time, but at 65*0.1 s, the predicted value and actual value are rising violently, and there is a big security risk in this situation, thus, it is possible to provide an early warning to the driver when the GCF model predicts a large increase in fluctuation and then prevent further increase. On the other hand, after comparing the fluctuation feature of vehicles above, it can be found that the fluctuation of the vehicle 1602 and the vehicle 1748 is gentler than vehicle 1668, and the times that the fluctuation value of the vehicle 1668 is greater than 6 is more, so it is obvious that the driver of the vehicle 1668 may be an aggressive driver. Therefore, we can conclude that drivers should be reminded to drive carefully when the GCF model predicted bigger fluctuation continuously.
In order to compare the accuracy of GCF model with it of IDM model, we need to select an error analysis function. Considering that acceleration fluctuation could be 0, we use MSE (Mean Squared Error) function to analyse the accuracy of the two models.
The two models are used to fit the data respectively which we screen out in Section 3.1, and the corresponding acceleration fluctuation term is obtained. Finally, compare the accuracy of the two models respectively based on the MSE function.
From the results of Table 3, we can find that the MSE error function value of GCF model is obviously lower than that of IDM model and most of the values is less than 1, so it can be indicated that GCF model can describe the fluctuation of acceleration better. It is mainly affected by the form of IDM, because IDM model is set to avoid impractical acceleration and deceleration, in other word, both of the maximum acceleration a and comfortable deceleration b of IDM are less than actual maximum acceleration and deceleration [23, 24], so it results that the acceleration fitted by IDM model to be significantly lower than actual acceleration, and cannot reflect the real-time change of acceleration. But the GCF model proposed in this paper is based on the idea of IDM model and considering the personal driving habit of the driver in the past, so it is more realistic to reflect the fluctuation characteristics of acceleration. Thus the fitting effect of acceleration fluctuation may be obviously better than that of IDM model.
Accuracy of GCF model and IDM model
Based on conditional heteroskedasticity of acceleration and the idea of IDM model, this paper establishes a novel car-following model. Empirical analysis shows that the GCF model can describe the fluctuation features of acceleration more accurately than IDM model. On the other hand, taking into account that predicted results of the GCF model is good, so we think that it can play the role of security warning. Finally, we should also pay attention to the shortcomings of the GCF model. Limited to its form, the GCF model can only be used to study the square of the acceleration fluctuation item, but cannot accurately determine the actual positive and negative value of acceleration fluctuations, so this part need to be studied further.
Footnotes
Acknowledgments
This work was jointly supported by the National Nature Science Foundation of China (61403288, 71540027), China Postdoctoral Science Foundation funded project (2014M562076), and the Fundamental Research Funds for the Central Universities (WUT: 2017IA004).
