Abstract
The 10,000-hour rate of civil aviation incidents is an important index parameter to measure flight safety. Predicting the development trend of the 10,000-hour rate of civil aviation incidents plays an important role in aviation accident prevention and safety decision-making. Many complex factors influence the occurrence of civil aviation incidents, so the 10,000-hour rate of civil aviation incidents changes randomly and volatilely. This study proposed the idea of prediction by combining the grey GM (1, 1) model and the Markov model. Specifically, the grey GM (1, 1) prediction model was constructed using the statistical data on the 10,000-hour rate of civil aviation incidents in China during 2005–2020. On this basis, a grey Markov prediction model was established. The prediction of the 10,000-hour rate of incidents in 2021 based on the two models showed that the grey Markov model displayed higher prediction accuracy than the grey GM (1, 1) model and conformed to the change laws of the 10,000-hour rate data of civil aviation incidents better. Moreover, the grey Markov model could effectively improve the accuracy of the grey prediction model, compensate for its deficiencies, and facilitate the mastery of the change laws of civil aviation incidents, providing a reliable basis for aviation safety management and incident prevention.
Introduction
In recent years, COVID-19 has had a huge impact on the global civil aviation industry. Thanks to the precise and scientific epidemic prevention and control measures, China’s civil aviation industry has become the world’s fastest-recovering and best-running aviation market under the COVID-19 epidemic. According to the statistical bulletin on the development of the civil aviation industry, from 2020 to 2022, the transportation turnover of civil aviation decreased significantly compared with that in 2019, with an average annual growth rate of
Total turnover of civil aviation transportation during 2018–2022.
Global aviation accident rate during 2010–2022.
The statistical data from the Civil Aviation Administration of China revealed 440 transportation aviation incidents and four severe incidents in 2020, 559 transportation aviation incidents and six severe incidents in 2021. What is more noteworthy is that on March 21, 2022, flight MU5735 from Kunming to Guangzhou of China Eastern Airlines Yunnan Co., Ltd. crashed in Wuzhou, Guangxi, and disintegrated after hitting the ground, killing all 123 passengers and nine crew members on board, which also terminated the record of 100 million hours of continuous and safe flight time of China civil aviation transportation.
With the recovery of civil aviation transportation production capacity and the increase in flight number and flight hours, the subsequent flight operation safety problem is still the focus of civil aviation operation safety. During aircraft operation, flight accidents or flight incidents may occur due to defects in aircraft design and manufacture, human factors, and weather and environmental factors [3, 4]. The 10,000-hour rate of civil aviation incidents refers to the probability for serious unsafe behaviours or unsafe states of civil aircraft to cause aircraft damage, personal injury, etc., during the flight for 10,000 hours but not cause serious flight accidents or even disaster events. To realize the continuous safety of civil aviation and master the development trend of the 10,000-hour rate of civil aviation incidents [5], accurately predicting the 10,000-hour rate of civil aviation incidents plays a vital role in accident prevention and safety decision-making of relevant units.
Commonly used accident prediction methods include BP neural network prediction method [6, 7], regression analysis prediction method [8, 9], grey prediction method [10, 11], and Markov prediction method [12, 13]. Therein, the BP neural network prediction model needs sufficient sample data, which has a large calculation workload and a slow convergence speed and easily falls into the local minimum. The accuracy of the regression analysis prediction method is highly correlated with the number of original data and samples, and it is not suitable for prediction in case of a small data size. The grey prediction method’s accuracy will decline if the data fluctuates greatly. The Markov prediction method applies to series with large data fluctuations, but more data should ensure accuracy. Given the differences in the applicable conditions of different prediction methods, combined prediction methods have also become the choice of many scholars to make up for the shortcomings of the methods. A combined method means integrating different prediction models and comprehensively using the information provided by various prediction methods to establish a combined prediction model, which can effectively improve the accuracy of the prediction model. According to the characteristics of sampled data, the aviation incident rate would be predicted using the grey Markov prediction model, expecting to provide a reference for aviation safety risk management and decision-making.
The 10,000-hour rate of civil aviation incidents is an important index parameter to measure flight safety. The greater the 10,000-hour rate of civil aviation incidents, the greater the possibility of civil aviation accidents. The existing literature mostly focuses on the research on civil aviation incidents, including incident influencing factors, incident prediction, and risk assessment. Aiming at the randomness and uncertainty of factors inducing flight accidents, Chen et al. [14] established an improved entropy and grey correlation algorithm for importance identification and identified key causes for flight incidents. Silagyi and Liu [15] evaluated and ranked the importance of accident factors to predict the severity of the consequences of aircraft approaching and landing accidents, established a support vector machine prediction model based on the RBF kernel function and verified the model’s prediction accuracy. Claros et al. [16] proposed a risk data modelling method based on a negative multinomial distribution. They integrated it into the FAA safety management system’s Safety Risk Management (SRM) plate, realizing the quantitative analysis of safety risks. To improve the ability of airlines to control flight operation risks, Zheli. [17] put forward an environmental risk assessment method of flight operation control based on support vector regression and verified its effectiveness. Fan [18] explored the change laws of China’s aviation incidents using the grey prediction theory. They established a grey prediction model, providing decision reference for civil aviation safety and disaster warning.
Grey prediction and Markov prediction have been widely investigated and applied by predecessors, involving various fields such as transportation, fault diagnosis, accident prediction, and aviation safety. To realize short-term traffic flow prediction, Comert et al. [19] proposed several new grey system models, expounded on the advantages of each model in reducing errors by comparing with the grey GM (1, 1) benchmark model, and verified the method’s feasibility. Liu et al. [20] proposed a new method to predict transformer faults by predicting the variation trend of dissolved gas content in transformer oil. They optimized the system state division method of the Markov model by using the Fibonacci sequence. Wang et al. [21] established three grey prediction models of hazardous chemicals considering the small size and information of accident data samples regarding hazardous chemicals. By comparison, it was concluded that the unbiased grey model achieved the highest prediction accuracy, which provided a new idea for the accident prediction of hazardous chemicals. Edem et al. [22] put forward a grey-fuzzy-Markov time series model for industrial incident prediction, which could predict accidents according to a small amount of or incomplete data information. Wang and Lv [23] established a grey Markov prediction model for incidents based on the grey prediction model, verified the feasibility and accuracy of the model with an example, and improved the prediction accuracy of the variables with large fluctuations. Ni et al. [24] integrated the deep belief network (DBN) and principal component analysis (PCA) in combination with the characteristics of big data to put forward a method to predict the serious flight accident rate of unsafe events based on deep learning. The results obtained by this combined prediction model are consistent with the reality.
To sum up, the current research on civil aviation incidents mainly focuses on analyzing influencing factors, predicting the number of incidents, and risk assessment. Although the key influencing factors of incidents can be identified as small-probability events, incidents are highly volatile, with little significance if used to predict the number of accidents quantitatively, making it difficult to reflect the safety performance level of civil aviation. Given the above problems, this study used the 10,000-hour rate of incidents as the prediction index. Then, a grey Markov prediction model was established to mitigate the influence of data fluctuations, which could help master the development trends of civil aviation safety and provide a reference for safety management and decision-making.
Theory and method
Grey theory
Chinese professor Deng put forward the grey prediction method in 1982, suitable for series prediction with a small data size and insufficient information. The most common grey prediction method is the GM (1. 1) model [25]. The model regards random variables as grey variables and random processes as grey processes, thus not needing a lot of historical data. Basic grey models filter out the possible random quantities in the original sequence through the accumulation of time series data, look for some hidden regularity from the fluctuating time series, and then get new series with weakened randomness and enhanced regularity, thus mining the inherent characteristics of the original sequence. The method of accumulation or subtraction in grey prediction is easy to produce errors, and the fluctuation of the number of civil aviation incidents and flight time data leads to the partially low accuracy of prediction results.
Markov theory
Markov is a random process with discrete time and state, which is characterized by no aftereffect; that is, the state at time
Grey Markov theory
The grey and Markov models can be used to predict time series. The grey prediction curve shows a monotonic increasing or decreasing trend, which can reflect the overall changing trend. The Markov model predicts the future development direction of the system according to the transition probability of the system state, and the transition probability reflects the influence of various random factors, so it is suitable for the prediction of time series with large random fluctuations. The grey Markov combined prediction model combines these two methods, which can learn from each other’s strong points. The development trend of the 10,000-hour rate of civil aviation incidents can be more accurately predicted by taking the respective advantages of the two models; namely, the grey model can reflect the development of trends with small data sizes, while the Markov model can process data fluctuations.
The grey Markov model was constructed using the following method. Firstly, the GM (1, 1) prediction model was established according to historical civil aviation incidents data in the first
Grey Markov model.
The calculation process of the GM (1, 1) model does not involve other related variables of the predicted variables but only needs to test or process the original data series, so the GM (1, 1) model has been widely used. The specific modeling steps are as follows [29, 30, 31].
(1) The original sequence
Where
(2) The original sequence is accumulated once to obtain a new sequence.
Where
(3) A new sequence GM (1, 1) model is constructed to obtain its differential equation:
Where a and b stand for undetermined coefficients.
(4) A data matrix
(5) The undetermined coefficients a and b are solved by the least square method.
(6) The differential equation is solved and the time response equation of the prediction model is determined.
(7) The original sequence reduction model is established.
Finally, the predicted sequence
(8) Residual test
The residual refers to the difference between the observed value and the predicted value, and the residual test is one of the effective methods for data reliability tests. The specific inspection process is as follows:
The residual is recorded as
According to the relative error, the average relative error of the samples can be obtained as follows:
The feasibility of the prediction result can be judged by the result of the average relative error. If the average relative error is
(1) System state analysis
According to the relative change rate between the actual data and the predicted value of the grey model, the system is divided into
(2) Construction of a transition matrix
After the state interval is divided, the state transition probability is calculated by counting the frequency of state transition, and finally the state transition matrix is obtained.
The frequency matrix for the system transition from state
The system state transition probability is the ratio of each element in the transition frequency matrix to the sum of the elements in the row, denoted as
Where
Generally, the matrix
(3) System state prediction
Knowing the initial state
Where
(4) Calculation of the predicted value
The
Where
The test method of grey Markov prediction results is consistent with that of grey prediction, so it will not be hereby repeated.
Taking the statistical data of civil aviation incidents during 2005–2020 released by the official website of the Civil Aviation Administration of China as an example (Table 1) [32] (CAAC, 2005–2020), the civil aviation incidents in 2021 were predicted to verify the effectiveness of the model.
Relevant data of civil aviation incidents in China during 2005–2020
Relevant data of civil aviation incidents in China during 2005–2020
(1) GM (1, 1) prediction model
From the data during 2005–2020 in Table 3, the original time series regarding the 10,000-hour rate of civil aviation incidents can be known as follows:
A one-time accumulated data sequence is generated to obtain:
The following matrix is established:
The following can be solved:
Namely,
Therefore, the time response sequence is acquired as below:
The calculation results are listed in Table 2.
GM (1, 1) predicted value of 10,000-hour rate of civil aviation incidents during 2005–2020
According to Eq. (9), the average relative error of the grey GM (1, 1) model can be calculated as
(2) System state interval division
According to the basic idea of the grey Markov model, after the grey prediction model is completed, the system state should be divided according to the relative change rate of data. According to the above data of relative change rate, the system states are divided as shown in Table 3.
Division of state intervals
(3) Construction of a transition matrix
According to the content in Table 3, the states of each year during 2005–2020 are divided, and a line chart of the system state transition is drawn, as shown in Fig. 4.
Line chart of state transition during 2005–2020.
According to the system state transition diagram, the transition frequency matrix of one-step state transition during 2005–2020 can be obtained as follows:
Further, the system state transition probability matrix can be calculated as follows:
Similarly, the state transition probability matrix of transition for 2, 3, and 4 steps can be calculated as follows:
(4) Results
Because the state was divided into 4 state intervals, the 4 years closest to 2021, namely 2020, 2019, 2018, and 2017, were selected to prepare the prediction table, as shown in Table 4.
Prediction table of 10,000-hour rate of civil aviation incidents
It could be seen from the above table that the system was most probably at state 2 in 2021, i.e., “decline of predicted value”.
Given that the predicted value of GM (1, 1) in 2021 was 0.570, the corrected value of grey Markov is:
By comparatively analyzing the relative errors produced by the two prediction methods (Table 5), it could be seen that the results predicted by the grey Markov model were more accurate and could provide more accurate decision-making reference for civil aviation safety management and accident prevention.
Comparison between GM (1, 1) and grey Markov model in prediction results
This study mainly aims to establish a prediction model for the 10,000-hour rate of civil aviation incidents in China. Based on the statistical data of civil aviation incidents during 2005–2020, a grey GM (1, 1) prediction model was established, and on this basis, a grey Markov prediction model was established, and the two models were combined to predict the 10,000-hour rate of civil aviation incidents in 2021. The results showed that the predicted value of the grey GM (1, 1) model was 0.5700 with a relative error of 4.95% and that of the grey Markov model was 0.6099 with a relative error of 1.70%, so the prediction accuracy of the grey Markov model was higher than that of the GM (1, 1) model, and the former conformed to data change laws more. To sum up, despite the suitability for not high requirements for the data sample size, the grey prediction model reaches relatively low prediction accuracy when used to cope with arrays of strong fluctuations. The Markov model can better solve the prediction problem in case of great fluctuations. If combined, the two methods can enhance the model accuracy and be more helpful for mastering the change laws of civil aviation incidents, providing a reliable basis for aviation safety management and accident prevention.
Footnotes
Acknowledgments
This work was supported by Science and Technology Planning Project of Henan Province (232102320049, 232102320011, 232102240016) Training Program for Young Core Teachers in University of Henan Province (2020GGJS174).
