Abstract
The rolling bearing fault diagnosis is affected by industrial environmental noise and other factors, leading to the existence of some redundant components after signal decomposition. At the same time, the existence of the modal aliasing phenomenon in empirical mode decomposition (EMD) and the relevant improved algorithms also leads to the existence of many invalid features in the components. These phenomena have great influence on the bearing fault diagnosis. So a rolling bearing bidirectional-long short term memory (Bi-LSTM) fault diagnosis method was proposed based on segmented interception auto regressive (SIAR) spectrum analysis and information fusion. The ensemble empirical mode decomposition (EEMD), the complementary ensemble empirical mode decomposition (CEEMD) and the robust EMD (REMD) algorithms decompose the rolling bearing fault signals, and AR spectrum analysis is performed on the obtained components respectively. By comparing the AR spectra of the components corresponding to different fault locations, the effective AR spectral values are intercepted as the eigenvalues of the data, and finally all the eigenvalues are fused to achieve the purpose of screening effective features more efficiently so as to reduce the impact of feature redundancy caused by mode aliasing on neural network training. Then the Bi-LSTM neural network was used as a rolling bearing fault diagnosis classifier, and the simulation experiments were conducted based on the rolling bearing fault signal data from Case Western Reserve University to verify the effectiveness of the proposed feature extraction and fault diagnosis method.
Keywords
Introduction
As an indispensable part of the industrial production equipment, the rolling bearing mainly exists in extremely harsh working conditions, including high speed, heavy load and the strong impact of the environment, temperature, wind, water spray and other external physical and chemical erosion. Any unexpected mistakes will result in failure of mechanical system. In common rotating machinery, rolling bearing faults account for one-third. Therefore, fault diagnosis of bearings can effectively prevent the serious consequences caused by bearing faults, which is of great practical significance [1]. The fault diagnosis methods of rolling bearings can be divided into two categories: establishing normal state model and analyzing vibration data. The establishment of the normal state model requires higher requirements on its working environment. A slightly unstable environment will affect the uniqueness of the normal model, while the working environment of bearings is greatly affected by uncontrollable factors such as noise, so this method is difficult and not operable enough. As the vibration signals of bearings contain a large amount of fault feature information, the current bearing fault diagnosis is mainly based on the processing of vibration signals. So how to extract effective fault features from vibration signals has become a key issue, which is related to the effect of bearing fault diagnosis [2].
The empirical mode decomposition algorithm proposed by Huang et al. is adaptive to decompose non-stationary and nonlinear fault signals into several eigenmode components, and each component has a specific physical meaning. Therefore, this method is widely used in the field of fault diagnosis. However, it also has some limitation. For example, windowing effect of Hilbert transform leads to errors at both ends of the modulation signal, marginal spectrum peak overlaps, and it is difficult to distinguish fault features and other problems. On this basis, many scholars have improved it so as to make up for the shortcomings of EMD algorithm. Ref. [3] analyzes the boundary problems of EMD decomposition through cosine window function, and the conclusion proves that the fault features contained in each modal component have different influences on the diagnosis effect. In Ref. [4], Hilbert-Huang transform was used to analyze the original fault signals to extract fault features, and support vector machine (SVM) was used to verify the classifier effect, thus achieving fault diagnosis. Ref. [5] improved EMD method through multi-objective optimization so as to extract the rolling bearing fault features for fault identification and classification. But the above fault diagnosis methods have different degrees of modal aliasing phenomenon, that is, a component contained in the failure data of the characteristics of multiple scales, or at the same time scale characteristics are broken down into different components. So the modal aliasing phenomenon will increase the difficulty of fault feature extraction, and have great negative impact on bearing fault diagnosis.
Ensemble empirical mode decomposition was proposed on the basis of EMD, which is a signal analysis method assisted by white noise. The mode aliasing in EMD is avoided by adding Gaussian white noise into the original signal. In recent years, many scholars have applied EEMD and its improved algorithm to the field of mechanical fault diagnosis. Ref. [6] integrates EEMD and optimized support vector regression model to diagnose the fault location of piston pump. In literature Ref. [7], EEMD, correlation coefficient and Hilbert-Huang were used to extract fault features of bearings, and the final experiment showed that this method could effectively solve the mode mixing effect of EMD. The CEEMD was proposed by adding a pair of opposite white noises into the signal to be analyzed [8]. Thus, the decomposition error caused by white noises is reduced, the number of noise sets is reduced, and the computational efficiency is improved. Ref. [9] uses CEEMD to decompose original signals, and proposes a new method to eliminate intrinsic mode function (IMF) components with defective information. The signal is then reconstructed from the sum of the relevant IMF, and the frequency-weighted energy operator is adjusted to extract the amplitude and frequency modulation from the selected IMF. In Ref. [10], EMD decomposition and AR spectrum estimation are combined to be used in the gearbox fault diagnosis. AR spectra of the previous six IMF components of signals in different states were compared, and then EMD-AR spectrum energy eigenvalues were extracted, and the eigenvalues were input into the constructed SVM classifier. The results show that the method can be effectively applied to gearbox fault diagnosis of special vehicles, and the diagnostic accuracy reaches 94.5%. Ref. [11] proposed a new method of combined photovariation period analysis based on EMD-AR spectrum. Firstly, the observed data were decomposed by EMD to obtain the various modal components, and the correlation coefficients between them and the original optical variation curve were calculated. Then, the AR spectrum was estimated by summing up the components with high correlation. Finally, the periodic analysis method of the power spectrum was applied to the optical variation data analysis of celestial bodies. Ref. [12] proposed a fault feature signal analysis method based on the combination of variational modal decomposition (VMD) and AR spectrum. The results show that the fault feature extraction method based on VMD-AR spectrum solves the problem of selecting decomposition mode number and avoids its empirical selection. Although the diagnosis methods proposed in the above literature can suppress the mode aliasing phenomenon to a certain extent, single signal decomposition algorithm is easy to cause signal feature loss and have a great influence on the diagnosis results.
In view of the shortcomings of the above methods, in recent years, many scholars have introduced the means of information fusion into mechanical fault feature extraction, and achieved good results. Ref. [13] presents a method for device fault diagnosis based on failure rate and failure symptoms. Firstly, an algorithm for calculating equipment failure rate was proposed based on Weibull distribution model. Then the probability of failure is evaluated according to the failure phenomenon and a fault diagnosis model based on failure rate and failure symptoms was realized/ Finally, its effectiveness was verified through calculation examples. This method comprehensively considers the failure rate, failure mechanism, failure symptoms, and the acquisition difficulty of failure symptoms, and greatly improves the accuracy of fault diagnosis. Ref. [14] combined with the robustness of deep convolutional neural networks and the characteristics of vibration signals, introduced information fusion technology to enhance the feature representation ability and portability of the diagnostic model. Based on the multi-sensor and narrowband decomposition techniques, a fusion cell convolution structure was proposed to extract multi-scale features from different sensors. It was tested on two data sets. Compared with some existing works, the proposed method had higher generalization ability, which proved the effectiveness of the proposed fusion unit for feature extraction on both source and target tasks.
In this paper, based on different EMD varients, combined with the AR spectrum analysis of modal components and the analysis of AR spectra, the effective frequency band of the intercept value of AR spectrum was selected as the fault characteristic parameter so as to reduce the invalid and redundant features and the influence of mode aliasing on neural network training. A method of information fusion and dimension reduction was proposed, that is, 10 AR spectral values are intercepted as feature parameters by EEMD, CEEMD and REMD algorithms and AR spectral analysis respectively. Then a 30-dimensional feature vector was obtained by concatenating them together, and the top 20 feature values with variance values were screened by low variance. Finally, Pearson correlation coefficient was used to select 10 characteristic parameters with large correlation coefficients as the final characteristic values, and the obtained characteristic values were input into BI-LSTM neural network classifier to verify the effectiveness of the proposed fault diagnosis method. The simulation results show that this method has a better bearing fault diagnosis effect.
Basic principle of AR spectrum estimation
AR model is a commonly used time sequence estimation and analysis method, which has the advantages of high resolution and correct positioning, and can effectively overcome the windowing effect of signals [10–12]. The stochastic process can be expressed as:
Rolling bearing fault diagnosis signal analysis based on EEMD-AR
Basic principles of EEMD algorithm
The EEMD decomposition method mainly adds white noise into the signal according to the characteristic so that the mean value of white noise is zero. In essence, it is an improvement of EMD algorithm, whose core is still the EMD decomposition. Average processing is carried out for the decomposition results. The more times of average processing, the less influence noise brings to the decomposition results, thus inhibiting the influence of mode aliasing. The main decomposition steps are described as follows [15]. Add the random white noise n
i
(t) with a certain amplitude to signal x (t) so as to form a new set of signal x
i
(t).
The EMD decomposition of x
i
(t) is performed to obtain the IMF component. The final IMF component is obtained by averaging the obtained IMF components.
In order to obtain the frequency band of effective AR spectrum value after decomposition of EEMD algorithm and AR spectrum analysis of its components, the following simulation experiments are designed for analysis. Two groups of fault data were selected for analysis. The diagnosis objects were the faults of the inner ring, outer ring and rolling body of the rolling bearing. In order to simulate the early bearing faults, fault data with damage diameter of 0.007 mm were selected for verification. Experimental data information is listed in Table 1 [16].
EEMD-AR analysis experimental data information
EEMD-AR analysis experimental data information
AR spectrum analysis was performed on the previous six IMF components with a large energy ratio which were decomposed by EEMD algorithm to obtain the AR spectrum. Figures 1–3 shows the simulation diagram of EEMD-AR analysis experiments. Figure 1(a) shows the AR spectra of the six IMF components of data 1, and Fig. 1(b) shows the AR spectra of the six IMF components of data 2. It can be analyzed from Fig. 1 that the first three IMF components of the two groups of data have extremely same variation trend, and the AR spectrum range in the corresponding frequency band is almost the same. Among them, the AR spectrum values of IMF1 and IMF2 are in the frequency band of 300–350 Hz, the AR spectrum values of each fault part of IMF3 are in the frequency band of 200–250 Hz, and there is an obvious gap between them. On the other hand, the AR spectrum values of these frequency bands are rolling body fault, inner ring fault, outer ring fault and normal data respectively from top to bottom, which are considered as valid features. Compared with other frequency bands, such as IMF1 in frequency band 150 to 200, the AR curves of inner ring fault and rolling body fault are very close, and even overlap. These eigenvalues are considered as invalid features, which will affect the training of neural network and reduce the accuracy of fault identification, so they should be eliminated. In the AR spectrum of the latter three IMF components shown in Fig. 1, almost all of the frequency bands in AR curves are very close between at least two fault parts, so these components are not directly considered. Figure 2 is the sum of AR spectra of the previous six IMF components. It can be seen from Fig. 2 that the AR spectral values of each fault part have the largest difference interval in frequency band 200–250 Hz and 300–350 Hz [17, 18]. Figure 3 shows the time-domain waveform of fault data.

AR spectra of the previous six IMF components decomposed by EEMD.

Sum of AR spectra of the previous six IMF components.

Time-domain waveform of fault data.
The captured specific frequencies of AR spectral analysis are set as the fault signal characteristic value. In that the interception of band is longer, the characteristics of the reference number is larger, so it can not directly be used for neural network training. There will be a great feature vector dimensions caused by excessive training hard, which will ultimately affect the accuracy of the fault recognition [19]. Therefore, dimension reduction must be performed on the intercepted fault characteristic values. In this part, the mean-low variance filtering is used to reduce the dimension of feature vector. Figure 4 is the flowchart of the EEMD algorithm to extract the characteristics and obtain a larger dimension vector. After obtaining a feature vector with a large dimension according to this process, the average value of the feature values is firstly got based on all the feature numbers in the data set according to the fault category, and a data set matrix with 4 rows and N columns is got (N is the number of feature values of a single sample). Then calculate the square difference of each column of elements, filter out the features with lower variance value ranking, and select the top 10 features as the final feature parameters. Figure 7 is the sum of AR spectra of the previous six IMF components. It can be seen from Fig. 7 that the AR spectral values of each fault part have the largest difference interval in frequency band 275–350 Hz. Figure 8 shows the time-domain waveform of fault data.

Flow chart of feature extraction for EEMD-SIAR analysis.

Sum of AR spectra of the previous six IMF components.

Time-domain waveform of fault data.
Basic principles of CEEMD algorithm
CEEMD algorithm improves EEMD algorithm by adding the N opposite noise signs into the signal and carrying out the EMD decomposition so as to finally obtain the final IMF component after 2 N averaging. CEEMD decomposition process is shown in Fig. 5 and the detailed decomposition process is described as follows [20].

CEEMD decomposition flow chart.
A group of white noise signals with negative numbers are included in the original data to form a group of new signals:
EMD decomposition of The final decomposition result IMF is obtained by averaging IMF1 and IMF2.
Similar to the above EEMD-AR analysis, the following experiments are designed to analyze the effective frequency range of fault data. The experimental data information is shown in Table 1. Figures 6–8 shows the simulation diagram of CEEMD-AR analysis experiments. Figure 6 shows the AR spectra of the first three IMF components. Figure 6(a) shows the AR spectra of the six IMF components in data 1, and Fig. 6(b) shows the AR spectra of the six IMF components in data 2. According to the AR spectra of the previous three IMF components in Fig. 6, two groups of data have extremely same changing trend, and the AR spectrum range in the corresponding frequency band is almost the same. Among them, the AR spectrum values of IMF1 and IMF2 in the frequency band 300–350 Hz and 275HZ to 300HZ respectively have an obvious gap between the AR spectrum values of each fault part, and these parts of the spectrum values are considered as valid fault features. Compared with other frequency bands, such as IMF1 in frequency band 150 to 200 and IMF3 in all frequency bands, the AR curves of inner ring faults and outer ring faults are very close, and even overlap. These eigenvalues are considered as invalid features, which will affect the training of neural network and reduce the accuracy rate of fault identification, and should be eliminated. In the AR spectra of the last three IMF components in Fig. 6, AR curves of at least two fault parts are very close to each other in almost all frequency bands, which is close to coincidence. Therefore, these components are not directly considered.

AR spectra of the previous six IMF components based on of CEEMD decomposition.
Its feature extraction process is similar to that of EEMD. The feature extraction process of CEEMD-SIAR analysis is shown in Fig. 9.

Feature extraction flow chart of CEEMD-SIAR analysis.
REMD is an improved empirical pattern decomposition, which is supported by the soft screen stop criteria (SSSC). SSSC is an adaptive screening stop standard used to automatically stop the screening process of EMD [21, 22]. It extracts a set of single-component signals (called intrinsic mode functions) from the mixed signals. It can be used with Hilbert transform or other demodulation techniques for time-frequency analysis. Similar to the above EEMD/CEEMD-AR analysis, the following experiments are designed to analyze the effective frequency range of fault data. The experimental data information is shown in Table 1. Figures 10–12 show the simulation diagram of REMD-AR analysis experiments. Figure 10(a) is the AR spectrum of the six IMF components of data 1, and Fig. 10(b) is the AR spectrum of the six IMF components of data 2. According to the AR spectra of the first three IMF components in Fig. 10, two sets of data have extremely same changing trend, and the AR spectrum range in the corresponding frequency band is almost the same.

AR spectra of the previous six IMF components decomposed by REMD.

Sum of AR spectra of the previous six IMF components.

Time-domain waveform of fault data.
Among them, the AR spectrum values of IMF1, IMF2 and IMF3 in the frequency band of 300–350 Hz, 130HZ to 170HZ and 75–100 Hz respectively have obvious gap between the AR spectrum values of each fault part, and these parts of the spectrum values are considered as valid fault features. Compared with other frequency bands, AR curves of inner and outer ring faults and rolling body faults of IMF1 in frequency band 220 to 270HZ are very close to each other, and even coincide with each other. These eigenvalues are considered as invalid features, which will affect the training of neural network and reduce the accuracy rate of fault identification, which should be eliminated. In Fig. 10, AR spectra of the last three IMF components in almost all frequency bands have AR curves that are very close to or close to overlap between at least two fault parts, so these components are not directly considered. Figure 11 is the sum of AR spectra of the previous six IMF components. It can be seen from Fig. 11 that the AR spectral values of each fault part have the largest difference interval in frequency band 300–350 Hz, 130HZ to 170HZ, and 75 to 100HZ. Figure 12 shows the time domain waveform of fault data, and Fig. 13 shows the REMD feature extraction process.

Feature extraction flow chart of REMD-SIAR analysis.
Rolling bearing fault diagnosis method based on SIAR and information fusion
Finally, as shown in Fig. 14, the 10 characteristic parameters obtained by three algorithms in accordance with the above process are fused to obtain 30-dimensional characteristic parameters. Then, the first 20 characteristic values of variance are screened through low variance. Finally, 10 characteristic parameters with large correlation coefficients are selected as the final characteristic values by Pearson correlation coefficient [23]. The bidirectional LSTM neural network is used as a classifier to verify its validity.

Flow chart of feature extraction based on information fusion and SIAR analysis.
LSTM is an improvement on RNN. RNN is limited by its limited theoretical performance in dealing with the remote dependence problems due to gradient disappearance and gradient explosion. LSTM uses the gate structure and memory module to replace the hidden layer of RNN. The structure of LSTM unit is shown in Fig. 15 [24]. As shown in Fig. 15, LSTM stores and processes information mainly through three structures: input gate (i), forgetting gate (f) and output gate (o). X represents the input data, h represents the output of data, and C is the memory unit of storage location.

LSTM structure diagram.
LSTM was originally designed with only one-way data processing function, but the bi-directional LSTM was proposed after in-depth study. Bi-LSTM can lock the data in the model so that the current input and the data in the sequence before and after have a certain relationship, and the hidden layer stores the bi-directional sequence information, namely historical information and future information. Its structure is mainly composed of two LSTM models and two hidden layers, as shown in Fig. 16. Equation (7) is the update process of LSTM layer [25].

Structure diagram of bi-directional LSTM.
The updating process of LSTM layer from front to back can be expressed by Equation (8).
In this experiment, the rolling bearing fault signal data from Case Western Reserve University in the United States were used. In order to verify the effectiveness of the segmented AR spectral value-information fusion method described in this paper, the early bearing fault signals, namely damage diameter with 0.007 mm, were selected for effect verification. The experimental data and information are shown in Table 2. In the light of the collected experimental data of time sequence, the bi-directional LSTM neural network was selected as classifier. Initialize the network hidden layer as 50, a maximum 100 training cycle and fault category. A total of four data types respectively are the inner ring fault (1), outer ring fault (2) and rolling body (3) and normal data (4). A total of 400 groups of data with damage diameter of 0.007 mm were selected, 100 groups of data for each category, with sampling points of 1000 and sampling frequency of 48KHz. 400 groups of experimental data in Figs. 4, 9 and 13 after processing are to get a set of data from 400 row 10 column matrix, as shown in Table 3–5. Based on the flowchart shown in Fig. 14, the characteristics of the data set is obtained based on the further fusion process, and the fused data set shown in Table 6 is obtained according to the random disturb.
Experimental data information
Experimental data information
EEMD-SIAR feature extraction results
CEEMD-SIAR feature extraction results
REMD-SIAR feature extraction results
Feature extraction results based on SIAR and information fusion
Figures 17 19 are the simulation test results based on SI-AR with EEMD, CEEMD and REMD, where the red cross represents the actual label of the test samples, and the blue circle represents the predicted values. The simulation results show that there is little difference between the three algorithms. The accuracy of EEMD-SIAR, CEEMD-SIAR and REMD-SiAR for fault diagnosis is 96%, 95% and 92% respectively. Figure 20 shows the simulation test results of information fusion on the eigenvalues under three processed algorithms. The results show that the diagnostic accuracy was 100%, thus further diagnostic accuracy of the information integration of the accurate rate is higher than that of single algorithm after processing. On the other hand, the other fault diagnosis models are compared with the proposed fault diagnosis model.

Simulation test results of EEMD-SIAR (96%).

Simulation test results of CEEMD-SIAR (95%).

Simulation test results of REMD-SIAR (92%).

Simulation test results of SIAR-information fusion (100%).
It can be seen from Table 7, various fault diagnosis models proposed by scholars have achieved good results. For example, Wang et al. proposed a new multi-sensor information fusion method to realize fault classification. In this method, the time-domain vibration signals from multiple sensors at different locations were constructed into a rectangular two-dimensional matrix, and then the improved two-dimensional CNN was used to realize signal classification. The results were verified on the Case Western Reserve University data set, the IMS bearing database and the designed bearing failure test bed data set, and the prediction accuracy was 99.92%, 99.68% and 99.25%, respectively. Compared with traditional fault classification methods such as 1D and 2D CNN, this model can use less data and computational complexity to achieve higher fault prediction accuracy. Qi et al. proposed a new fault diagnosis model based on fault rate and fault symptom, In this method, the information of failure rate, failure mechanism and fault symptom was considered comprehensively, and the information of these fault characteristics was fused, with a correct rate of 87.5%. Li et al. proposed a deep stacking least squares support vector machine model for fault diagnosis, with a correct rate of 99.90%. Konar and Chattopadhyay proposed a continuous wavelet transform model, whose accuracy was only 90%. The fault diagnosis model proposed in this paper was based on the traditional signal analysis algorithms and its varients, which analyzed the fault signal and fuses the extracted features. It can be seen from the results that the prediction accuracy rate of the proposed diagnosis model is up to 100%, which is superior to the fault diagnosis ability of most other models and can be effectively used for bearing fault diagnosis.
Comparison results of rolling bearing fault diagnosis models
In order to solve the problem that there are invalid and redundant components in the decomposed components of the fault diagnosis algorithm, and it is easy to cause the loss of feature information when a single algorithm decomposes the signal to extract fault features. So this paper fragments the AR spectrum value as the eigenvalue of the fault data, and adopts EEMD, CEEMD and REMD algorithms to decompose the signal. The income components will be carried out the AR spectral analysis and the effective AR spectrum values are intercepted as the characteristic value of data. Then the characteristics of three kinds of algorithms about value information fusion avoid lost characteristics in order to achieve the purpose of more efficient screening effective characteristics. Finally it is verified by simulation results of fault data in this paper and the proposed method can be effectively used in the fault diagnosis of rolling bearing.
Although the current fault diagnosis method based on empirical mode decomposition and its related varients had unique advantages in the processing of non-stationary nonlinear signals, the algorithm still has the following problems to be solved. (1) Boundary effect. The boundary effect which is easy to appear in EMD decomposition has certain influence on signal analysis. Moreover, in the fitting process of upper and lower envelope during EMD decomposition, the fitting method is still cubic spline interpolation, which will cause serious boundary effect to pollute the original fault signals in practical engineering applications. As a result, the original fault signal loses some authenticity, which affects the diagnosis result and needs to be further improved to enhance its performance. (2) For the EEMD algorithm, although the addition of white noise and the elimination of invalid extreme points can inhibit the generation of mode aliasing phenomenon to a certain extent, it still cannot completely eliminate this phenomenon, and the process of adding white noise will produce some redundant components without practical physical significance, and there is no very effective method to effectively screen these redundant components.
Footnotes
Acknowledgments
This work was supported by the Basic Scientific Research Project of Institution of Higher Learning of Liaoning Province (Grant No. LJKZ0293 and LJKZ0307), and the Postgraduate Education Reform Project of Liaoning Province (Grant No. LNYJG2022137).
Conflict of interest
The authors declare that there is no conflict of interests regarding the publication of this article.
Author contributions
Cheng Zhong participated in the algorithm simulation and draft writing. Jie-sheng Wang participated in the concept, design, interpretation and critical revision of this paper. Yu Liu participated in the data collection and analysis of the paper.
