Abstract
Reliable degradation prognosis of mechanical components is very important for condition-based maintenance to improve the reliability and reduce the cost of maintenance. This paper reports the development of a fuzzy feature fusion and multimodal regression method for the degradation prognosis of mechanical components. Initially, the raw features from the vibration signals of the mechanical components are extracted. A degradation index is subsequently yielded by merging the obtained features through/using the fuzzy fusion technique. The ensemble empirical mode decomposition is then introduced to decompose the fusion index into several multimodal sub-series to acquire more detailed information. Extreme learning machines are established to predict the sub-series in different modes. The predicted results are obtained by integrating the multimodal sub-results. The reported approach was evaluated with real data from a rolling element bearing. Moreover, two peer models were imported to validate the effectiveness of the proposed method. The experimental results indicate that the reported approach is capable of erecting the degradation index reflecting the bearing degradation and that it had better performance in the remaining useful life prediction than the peer methods.
Keywords
Introduction
Prognosis and health management (PHM) technology [1] is very important for the healthy service of mechanical components. Effective PHM helps to decrease the production downtime and maintenance costs of mechanical components. It can also reduce or eliminate machine faults while providing an accurate degradation prognosis [2]. Therefore, a useful method for predicting the degradation of mechanical components has to be sought.
Fault prediction includes feature extraction, degradation prognosis, and remaining useful life (RUL) prediction. Among these ones, degradation prognosis is a vital technology of fault prediction [3], which is related to the veracity of fault prediction and the accuracy of RUL or fault probability prediction. Therefore, much research has considered fault diagnosis and degradation prognosis of mechanical component lifetime [4–6]. There are also different approaches for degradation prognosis, which has been considered. These methods are usually classified into three categories: model-based prognosis models [7], data-driven prognosis models [8], and hybrid prognosis models [9]. Among these ones, data-driven models have been gaining momentum recently and are becoming the mainstream in the degradation prognosis field [10]. Data-driven models collect signals and data from periodic monitoring of the components and predict their future evolution of degradation [2]. Moreover, data-driven models are relatively easier and provide better results, without the assumption of physical parameters.
Mechanical fault diagnosis and degradation prognosis are normally based on running condition indexes, including vibration, temperature, current and voltage, oil, and wear signals [11]. Among these ones indexes, vibration signals can be used to extract information to diagnose abnormal conditions [12], and thus are more effective and suitable than other indexes at reflecting the running condition of mechanical components [11].
For degradation prognosis, feature extraction is the first crucial step therefore a suitable method is utilized for feature fusion. Finally, an effective prognosis model could be selected to prognose the degradation based on the degradation index.
Feature extraction, which effectively reflects the machine running condition using the original vibration signals, is an important prerequisite for degradation prognosis. Typical methods such as time domain, frequency domain, and time-frequency domain analyses have been developed and reported [13]. After extracting the features, it is necessary the fusion of those features.
A variety of feature fusion methods have been adopted in the research. The kernel joint approximate diagonalization of the eigen-matrices approach was applied for feature fusion [14]. The locality preserving projections was used to fuse features [15]. Combining phase space reconstitution (PSR) with between- and within-class scatter evaluation, a feature-weighted fusion method was proposed [9]. A new fusion technique was used based on fuzzy fusion of multiple criteria [16]. This technique was originally used for fusing the time domain and the time–frequency domain of the bearings to establish a degradation index.
Ensemble empirical mode decomposition (EEMD), an improved method of empirical model decomposition (EMD), has been widely adopted for signal decomposition by many researchers. EMD [17], which was proposed by Huang [18], can decompose complicated signals into several intrinsic mode functions (IMFs) self-adaptively. However, there is a mode mixing problem when using EMD. EEMD [18] was thus developed to overcome the mode mixing problem by adding finite white noise into the original signals. The EEMD is used to improve the accuracy of the prognosis result [15–19].
Data-driven models such as the support vector machine [20], artificial neural network (ANN) [21], and extreme learning machine (ELM) [9] have been used to establish prediction models, such as short-term bus passenger flow forecasting [22] and RUL estimation [23]. Among these ones models, ELM, which was developed by Huang et al. [24], is a novel algorithm of single hidden-layer feed-forward neural networks. With its ability to randomly choose hidden nodes and analytically determine output weights, ELM has an extremely fast learning speed and an advantageous generalization capability. In addition, it has exhibited commendable performance in both classification and regression with a simple structure [9]. Compared with ANN, the training speed of ELM is much faster and achieves better generalization. According the literature review, ELM was selected as the regression model in this work. To validate the availability of the proposed approach, a rolling element bearing, as one representation of a mechanical component, is introduced.
The rest of the paper is organized as follows. The theoretical background of the proposed degradation prognosis method is introduced in Section 2. The degradation prognosis procedure is illustrated in Section 3. Experimental results and comparison with peer models are presented in Section 4. Finally, the conclusions drawn from the presented work are presented in Section 5.
Methodologies
Feature extraction in different domains
Feature extraction has an important influence on the degradation prognosis method in terms of its accuracy and efficiency. In practice, fault diagnosis of mechanical components often uses time domain and time–frequency domain parameters from vibration signals [25–28]. The features extracted from the original vibration signals show accurate information about the initially fault in mechanical components.
In all, 16 features were used in this study. Time-frequency domain can be obtained using several tools [29]. In this paper, eight features were extracted from the time–frequency domain using the wavelet transform package. The remaining eight time domain features (standard deviation, root mean square, variance, maximum, peak-to-peak value, absolute mean value, waveform index, and kurtosis) were extracted using the formulas in Table 1. These features represent the degradation condition of mechanical components from a healthy condition to collapse.
According to Table 1, P is the feature, q is the extracted vibration signal, and W is the sample number for μ = 1, 2, …, W.
List of the original extracted feature
List of the original extracted feature
where E ζ is the energy values of sub-frequency bands obtained by a three-layer wavelet packet transform.
In the fuzzy fusion of multiple criteria method, the cost function is composed of a fuzzy nearness data fusion of kurtosis, smoothness factor, and kurtosis coefficient. It improves the robustness that would identify the optimal resonance band. Therefore, the cost function constructed by the multiple indicator fusion results is adopted in this paper. The mathematical expression is as follows [30]:
The realization of the data fusion function F(.) uses the fuzzy nearness, which is a measurement of the proximity degree of fuzzy sets. A collection of initially minimum spectral subsets is X (f) = {C1, C2, …, C
i
, …, C
l
}. The corresponding set of time domain components is {x1 (t), x2 (t) , …, x
i
(t) , …, xl (t)}. Every index sequence is normalized over the interval from 0 to 1.
Then, to calculate the mean value
To calculate the grand mean
In this paper, the triangle fuzzy membership function is chosen. The fuzzy quantity of each indicator is expressed as follows:
So, the blur quantity of the data fusion cost function is as follows:
The nearness of the post of the mth index fuzzy quantity and the fuzzy quantity of data fusion is:
If the fuzzy quantity of the mth index is closer to the fuzzy amount of the data fusion, the nearness D
m
is bigger. This means that the better the reliability and stability, the greater the weight. Based on this principle, Equation (18) can be obtained by using Equations (9) and (10).
A fusion degradation index is finally obtained:
Huang et al. [31] introduced the EMD method, which allows for adaptive decomposition of non-linear and non-stationary signals. This method can decompose the signal into several narrow-banded IMF components. It is a widely used approach. However, it may incorrectly reveal an intermittent signal’s characteristic information as a result of mode mixing, which may cause the decomposition to render the physical meaning of the IMFs unclear. To overcome this problem, Wu and Huang proposed EEMD [32].
EEMD is an improved version of EMD, and it can improve the prediction accuracy [33]. It solves the mode-mixing problem [34] by adding white noise of finite amplitude fairly. It also uses statistical characterization of the Gaussian white noise to improve the distribution of extreme points in the original signal. The process of EEMD is as follows [35].
In this paper, the EEMD method was applied to perform with an ensemble number of 500 and a white noise amplitude that is 0.2 times the standard deviation of the bearing fuzzy fusion degradation index.

Schematic of mechanical components degradation prognostic model.
Huang presented a simple learning algorithm for single hidden-layer feed-forward neural networks called ELM [36]. Its learning speed was thousands of times faster than traditional feed-forward network learning algorithms such as BP algorithms, and it obtained better generalization performance [25]. The structure of ELM can show that it can randomly assign the weights of the input layers. Moreover, it computes and analyzes the weights of the output layers by a simple generalized inverse operation [37].
Suppose there are k samples (c r , d r ), r = 1, 2, …, k, where c r = (cr1, cr2, …, c rv ) T and d r = (dr1, dr2, …, d rw ) are the rth input feature and its target value, respectively. Then, the model of single hidden layer feedforward neural networks with R hidden nodes can be described as follows:
Standard SLFNs with R hidden nodes with activation function g
r
can approximate these k samples between output o
p
and target s
p
with zero error means. It is represented as:
There is an existing optimal value of β
i
such that
By using the following substitutions:
The output weight β can be written as:
As ELM does not need to iteratively adjust input weights and threshold in training, it decreases the training complexity and improves the training speed. However, its accuracy is still related to feature extraction.
The main procedure of the proposed approach is demonstrated in Fig. 1.
Experimental results and analysis
Rolling element bearings are among the most vital and easily destroyed components in rotating machinery because of their delicate structures, heavy loads, and harsh running environment [30], which makes monitoring them essential. In this section, the reported degradation prognosis method is applied to the bearing. In the aforementioned methods, fuzzy fusion has a capacity preferable to that of feature fusion. EEMD focuses on characteristic decomposition. ELM is an efficient learning algorithm with a fast training speed. An approach validation scheme, which is combined with the preceding three methods, was implemented in this work. The schematic procedure of the degradation mode can be seen in Fig. 1 and it will be explain in the following sections.

Bearing experiment rig and sensors.
With the development of information technologies, the test data for experimentally determining the operating status are easy to obtain. In this work, experimental data from the Prognostics Center of Excellence were used to verify the validity of the degradation prognosis proposed in this paper. They were generated by the NSF I/UCR Center for Intelligent Maintenance Systems with support from Rexnord Corp. in Milwaukee, WI [38]. The bearing experiment rig and sensor placement are demonstrated in Fig. 2.
As shown in the Fig. 2, four bearings were installed on a shaft. A radial load of 6000 lbs was applied to the shaft and bearing by a spring mechanism. The vibration data from each bearing were collected every 10 min with the sampling rate set at 20 kHz. A data collection length of 20480 points was collected by an NI DAQCard-6062E. The rotational speed was set at 2000 rpm by using an AC motor associated with the shaft through rub belts [39]. The raw vibration signals are shown in Fig. 3.

Acceleration of bearing ending with failure roller.

The extracted time-domain and time-frequency domain features.
The mean absolute error (MAE), root mean square error (RMSE), and correlation coefficient (CC) are introduced to estimate the performance of the model.
Only the vibration signals of the bearing were used in this study. Features can represent the change of bearing vibration signals. In this work, a total of 16 features including eight time domain features and eight time–frequency domain features were extracted from the raw vibration signals of the bearing, as illustrated in Fig. 4.
The Fig. 4 shows the 16 original features derived from the raw signals. Undoubtedly, feature extraction is an extremely critical procedure [40]. As observed in Fig. 4, the original features were sensitive in either early or severe failure. The fusion result shows that it is much better than many other indexes in performance of degradation prognosis. Thus, it is necessary to adopt a method for feature fusion.
Feature fuzzy fusion
Feature fusion will increase the amount of useful information available for degradation prognosis and remove some of the inevitable redundant information. Redundant features may reduce the accuracy and efficiency of degradation prognosis. To improve the performance of degradation prognosis, feature fusion is necessary.
The above features are not insensitive to identifying the degradation process, but they are not comprehensive enough to describe the degenerative process. Therefore, it is very important to rebuild the features from the original features to estimate the degradation of the bearing. In this study, the fuzzy fusion method was used to execute the fusion task. The result, a degradation index, was obtained using Equation (19). Figure 5 presents the overall degradation trend of the bearing more clearly. To better describe the failure, the degradation process can be divided into three phases. At the beginning, the curve remains stable, which indicates normal working conditions; this is called the normal phase (Phase 1). Over time, the tendency toward degradation increases gradually. This phase, where the fault starts to occur, is regarded as the failure development phase (Phase 2). At the last phase, the curve sharply increases, which is denoted as the severe phase (Phase 3). Relying on the preceding analysis, the conclusion can be drawn that the fusion index is able to demonstrate more detailed information about the failure process. Additionally, it was found that the normal phase contained less information. It can even be said that Phase 1 made little contribution to the degradation. Phase 1 was therefore removed from the experiment. Phases 2 and 3 (700–984) of the degradation indexes were utilized to model the degradation estimation. In this study, about 65% of the sample sets were used for training; the rest were used for testing.

Degradation index for bearing prognosis.
According to the aforementioned procedure, EEMD, deriving more details and it was subsequently used for the decomposition of the character index. was Seven IMFs and one residual component were ultimately gained; they are exhibited in Fig. 6.
As shown in the figure, the seven individual elements of b1–b7 were acquired with EEMD. In particular, one independent component, r, described the overall tendency of the degradation index in the form of a monotonic function. Overall, eight components were derived after decomposition. As seen in these components, they became stationary and visually regular. The obtained sub-series are exhibited from high to low frequency so that more details are presented. The EEMD technique attenuated the coupling and interruption implied in the data to some extent. Subsequently, multimodal ELM was established in each mode.

The decomposition results of the characteristics index.
To compare the performance of the reported approach, mono ELM was introduced to compare the influence of the multimodal prediction. Furthermore, support vector regression (SVR), which is commonly used as an effective tool in many disciplines [41], was also adopted for comparison. The final results of all of the models are displayed in Fig. 7.
Figure 7(a)–(c) show the respective prediction results. As can be seen, the values from the proposed method and ELM are much closer to the observed values as a whole. Their fitting effect exceeds that of SVR, which is obviously worse among these ones models. SVR conforms to the beginning stage, but not to the whole. In particular, the differences in their results are quite distinct in the prediction of extreme values. It is apparent that the proposed method is superior to the others and exhibits outstanding performance. The preceding comparison reveals that the proposed method is more stationary and has a better generalization capacity.

The prediction results for different models: (a) the proposed method; (b) ELM; (c) SVR.

Comparisons of boxplot of absolute errors using different models.
As shown by the median displayed in Fig. 8, the absolute error mean for the proposed method is the lowest, and the stationarity is outstanding. The results yielded through the proposed method and ELM are shorter than those yielded through SVR when comparing the length of every box entity. This demonstrates that the distributions of the absolute error for the proposed method and ELM are relatively concentrated. Furthermore, the position of the entity for the proposed method is lower than that for SVR. Most of the absolute error of the proposed method is thus within a lower range. Compared with the distance between the median and quartiles, the condition of the proposed method is relatively symmetrical and shows basically normal distribution. Furthermore, none of the benchmarks are balanced clearly. None of the benchmarks are clearly balanced, which indicates that the proposed method is better in terms of the distribution of absolute error. All of the models have some outliers reflecting significant error values. However, SVR has the most outliers among these ones models; the other two have relatively few. This shows the advantage of ELM from another aspect. Based on the preceding analyses, one may experience the advantages of the proposed method as compared with other methods.
Table 2 shows the results of MAE, RMSE, and CC. The proposed method has lower values than those of the other two models. This means that, among the three models, the proposed method has better accuracy and performance than the other two in terms of the various performance criteria. These differences also illustrate that multi-feature fuzzy fusion and multimodal decomposition can better extract essential features of the degradation process.
The comparisons of three models
In this paper, a data-driven prognostics method based on fuzzy fusion and multimodal degradation prognosis is proposed. The objective of this work was to develop a method to estimate the degradation process of mechanical components. Initially, the raw features were collected from the time domain and time–frequency domain of vibration signals for the bearings. Then, the acquired original features were fused into a degradation index using the fuzzy fusion technique. Subsequently, EEMD was adopted to decompose the fusion index into multimodal sub-series. A multimodal ELM was established to estimate the sub-series. Finally, the sub-results of each mode were added into the prognosis results. The adopted method is applied to actual bearing data, and the degradation prognosis results of the bearing are shown and discussed. The proposed approach can achieve a prognosis with better accuracy, and it provides a good method for making a prognosis of the RUL of mechanical components.
Footnotes
Acknowledgments
This work is supported in part by the National Key Research & Development Program of China (2016YFE0132200), and the Chongqing Science & Technology Commission (cstc2015zdcy-ztzx70012).
