Abstract
Predicting the remaining useful life (RUL) of rolling element bearings (REBs) has emerged as a vital technique for guaranteeing the safety, availability, and efficiency of rotating machinery systems. An approach using locally linear fusion regression (LLFR) is developed for the RUL prediction of REBs. The original features, derived from the time domain and time– frequency domain of the vibration signal of the REBs, are extracted first. Utilizing locally linear embedding, the extracted features are then fused into a condition indicator reflecting the entire degradation process. The adaptive network-based fuzzy inference system is then introduced for the RUL prediction. The reported approach is investigated with real REB data. Peer models are employed to validate the performance of the proposed method in this work. The derived experimental results indicate that LLFR has superior prediction ability as compared to the peer models in terms of the introduced performance criteria and that it can obtain more reliable and precise prediction results.
Keywords
Introduction
Rolling element bearings (REBs) are one of the most vital components in rotating machinery. Studies indicate that almost 40% of motor faults are caused by bearing failure. The performance status of REBs directly affects the health of mechanical systems [1–3]. Therefore, there is an emerging need to predict the remaining useful life (RUL) of REBs. Accurate RUL prediction can help avoid problems caused by the failure of REBs such as productivity decline, downtime maintenance, and safety accidents, as well as provide a foundation for performance checking schedules and maintenance strategies of mechanical systems [4–6]. For these reasons, many approaches to RUL prediction have been developed.
Methods currently used in the RUL estimation of REBs can be loosely divided into two categories: model-driven approaches and data-driven approaches. Model-driven approaches depict the rising tendency of a failure mode quantitatively using physical rules and evaluate the RUL by calculating deterministic equations obtained from empirical data [7]. Data-driven approaches are designed to establish a relevant model of the degradation process based on the observed data. It is, however, infeasible to build exact physical models for RUL prediction in practice, especially in an intricate system with various failure mechanisms. Without considering the failure modes, data-driven methods are frequently used because they offer a balance between operability, accuracy, and applicability [8–11]. Therefore, a data-driven method is adopted to predict the RUL in this work.
Usually, the RUL prediction of REBs can be regarded as a regression issue in which a related model is created via the relationship between sensitive and reliable features and the corresponding RUL throughout the lifecycle. As a result, this approach is composed of two main steps: feature construction and regression model establishment.
There are many technologies that can provide systematic, scientific, and efficient strategies for feature extraction. Most often, such statistical calculation methods as time domain analysis, frequency domain analysis, and time– frequency domain analysis are widely adopted [12–14]. However, it is challenging to extract the appropriate features from vibration signals because of abundant noise [15, 16]. Reliable, sensitive, and proper features may better reflect the degradation process of REBs and improve the prediction effectiveness [17]. Unfortunately, some features are sensitive to global failure and others are sensitive to local failure. Moreover, an insufficient number of features will result in insufficient information so that the comprehensive and complete degradation process of REBs cannot be reflected. More features may cause overfitting, in which random error is shown instead of the implicit relationship in the model. Thus, a new method is urgently needed for building appropriate indicator fusing multiple original features while retaining sensitivity and variability. In particular, multi-feature fusion methods offer the possibility of addressing this problem [18]. In this approach, the features originating from the vibration signals are fused into a condition indicator that comprehensively reflects the degradation process of the REBs. Wang et al. [19] used the Mahalanobis distance to fuse original features into a new indicator exhibiting the degradation procedure for degradation assessment and RUL prediction. Liu et al. [20] utilized the joints approximate diagonalization of eigen-matrices to merge weighted features into an advanced indicator for RUL prediction. Li et al. [21] adopted the correlation matrix clustering and weight method to establish a new degradation condition indicator. Wei et al. [22] put forward a centralized and distributed information fusion method to obtain a RUL distribution for minimizing prediction uncertainty. However, these approaches have complicated processes and lower fusion efficiencies. They thus may not be able to reduce the evaluation uncertainty for uncertain degradation processes of a system. As an alternative, dimensionality reduction techniques have been introduced as feature fusion approaches. The principal component analysis (PCA) approach, a typical representation of this technique, has been introduced to integrate original features to effectively predict the RUL of bearings [23]. However, it has been found that the PCA approach may be limited by not regarding the nonlinear properties of the original features. Nonlinear dimensionality reduction methods can better address this issue, especially manifold learning, which can derive intrinsic low-dimensional manifold structures implied in high-dimensional space by analyzing topological characteristics of the data distribution and optimizing the fusion strategy. The essential structure can be effectively exhibited from nonlinear data with the manifold learning approach. Such manifolds as locally linear embedding (LLE) [24], isometric feature mapping [25], and diffusion maps (DM) [26] have been developed. In particular, LLE has the advantages of simple calculations, an easily implemented procedure, and intuitive geometric meaning. As a result, in this work LLE is adopted to merge the original features. A preprocessing technique is introduced to improve the prediction effect with consideration of such adverse influences as noise.
In past years, many data-driven approaches, such as auto-regressive models [27], hidden Markov models [28], artificial neural networks (ANN) [29–31], Bayesian networks [32], support vector regression (SVM) [33–37], the fuzzy system model [38], and stochastic processes [40–42] have been applied to various disciplines, including mechanical fault diagnosis and prognosis. Especially, adaptive network-based fuzzy inference systems (ANFISs), in which fuzzy membership functions and fuzzy rules are derived via the learning of abundant historical data instead of via human experience or intuition, is an efficient way of handling the regression problem [43]. Combining the superiority of both ANNs and fuzzy inference systems, ANFIS presents significantly better self-learning ability, robustness, and adaptability. Soualhi et al. [44] developed an approach that incorporates ANFIS for the prognosis and health management of REBs. Ma et al. [45] introduced degradation process prediction into condition monitoring and employed ANFIS to effectively estimate the RUL of bearings. Chen et al. [46] adopted ANFIS and high-order particle filtering to form a prognostic method of machine condition. In view of these advantages, ANFIS is used as the regression model in this study.
Based on the above-mentioned analysis, locally linear fusion regression (LLFR) based on LLE and ANFIS is proposed for the RUL prediction of REBs in this study. Initially, features showing the operating status are extracted from the vibration signals of the REBs. Second, LLE is adopted to fuse the obtained features into a condition indicator. Finally, ANFIS is employed to predict the RUL. Furthermore, two benchmark models with different fusion methods are used to evaluate the performance of the proposed model. RUL prediction is composed of a degradation estimation and time prediction of the RUL.
The rest of the paper proceeds as follows. Section 2 systematically presents the proposed method. The reported approach is investigated using real data and compared with peer models in Section 3. Section 4 presents the conclusions of the paper.
Methodology
In this section, the systematic methodology of the proposed approach is given. The implementation process of the reported approach, including feature extraction, LLE fusion, and ANFIS, is demonstrated in detail. Moreover, the motivation behind using the proposed method is also explained.
Feature extraction
The vibration signals of REBs consist of random noise and periodical fluctuations. The features from the time domain and time– frequency domain, which have clear physical meaning, efficiently denoise the original signals [47]. To better grasp the degradation characteristics, 15 time domain features are first extracted. Furthermore, the energy values of sub-frequency bands obtained by a wavelet packet transform (WPT) are considered as the timeߝfrequency domain features.
The raw signal, which is composed of M segments of signal with length N, is denoted as
The specification of the features
The specification of the features
Some of the obtained original features contain similar properties and some have partial information of the degradation process. To reflect the degradation process accurately, all original features are fused into a condition indicator for condition estimation and RUL prediction. In this work, feature fusion is achieved by the LLE algorithm.
As a representation of manifold learning, LLE is an efficient tool of dimensionality reduction based on unsupervised learning, especially for nonlinear systems [24]. Mapping data points of high-dimensional space into low-dimensional space, LLE achieves a low-dimensional approximation for the raw data. The steps in feature fusion using LLE are outlined as follows:
As a result, the condition indicator
As mentioned above, a nonlinear regression model is required for degradation evaluation and RUL prediction based on the obtained condition indicator. Integrating neural networks and fuzzy logic rules, ANFIS is a multi-layer adaptive system capable of handling the complexity and nonlinearity [43]. The difference between ANFIS and fuzzy inference systems is that ANFIS adopts a back-propagation method to control errors. In a sense, ANFIS is a combination of ANN and fuzzy logic. Compared with ANN, ANFIS has better self-learning capability, stability, and self-adaptability. Furthermore, high efficiency, a fast convergence rate, and good prediction performance are also reflected in ANFIS. Therefore, ANFIS is used as the regression model in this work.
ANFIS can find optimal solutions with the back-propagation method or with a combination of the least squares method and the back-propagation method to estimate the membership function parameters by combining the two systems [48]. Thus, the degradation estimation yi+1 and RUL prediction R
i
can be expressed as
Two inputs, y1 and y2, and one output, Y, in this system are assumed, as illustrated in Fig. 1. Moreover, a common set of fuzzy if– then rules in the Sugeno fuzzy model is depicted as

The structure of ANFIS.
Figure 1 shows that ANFIS consists of five layers. Let Ou,s represent the output at the uth node of the sth layer. The information of each layer is described as follows:
Having separately described the abovementioned constituents, the LLFR process can be implemented, as illustrated in Fig. 2.

The flow chart of the proposed approach.
Experimental data and performance criteria
The reported approach was investigated using real REB data originating from the IEEE PHM 2012 Prognostic Challenge [49]. To depict the degradation process of the tested REB significantly, an experimental platform was used to provide real data throughout the lifecycle [50]. The experimental platform consisted of three modules: rotating module, degradation generation module, and measurement module (Fig. 3). With a sampling frequency 25.6 kHz and operating conditions of 1800 rpm and 4000 N loads, the vibration signal of the REB was acquired every 10 s. The collected vibration signal is demonstrated in Fig. 4.

The structure of the experimental platform [49].

Raw signal of the REB to be tested.
As seen in Fig. 4, the amplitude changed from stationarity to a gradual increase. The raw vibration signal roughly reflects the whole life process, from normal conditions to complete failure.
In this work, the mean absolute error (MAE), mean absolute relative (MARE), root mean square error (RMSE), correlation coefficient (CC), and equal coefficient (EC) are introduced to assess the performance of the reported approach.
According to the above-mentioned procedure, multiple features derived from time domain and timeߝfrequency domain of the vibration signal were extracted for building a condition indicator. Fourteen time domain features were obtained first. Additionally, the energy values of sub-frequency bands derived by the WPT as the timeߝfrequency features were adopted. The decomposition level of three layer was obtained using the empirical formula [51]. Meanwhile, “db4,” one of most widely used and effective mother wavelets, was introduced for decomposing the raw signal. Eight energy values were acquired. Thus, a total of twenty-three features were obtained in this work, which are given in Fig. 5.

The extracted features for raw signal.
As can be seen in Fig. 5, the results reflected by these features have large differences. Some features, e.g., f3, f18, and f19 were sensitive only to severe failure, but they were unresponsive for the beginning and development of failure. On the other hand, some features, e.g., f4 and f5, were sensitive to the occurrence and development of failure but unresponsive at the complete failure stage. From features f6 and f8, a rough degradation trend can be observed. However, the overall information is noisy. Moreover, irregular trends are reflected in the lesser features such as f1 and f13. Of course, there also are accepted features that can better exhibit the failure process, e.g., f2, f9, and f16. To exhibit the overall condition of these obtained features more specifically, a sensitive degree statistics of these features in different stages is shown in Table 2.
Sensitive degree of the features
Note: 0 = insensitive, 1 = mid-sensitive, 2 = sensitive.
Based on the above analysis, it is shown that none of the features could exhibit all of the information on the degradation process. An ideal condition indicator is desired that contains complete failure information of the different features and maintains consistent sensitivity over changing stages of the degradation process. Consequently, LLE was introduced to integrate the original features scientifically and systematically. In this work, LLE, with the target dimension of one, fused the twenty-three features to be a condition indicator, as shown in Fig. 6.

The condition indicator of multi-feature fusion for REB.
As shown in Fig. 6, the obtained condition indicator had much noise and drastic fluctuations, which affected the evaluation effect and prediction performance. Thus, a denoising technique was introduced to address the issue. In this work, the condition indicator was processed using equal interval sampling of 1 min and median filtering with a window size of 10. The derived result is given in Fig. 7.

The denoised condition indicator.
Figure 7 exhibits a multi-feature indicator, which synthetically reflected the degradation process of the REB. An obvious staggered trend is observed. The running state of the REB can be considered as having three periods: the normal stage (Stage 1), the failure growth stage (Stage 2), and the serious stage (Stage 3).
The curve condition of each stage reflects the current situation. In the Stage 1, the relatively stationary volatility of the curve shows that the REB was under regular working conditions. Accompanying the occurrence of early fault, the REB then began to step into the Stage 2. The gradually increased curve indicates that the failure was continuously developing. The failure growth stage occupies a large proportion of the failure process. This is owing to the fact that the failure behavior takes time to progress from an early fault to complete failure. A sharp increase in the curve shows that the REB entered into the Stage 3, the serious stage. It is quite clear that a complete and comprehensive process is exhibited by the condition indicator. What is more, LLE completed the fusion task excellently.
Before establishing the regression model, a sample set, including training and testing, must be created. A failure process composed of the Stage 2 and the Stage 3 was utilized for degradation estimation and RUL prediction owing to the minimal contribution of the Stage 1 to the failure. Around 60% of the sample was utilized for training and 40% for testing.
To highlight the performance of the proposed model, peer models in which LLE was replaced by either DM or PCA were introduced. The comparison models can be called DM-ANFIS and PCA-ANFIS, respectively. The selection of the input structures of the peer models was the same as that for the proposed method. The optimal structure, which is expressed as yi-G = [yi-1, yi-2, …, yi-g] was determined by trial and error. The results of the degradation estimation are displayed in Fig. 8 for the different models.

The estimation results of different models: (a) LLFR, g = 5; (b) DM-ANFIS, g = 5; (c) PCA-ANFIS, g = 5.
As is observed in Fig. 8(a) and (b), the predicted curve of LLFR and DM-ANFIS better coincided with the variation of the observed curve. This reflects the better fitting performance of the manifold learning. More specifically, LLFR exceeded DM-ANFIS in local capture. The worse results for PCA-ANFIS are in sharp contrast to the results for the other models in Fig. 8(c), and suggest shows that manifold learning is superior to PCA in multiple feature integration, and that the effect of LLE was the best. The ability to estimate extreme values is a crucial criterion for measuring model performance. In contrast to Figs. 5(a), 8(b) and (c) show that the predicted extreme values of the reported model more closely approach the observed values. LLFR has the ability to capture the extremum. However, the benchmark models show obviously deviation, especially in the evaluation of the serious stage. To all appearances, the reported approach is superior to the peer models.
To further exhibit the performance of the models in detail, the absolute relative errors yielded by each model are given in Fig. 9. It is clear that the LLFR model has the best stability among these models. The others show obvious drastic fluctuations.

Comparisons of absolute relative error using different models.
Based on the adopted performance criteria, a quantitative analysis is additionally given in Table 3.
Comparison of the estimation performance
In the table, the superiority of the reported model is clearly exhibited in terms of the CC, MARE, and EC. It is also shown that LLFR has a better effect and accuracy compared with the benchmark models. Moreover, observing the CC and EC, the preponderance of manifold learning is displayed once again. Therefore, it is concluded that LLFR can significantly merge inherent characteristics of the data, remove the interference of random elements, and obtain a good result.
After the degradation estimation of the REB, RUL prediction was performed using each model. The results are shown in Fig. 10.

The RUL prediction results of different models: (a) LLFR, g = 5; (b) DM-ANFIS, g = 2; (c) PCA-ANFIS, g = 2.
Figure 10(a-c) demonstrate the results of the RUL prediction using LLFR, the DM-ANFIS model, and the PCA model, respectively. At the beginning of the RUL prediction, the LLFR results coincide with the observed RUL. The comparison models have larger fluctuations from the start. Hence, the reported model has the best representation in the beginning stage, whereas the peers yield large errors. As time goes on, the comparison models gradually deviate from the observed values. In contrast, the LLFR model traces the RUL trend continuously, and the obtained predicted values converge to the observed RUL. Thus, among these models, LLFR furnishes more precise prediction results and exhibits the best performance. Furthermore, the results of DM-ANFIS are slightly better than those of PCA-ANFIS. However, these models fail to provide ideal results in the serious stage. Thus, the performance of the LLFR model is superior to that of the peer models in this stage.
Based on the above analysis, the fusion ability of LLE outperforms the comparison fusion methods, such that random errors of the stochastic process implied in the raw features are weakened greatly. A statistical analysis of absolute error is demonstrated in Fig. 11 using a boxplot.

The boxplot of absolute error using different models.
Overall, the boxes corresponding to the proposed model are in the low range of absolute error, whereas the competing models are in relatively high ranges. This indicates that the reported model has the best comprehensive performance among these models. Comparing the heights of the boxes, which reflect the distributions of the absolute errors, it is shown that LLFR has a relatively concentrated distribution of errors, as shown by its shortest box height. Observing the position of the median lines, the reported model is the best stationary in that the mean values of the absolute errors are the minimum. Thus, LLFR outperforms its comparison models.
Visual results are shown with the above quality analysis. Naturally, a quantitative comparison was performed for reflecting more specific information. The prediction performance comparison of the three competing models is given in Table 4.
Comparison of the prediction performance
Table 4 compares the performance for the different models. Comparing these criteria, the advantages of the LLFR model are displayed based on the RMSE, MAE, and EC. It is found that the reported model is superior to the others in terms of prediction accuracy. The related RMSE, MPE, and EC differences between LLFR and DM-ANFIS are 55.36%, 62.02%, and 12.98%, respectively, and between LLFR and PCA-ANFIS the differences are 60.04%, 66.09%, and 17.10%, respectively. These results also demonstrate the effectiveness of the proposed method.
The condition of REBs is extremely important in rotating machinery. A reliable RUL prediction can effectively improve the implementation of condition-based maintenance for REBs. A locally linear fusion fuzzy regression approach is herein developed to predict the RUL of REBs. In the proposed method, the LLE technique is adopted to merge original features obtained from the vibration signal of the REBs to create a condition indicator. The condition indicator is then used by ANFIS to estimate the RUL. The reported method was applied to experimental data from an REB. The experimental results revealed that the reported approach demonstrated better performance than competing approaches, and can provide more reliable and accurate prediction results. It was able to accurately predict the RUL of the REB. The primary innovation of this work is the proposed predictor following a philosophy that the comprehensive information can be obtained by using multiple feature fusion. In particular, the fusion method can be replaced with other methods in the implementation process. Thus, the approach may be generalized in some extent. Further work to improving the prediction accuracy should be undertaken in future.
Footnotes
Acknowledgments
This work is supported in part by the National Key Research & Development Program of China (2016YFE0132200), the Project of Chongqing Science & Technology Commission (cstc2015zdcy-ztzx70012), Fundamental Research Funds for the Central Universities (106112014CDJZR095501), and Chongqing Research Program of Basic Science & Frontier Technology with Grant (cstc2017jcyjB0305). The valuable comments and suggestions from the editor and the two reviewers are very much appreciated.
