Abstract
Fault diagnosis plays a crucial role to maintain healthy conditions in rotating machinery. In real industrial applications, a Machine Learning based Classifier (ML-C) analyses data from a current machinery condition to detect abnormal behaviours. Usually, this is achieved through a previous training of the ML-C model, under supervised learning; however, for new machinery conditions, the classifier is not able to correctly identify these new condition. This paper proposes a framework to detect new patterns of abnormal conditions in gearboxes, that could be associated to new faults. The framework relies on an algorithm to build evolving models in simultaneous scenarios of classification and clustering. The design is inspired by the main principles of the K-means and the One Nearest Neighbour (1-NN) algorithms. A heuristic metric is defined to analyse the new discovered clusters; as a result, these new clusters can be labelled as new classes corresponding to new faulty patterns. Once a new pattern is identified, the associated data feeds a dedicated supervised classifier which is updated through a new training phase. The proposed framework is tested on data collected from a gearbox test bed under realistic conditions of faults. Experimental results show that the algorithm is able to discover new valuable knowledge than can be identified as new faulty classes.
Keywords
Introduction
Fault diagnosis, in rotating machinery, is classically addressed by data driven classification, under a supervised learning [6]. Several works follow this approach for fault diagnosis in rotating machinery, such as the recent works in [3, 16] where deep learning and deep convolutional neural network are used for feature fusion and extraction, and Support Vector Machine (SVM) is used for classification. A multidimensional hybrid intelligent method for gear fault diagnosis is reported in [24], where several classifiers are trained and the classification result is combined with a genetic algorithm to get a better performance. The proposal in [22] uses multi-class SVM for rolling bearing fault diagnosis. In [14, 27], supervised approaches combining optimization process are developed, to obtain classifiers with high separability and compactness of different classes.
Unfortunately, supervised approaches for data driven fault diagnosis are not fully useful in real industrial applications, because the knowledge of the all faulty modes is hard to characterize for building a good fault classifier. When a classifier analyses an unlabeled sample, this sample is assigned to the nearest known class. This classification is wrong as the real machinery condition was not considered in the classifier training phase. For this real case, a procedure based on knowledge extraction is needed to detect whether a new sample could be a new pattern.
Semi-supervised learning is used to exploit unlabeled data in the field of fault diagnosis. The work in [28] analyses labeled and unlabeled data to build classification models for fault detection based on SVM under a semi-supervised scenario. The approach in [25] uses manifold regularization based on semi-supervised learning in order to detect faults by using labeled and unlabeled data. A recent work also addressing the semi-supervised approach by using dictionary learning in fault diagnosis of rotating machinery is developed in [12]. Semi-supervised fuzzy C-means cluster analysis for fault diagnosis is performed in [4].
On the other hand, approaches for discovering faulty patterns in rotating machinery have been also reported. In [18], a neuro-fuzzy system is used to discover knowledge rules from vibration signals associated with faulty bearings. A rule interpretation process allows detecting new faulty behaviours. The proposals in [1, 26] develop unsupervised approaches for fault detection and diagnosis with cluster analysis of vibration signals. Clustering techniques are mainly studied in these types of problems due to their skills of finding hidden patterns in the data with unknown symptoms [8]. Unsupervised learning algorithms are extended in [13] for machine defects classification, based on some improvements of local and global regressive mapping for machine fault diagnosis.
The knowledge discovery is one of the main focus in semi-supervised and unsupervised approaches for data driven applications. Under unsupervised learning, different approaches aim at discovering new clusters that might be associated with new unknown classes. This type of learning is commonly called ‘partial supervision’ [21]. From the semi-supervised strategy, the most common approaches are oriented to: (i) generate a model with labelled samples, and then evaluate the unlabelled samples over the model [25], and (ii) build an unsupervised model with the unlabelled examples, and the clusters are analysed by a supervised scheme to define the classes [23, 28]. Recent works have also included evolving properties on the clustering and classification algorithms to reconfigure the cluster and classes structures under on-line and data stream scenarios [2, 17]. The main idea is the control of the clusters formation for new knowledge discovery.
This work deals with the problem of new faulty pattern discovery from unlabelled samples that are collected from monitoring process, by proposing a semi-supervised framework for building fault classifiers that are updated in an on-line manner. This is accomplished by using a new Heuristic Algorithm for Evolving Models in scenarios of Classification and Clustering (HHA-EMCC). HHA-EMCC is able to combine the supervised and unsupervised concepts of 1-NN and K-means, under some criteria. HHA-EMCC follows some guidelines to build classes with labelled data and to discover new patterns from the unlabelled data. These new clusters are evaluated by a metric, which identifies whether the clusters are new classes, i.e. new patterns, or they are composed by inconclusive samples or noisy samples. A dedicated supervised classification model runs in parallel to the HHA-EMCC, and it is constantly updated when new clusters are discovered, i.e, new patterns feed the classification model for a new training phase. Inconclusive samples are also treated by the classification models, as it is assumed that all the new incoming unlabelled samples belong to the current operational condition. The proposed framework is applied to detect new faulty conditions in gearboxes. Results show that new clusters associated to new faulty condition are adequately identified.
This paper is organized as follows. Section 2 introduces the main characteristics of HHA-EMCC to build hybrid evolving models. Section 3 details the proposed semi-supervised framework. Section 4 presents the experimental procedure to collect the dataset that will be used to test our proposal. Section 5 evaluates our proposal in different scenarios, and addresses the results analysis. Finally, Section 6 concludes this work and outlines future research directions.
Background
HHA-EMCC is an algorithm inspired by the proposal in [19]. This algorithm aims at combining the principles of 1-NN and K-means approaches to solve: (i) classification tasks with labelled samples, (ii) clustering tasks with unlabelled samples, and (iii) both classification and clustering in the same scenario with labelled and unlabelled samples. These three main features are in the same ML model which in turn will be composed by clusters or classes, or both. In general, clusters and classes will be denoted as group. In this sense, HHA-EMCC can create hybrid models with classes and cluster simultaneously.
Let X ={ (x1, y1) , ⋯ , (x
n
, y
n
) } be the available dataset, with x
i
a sample of m-dimensional features and y
i
defined by the expression in (1).
where l i is a label.
HHA-EMCC design allows the model to reconfigure its structure (if needed) in presence of a new incoming sample. More precisely, the core micro-processes for training and testing the model remain even when the model is already working in an on-line manner. The steps to build and reconfigure the model are: (i) in an off-line manner, the training set will provide some parameters to the model in order to guide the process, (ii) the input data, either the training set or an incoming sample, is evaluated by a sample assignment process which decides whether the sample belongs to an existing group or a new group has to be created, (iii) after a sample is assigned to a group, the next step is to find the group similarity with its neighbour group. This evaluation can activate the merging or the split process of the current groups in the model.
Figure 1 shows the workflow, where the main result of the training phase is a model composed of groups (clusters and/or classes). The result from either the sample assignment, split or merging process modifies the model (these connections are standing up in blue rows).

Workflow of the proposed algorithm.
Definitions and functionalities of the HHA-EMCC were already given by the authors in [19], in which the workflow in Fig. 1 was used to cluster similar features, also called attributes. HHA-EMCC principles remain the same as presented in [19] with the following differences: (i) features are replaced by samples, (ii) the similarity between samples is computed trough the Euclidean distance. The most important characteristics of each component in Fig. 1 are described as follows.
The main K-means and 1-NN basis are combined to find the similarity between an incoming sample and the classes or clusters in the model. Given a sample x
i
, the similarity is computed through three procedures in the following order: The centroid similarity proposed by K-means is used to assign the sample x
i
into a provisional group P
r
. This allows identifying the region where the nearest sample, and neighbour group to x
i
may be located. 1-NN is used to locate the nearest sample Thereafter, the final assignment is performed in the proper group P
j
which could be P
r
or P
v
. P (x
i
) = P
j
if the distance between x
i
and its nearest neighbor The distance, between the groups where the sample was finally assigned and its neighbours, is compared with reference values in order to perform group merging, group split or new group construction.
After applying the previous procedure, the results of the sample assignment are: (i) an unlabelled sample assigned to a class is labelled with the class label, (ii) groups composed by labelled samples represent a class, (iii) groups composed by unlabelled samples represent a cluster, (iv) a labelled sample assigned to a cluster makes the cluster becomes a class, and (v) a labelled or unlabelled sample, that is not assigned to any existing group, creates a new class or cluster, respectively.
Merging process
As proposed in [19], two neighbour groups should be merged, i.e. class-cluster or cluster-cluster, either because a low distance or high density exist between their borders. The previous scenario can be detected through the use of: (i) a distance metric to measure the proximity between groups, and (ii) a density metric to measure the proportion of elements in the overlapped area, if any. The merging between two classes is not allowed by our proposal. When successive assignments of samples with another label are placed into a class, it might imply a data problem commonly associated to unbalanced data or overlapping between classes. This case is not studied by this work.
Split process
As proposed in [19], this scenario appears when a new sample x i does not belong to its provisional group P i , but it is assigned to its neighbour group P j . After verifying some conditions, it can be concluded that some neighbours of x i in P i may be closer to P j than to P i . This is verified by measuring the similarity between the neighbours of x i to P j . At the end of the split process, some samples initially reassigned to P i are now assigned to P j .
Example
An example of the HHA-EMCC reconfiguration capability is presented in Fig. 2. Gaussian mixtures are used to create an artificial dataset whose samples, features, classes and variance are previously defined. The original dataset has four well-defined classes as shown in Fig. 2(a). Labels 1 and 3 were removed from the samples, and the model was trained with only labelled samples in classes 2 and 5; next, the unlabelled samples are evaluated. According to Fig. 2(b), HHA-EMCC was able to create the original classes 2 and 5 and new clusters that represent the original classes 1 and 3 (clusters 10 and 41, respectively). In addition, HHA-EMCC creates other clusters with the samples placed in the border of the clusters 10 and 41 (the clusters 33, 130, 133, 135 and 136). The samples in the mentioned clusters, that originally belong to classes 1 or 3, are not recognized by the algorithm in the corresponding cluster 10 and 41, but they are associated to noise or anomalous behaviours in the case study.

(a) Classes from an artificial dataset created by using Gaussian mixtures. (b) Classes and clusters generated by the proposal: training with labelled samples corresponding to classes 2 and 5 and, testing with unlabelled examples corresponding to classes 1 (cluster 10) and 3 (cluster 41), and other clusters associated with noise or anomalous behaviour.
In this section, the framework for discovering new patterns for fault diagnosis is presented. The main objective is to have an extended diagnoser with new discovered classes. Figure 3 overviews the activities performed by the framework. HHA-EMCC is the core of our proposal, and it allows detecting new knowledge in fault diagnosis through cluster constructions. This diagram covers: (i) the historical data analysis through feature extraction and feature selection, that is essential for applying HHA-EMCC, (ii) the training and re-training phases to build the HHA-EMCC-based hybrid model and classification-based diagnoser, (iii) the iterative process for knowledge discovery, until all the new unlabelled monitoring samples are processed. The short acronym HACC in Fig. 3 states for Heuristic Algorithm for Classification and Clustering.

Activity diagram of the proposed framework.
To apply our framework, two assumptions are stated:
The main components of the framework are detailed as follows. Section 3.1 gives the guidelines to prepare the data for the hybrid process. The training (or re-training) phase is discussed in Section 3.2. In Section 3.3 the hybrid process for discovering new knowledge will be fully detailed.
This phase is composed of the data acquisition process, feature extraction and feature selection. The data acquisition and feature extraction processes depend on the particular application domain. In this work, the data acquisition is oriented to measure vibration signals for rotating machinery, and the feature extraction is devoted to compute statistical parameters from vibration analysis on time domain, frequency domain and time-frequency domain, as presented in Section 4. Next, the RF algorithm is used to perform the feature selection process. RF is a well-known classifier that is also used for feature selection. RF detects the most significant variables, called important or significant variables, through a metric that measures the information degree contributed from each feature to all the classes. If the information degree is high, the feature is more significant [10].
Training phase
Once the labelled dataset L is available after the data analysis phase, a training process is accomplished by using HHA-EMCC and the RF classifier. Two models are built: (i) one hybrid model for discovering new patterns, and (ii) other one for classification (diagnoser). This component must run at the beginning, and it aims at having the initial setting of the classification models, according to
Knowledge discovery phase
The knowledge discovery phase is activated when new unlabelled samples are available from the monitoring process in the working phase. The best features, identified previously in the data analysis phase, are extracted from the new unlabelled samples, and the new samples are stored in the dataset U. The core of the knowledge discovery phase is the HHA-EMCC described in Section 2. This phase performs an iterative process that runs until the entire new dataset U is processed; consequently, it is necessary to read each new behavior i stored in U, at each iteration. The
The heuristic metric H can have the following values for decision-making: (i) H k ≈ 1, then a cluster k is defined as a new class because that means the cluster is well-formed and it has a high density compared to other clusters and classes, (ii) H k ≈ 0, the cluster and its samples are targeted as noisy, and (iii) H k ≈ 0.5, the cluster and its samples are marked as inconclusive data. Finally, after applying the metric H k , the results in the knowledge discovery phase are: (i) diagnosis of those samples that the framework has assigned to a class, (ii) new labelled data belonging to the class newClass i that was identified as new knowledge (dense clusters), (iii) inconclusive data associated to samples, which belong to low-density clusters, and (iv) noise data that is neither dense or low-density cluster.
Once new groups are obtained, the known patterns are processed by the classifier to make the final diagnostic decision. The new labelled samples (newClass
i
), extend the labelled dataset for a re-training phase to include this knowledge in the classifier model. From here, and based on the
Pseudo code of the knowledge discovery phase
This section describes the experimental test bed, and the procedure to build the raw dataset that will be used by our proposal. The feature extraction process transforms signals into condition parameters in order to be used by the ML algorithms. Section 4.1 details the procedure to collect these raw signals from a test bed, and Section 4.2 describes the feature extraction process to obtain the dataset L. More detailed information about the test bed and faults related to this case study are given in [16, 20].
Measurement procedure
Figure 4 shows all the elements that are placed in the experimental test bed. The gearbox structure is depicted in Fig. 5, two spur gears Z1 and Z2 –with number of teeth 53 and 80, respectively, modulus 2.25 and impact angle 20°– were installed on the input and the output shafts of the gearbox. A three phase motor generates the rotation motion, this motor is supplied with 220 V at 60 Hz and nominal speed 1650 rpm. The motor was used with an adjustable-speed drive to generate different constant speeds at 8 Hz, 12 Hz and 15 Hz. Variable speed was also considered over ranges at 5–12 Hz, 12–18 Hz, 8–15 Hz. The measure procedure starts when a motor torque motion is transmitted to the gearbox; as a consequence, a second torque is produced and transmitted to a pulley. The pulley is part of a magnetic brake control for the load regulation according to voltage inputs on 10 V and 30 V. The main objective is collecting several samples of the vibration signals for each faulty condition, including the healthy one (without fault), under different operational conditions of the motor speed and load [20]. This is achieved through the PBC IEPE accelerometer with a sensitivity of 100 mV/g, that is vertically placed in the gearbox case. The data acquisition system was performed with the NI CompactDAQ-9191 of National Instruments and the module NI 9234 which is inserted in the DAQ slot. The data acquisition was developed in our laboratory over a NI LabVIEW environment.

Experimental test bed for feature extraction from vibration signals associated to several fault conditions in gears.

Gearbox structure for fault analysis.
Table 2 describes the different induced faults. These faults correspond to particular physical conditions of the device, such as healthy operation, moderate or severe faults.
Gear fault conditions
In the previous step, 900 raw vibration signals have been collected, 90 samples for each fault condition in Table 2. This phase processes the signals in order to extract the most common features that describe all the condition states. Therefore, 817 features were obtained from three different domains: Time domain parameters In this experimental setting, seven classical statistical features were obtained over all the signal length, for each sample in time domain, such as: Root mean square (RMS), energy operator, crest factor, mean, standard deviation, variance and skewness. Frequency domain parameters The raw vibration signals were transformed into frequency signals using Fast Fourier Transform (FFT), as follows: (i) two signals on decade frequency domain, one with dimensionless magnitude and other one with decibels magnitude, (ii) two signals on octave frequency domain one with dimensionless magnitude and other one with decibels magnitude. These four frequency domain signals were split into bands: 80 bands in case (i) and 15 bands in case (ii). Statistical parameters such as RMS, mean, standard deviation and kurtosis were calculated for the two signals in case (i) and mean, standard deviation and kurtosis were calculated for the two signals in case (ii). As a result, 730 features were obtained with this analysis. Time-frequency domain parameters Wavelet Packet Decomposition (WPD) is applied over the raw vibration signal in time domain, using five mother wavelets: Daubechies (db7), Symlet (sym), Coifier (coif4), Biorthogonal (bior6.8) and Reverse Biorthogonal (rbior6.8). These mother wavelets are used due to the good performance that they have exhibited in machine learning classification [5, 7]. WPD is performed until four levels for each mother wavelet, then 24 coefficients are obtained for each one. Finally, 80 features associated to the energy parameter are extracted.
Results and analysis
This section presents the results after applying the framework for knowledge discovery in fault scenarios of gearboxes. The experimental test shows that our proposal can deal with this scenario.
Data analysis phase
Data acquisition and feature extraction were presented in Section 4 for obtaining the raw dataset. Feature selection is accomplished in two sequential steps: (i) correlation analysis was developed to reduce the redundancy and retain the low correlated features, according to a specified threshold, (ii) a Random Forest (RF) algorithm was applied to measure the relevance of the low correlated features, under complete and partial information. The latter is considered due to a few set of known classes are usually available in real cases, e.g. some common conditions such as the normal operation and few faulty states previously identified from historical analysis.
In the step (i), more than 400 features with a correlation higher than 95% were deleted, and only 330 features from the original dataset were retained. In the step (ii), under complete information scenario, the entropy based information degree was computed by applying the RF algorithm over the complete database with 330 features and 10 fault classes. The feature selection was accomplished considering three thresholds for the entropy values: higher than 40%, 50% and 60%. Then, 29 features were selected with entropy higher than 40%, 14 and 9 features for 50% and 60%, respectively. Under partial information scenario, the samples with labels f1, f5, f6 and f8 have been selected as the known classes (see
In order to evaluate this selection, several classification models have been trained by considering the previous feature selection. Table 3 shows the classification accuracy is slightly decreased with 8 features selected under partial information, regarding the accuracy obtained with 14 features selected under complete information. Then, the 8 features selected under partial information were considered to the next phase.
Accuracy values by using different classifiers for diagnosing 10 faults
Accuracy values by using different classifiers for diagnosing 10 faults
As described is Section 3.2, this phase is executed at first, to set the initial configuration of the classification models, according to
Once the initial setting is accomplished, both classification model are ready to process new unlabelled data in the next phase. HACC based model is able to discover new patterns different from the current known classes, in this case RF based model is trained again to create a new classifier updated with the new discovered pattern.
Knowledge discovery phase
This section details the experiments to test the ability of our approach for discovering new knowledge, by processing the available dataset which has been split in the set L of labelled samples and U of unlabelled samples. The following scenarios are assumed: (a) Two initial classification models are available from the first training with the labelled data in the set L, related to known faults, (b) The remaining faults are considered as unknown classes, and their unlabelled samples are in the set U, see Fig. 3. They are new patterns that could be discovered from the monitoring process, (c) Samples related to certain fault conditions will be analysed as a group of samples, according to
The results after running the framework are presented in the following items. Each item is related to a set of samples i in U
k
(i), k = 1, . . . , 4, where k defines a new working phase, for a new condition of the gearbox, in the data set U (i). All the figures are in the 3-Dimensional (3-D) space after applying PCA over the 8 selected features. Experiment with k = 1: This experiment considers that the gearbox is in the faulty condition f2, and a set of unlabelled samples U1 (i), i = 1, . . . , Tr, related to this condition is available. The sample distribution is shown in Fig. 6(a), with the known dataset (classes) and the unlabelled dataset in L and U1 (i), respectively. Once the framework is executed, the results are the classes and clusters in Fig. 6(b). HHA-EMCC created correctly the classes associated with faults f1, f5, f6 y f8, and new clusters with the new condition were also identified. The heuristic metric in Equation (5) was able to label one of them as newClass0 corresponding to the new discovered condition f2. The clusters 5 and 12 are marked as noisy clusters, and their samples are excluded in the next experiments. The new labelled samples feed the dataset for training the RF classifier with this new knowledge (see Fig. 7(a)). The classification accuracy was around 0.9924. Experiment with k = 2: This experiment considers the gearbox is in the faulty condition f7, and a set of unlabelled samples U2 (i), i = 1, . . . , Tr, related to this condition is available. The result obtained from the previous experiment are included as a new class, and the dataset L is updated. Figure 7(a) shows the known dataset (labelled samples) in L, and the unlabelled dataset in U2 (i). After running, HHA-EMCC efectively detects a new behaviour creating a new cluster with the unlabelled samples, according to the heuristic metric in Equation (5); then, they are labelled as newClass1, (see Fig. 7(b)). There are neither inconclusive data nor noisy data in this experiment. The RF classifier is retrained with this new class, and the classification accuracy was around 0.9875. Experiment with k = 3: This experiment considers the gearbox is in the faulty condition f9, and a set of unlabelled samples U3 (i), i = 1, . . . , Tr, related to this condition is available. Figure 8(a) shows the current known classes in L including those discovered in the previous experiments, and the unlabelled dataset in U3 (i). HHA-EMCC generated the clusters shown in Fig. 8(b). Notice that the unlabelled samples were segmented into 5 clusters (with id 7, 8, 9, 19 and 13) with low density, and one cluster that is considered a new class associated with the unknown condition f9. After evaluating the heuristic metric H
k
in Equation (5), the new class is labelled as newClass2, and the samples assigned to the clusters 7, 8, 9, 19 and 13 are identified as inconclusive data. RF classifier is trained with the extended dataset including the new class, and the inconclusive data is evaluated for possible labelling. L is extended for the next experiments with the results given by the classifier for each sample in the inconclusive data. According to the Assumption 2, these samples are labelled as “newClass2” also, if apply. The classification accuracy, after evaluating all the inconclusive data, was around 0.9718. Particularly, for the inconclusive data in U3, 36 samples were correctly classified, 4 samples were incorrectly classified and the accuracy was around 0.9. Experiment with k = 4: The same procedure was addressed by considering the gearbox in the fault f10, then the dataset U4 (i) is related to samples associated with this faulty condition. HHA-EMCC was able to create a new class labelled as newClass3, and four clusters (with id 8, 12, 13 and 14) have been identified as inconclusive data. The RF classifier was trained with this new class, and the global classification accuracy was around 0.9575. Regarding the inconclusive data in U4, 9 samples were correctly classified, 21 samples were incorrectly classified and the accuracy was around 0.3. We can see from the original data and the results in the Fig. 9(a) and (b), that the low accuracy is because most of samples in U4 (i) are overlapped with the newClass2. On the other hand, they do not met the similarity criterion in HHA-EMCC for being included in newClass2. This case reflects that HHA-EMCC-HM works better than the RF classifier to discriminate two different classes.

(a) Known classes corresponding to faults f1, f5, f6 y f8) and the unlabelled dataset corresponding to fault f2, experiment k = 1. (b) Resulting classes and clusters after applying the proposed framework: new class newClass0 identified from the unlabelled samples corresponding to fault f2, and clusters 5 and 12 associated with noisy data.

(a) Known classes corresponding to faults f1, f5, f6, f8 and newClass0, and the unlabelled dataset corresponding to fault f7, experiment k = 2. (b) Resulting classes and clusters after applying the proposed framework: new class newClass1 identified from the unlabelled samples corresponding to fault f7.

(a) Known classes corresponding to faults f1, f5, f6, f8, newClass0 and newClass1, and unlabelled dataset corresponding to fault f7, experiment k = 3. (b) Resulting classes and clusters after applying the proposed framework: new class newClass2 identified from the unlabelled samples corresponding to fault f9, and clusters 7, 8, 9, 19 and 13 associated with noisy data.

(a) Known classes corresponding to faults f1, f5, f6, f8, newClass0, newClass1 and newClass2, and unlabelled dataset corresponding to fault f10, experiment k = 4. (b) Resulting classes and clusters after applying the proposed framework: new class newClass3 identified from the unlabelled samples corresponding to fault f10, and clusters 8, 12, 13 and 14 associated with noisy data.
This work presents a semi-supervised framework for finding new patterns that could be associated with new faulty conditions in data-driven fault diagnosis applications. The core of the framework is HHA-EMCC, which is an algorithm that creates on-line evolving models to discover new classes or clusters when new incoming samples feed the current structure. Some new clusters can be associated to a new pattern and, in consequence, to a new faulty/abnormal condition. The resulting clusters are analysed with a heuristic metric that identifies either noisy data, inconclusive data or new knowledge patterns.
The approach progressively creates accurate classifiers, as the new classes feed the existing classifier and the inconclusive unlabelled data can be labelled by following a semi-supervised approach. In this sense, two ML models run simultaneously in our framework: one generated by the HHA-EMCC, and the other one is the supervised classifier. The experimental results show that the framework works correctly, and it was able to identify new pattern from cluster constructions. The accuracy in classification results, after including the new classes, shows that is possible the refinement of the supervised diagnoser under our proposed semi-supervised framework.
Most of the works in fault diagnosis are addressed by classification under supervised schemes. This work aims at changing that point of view by facing the problem as a hybrid task discovering new classes and clusters into an evolving model. In future works, we look forward two main aspects: (i) the on-line feature analysis at time when new samples associated with a new condition feed the HHA-EMCC, and (ii) the on-line training of the RF classifier considering only the new discovered class.
