Abstract
Rotating machine faults are unavoidable; thus, early diagnosis is essential to avoid further damage to the machine or other machine attached to it. Various signal analysis based conventional techniques have been developed and used in the industries to identify various defects in the rotating machines. In last two decades, researchers have shifted their focus to automated or intelligent fault diagnosis based on Artificial Intelligence (AI) techniques due to a variety of issues in conventional fault analysis techniques, such as a dependence on machine operating circumstances, human interference, and expert abilities. In AI based techniques, various machine learning (ML) and deep learning (DL) techniques have been successfully applied for fault diagnosis of various rotating machines. From last half decade DL have been gaining popularity due to its attractive characteristic of automated feature learning and solving big data, unbalanced data, big computational burden and over-fitting problems of conventional ML techniques. Advances in DL methodologies have prompted interest in DL based intelligent fault diagnosis in the industry in the last five to 6 years. This review paper summarizes recent research and developments on DL based fault diagnosis in the last five to 6 years for various critical rotating machineries in industry such as electric motors, rotor-bearing systems, gear and gearbox, wind turbines, pumps, and compressors.
Keywords
Introduction
We are in the midst of the fourth industrial revolution, generally known as Industry 4.0. Improved sensor data integration into automated machine decision-making and condition monitoring is one of Industry 4.0’s goals. In Industry 4.0, Artificial Intelligence (AI) has a significant impact on manufacturing, transportation, and other industries. 1 When AI is properly deployed in condition monitoring, observations of the system and its performance lead to the development of settings that boost performance while conserving machine health. Overall cost reductions can be achieved by AI based machine fault diagnosis which suggested near-beginning maintenance, results in fewer maintenance actions, minor monetary losses, reduced inventory and material usage, avoid unplanned breakdowns, improved human-machine relationships, and overall effectiveness. 2
Rotary machines are used in all type of industries including manufacturing, transportation, power generation and etc. The most frequent type of rotary machineries in any industry are motors, gears, bearings, engine, generators, pumps, turbines, compressors, and so on. When one of these machines fails, the entire production line and, in certain cases, the entire plant has to shut down. Consequently, it is critical to timely diagnose and maintain these machines. AI has been fast gaining popularity in the field of machine fault diagnostics.3,4 It is becoming more frequent to employ a combination of traditional monitoring methods and AI as the key component of autonomous fault diagnosis. This combination has created a new inspection lane for rotary machine fault diagnosis and condition monitoring, which is critical for improving rotary machine fault analysis, monitoring, identification, prediction and prognosis. The steps involved in the decision making (fault diagnosis or prognosis) based on AI technique is shown in Figure 1.3,4 Decision making based on artificial intelligence based methodology.
There are two types of AI techniques that have been applied for machine fault diagnosis: i.e. conventional learning, often known as machine learning (ML), and deep learning (DL). ML techniques such as fuzzy logic, support vector machine, artificial neural network, neuro-fuzzy, decision trees, bayesian classifier, random forest, and others have been utilised to diagnose faults in various rotating machines.5,6 In a study, Lei et al. 2020 7 presented a review of machine failure analysis based on ML techniques. In ML based fault diagnosis, various steps are involved such as data acquisition, compilation, fault characteristic extraction, and health condition identification for intelligent fault diagnosis (IFD) (shown in Figure 1). Finally, they addressed the challenges of IFD based on ML techniques. In addition, IFD based on deep learning, and transfer learning were also discussed.
In recent years, DL based fault diagnosis has been gaining popularity in the industry.8–10 Various DL techniques such as Deep neural networks (DNN), Deep belief networks (DBN), Sparse auto-encoders (SAE), Convolution neural networks (CNN), Recurrent neural networks (RNN), restricted Bolzmann Machines (RBMs) and generative adversarial networks (GAN) have been developed.3,11–13 In a work, Tama et al. 2020 14 presented an overview of DL algorithms for fault detection utilising vibration signals. They discussed challenges and open research problems in DNN based machine fault detection. In other work, Liu et al. (2017) 15 carried out a review of DNN techniques and their applications in real-world. This paper discussed a number of frequently used DL algorithms with particular emphasis on applications in computer graphics, pattern identification, and speech recognition. They discussed most frequently used DL approaches such as RBMs, DBNs, autoencoder (AE), and CNNs. These DL methods can be used to work with unlabeled data by utilizing unsupervised learning. 16
In AI based machine fault diagnosis, DL has been preferred over other shallow learning techniques.17–19 The development of DL based fault diagnosis of rotary machineries mainly motors, gears and gearbox, bearings, turbines, pumps, and compressors has been reviewed in this research study. First section discussed the fault conditions that might occurs in different rotating machineries and the traditional techniques of monitoring them. Second section presented a brief introduction to various DL techniques. Next section includes the research and development of DL based fault diagnosis of rotary machines since 2015. The conclusion and future directions of the current research are presented in the end.
Rotaing machinery faults and conventional monitoring methods
Various types of faults and their monitoring methods have been discussed for a number of rotating machines like electric motor, gear, bearing, wind turbine, pump and compressor.
Electric motors
Motors utilize more than half of all electrical power produced worldwide, accounting for nearly 60% of all industrialized electricity. Various types of AC and DC motors have been utilised in industry. Motors have been evolved as modern world’s backbone, as they are essential to the manufacturing, transportation and other industries. Conveyors, compressors, blowers, generators, fans, machine tools, cranes, and electric vehicles are all examples of common motor applications. 20
Types of electric motor faults
Motors fail due to mechanical or electrical issues, therfore the motor faults are classified as mechanical or electrical faults. 20
The Electircal faults include: • Stator winding faults • Phase unbalance and single phasing
The mechanical faults include: • Broken rotor bar • Bearing fault • Unbalanced rotor • Misaligned rotor
Monitoring method for electric motors
Various conventional methods have been developed for diagnosing motor defects and used in the industry. These methods are
20
: • Machine current signature analysis (MCSA) • Vibration monitoring • Torque monitoring • Thermal monitoring • Current monitoring • Magnetic flux monitoring • Acoustic noise measurement • Surge testing • Partial discharge measurement • Chemical analysis • Instantaneous angular speed • Instantaneous power
The most frequent methods for motor condition monitoring, among others, are vibration and current monitoring. Machine current signature analysis is effective for detecting electrical defects, such as stator winding failures and phase unbalance. Vibration detection is effective for detecting mechanical problems in motors, such as bearing and rotor-related faults. 20
Gear and gear box
Gear is a major component in automatic transmission mechanisms in wind turbines, aircraft, automobiles, and a variety of other mechanical machines. The failure of the gear results in unnecessary downtime, production losses, and human casualties. Consequently, diagnosing gear defects early is crucial. 21
Types of gear faults
• Chipped tooth • Missing tooth • Cracked tooth • Tooth breakage
Monitoring method for gear and gear box
• Vibration monitoring • Acoustic monitoring • Torque monitoring • Current monitoring
In gear condition monitoring, vibration testing is the most extensively used monitoring method for identifying the state of a gear/gearbox. 21
Rotor-bearing system
A rotor bearing system might fail either due to rotor or bearing defects. Bearing damage is the most predominant type of damage in a rotor-bearing system or rotating machinery. Depending on the type and size of the machine, the percentage of bearing defects in the total number of machine faults ranges from 40 to 90%. Consequently, diagnostics of the bearing is crucial in rotating machines. 22 In addition to bearing damage, rotor fault is another major fault in any rotating machines.
Types of rotor-bearing system faults
Bearing faults and rotor faults are two types of rotor-bearing system faults. Bearing defects can be classified as distributed or limited. Distributed faults affect the entire bearing area and are difficult to detect. Limited bearing defects usually result in single-point damages, and they can be divided into four categories based on the damaged part
23
: • Bearing cage fault • Bearing element fault (ball or roller) • Outer race fault • Inner race fault.
Type of rotor faults which are generally occurred in rotating machines are: • Rotor unbalance • Rotor misalignment • Bent Rotor • Manufacturing defect
Monitoring methods for rotor-bearing system
Rotor-bearing system defects are monitored using a variety of conventional approaches such as
24
: • Vibration technique. • Acoustic emission technique. • Current signal technique. • Oil and debris-monitoring technique. • Thermography technique.
In recent years, a variety of methods for monitoring rotor or bearing health have been developed, but the vibration monitoring system has been proven to be the most effective. Vibration-based condition monitoring is non-destructive detection analysis of faults. Different signal processing techniques have been used for fault monitoring based on time, frequency, or time-frequency domain techniques.
Wind turbine
Large-scale power systems such as wind turbines are subject to changeable weather and hostile conditions. As a result, wind turbines are prone to frequent breakdowns and malfunctions, resulting in large expenses. The development of condition monitoring systems (CMS), which could monitor wind turbine parameters in real time and detect faults in the near future, has aroused the interest of the wind energy industry. 25
Types of turbine faults
A wind turbine failure may be classified as. • Blade Failure • Generator Failure • Gearbox Failure • Bearing Faults • Shaft Faults • Control system Faults
Monitoring methods for wind turbine
• Vibration analysis • Acoustic emission • Ultrasonic testing (UT) techniques • Oil analysis • Strain measurement • Electrical effects • Shock pulse method (SPM) • Process parameters • Performance monitoring • Radiographic inspection • Thermography
The most recognized technology employed in turbines, mostly for rotary apparatus is vibration analysis. A rapid change of strain force occurs in the turbine components resulting in elastic effects that may be evaluated via acoustic emissions. UT has been traditionally used to locate and assess the plane and the subplane structural faults using organic analysis. Strain gauges can be quite useful for predicting lifetime and guarding against high pressure levels particularly in the blades. To detect separation defects in the wire, a spectrum analysis of the stator current in the generator has been used. 25 The SPM has been employed as a quantitative tool for bearing monitoring. It works by detecting the mechanical shocks that occur when a ball or roller in a bearing makes contact with a defective portion of the raceway or with wreckage. For the detection of problems in wind turbines, radiographic imaging of important structural turbine machinery using X-rays and thermography techniques have also been applied.26,27
Pump
Pumps are used to move fluids and are controlled by an electric motor. They are used to pump water from wells, in filter aquariums and ponds, and provide ventilation. They are also utilised in the power industry to pump oil and typical gases, as well as in cooling towers and other industries for water cooling and fuel injection. Pumps come in a variety of sizes, ranging from microscopic pumps (used in medical applications) to massive industrial pumps. Their failure results in the shutdown of manufacturing processes and lines, thus monitoring and diagnosing them is critical. 28
Types of pump faults
Pumps faults are classified as • Impeller faults • Dry run • Cavitation • Cover plate faults
Monitoring methods of pump
Various monitoring methods are used to diagnose faults in the pump. These are: • Shut-off head method. • Head-flow method. • Thermodynamic method. • Pump vibration method.
One of the most useful pump control techniques is the head-flow technique. It not only detects pump wear and tear, but also changes in the conflict with fluid flow within the pump structure. Another effective method of testing is to analyse high-temperature samples of fluid flow through a centrifugal pump. As a pump’s covering wears down, the temperature inside it rises frequently. Because it allows for head dimensions even when there is no flow, the shut-off head approach is ideally suited for prognostic continuation in pump structures. When working on no flow, this technique is not advised for use in high-power, high-precision pumping systems since it poses a considerable risk of blast. Every rotating device in a pump has a specific vibrating characteristic that changes as the active or stationary elements wear out. Variations in pump vibration samples can be used to detect delicate component damage that is difficult to detect visually or acoustically. Furthermore, a faulty bearing and impeller will cause the pump to vibrate more during operation.29,30
Air compressor
Compressors are similar to pumps in that they increase the force applied on a fluid while also allowing it to be transported through a pipeline. Compressors are vital machinery in a variety of industry applicaton, therefore diagnosing and managing them is essential. 31
Types of compressor faults
The compressor faults are classified as: • Loose pulley • Flywheel Faults • Belt Faults • Belt guard Faults • Cooler Faults • Clamps or Accessories Faults.
Monitoring methods for compressor
Various methods have been used to diagnose faults in the compressors. These are: • Frame Vibration • Crosshead Vibration • Cylinder Vibration • Crank Angle Measurements • Rod Drop • Rod Position
Frame vibration approaches can be beneficial for detecting the energy source by piston rod loading and the instant energy transmit toward the central bearings due to the asymmetrical configuration of the crankshaft. Crosshead vibration techniques can also be used to identify compressor faults. Appropriate plane study is vital for specific vibration techniques. On the road to identifying desired activities related with the compressor, acceleration sensors can be put on top of the cylinder. Sensors that are often employed for this function include eddy current displacement (ECD), inductive (magnetic), and optical transducers.32,33
Introduction to deep learning techniques
Deep learning is a subset of a large family of AI techniques based on ANN which learned through demonstration.34,35 There are 3 types of learning: supervised, semi-supervised, and unsupervised. The term “deep” refers to the number of layers in the neural system that are not visible. Deep networks can have up to one hundred 50 layers, while traditional neural networks only have two to three. Deep learning networks have the advantage of frequently improving as the size of your data grows. Deep learning systems are trained using large sets of labelled datasets and neural system architectures with the aim of studying features directly from the dataset without the need for physical feature mining. Various deep learning structural architecture such as DNN, DBN, CNN, RNN, SAE, and boltzmann machines are examples that are used for computer visualisation, software visualisation, language recognition, ordinary speech processing, and audio identification in the workplace. 36
(i) DNN
As illustrated in Figure 2, the DNN technique is an element of the ANN method and involves many coatings between the input and output coats. Neural systems come in a variety of shapes and sizes, but they all have the same basic components: neurons, weights, synapses, biases, and functions. These machines have intelligence comparable to humans and can learn in the same way as any other machine learning algorithm. Deep neural network.
Deep neural networks are typically feed forward networks in which data is transferred without looping from the input layer to the output layer. The DNN begins by constructing a network of crucial neurons and assigning random mathematical values, or “weights,” to the connections between them. The weights would be controlled by an algorithm if the network could not precisely detect a testing sample. In this instance, the algorithm will develop more substantial assured constraints before establishing an accurate arithmetical treatment to process the dataset completely. Any mathematical treatment functions as a coating, and challenging DNN systems have multiple coatings, thus the term “deep” systems. 37
(ii) CNN
A CNN is a type of DL technique that is often used to analyse scene images. Natural approaches have been applied to encourage CNN into the neuronal communication model, which is similar to the animal illustration cortex’s association. A CNN is a neural network containing one or more convolutional layers used for image processing, classification, segmentation, and other auto-related data, as shown in Figure 3. A convolution is a filter that is dragged over the input.
38
Convolutional neural network.
(iii) DBN
Deep belief networks are used to distinguish, cluster, and produce images, video sequences, and motion-capture datasets. Figure 4 depicts a DBN. A continuous DBN is just an extension of a DBN that recognises a number of decimals before the binary dataset. Geoff Hinton and his students developed this in 2006. Despite the fact that DBN has fallen out of favor and is rarely employed before, nowadays, especially when compared to other unsupervised or generative learning algorithms, they are still well-deservedly recognised for their significant contribution to deep learning application.
39
Deep belief network.
(iv) RNN
Recurrent neural networks are a type of ANN that is widely utilised in language recognition and natural language processing (NLP). A network of RNNs is shown in Figure 5. Recurrent neural networks are used in deep learning to enhance models that imitate the behaviour of neurons in the human brain. Recurrent neural networks, a state-of-the-art technique built for chronological datasets, are used in Apple’s Siri and Google’s voice exploration. It is the first algorithm to remember its effort, making it perfect for inner recall and machine learning problems with chronological datasets.
40
Recurrent neural network.
(v) SAE
A Sparse Autoencoder (SAE) is a sort of autoencoder that use sparsity to create a bottleneck in the flow of information. Activations within a layer are penalised by the loss feature in particular. A network of SAE is shown in Figure 6. One of SAE’s major roles is unsupervised pre-training, coat by coat, while effort is served throughout. The primary coat can be utilised as an input for a second autoencoder once it has been pre-trained. Given autoencoder is a neural system technique for training that attempts to identify a simplified demonstration of an input. Despite the fact that they are trained using supervised learning methods, which are referred to as self-supervised, they are an unsupervised learning methodology.
41
Sparse autoencoder.
(vi) Boltzmann Machine
The Boltzmann learning approach is arithmetical within behaviour and is derived from the science of thermodynamics. It is similar to the error-correction learning method utilised in the supervised training procedure. A network of Boltzmann Machines is shown in Figure 7. A Boltzmann Machine is a symmetrically connected network of neuron-like units that makes stochastic decisions about whether or not to be on. Boltzmann machines use a straightforward learning process to learn appealing features from binary vector datasets.
42
Boltzmann machine.
DL based rotating machines fault diagnosis
Deep learning techniques for automated feature mining do not require domain expertise in signal processing, therefore they are getting popularity in the field of condition monitoring and fault diagnosis. Some DL algorithms, such as the DBN, however, favour processed data over raw data. Deep learning approaches such as CNN, DBN, autoencoders, and other DNN have been developed and employed for fault diagnosis in last half decades. This section includes the research and development of DL based fault diagnosis in last 5 to 6 years for a variety of rotating equipment, such as motors, bearings, gear/gear boxes, wind turbines, pumps, and compressors.
DL based motor fault diagnosis
Deep learning is emerging in the field of motor condition monitoring and fault diagnosis. Different types of DL, such as DNN, CNN, RNN, encoder, and others, have been developed and successfully used to detect various electrical and mechanical faults in motors.
In a work, Sun et al. 2016 43 investigated a sparse auto-encoder with deep neural network (SAE-DNN) system for categorizing induction motor faults. This method can be trained features directly from unprocessed datasets that distinguish various induction motor operational conditions. The results revealed that learning and a thorough investigation validate the SAE-DNN’s efficiency in analyzing induction motor faults. In other study, Xiao et al. 2016 44 presented adaptive function mining with stacked denoising auto-encoders (SDAE) designed for asynchronous motor fault diagnosis. The SDAE approach improved the asynchronous motor fault diagnosis evaluation with stacked auto-encoder (SAE), which solved the problem with the conventional MCSA technique.
Shao et al. 2017 45 presented a DL approach based on DBN for fault analysis of induction motors, where frequency domain data of the measured dataset is used as input data. In this paper, a DBN-based approach is used to model and analyse multiple layers of representation on a high-dimensional vibration dataset. In other study by Sun et al. 2017 46 investigated convolutional discriminative feature learning (CDFL) for induction motor fault diagnosis. The rapid convolutional pooling structural architecture was used in this study to extract discriminative and invariant properties from an unprocessed vibration dataset. The weights of the small filters were learned using a back-propagation neural network (BPNN). Then support vector machine (SVM) is used to effectively categorise the learned features.
Xia et al. 2017 47 conducted an intellectual fault study of motor bearing using unsupervised characteristic learning with stacked denoising autoencoders (SDA). The proposed approach, which is derived from SDA and DNN, can learn representative characteristics from an enormous amount of unlabeled dataset without the need for human interference. Wang et al. 2017 48 presented the motor fault diagnosis based on short-time fourier transform (STFT) and CNN. Proposed deep learning methodology effectively categorizes health states with increased diagnostic accuracy and can extract prominent fault characteristics in a consistent manner. Ghosh et al. 2017 49 used DNN to detect a fault caused by a short circuit in the stator winding or an unstable inverter deliver by predicting magnetic flux. The DNN prognostic harmonics compensated control drive is first simulated and used for dataset analysis and finite element investigation.
Zhang et al. 2017 50 used a 1-D stacked dilated convolutional neural network (1D-SDCNN) to create an intelligent fault study for rolling bearings of induction motor under changeable operational condition. Stacked dilated convolutional allows the device to contain large accessible areas with a limited number of coats, allowing for improved overall features. The results showed that the proposed approach is more practical and powerful for analyzing induction motor faults even when different load conditions are used for training and testing. Lee et al. 2017 51 presented the DNN and GAN to diagnose induction motor faults in industrial maintenance. The effect of oversampling on the fault diagnosis of a motor was demonstrated in this study. By resolving the complexity of dataset instability with GAN and conducting intellectual fault diagnosis and analysis with DNN, the results demonstrated the likelihood of data-driven maintenance solution. However, other problem of data imbalance like under-sampling and ensemble learning need to be consider for the study.
Yao et al. 2018 52 presented a fault diagnosis for a self-sensing motor drive structure using CNN. The motor drive structure was used as a multivariable sensor at the drive-side of a mechanical device in this paper, which measures torque, speed, location, and power in addition to current. Therefore there is no need of extra sensors and data acquisition, in the proposed work, the raw data of healthy and faulty conditions from self-sensing motor drive are first converted into color and polar images for different operating conditions. The result demonstrated that CNN can be used for accurate diagnosis. Cipollini et al. 2019 53 worked on unsupervised deep learning for induction motor bearing’s fault diagnosis. Results showed that the proposed technique has the potential to derive a dense and important representation of the bearing state from the stator current signal. Shao et al. 2019 54 studied the deep convolutional neural network (DCNN)-with multi-signal (vibration and current) for induction motor fault diagnosis. deep convolutional neural network is being developed as a building block for multi-signal representation, demonstrating the ability to study discriminative characteristics from time-frequency distribution (TFD) images without human interference.
Afrasiabi and Afrasiabi. 2019 55 introduced an accelerated deep learning approach to analyse real-time bearing defects in induction motors. In this work, they developed an accelerated CNN (ACNN) by avoiding less significant connections and sharing the weights in order to compress and speed up the conventional CNN. Results showed that the developed ACNN perform better and faster than the conventional CNN, SVM and ANN. Zou et al. 2020 56 used deep learning for bearing fault diagnosis of the traction motor in high speed trains. The discrete wavelet transforms (DWT) and enhanced DBN were used for the diagnosis. The DWT was first used to extract 2D time–frequency map from the healthy and faulty dataset. After that, these maps were used to train the enhanced DBN for correlating the fault features and fault types. In comparison to other conventional approaches like BPNN and SVM, the proposed approach achieved the better accuracy.
Kumar and Hati. 2020 57 introduced an adaptive gradient optimizer with deep convolutional neural network (ADG-dCNN) for successful bearing and rotor defects detection in squirrel cage induction motors. In this work, adaptive gradient optimizer was successfully used for optimizing the parameter for efficient training and deep CNN was successfully used for efficient feature learning. Rodriguez et al., 2020, 58 performed a diagnostic for damaged BRB of various severity levels in induction motors based on a CNN and MCSA. The STFT was initially used to observe left sideband frequency components associated with BRB. Then CNN was used to achieve automatic image categorization of different BRB fault conditions.
Skowron et al. 2020 59 investigated the CNN for stator winding faults of an inverter-fed induction motor using raw transient stator current signals. In this work, they used only 2000 samples of transient stator current for the diagnosis thus avoiding other signal processing techniques like FFT, WPT and HHT. The proposed technique had a high degree of precision in determining the location of the breakdown and the severity levels, as well as the ability to discern even individual shorted turns. Grezmak et al. 2020 60 introduced a multi-stream convolutional neural network (MS-CNN) for induction motor in variable frequency drive. They showed that the proposed technique could learn patterns in the inputs from FFTs of combined vibration and current signals obtained at different line frquency. Results showed that the MS-CNN identifies the harmonics of the motor’s line and rotational frequencies for differnet fault conditions.
Kumar et al. 2020 61 introduced a novel transfer learning (TL) and a DCNN to detect bearing fault and BRB faults (individually and jointly) in squirrel cage induction motors. TL extracts critical information from the already trained model to improve the learning of a new model therefore it is useful for less labelled data cases. Guo et al. 2020, 62 presented the use of DNN to predict the winding temperature of a permanent magnet synchronous motor (PMSM). In this work, temperature (ambient, coolant surface and tooth temperature), voltage, current, speed, and torque were considered for the input and winding temperature was considered for the output. The method has the potential to play a significant role in PMSM high temperature identification to avoid any upcoming faults, as well as provide industrial support for high temperature caution and PMSM protection.
Liang et al. 2020 63 introduced a novel gate-structure dilated convolutional capsule network (GDCCN) with a deep learning methodology for motor fault diagnosis. It is combined by the long short-term memory (LSTM) system’s enter gateway arrangement, extended convolution, and capsule neural network. Result showed that the GDCCN provides better performance than conventional DL methods even in noisy enviroments and different workloads. Espitia et al. 2020 64 & Espitia and Soto. 2020 65 used SAE based deep learning methodology for effective fault diagnosis in electromechanical systems. They considered bearing, demagnetized, and eccentricity-related induction motor faults. GA was used to optimize the hyperparameter of a SAE and LDA was used to maximize distance between different data sets.
Singh et al. 2021 66 performed the fault diagnosis of electrical and mechanical faults in induction motor based on DNN using vibration and current time domain signals. In this paper, critical features such as standard deviation, skewness and kurtosis were extracted to characterize motor faults. They performed the optimization of batch size and number of epochs by trial-and-error method and suggested that increasing both hyper parameters is not a better proposal. Zou et al. 2021 67 used deep learning techniques for fault diagnosis of bearing of traction motor in high-speed trains. In this work, they obtained 2 –D time-frequency map from 1-D vibration signals based on DWT. Then they used improved DBN for performing the bearing fault diagnosis and showed that it is better than SVN and BPNN. Kumar and Hati 2021 68 performed the multiple fault diagnosis of induction motor using transfer learning-based CNN (TL-CNN). In this work, they converted current signals acquired from hall sensors into images which are further used as input to TL-CNN. They showed that the transfer learning reduces the computation time required for building effective CNN model with high depth.
Summary of Deep Learning (DL) based motor fault diagnosis.
DL based rotor-bearing system fault diagnosis
Deep learning have been also gaining interest in the diagnosis of rotor-bearing faults. Various researchers have looked into various types of DL to detect rotor and bearing faults. The work on DL fault diagnosis of rotor-bearing models reported in the last 5 years is discussed here.
Lu et al. (2015) 69 investigated a novel feature extraction approach for rolling bearing fault diagnosis using DNN in a paper. They demonstrated that DNN is currently the most powerful machine learning tool, and that its benefit is suitable for resolving high nonlinearity issues in bearing signal features. The proposed method achieves features that accurately and clearly depict bearing signal details, which are very useful for successful fault diagnosis. Ahmed et al. (2016) 70 employed DNN for bearing fault identification based on sparse autoencoder. This allows to extract fault features in unsupervised manner. They showed that the system structural design and regularization parameters, as well as DNN parameters, have a major impact on bearing fault identification performance. Results showed that identification performance is higher in the re-training phase as compared to the pre-training phase of DNN.
In a work, Xia et al. 2017 71 employed DNN for bearing fault classification based on stacked denoising autoencoder. Critical fault features were extracted from unlabeled data using denosing autoencoder, and then DNN was trained with few labelled data. Results showed that the present approach can extract critical features based on large unlabeled data and correctly classify bearing fault based on few labeled data of a new condition. Xia et al. 2017 47 used a hierarchical DNN to calculate the remaining useful life (RUL) of bearings in rotary machines based on vibration and temperature data. The RUL of bearing was evaluated using a hierarchical DNN with RUL predictor (DNNRULP) technique. The DNNRULP technique consists of three main components: a DNN with Health Stage Classifier (DNNHSC), an ANN with RUL predictor (ANNRULP), and a DNN with an even operative. Result showed that the proposed approach demonstrates efficiency in RUL prediction and achieves higher prediction accuracy than a single ANN device.
Zhang et al. 2017 72 presented a novel method called deep CNN with wide First-layer kernels (WDCNN) learning method for bearing fault diagnosis. The WDCNN was divided into two parts: a large first-layer convolutional kernel and a deep network formed by small convolutional coats. In addition to the wide kernels in the primary convolutional coat was used for extracting characteristic and restraining elevated frequency tone, the proposed method used unprocessed vibration signals as input. Chen et al. 2017 73 presented a deep sparse auto-encoder method with noise (DSAE-N) for rolling bearing fault severity detection. The DSAE-N was built by loading many SAEs into a classifier coat in order to achieve automated attribute extraction and fault diagnosis. The DSAE-N was trained with noise samples to avoid the overfitting issue due to limited training data. Result showed that the DSAE-N performs well in comparison to manual feature extraction technique, SAE and DSAE without noise samples.
Sohaib et al. 2017 74 employed hybrid feature pool in deep-learning-based bearing fault diagnosis. In this study, they combined features from different domains i.e. time, frequency and time-frequency domain. Finally they combined the sparse stacked auto encoder (SSAE) with DNNs and showed that the present method perform very well to analyze a variety of bearing faults with different severity levels. Li et al. 2017 75 investigated a combined deep CNN architecture and enhanced Dempster–Shafer theory (IDSCNN) for bearing fault diagnosis. Result showed that the present method successfully perform the diagnosis by effective fusing signals from multiple vibration sensors.
Chen et al. 2017 37 worked on DNN based rolling bearing fault diagnosis. In this work, performance of DBN, DBM, and stacked SAE were evaluated. Result showed that all three methods perform very well with the combination of time-frequency and time-frequency domain features in comparison with raw data. In addition, SAE perform slightly better than DBN and DBM, and DBN need more training time. Chen and Li (2017) 76 investigated the multi-sensor characteristic information for bearing fault diagnosis based on SAE and a DBN. Time and frequency characteristics obtained from various accelerometers signals were fused in a single stream with SAE. Then, DBN was trained using these combined characteristics and found effective identification of bearing faults. In other work, Chen and Li, 2017 77 investigated DNN with a denoising auto-encoder (DAE) in order to perform fault diagnosis on rotary machines. In this work, they considered critical fault of rotor-bearing system like unbalance, misalginment, rubbing and pedestal loosness faults. This paper used a DNN with a denoising auto-encoder (DAE). After obtaining a DNN via unsupervised DAE training, the dropout approach is used to change the network parameters, reducing the issue of over-fitting. The results showed that the DNN can efficiently learn the frequency domain features in unsupervised mode and adaptively extract the characteristics of different faults in the rotor-bearing system.
Ren et al. 2017 78 used a deep learning method to estimate the RUL of several bearings. From the vibration signals of a rolling bearing, the technique used elevated value of degression samples. Three time domain features and one proposed six-dimensional frequency domain feature (Frequency Spectrum Partition Summation (FSPS)) among the features of the collected bearing vibration signal used here. Results showed the promising performance of deep learning approach for RUL prediction of multi-bearing. Xin et al. 2018 79 considered a combination of time-frequency features obtained from STFT and deep CNN for rotary machines fault diagnosis. Here, sparse auto encode, convoluting, pooling, and softmax classifier were used to build a new DNN called the DCNN scheme. The results showed that the present methodology is capable of recognizing various faults in gear and bearings from time-frequency images efficiently.
Eren et al.2018 80 used a dense adaptive 1D CNN classifier to investigate bearing faults. The result showed that 1D CNNs can extract extremely discriminative characteristics from the unprocessed input sensor dataset and an effective fault diagnostics can be achieved by using a simple-dense CNN arrangement. Yang et al. 2018 81 worked on deep learning based fault diagnosis of bearing. In this work, they divided the whole spectrum into several sections and used for the input of DNN. In addition, they assigned labels in data sets randomly to avoid pre-affixing a specific class label to a sample. Xia et al. 2018 82 presented a two-stage outlook for calculating bearings’ RUL based on deep learning. The degradation method was divided into various physical conditions in the first step. Then a stacked denoising auto encoder was used to initialize a DNN classifier. In the second stage, a shallow neural network system was built and learned in each physical condition of fault in order to perform the middle RUL evaluation. The experiments showed that the industrial technique could achieve adequate RUL calculation with a limited dataset.
Wang et al. 2018 83 and Wang et al. 2018, 84 employed Gaussian radial basis kernel function based autoencoder (KAEs) to enhance the feature extraction technique especially for non-linear component in bearing fault diagnosis of aircraft engine. A DNN was built with one kernel based autoencoder and multiple KAEs and result showed that the present methodology has higher accuracy because of its better feature clustering effect in feature extractions. Nguyen et al. 2018 85 used DNN to identify different bearing fault severities. In this work, they extracted characteristic frequency from envelop power spectrum of acoustic emission signals. Then, Adam optimization-based backpropagation algorithm with Xavier initialization method was used in DNN and results showed that better performance as compared to state of the art approaches. Chunfeng et al. (2018) 86 introduced heterogeneous transfer learning (HTL) based on stack sparse auto-encoder (SSAE). In order to solve the problem of small target domain data, they introduced a concept of distance to the center of the source and target domain to find the similarity of distribution using heterogeneous characteristics representation in HTL. Result showed that the present methodology has better performance than the conventional machine learning approaches especially for limited labeled data.
Saufi et al. 2018 87 performed the bearing fault diagnosis based on wavelet transform based images and flexible SSAE. A differential evolution and a resilient back propagation technique were used to optimize SSAE hyper constraints to prevent overfitting problem. Result showed that, even with limited datasets, the present SSAE based method is capable of accurately diagnosing bearing fault conditions and has better performance than DNN, CNN, SVM and KNN. Appana et al. (2018) 88 used the envelope spectrum (ES) of acoustic signals and CNN to perform a robust fault diagnosis of bearings under real life operating condition of changeable rotating speed. They performed the diagnosis by training CNN with signal obtained at one speed and testing on the reaming signal obtained at other speeds.
Sadoughi and Hu (2018) 89 introduced a physics based CNN (PCNN) for rolling bearing fault diagnosis. The traditional CNN was customized to include useful details from physical experience about bearings and their fault characteristics. Novel physics bases kernels were created based on the bearing fault frequencies and shaft speeds and used in convolutional layers of the PCNN. Espitia et al. 2019 90 used SAE based feature reduction analysis to track the condition of electromechanical systems and compared the performance with other technique like PCA and LDA. In this work, first, statistical time-domain features analysis using the vibration signal is demonstrated. Then SAE, PCA and LDA methods employed for feature reduction. Finally, they showed that the DNN based approach using SAE represents better discriminative capabilities than PCA and LDA in bearing fault diagnosis.
Gao et al. 2019 91 combined 1D convolutional neural system and generative adversarial networks (ASM1D-GAN) to develop an intelligent bearing fault diagnosis technique. In this work, in order to overcome the problem of small data of natural occurring fault, an ASM1D-GAN approach was developed to generate effective novel fault features which are consistent with natural fault features obtained from the raw data. Wang et al. 2019 92 presented bearing fault diagnosis using a multiscale learning neural network based on CNN. Two channel inputs were considered in this study i.e. one-dimensional CNN and a two-dimensional CNN to extract a variety of vibration features. 1D and 2D CNN learn informations from adjacent and nonadjacent intervals, respectively. Result showed that diagnosis with 1D CNN perform better than with 2D CNN. The performance of the diagnosis was significantly increase when both 1D and 2D CNN were used simultaneously.
Zhou et al. 2019 93 used CNN to study real-time fault diagnosis in rolling bearings based on 2D image input. DNN with stacked AutoEncoder do not efficiently get the spatial neighborhood features (SNFs) from 1D vibration signal because this 1D vibration signals show the amplitude only. Therefore, in this study, CNN was used with 2-D image of the vibration waveform as input. Results showed that present approach fully utilized the SNFs of the vibration signals in achieving the accurate bearing diagnosis. Enshaei and Naderkhani, 2019 94 presented role of deep learning in bearings of induction machine. This study focused on a deep bi-directional long short-term memory (BiD-LSTM) method fed by unprocessed signals in order to overcome the shortcomings of traditional machine learning (ML) techniques. The proposed method’s results validated the system’s efficiency with high accuracy and low latency.
Yan et al. 2019 95 used a multi-domain indicator and an optimised stacked denoising auto encoder (MIOSDAE) to recognize the current state of rolling bearings. The grasshopper optimization algorithm (GOA) was used to optimise the significant parameters of the stacked denoising auto encoder (SDAE) model, which could improve error categorization precision. Result showed that the present technique is valuable for identifying bearing current state. Chen et al. 2019 96 presented a new deep learning methodology called RNN scheme with encoder-decoder arrangement for estimating the remaining useful life (RUL). Here, 5 band-pass energy magnitude of frequency spectrum was used as input in RNN. Finally health indicator (HI) is obtained from this encoder-decoder arrangement using long historic data to extract critical degradation facts to calculate the RUL. Ma et al. 2019 97 investigated a lightweight deep CNN for bearing fault diagnosis of rotary machine. In this paper, the WPT was used to extract excellent information from the time-frequency domain, while the secondary layer constructs a relatively lightweight CNN. In terms of noise suppression and advanced training potential, the experiment shows that the present algorithm is superior to other algorithms. Zhou et al. 2019 98 proposed a deep learning fault diagnosis technique for unstable datasets of bearing using a global optimization GAN (GOGAN). It is noted that GOGAN can generate additional qualifying dataset required for the ultimate purpose of fault diagnosis.
Zhao et al. 2020 99 used FFT to perform fault diagnosis of a motor-rotor system using multi-manifold deep extreme learning machine (MDELM). In this work, they performed multichannel vibration data fusion, and random and pure unsupervised feature mapping for identifying the faults by MDELM. Results showed that the present methodology has superior performance in detecting critical faults like unbalance, misalginment, loosness and rubbing in the motor-rotor system. Chen et al. 2020 100 used a deep DRN to recognise the severity as well as the location of the bearing faults at the same time. They used frequency and energy features of vibration obtained in rasonance zone. To visualize inside of the DNN, they devloped a mathematical model by maximizing activation value. The result showed that the devloped methodology has high potential to detect severity as well as location of the bearing fault even under noisy environment.
Zhang et al. 2020 101 summarised application of DL in bearing fault diagnosis. They discussed the advantage of DL over ML such as the capability of automatic feature extraction, unsupervised learning, to handle imbalance data and noisy data, and classifier’s performance. To handle small data or imbalanced data problem in fault diagnosis, techniques like data random sampling, GAN and etc. have been successfully used in order to create additional faulty data in the training. Tang et al. 2020 102 developed a deep CNN approach based on knowledge fusion for analysing bearing defects in a variety of operational scenarios. Via the use of multi-sensors and narrow-band decomposition methods, an information fusion technique was implemented in this learning to increase the characteristic demonstration capability with the transferability of analysis systems. Han et al. 2020 103 used CNN to identify bearing faults based on different input modes like time, STFT, grayscale, CWT and time domain color feature (TDCF) diagram. In this work, they added red colour to TDCF in one input mode of CNN and found enhanced fault feature characteristic of the vibration signals. Results showed that the CNN with TDCF input mode maintained high performance as compared with time, STFT, gray scale and wavelet diagram even under the high noise conditions.
Xin et al. 2020 104 developed an automated fault diagnosis for bearings and gears using vibration signal analysis and multi-object deep convolutional neural networks (MO-DCNN). The time series, frequency dataset, and time–frequency dataset was structured like the multi-object to mine characteristics. The present method could boost overall feature characteristics and diagnostics performance while avoiding the drawbacks of using a single time or frequency field dataset. Zeng et al. 2020 105 investigated a bearing fault study based on denoising autoencoders for small number of labelled trial cases. In this work, they combined a CNN with an auto encoder (AE) and skip connect and finally trained the model with pre-training and fine tuning. Results showed that the present method reduces training time, improves the capability of the traditional fully connected auto encoder to extract features and diagnosis performance.
Xiong et al. 2020 106 combined WPT and CNN and developed a new continuous fault diagnosis approach for rolling bearings. Time-frequency analysis such as WPT was employed to find the richer information from the 2D coefficient matirix obtained from 1D raw signals. The CNN with WPT coefficient showed better and robust performance as compared to CNN with 1D raw signals. Li et al. 2020 107 developed a rolling bearing fault diagnosis with WPT and CNN. In this work, WPT was used in the first step to extract 1-D time-frequency coefficients from vibration signals, which were then converted into 2-D grey images using a careful data-to-image conversion technique. In the next step, a CNN system with three convolutional coats was used to find characteristics features from grey pictures and the present method yielded good results.
Zhao and Jia, 2020 36 presented a novel unsupervised deep learning netwrok (UDLN) method for rotary machine fault diagnosis. In this work, they used vibration spectrum as input to the UDLN where first layer is constructed with sparse filtering for feature extractor and second layer with weighted euclidean affinity propagation for the clusturing extractor. Zhang et al. 2020 108 showed a new unsupervised field adaptation with DNN for bearing and gear fault diagnosis. The key contributions were such as implementing maximum mean discrepancy (MMD) in DNN to reduce the variation of the source and target data distribution, and applying manifold regularizations, while simultaneously improving the delegate knowledge of the unprocessed dataset.
Zhao et al. 2020 109 used a semi-supervised deep sparse auto-encoder (SSDSAE) for rotary machine fault diagnosis using few labeled data. In this work, they used vibration spectrum signals to extract local structural informations (by minimizing interclass compactness) and non-local structural informations (by maximizing interclass separability). In addition, weighted cross entropy methods was adapted in this work for improving generalization accuracy of the SSDSAE model. Li et al. 2020 110 studied the fault diagnosis of bearing fault of rotary machine using deep learning and dataset augmentation. In this work, five dataset augmentation methods were employed such as extra Gaussian noise, masking noise, signal translation, amplitude shifting, and time stretching. In addition, they showed they signal translation and time stretching methods are promising as compared to other data augmentation methods.
Yan et al. (2021) 111 performed fault diagnosis of rotor-bearing system based on a new method called deep regularized variational autoencoder (DRVAE). In this work, the hyper parameter DRVAE was optimally selected based on bird swarm algorithm (BSA). They used frquency domain vibration signals acquired from multiple sensors as input to DRVAE. Che et al. 2021 112 performed the fault diagnosis of rolling element bearing based on hybrid multimodel fusion of DL models like CNN and DBN. In this work, vibration signals are converted into grayscale images and time series data for processing into CNN and DBN, respectively. Then three CNN and three DBN model were combined to obtain multi model fusion. Result showed that the prsent methdology based on multimodel fusion perform better than individual DL models. Xin et al. 2021 113 developed Gaussian convolution DBN (GCDBN) for fault diagnosis of rotor-bearing system based on infrared thermal analysis. As vibration signals sometime diffcult to acquire and affetced by time varying speeds, infrared thermal images are utilized to discretize the rotor bearing faults to avoid the affect of varying speed. Results showed that GCDBN perform very well for fault diagnosis with thermal images.
Summary of DL based rotor-bearing system fault diagnosis.
DL based gear and gearbox fault diagnosis
Various researchers have explored deep learning techniques in Gear fault diagnosis over the last 5 to 6 years. This subsection provides an overview of various DL techniques that have been developed and used to detect various Gear faults.
Lu et al. 2016 114 introduced a deep model based domain adaptation (DAFD) technique for gear well as bearing fault diagnosis. DAFD was used with MMD and weight regularization to learn portable features of vibration spectrum that link cross-domain discrepancies between source and target domain while supporting the minimal identifiable information in novel datasets. Heydarzadeh et al. 2016 115 presented application of DWT and DNN to develop a robust gear fault diagnosis based on vibration, acoustic and torque signals. Results showed that the DWT-DNN approach is not only data-driven and needs little prior knowledge for feature extraction, but it is also non-sensitive to various load conditions and does not require precise assumptions for signal measurements. Jia et al. 2016 116 demonstrated the capability of DNN for fault characteristic extraction and intelligent diagnosis of planetary gearbox as well as rolling element bearing faults with large amounts of data. Finally, they showed that DNN can extract the discriminative features from the vibration spectra with fine tuning and suggested to use hyperbolic tangent function as active function for higher accuracy.
Qi et al. 2017 117 created a deep network based on stacked sparse auto encoders (SAE) for fault diagnosis of rotating machineries like gearbox and bearing. The time field novel signals were preprocessed using ensemble empirical mode decomposition (EEMD) and autoregressive (AR) systems to obtain AR constraint like inputs to the analysis method. These novel signals' low-level characteristics are used in the stacked SAE to obtain extra conceptual and sparse elevated high level characteristics feature. Xia et al. 2017 118 introduced the fault diagnosis of rotating machinery (gear and bearing) based on different sensor signals and CNN. Sensor synthesis was achieved in the dataset stage and then both temporal and spatial information was obtained in this study to improve diagnosis accuracy and quality by integrating the unprocessed signals as the CNN system’s input. Jing et al. 2017 119 developed a CNN-based function learning and fault detection technique for gearbox fault diagnosis. In this work, they used frequency domain vibration signals of planetary gearbox and compared the performance of methods with time domain and time-frequency domain vibration signals. The result showed that the CNN method performed better with frequency domain vibration signals in comparison with time and time-frequency techniques. Also CNN showed better performance than fully connected neural network (FNN), SVM and random forest (RF).
Rao and Zuo, 2018 120 developed a novel fault diagnosis techniques using order tracking DNN (OT-DNN) for gear and bearing fault diagnosis especially for varying speed condition. In this work, first the OT was used to resample the vibration signals to avoid frequency smearing then the spectra obtained from resample signals was used as input to DNN. In addition, the OT-DNN method showed much higher accuracy for fault diagnosis than Fourier transform-DNN (FT-DNN) especially under varying speed conditions. Han et al. 2019 121 used characteristic selection to investigate the intellectual fault diagnosis of rotary machines using deep learning. In this work, they used frequency domain features of original vibration signals, and time and frequency domain features of intrinsic mode function (IMF) component obtained from original signals. It has the advantage of requiring a few labelled samples to be entered in the fine-tuning process to change the DNN’s constraints for effective diagnosis. Yu and Zhou 2019 122 introduced gear fault diagnosis using one-dimension residual convolutional auto-encoder (1-DRCAE) based feature learning. In this work, 1-DRCAE developed a deep 1-D convolutional auto-encoder to suppress noise and extract fault characteristics from noisy vibration signals in order to achieve unsupervised learning. Sun et al. 2019 123 used adaptive separation and deep learning to diagnose compound fault of a planetary gearbox. An adaptive fault separation improved particle swarm optimization variation mode decomposition (IPVMD) method was used to decompose the vibration spectrum signals of combined fault to many single fault signals.
Fu and Wang 2020 124 developed a new deep learning approach for gear and bearing fault diagnosis using vibration signals based on dataset augmentation. This work introduced a hybrid fault diagnosis technique with a GAN and a SDAE to improve the issue of minimal fault data in real engineering scenarios. The GAN technique was used to supplement the limited actual measured dataset, which was mostly in defective states. The SDAE fault diagnosis framework then modified the created dataset. Ye et al. 2020 125 introduced a deep learning model for diagnosis of gear faults based on characteristic synthesis of multi-channel sensory signals. Initially, a deep formation multiple DNN (MDNN) comprised of various auto-encoders was constructed to adaptively extract important characteristics from sensory signals as well as to extract compound connections between indication and error samples. Second, MDNNs were used to combine delegate deep characteristics learned from multi-channel sensory datasets using locality preserving projection (LPP). Finally, the combined deep characteristics in softmax were used to build an intellectual investigation method.
Ha and Youn, 2021 126 performed fault diagnosis of planetary gearbox with the use of CNN and maximum classifier discrepency (MCD). In this study, vibration signals were first converted into image map using time syncronous averaging (TSA) which visualizes the fault toothwise. Then MCD a typical domain adaption method was employed to solve the domain shift issue in the data map. They considerd discrepency scale factor and ensemble appraoch to improve the performance and robustness of the CNN. Mallikarjuna et al. 2021 127 worked on fault diagnosis of aircraft gearbox using a DL technique. In this work, they used time and frequency domain vibration signals in long short term memory (LSTM) and Bi-directional LSTM (BLSTM) models to classify the gearbox condition. BLSTM model is more effective than LSTM with both time and frequency domain signals. In other work, Shi et al. 2022 128 applied Bi-directional convolution LSTM (BiConvLSTM) for fault diagnosis of planetary gearbox using vibration and rotational speed signals. Result showed that the BiConvLSTM model can not only detect the fault types but also the location of the fault even under, and it has high accuracy than other methods like LSTM, ConvLSTM, BiLSTM, CNN and CNN-BiLSTM.
Summary of DL based gear fault diagnosis.
DL based turbine fault diagnosis
A study of DL-based techniques developed for wind turbines in the last 5 years is addressed in this sub-section.
Wang et al. 2016 129 explored the deep auto encoders to detect wind turbine blade breakage. The supervisory control and data acquisition (SCADA) dataset for each wind turbine condition was calculated, and a 1-step contrastive divergence (CD) algorithm was used to train the DA model. The results showed that the proposed method could be used to identify blade breakages in the near future using SCADA data. Wang et al. 2016 130 used DNN to investigate the failure of a wind turbine (WT) gearbox. The proposed method was built in two phases in this study: modelling and tracking the lubricant strain. The first phase focused on the development of a precise lubricant pressure prediction model of WT with healthy gearboxes’ using SCADA data. The prediction model was expanded using a DNN algorithm, which was compared to other models. An exponentially weighted moving average (EWMA) manage graph was used in the next step to obtain guidelines for monitoring lubricant strength. The present DNN method produced more accurate prediction outcomes and a more effective model for monitoring lubricant pressure.
In a study, Cheng et al. 2017 131 used frequency analysis and a DL classifier to create a rotor-current-based fault diagnosis for drive train gearboxes of doubly-fed induction generator (DFIG) wind turbines. In this work, a classifier was projected in favour of gearbox defect categorization by extracting fault characteristics using a deep structural architecture that included a stacked auto encoder (SAE) and a SVM. The tests were carried out on a DFIG wind turbine drive train test rig in the suit, with four different gear defect states. Teng et al. 2018 132 presented a novel approach to fault detection in a direct drive wind turbine (DDWT) based on a DNN model. For a defective wind turbine, the multi-layer learning characteristic of DNN for the input features exhibits brilliant fitting ability between input variables and output variables, allows for the detection of the degree of deviation of the online data from the normal state. The present DNN model effectively detected the fall off of the permanent magnets in the DDWT generator.
Han et al. 2018 133 used a new novel deep adversarial convolutional neural network (DACNN) for diagnosis of mechanical defects in wind turbines. In this study, they considered bearing faults, misalignment, and variance within the airfoil of blades, as well as Yaw faults. The present technique was found to be efficient and advantageous in fault diagnosis, especially when it came to identifying undetected states through system learning. Qian et al. 2019 134 used a long short-term memory (LSTM) neural system to conduct a fresh state analysis of wind turbines. Supervisory control and data acquisition (SCADA) data from a commercial wind farm was collected over a 12-month span to verify the anticipated system’s execution. Gearbox oil and bearing temperature were used as inputs to the LSTM in this case. The experiment shows that LSTM outperforms traditional back propagation neural network algorithms in terms of prediction accuracy.
Afrasiabi et al. 2019 135 used GAN and temporal convolutional neural networks (TCNN) to investigate wind turbine (WT) fault diagnosis. The proposed approach is split into two parts. GAN was used to mine and pick useful characteristics from signals, as well as to inscribe machine fault distinguisher learning using a limited number of samples. Tan et al. 2019 136 provided an analysis of smart condition monitoring with dropouts and cross-validation using deep neural networks (DNN). In this work, the present approach used real datasets of wind turbines to determine the best number of epochs and nodes for the DNN. They showed that the number of nodes in the predicted technique is the determining factor in the neural system’s outcome.
Han et al. 2020 137 combined deep transfer network (DTN) with joint distribution adaptation (JDA) technique to develop a new intelligent fault diagnosis technique for industry. They analysed experimental data from bearings, gearboxes, and wind turbines in this study. Inspired from tranfer learning, JDA was used to analysis disrimination features associated with source domain labelled data and to adjust the conditional distribution of unlabelled data in target domain in order to get effective distribution matching. Prosvirin et al. 2020 138 used an auto encoder with nonlinear function approximation and a DNN to identify blade rub-impact defect in turbine. The deep undercomplete denoising auto encoder (DUDAE) based on the resample vibration signals was used to learn the nonlinear function approximation of the system condition for normal working conditions. Then the remaining signals from unknown fault states were used as inputs to the deep neural system, which was used to create the residual signals for producing discriminative features based on auto encoder. The results showed that the amplitude of residual signals with DNN model is very effective in identifying complex fault like blade rub-impact. In a study, Nie et al. 2020 139 introduced a new autoencoder with an active characteristic enhanced aspect for efficient wind turbine fault diagnosis. A model which incorporates a noise addition approach and denoising stacked feature enhanced autoencoders based on dynamic feature improved aspect (DSFEAE-DF) was proposed to extract extra discriminative features from raw signals. The influence of neurons was also visualized to demonstrate the perspectives of function enhancement. Results showed that the present approach worked very well for turbine fault diagnosis even under noisy environments compared to SAE, k-sparse SAE and others.
Zhang et al. 2021 140 developed a hybrid attention improved residual network (HA-ResNet) based fault diagnosis of a turbines of a wind farm. In this work, they combined the WPT obtained from the raw vibraiton singals and the ResNet an improved CNN model for further improving the diagnosis perforamnce by giving attention to key frequency band of wavelet coefficients. Wang et al. 2021 141 introduced a novel collaborative deep learning framework for fault diagnosis of a wind turbine. In this work, they cosidered a number of fault condtions like misalignment, loosninng, Outer-ring beairng fault, mass unbalance of wind wheel, variation in airfoil of blades, yaw fault and aero-asymmetry of wind wheel. Result showed that the present method of fault diagnosis is better than local learning scheme and also avoids many problems of covnetional learning like big data, unbalanced data, big computational burden and overfitting.
Yu et al. 2021 142 introduced a fast deep graph convolution network (FDGCN) for fault diagnosis of wind turbine fault diagnosis. In this work, they obtained time-frequency graph of vibration signals based on WPT. Finally they employed particular pooling operation for optimizng the training time and graph convolutional kernels for extracting features from the nodes and edges of the graph. Yang et al. 2021 143 performed DL based blade damage detection of wind turbine using image recognition. In this work, they used transfer learning to improve features extraction and ensemble learning using random forest to improve the CNN perforamance. Result showed that the present appraoch has potenial to detect the blade damage based on unmanned aerial vehicle (UAV) images. In other work, Li et al. 2021 144 performed successful fault diagnosis of wind turbine based on transfer learning model and convolution autoencoder (CAE) even with small samples. Guo et al. 2021 145 developed a effective fault diagnosis appraoch in which combined informations from WPT of vibraton signals, domain knowledge and operating conditions of turbine gear box were used as rienforced input to CNN.
Summary of DL based turbine fault diagnosis.
DL based pump and compressor fault diagnosis
This section reviews the literature published in the last 5 years on DL-based pump and compressor fault diagnosis.
Thirukovalluru et al. 2016 146 proposed using a denoising stacked auto-encoder to generate feature datasets for air compressors, drill bit monitoring and steel plate monitoring. In this paper DNN was used to create useful feature from handcrafted features extracted from conventional techniques such as FFT and WPT. Yan et al. 2016 147 considered CNN for diagnosis hydraulic pump faults such as cylinder wear, valve wear, roller wear, loose slipper, sliding boot wear and spring wear under stable and variable pump speeds. They showed that the present CNN based method is simple and effective even for varying speed conditions. In addition, the performance of CNN was found higher than DBN, SVM and BPNN. Wen et al. 2017 38 employed a new LeNet-5 based CNN method for fault diagnosis of motor bearing, centrifugal and axial piston hydraulic pump. They converted 1D vibration signals in to 2-D images and then CNN was used to extract features and perform the diagnosis. Result showed that the present method has higher potential than other methods like deep CNN, DBN, sparse filter and SVM.
Wang et al. 2018 148 used DBN to perform multiple fault diagnosis in axial piston pump. In this work, they used time, frequency and time-frequency domain data for characterizing pump faults and then DBN was used to automatic learn the features using RBM. Results showed that the performance of DBN based methodology is higher than the SVM and ANN for multiple fault diagnosis of pump. Yang et al. 2019 149 performed centrifugal pump fault diagnosis for impeller and bearing faults based on hierarchical symbolic analysis (HSA) and CNN. In this work, HSA was employed to decompose the raw time domain signals into a no. of low and high frequency components. Result showed that the combination of HSA-CNN performed better than SVM, RF, STFT-CNN and CWT-CNN.
Tang et al. 2019 150 presented a review of DL techniques in condition monitoring rotary machines like pumps, bearing and gears. They showed the various advantages of DL such as automatic feature extraction, effective dimensional reduction, imbalanced data solution, handling varying fault information and etc. in condition monitoring over other AI techniques like, SVM, ANN and others. Maurya et al. 2020 151 used DNN to conduct condition-based monitoring of rotating machines under variable system running conditions. They used vibration data to track bearing faults and acoustic data to monitor faults in air compressors using DNN. They showed that the DNN’s computational complexity was reduced through a related enhancement in execution by using low-level features or derived features from time, frequency, and time-frequency domain. Finally result showed that the present low-level feature based DNN technique improves diagnosis accuracy with little computational time.
Hasan et al. 2021 152 successfully used grayscale images from 2D time-frequency graph called scalogram obtained using CWT of vibration signals as input to adaptive deep CNN (ADCNN) for centrifugal pump fault diagnosis. Jiang et al. 2021 153 combined empirical wavelet transform (EWT) and 1D-CNN for fault diagnosis of axial pump using vibration and pressure signals. They fused time and frequency domain features obtained from denoised signals by EWT and further used as input to CNN for effective fault diagnosis. In other work, Bie et al. 2021 154 used singular spectral entropy of intrinsic modal function (IMF components of vibration signals as input to LSTM DNN for performing successful diagnosis of reciprocating pump.
Summary of DL based pump and compressor fault diagnosis.
Future trends and research challenges in DL based fault diagnosis
Deep learning has been successfully developed and used in the field of condition monitoring and fault diagnosis of rotating machines in last 5 years. However, DL based condition monitoring and fault diagnosis has some shortcomings which need to be considered in future research. Here some future trend amd research challenges are discussed. • Notably, the DL based techniques are required big data volume for the effective performance. However, in industry, the big data volume may be limited for the particular fault conditions of a rotating machine. The focus of DL based diagnosis is now shifting on limited data analysis. • In last 5 year, the research with DL based fault diagnosis has focused on individual machines like bearing, Gear, motor, pump, turbine and others. However, in industry, machines are worked in combinations; therefore the fault diagnosis based on DL should focus on the combined machines rather on individual machines. • L based diagnosis have been effectively developed in order to diagnose specific type of faults at particular stage in rotating machines. However, in real situations, there are fair chances of developing multiple faults or degradation of faults in a rotating machine. Therefore, the DL based fault diagnosis may focus on this aspect of monitoring. • The effective DL based diagnosis has been established with the standard data sets available or generated datasets in laboratory for different machines. The focus may shift from the standard data to the actual data from the industry. • The research mainly considered the data which is generated in similar environments and from similar machines, however, it can be predicted that the inn near future the DL based diagnosis will be successfully developed for the actual data obtained from different operating and working conditions and from different capacity machines. • The future research may focus on theory evidence why DL can achieve particular results of the diagnosis, which is lacking in the literature. • The DL is fairly developed for fault diagnosis of various rotating machines. It is no doubt that the DL will be rapidly developed for fault prognostics in near future. • Notably, the DL based fault diagnostics have not been yet put into actual exercise in industry due to some limitations of hardware and other challenges of DL like noisy data and limited knowledge of the fault situations.
Conclusions
The aim of any fault diagnostics is to effectively detect the present condition of the machine so as to choose whether the machine requires going into maintenance. Deep learning based diagnosis shows better performance in fault diagnosis in comparison to other machine learning algorithms likes ANN, FL and SVMs. This review paper presents the research and progress in the field of condition monitoring and fault diagnosis of rotating machines based on DL techniques. This paper first added various fault and their condition monitoring methods of rotating machines and then theory of various types of DL in brief. After that the review of 5-6 years research advancement in DL based fault diagnosis for motor, rotor-bearing model, gear, pump and compressors have been presented. The future trend and research challenges have also been discussed at the end of the paper.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
