Abstract
Bearing fault diagnosis is crucial for ensuring the reliability and safety of rotating machinery in industrial settings. Machine learning-based diagnostic models offer powerful solutions, but their effectiveness is challenged by substantial domain shifts caused by variations in operating conditions, such as changes in motor load. To address this challenge, a novel domain adaptation framework that combines physical domain knowledge with deep learning techniques is proposed. The framework employs envelope spectrum analysis to generate reliable pseudo-labels for unlabeled target domains. By incorporating local maximum mean discrepancy into the training process, the framework aligns feature distributions between source and target domains while preserving class-specific information. This method enhances the adaptability of diagnostic models to real-world industrial conditions, reducing the need for extensive labeled data and improving predictive reliability across different operating scenarios. Experimental results performed on the Case Western Reserve University bearing dataset demonstrate that our method outperforms baseline models, achieving superior classification accuracy under significant domain shifts. By improving fault detection under varying load conditions, this approach contributes to more efficient predictive maintenance, reducing unexpected failures and operational downtime in industrial machinery. This approach highlights the potential of combining physics-based insights with deep learning to enhance fault diagnosis in diverse and complex industrial scenarios.
Introduction
Bearings play a critical role as key components in rotating machinery, responsible for supporting loads and reducing friction between moving parts. 1 The condition of bearings directly influences the efficiency, reliability, and safety of industrial systems. 2 Undetected bearing faults can lead to catastrophic equipment failure, significant financial losses, and safety hazards.3,4 As a result, precise and prompt bearing fault diagnosis has emerged as a crucial focus in both research and industrial practices.
Traditionally, bearing fault diagnosis has relied on signal processing techniques and statistical methods to extract time, frequency, and time-frequency domain characteristics from vibration signals.5–7 These handcrafted features were then analyzed using methods like envelope detection and spectral analysis. However, these methods often face limitations due to their reliance on expert knowledge and their lack of adaptability to complex fault patterns.8,9 Machine learning methods, including support vector machines (SVMs) and random forests (RFs), have been utilized to automate the processes of feature selection and classification, offering improved accuracy and scalability.1,10 For example, Zhou et al. 11 proposed a model, combining the whale gray wolf optimization algorithm with variational mode decomposition (VMD) and SVM, achieving 100% fault diagnosis accuracy by effectively optimizing VMD decomposition and SVM parameters. Wang et al. 12 introduced an SVM-based diagnostic model optimized with particle swarm optimization and used features derived from singular value decomposition of Hilbert spectra and permutation entropy for accurate fault detection. Roy et al. 13 developed an RF classifier using features extracted from autocorrelation of vibration signals and refined through recursive feature elimination, achieving high diagnostic accuracy across varying operating conditions. Wan et al. 14 improved the RF model by removing low-performance and redundant trees, and parallelizing it on the Spark platform to enhance fault diagnosis accuracy and speed for large-scale datasets.
In recent years, deep learning approaches have shown exceptional performance by automatically extracting hierarchical features from raw signals, thereby removing the need for handcrafted feature engineering.15–17 Ruan et al. 18 introduced a physics-guided convolutional neural network (PGCNN) where the input size and convolution kernel were optimized based on bearing acceleration signal characteristics. The PGCNN, validated on two benchmark datasets, demonstrated improved accuracy and efficiency compared to traditional CNNs with standard parameters. Wang et al. 19 proposed a 1D-CNN-based multi-modal approach that fuses vibration and acoustic signals for bearing fault diagnosis. By combining features from multiple sensors, the method achieved higher accuracy and robustness under varying signal-to-noise ratios compared to single-modal methods. Qiao et al. 20 developed a dual-input CNN-LSTM model for end-to-end fault diagnosis using time and frequency domain features. The model, tested under varying noise and load conditions, showed high fault recognition accuracy and adaptability due to its combination of spatial and sequential feature extraction. Khorram et al. 21 designed a convolutional RNN model using raw time-domain vibration data without preprocessing. The method achieved state-of-the-art accuracy in bearing fault detection, emphasizing speed and end-to-end learning for practical application. He et al. 22 proposed RTSMFFDE-HKRR for train-bearing fault diagnosis in noisy environments, improving feature extraction but not addressing domain adaptation. He et al. 23 developed AGFCN for high-speed train bogie bearings, enhancing fault detection under noise but relying on labeled target data. Yao et al. 24 introduced an attention-based neural network for wheelset axle faults, improving efficiency but lacking domain adaptation. Despite these advancements, deep learning models often encounter challenges in real-world applications due to the lack of sufficient real-world training data.25,26
To tackle the challenges posed by domain shift, researchers have developed transfer learning, a method in which models are trained on one dataset and then applied to real-world observational data.
27
However, domain shift remains a major obstacle in transfer learning, as differences between the source and target domains can hinder model performance.
28
These differences often result from variations in operating conditions, such as load, speed, or temperature, between the training data (source domain) and the testing data (target domain).
29
Such discrepancies can significantly affect the ability of features learned from the source domain to generalize to the target domain.30,31 To address these challenges, several domain adaptation techniques have been proposed to align the distributions between the source and target domains.
9
Figure 1 illustrates the domain adaptation problem and its two main solutions: global and local adaptation. For the global approach, Wan et al.
28
introduced a deep convolutional multi-adversarial domain adaptation model that utilizes an enhanced ResNet for feature extraction. This model integrates multi-kernel maximum mean discrepancy (MMD) with domain discriminators to achieve effective alignment between domains, demonstrating superior performance in cross-domain fault diagnosis. Similarly, Liu et al.
32
introduced a simulation-driven domain adaptation approach using a domain adversarial neural network (DANN). By utilizing simulated vibration signals, their model achieved high fault diagnosis accuracy with minimal reliance on real-world data, making it suitable for industrial applications. Furthermore, Chen et al.
33
developed a joint attention adversarial domain adaptation framework that integrates both local and global attention mechanisms with MMD-based feature region weighting. This approach demonstrated superior accuracy across multiple fault datasets, outperforming state-of-the-art models and proving its robustness in addressing domain shifts. Figure 1 illustrates the domain adaptation problem and highlights its two primary solutions: global and local adaptation. Illustration of the domain adaptation problem (left) and solutions (right).
For the local domain adaptation approach, local maximum mean discrepancy (LMMD) has been widely used. 34 The LMMD method reduces distributional discrepancies between the source and target domains in a localized manner by aligning both marginal and conditional distributions. This approach enhances fault classification performance across varying working conditions. 35 While other methods focus on global adaptation, LMMD outperforms them all due to its ability to adapt at the class level, known as subdomain adaptation. This allows LMMD to achieve higher classification accuracy. However, LMMD-based methods have limitations when the domain shift is large. In such cases, the predictions made by the model for the target domain data may be unreliable. Since LMMD relies on these predictions to guide the adaptation process, incorrect predictions can propagate errors and hinder the model’s performance. 36 This highlights the need for a more robust approach to address large domain shifts.
While machine learning and deep learning have significantly advanced bearing fault diagnosis, their effectiveness is often hindered by domain shifts caused by varying operating conditions. Transfer learning techniques have been introduced to mitigate this issue, yet existing methods struggle with effectively balancing global and local adaptation. LMMD-based approaches have improved feature alignment across domains, but their reliance on pseudo-labels can lead to error propagation when domain shifts are large. In such cases, inaccurate predictions in the target domain compromise the adaptation process, ultimately degrading model performance. These challenges highlight the need for a more robust adaptation framework that can enhance reliability under substantial domain shifts while preserving class-specific information.
To overcome this limitation, we propose a knowledge-driven subdomain adaptation framework for bearing fault diagnosis. Our approach leverages prior knowledge of fault frequencies to generate guided labels for the target domain. These guided labels serve as a reliable reference for the LMMD process, enabling more accurate alignment of the source and target domain distributions, even under significant domain shifts. By integrating domain knowledge into the deep learning framework, our method enhances the model’s robustness and diagnostic accuracy across varying working conditions.
The key contributions of this paper are as follows. (i) We identify the limitations of existing LMMD-based domain adaptation methods for bearing fault diagnosis under large domain shifts. (ii) We propose a novel knowledge-driven subdomain adaptation framework that integrates fault frequency information to generate guided labels for LMMD. (iii) We validate the effectiveness of the proposed framework through extensive experiments, achieving improved fault classification performance across diverse working conditions.
Problem formulation
Bearing fault diagnosis under varying operating conditions is a complex problem, often requiring the adaptation of machine learning models to handle the differences in data distributions between training (source) and testing (target) datasets. These variations in operating conditions such as changes in load, speed, or temperature can significantly impact the signals collected from vibration sensors, which are commonly used for fault diagnosis.
Let the source domain D s be represented as D s = {x s ,y s }, where x s are the input features (e.g., vibration signals) and y s are the corresponding fault labels for the source domain data. The source domain data is labeled, and we use this data for training our initial fault diagnosis model. On the other hand, the target domain D t is represented as D t = {x t }, where x t are the input features from the target domain, which are unlabeled. In practice, the target domain data comes from a different operating condition than the source data. The distributions of x s and x t are not identical, which poses a challenge for accurate fault classification in the target domain.
The goal of domain adaptation is to overcome the discrepancy between the source domain distribution P s (x s ) and the target domain distribution P t (x t ), which arises due to differences in working conditions. This distributional shift can lead to poor performance of the trained classifier when applied to the target domain, as the feature space and fault characteristics in the target domain may not directly match those in the source domain.
To tackle this challenge, the goal is to learn a classifier F that can effectively classify faults in the target domain, despite the domain shift between the source and target domains. This can be achieved by minimizing the domain shift through a process known as domain alignment, where we align the feature distributions between the source and target domains. Figure 1 illustrates the domain adaptation problem (left) and solutions (right).
Methodology
The methodology for this study combines domain knowledge of fault frequencies with advanced neural network-based domain adaptation techniques to achieve robust fault diagnosis across varying working conditions. This approach consists of three main components: data preprocessing, pseudo-label generation using fault frequency information, and a neural network architecture enhanced with LMMD. Figure 2 illustrates the proposed methodology. In the first step, data preprocessing involves extracting features from vibration signals in both the source and target domains, which are represented as the envelope spectrum. These extracted features serve as crucial inputs for the subsequent stages. The next step focuses on the pseudo-label generation, where fault frequency features from the envelope spectrum are used to create pseudo-labels. These labels provide fault-related guidance for the unlabeled target domain data. Finally, the neural network is trained using two loss functions: cross-entropy loss and LMMD loss. The cross-entropy loss aids in fault classification by using the envelope spectrum and true labels from the source domain, while the LMMD loss reduces the distribution mismatch between predictions from the source and target domains. By utilizing pseudo-labels, LMMD supports domain adaptation, enabling the model to effectively classify target domain data. Additional details of each component are provided in the following sections. The proposed method for cross-condition bearing fault diagnosis.
Data preprocessing
In this stage, the raw vibration signals are processed to extract meaningful features that are sensitive to bearing faults. A widely adopted approach, envelope spectrum analysis, is employed to highlight fault-specific frequency components buried within complex signals. This process begins with band-pass filtering of the raw vibration signals to isolate the frequency range that typically contains fault-related features. The filtered signals are then demodulated using the Hilbert transform to compute the signal envelope:
Roller bearing fault frequencies.
N b : number of rollers; f s : fundamental frequency of the rotor; D b : roller diameter; D p : pitch diameter; θ: contact angle.
Pseudo-label generation
The fault frequency-guided label generation is a critical step in adapting the model to new domains. Fault frequencies are consistent across varying operating conditions because they are determined by the bearing’s geometry and rotational speed. This property is leveraged to assign pseudo-labels to the unlabeled target domain samples. The process involves calculating theoretical fault frequencies using known machine specifications as shown in Table 1 and comparing them to peaks in the envelope spectrum of the target domain data.
Samples exhibiting prominent peaks at the calculated fault frequencies are pseudo-labeled as belonging to the corresponding fault class. Peaks are identified in the envelope spectrum using signal processing criteria, including peak prominence, height, and distance as in 37. A peak is considered valid if it is within a predefined frequency range associated with the expected fault frequencies (e.g., BPFO, BPFI, and BSF), with tolerances set to account for minor variations in system dynamics.
If a single prominent peak corresponding to a calculated fault frequency or a harmonic is detected, the sample is pseudo-labeled as belonging to the corresponding fault class. In cases where multiple peaks are found, the label is assigned to the fault class whose frequency is closest to the most prominent peak (based on amplitude). If no prominent peaks are detected within the tolerance range of any fault frequency, the sample is pseudo-labeled as “healthy.” This approach ensures that every sample is assigned a single label while leveraging physical phenomena to guide the pseudo-labeling process. By avoiding reliance on uncertain model predictions and directly linking pseudo-labels to fault-related characteristics in the spectrum, this method enhances the reliability of domain adaptation, even under significant domain shifts.
Neural network architecture
The neural network architecture for this study.
The unique aspect of this architecture is its integration with a domain adaptation component, where a shared feature extractor is used for both the source and target domains. The LMMD loss function is applied in the shared feature space to align both the marginal and conditional distributions between the two domains. This design facilitates the learning of domain-invariant features, enabling the model to generalize effectively across different working conditions.
Training process
The training process combines supervised learning on the source domain with unsupervised domain adaptation on the target domain. The total loss function comprises two terms: the classification loss, L
cls
, for the source domain, and the LMMD loss, L
LMMD
, for domain alignment. These are combined as:
The training process is divided into two phases. In the first phase, the model is trained using labeled source domain data to achieve accurate fault classification within the source domain. In the second phase, the model is fine-tuned using both source and target domain data. During this phase, fault frequency-guided pseudo-labels are utilized to compute the LMMD loss, enabling precise alignment of the feature distributions between the source and target domains. The model parameters Φ in each phase are updated using the Adam optimizer with a learning rate η:
Experiments
The proposed methodology is tested through a series of experiments to evaluate its performance across different operating conditions. The experiments utilize the CWRU bearing dataset,
35
which contains vibration signals collected at various motor loads (1HP, 2HP, and 3HP). This dataset includes four fault categories: normal operation, inner race fault (IF), outer race fault (OF), and rolling element fault (RF), with fault diameters of 0.007, 0.014, and 0.021 inches. The vibration signals were captured using accelerometers attached to the motor housing, with a sampling frequency of 48 kHz. These signals are representative of real-world scenarios where domain shifts arise due to variations in machine load or operating conditions. Figure 3 demonstrates the experimental testbed from CWRU bearing dataset. Experimental test setup of the CWRU dataset.
For this study, only data corresponding to the smallest fault diameter of 0.007 inches is used to evaluate the model’s capability for early fault detection. The raw vibration data is preprocessed by segmenting it into windows corresponding to five revolutions of the motor shaft with 75% overlap. This ensures that each segment contains five full cycles of rotational data, allowing the model to focus on fault-specific patterns and improving the granularity of fault diagnosis. Examples of the raw vibration signals from the motor under different working conditions are shown in Figure 4. Raw vibration signals from different working conditions.
Additionally, envelope spectrum analysis was performed to validate the reliability of fault frequency-based pseudo-labeling in the target domain. The spectral analysis in Figure 5 revealed clear peaks around characteristic fault frequencies, confirming the physical interpretability of the guided labels. Although there may be multiple peaks, our selection algorithm ensures that they are appropriately pseudo-labeled. Validation of pseudo-labels created from envelope spectrum analysis.
Three domain adaptation tasks are defined to assess the robustness of the proposed method under varying operating conditions: 1HP → 2HP, 1HP → 3HP, and 2HP → 3HP. These tasks simulate real-world scenarios where the operational conditions of machinery undergo significant changes, requiring the model to generalize effectively across different domains. For each task, 80% of the source data and 80% of the target data are used for training, while 20% of the source data is reserved for model validation during training, and 20% of the target data is used to evaluate the model’s performance. Each task is conducted multiple trials, and the average accuracy is reported to ensure the reliability of the results.
The training process was further analyzed through loss convergence graphs in Figure 6, which demonstrated the effectiveness of the proposed framework in aligning source and target domains. In both two phases, the loss function decreased steadily during training, indicating successful feature alignment, while the validation accuracy converged stably, ensuring reliable performance across tasks. These results emphasize the importance of integrating fault frequency-guided labels into the adaptation process, especially in challenging scenarios with significant domain shifts. Loss function and validation accuracy in the training process.
The classification accuracy performance and comparisons.
The classification accuracy performance under different levels of noise.
The confusion matrices for the three domain adaptation tasks in Figure 7 highlight the effectiveness of the proposed framework compared to baseline methods. For the 1HP → 2HP task, the proposed method significantly reduces misclassifications across all fault classes, particularly resolving ambiguities between inner and roller faults that are prevalent in the NA approach. In the 2HP → 3HP task, the proposed framework demonstrates clear improvements, especially in distinguishing race faults from others, which remain challenging for MMD and LMMD. The largest domain shift, 1HP → 3HP, poses severe challenges for NA and moderate issues for MMD and LMMD, as evidenced by high confusion between normal and faulty classes. However, the proposed method achieves the highest true positive rates across all tasks, effectively addressing inter-class confusion and demonstrating superior robustness under significant domain shifts. These results underscore the reliability and effectiveness of the proposed framework in comparison to traditional and existing domain adaptation techniques. The confusion matrices for the three domain adaptation tasks.
To illustrate feature alignment, t-SNE visualizations were used to project the feature space for the 1HP → 3HP task as shown in Figure 8. Without adaptation, source and target features were poorly aligned, resulting in overlapping class boundaries. MMD-based adaptation partially improved alignment, but some classes remained misaligned. In contrast, the proposed framework demonstrated a clear alignment of source and target features, with well-separated class boundaries, enabling accurate classification. t-SNE visualizations for the 1HP → 3HP task. (a) No adaptation; (b) MMD method; (c) LMMD; and (d) proposed method.
Discussion
Despite the promising outcomes, the current approach has limitations. Firstly, the reliance on pseudo-label generation based on fault frequency may not fully account for complex fault patterns that are not explicitly linked to specific frequency features, limiting its generalization to more diverse fault types. Additionally, the performance of the model is highly dependent on the quality of the source and target domain data. Variations in data quality or sensor reliability could impact the model’s effectiveness. The method also assumes that the fault frequencies are consistent across different working conditions, which may not always be the case in real-world applications where fault characteristics can change dynamically.
Recent advancements in machine learning and signal processing have introduced various approaches for industrial fault diagnosis. Digital twin methodologies have gained traction for vibration-based monitoring and gear wear prediction, enabling real-time assessment and predictive maintenance by creating virtual replicas of physical systems. 41 Digital twin-driven intelligent assessment of gear surface degradation integrates physics-based and data-driven models to track fault progression and optimize maintenance strategies. 42 Furthermore, digital twin-enabled domain adversarial graph networks have been explored to enhance cross-domain fault diagnosis by leveraging structured relationships between different operating conditions. 43 Another emerging direction is the use of neuro-fuzzy system-guided cross-modal zero-sample diagnostic frameworks, which combine heterogeneous sensing data, such as infrared thermography and acoustic signals, to improve diagnostic accuracy when labeled data is limited. 44
Conclusion
This paper presented a knowledge-driven subdomain adaptation framework for bearing fault diagnosis, addressing the challenges posed by domain shifts due to varying operating conditions. By leveraging fault frequency-based pseudo-labels and LMMD, the framework aligns source and target domains effectively while minimizing the reliance on model predictions. The integration of physical domain knowledge ensures reliable adaptation even under significant domain shifts, as demonstrated in experiments on the CWRU bearing dataset. Comparative analysis against baseline approaches showed that the proposed method consistently outperforms in terms of accuracy and robustness. This study underscores the importance of combining domain knowledge with advanced machine learning techniques, paving the way for practical and reliable fault diagnosis solutions in real-world industrial applications.
Future work will focus on several areas to enhance the proposed methodology. First, exploring more sophisticated techniques for fault frequency extraction and incorporating additional domain knowledge could help in handling more complex fault patterns. Furthermore, enhancing the robustness of the model to variations in sensor data quality will be crucial for real-world applicability. Future studies could also investigate multi-source domain adaptation to further improve performance when data from multiple sources or varying conditions are available. Finally, incorporating online learning techniques that allow the model to adapt continuously to new, unseen operating conditions would make the system more flexible and capable of handling evolving fault scenarios in industrial environments.
Footnotes
Author contributions
All authors contributed to the study conception and design. Material preparation, data collection, and analysis were performed by N.T.H., N.D.T., and H.S.H. The first draft of the manuscript was written by N.T.H. and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research is funded by Hanoi University of Science and Technology (HUST) under project number T2023-PC-027.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
