Abstract
To solve the fault diagnosis difficulties in autonomous underwater vehicle (AUV) thrusters, a semi-supervised AUV fault diagnosis method based on dynamic decay learning strategy and hypergraph attention network (HGAN) is proposed. Firstly, an attention mechanism is introduced into hypergraph convolutional networks (HGCN) to construct HGAN. Then, the HGAN and graph convolutional network (GCN) are designed in parallel architecture to capture both the dynamic and static features of the input graph signal simultaneously. Finally, a dynamic decay learning strategy is introduced, which improves the training efficiency of the proposed model. The diagnostic precision of the proposed approach is verified through experiment analysis. Besides, its superiority over the other related methods is also verified through comparison study.
Introduction
As a commonly used tool working underwater, it will cause incalculable economic and safety losses due to the complexity of the underwater environment when failure arises in the propulsion system of AUV. In addition, AUV technology is developing towards greater range, greater depth, higher speed, and higher intelligence with the demand of ocean exploration. Studying effective monitoring and diagnostic methods for AUV thrusters has important safety and economic significance. Fault diagnosis based on vibration signal processing and shallow machine learning are two kinds of traditional methods. Common signal processing methods such as FFT, 1 empirical mode decomposition, 2 wavelet transform, 3 cyclostationary analysis, 4 sparse dictionary learning, 5 variational mode decomposition, 6 Lempel–Ziv complexity,7,8 etc., have been applied widely in the field of fault diagnosis. However, due to the complex environments such as strong interference from underwater currents and noise, it is inevitable increasing the fault diagnosis difficulty based on vibration signal processing. In addition, fault diagnosis methods based on vibration signal processing require strong professional background knowledge, which lack of directness. In the field of fault diagnosis based on shallow machine learning, Yin et al. proposed a fault recognition method based on the fusion of time-frequency features and support vector data description, which addressed the problem of low classification accuracy in AUV thruster fault recognition. 9 Zhang et al. proposed an improved SVDD fault mode classification method to solve the problem of difficulty in obtaining optimal kernel function parameters of traditional SVDD. 10 Wang et al. improved dynamic recurrent neural networks and radial basis function networks, and obtained overall fault information by comparing the output values of the AUV motion state model with the measured values of actual velocity and angle. 11 The literature proposed an equivalent model based on fuzzy neural network structure for system state monitoring, whose output is the state or fault degree of certain parts of the system. 12 Sun et al. proposed an improved dynamic recurrent neural network for AUV thruster fault diagnosis. 13 Tang and Gang proposed an intelligent fusion system by combining feedforward neural network with artificial neural network for fault detection and fault-tolerant control of AUV propulsion system. 14 Shi and Zhang proposed a fault diagnosis software based on Bayesian networks. Furthermore, a task context based irrelevant node segmentation method was proposed based on the characteristics of AUV, which further simplified the Bayesian network, reduced the complexity of consequence calculation, and improved the ability of real-time fault diagnosis effectively. 15 DAS proposed a method by combining genetic algorithms with ensemble learning for fault detection of AUVs. 16 Nascimento evaluated a data-driven fault diagnosis scheme based on recurrent neural networks using empirical data. The nominal dynamics of the thruster were modeled by employing control inputs, voltage, rotational speed, and current signals. 17 Costa proposed a diagnostic method using wavelet coefficient energy and SVDD to address the dependence of Fourier diagnosis on the mother wavelet. 18 However, the performance of fault diagnosis based on shallow machine learning often depends on the effectiveness of feature extraction, which limits its further application in AUV fault diagnosis.
The rapid development of deep learning methods in recent years have been applied widely in fault diagnosis. Unlike shallow machine learning, the deep learning methods do not require the feature extraction step, and could realize end-to-end diagnostic results. 19 Sang et al. 20 used the collected AUV acoustic signals as inputs of the improved convolutional neural network (CNN) to monitor the health status of in-situ thruster, and diagnosed the fault status of thruster correctly. Xia et al. 21 proposed a hierarchical attention multi-source data fusion method (HAMFD), which improved the accuracy of fault assessment through a multi-layer attention mechanism and achieved effective fault identification of Qianlong-2 AUV. Moghaddam et al. used a consciousness attention model based on dynamic core assumptions, and combined it with CNN and transfer learning to improve the fault diagnosis accuracy successfully. 22 SUBHA combined phase space reconstruction with ELM for predicting sensor outputs of AUV. 23 Chaos et al. presented a fault-tolerant control for AUV in the presence of critical failure. 24 Usually, increasing the number of labeled training samples is the most effective way to enhance the generalization ability of the aforementioned artificial intelligence models. However, obtaining a large number of labeled training samples often faces many challenges in practical engineering applications. 25 In order to solve the above-mentioned difficulty in obtaining a large number of labeled training samples, researchers have explored various semi-supervised learning methods aiming at mining the information contained in unlabeled samples by using limited labeled samples. Liao et al. 26 developed a deep semi-supervised domain generalization network (DSDGN) specifically for fault diagnosis of rotating machinery running under variable speed, and further generalized the model on fault diagnosis tasks at unknown speeds successfully. Lin et al. 27 proposed an improved semi-supervised meta learning (ISSML) model, which can be adjusted flexibly according to the different diagnostic scenarios. Tang et al. 28 proposed an innovative multi-scale recursive autoencoder (MRAE) training framework, which can achieve accurate feature extraction in environments with scarce labeled training samples. In addition to the inability to obtain sufficient labeled training samples, there are also noise interference caused by various factors such as wind, rain, ships, ocean currents, marine organisms, and industrial activities in the marine environment. 29 These noise interference includes both the natural sound sources and anthropogenic sound sources. 30 AUVs will be disturbed inevitably by these ocean environmental noises during operation. Faced with the dual challenges of scarce label samples and ocean noise interference, finding effective solutions has become particularly crucial. The introduction of HGCN has overcome the limitations of traditional graph neural networks in handling complex relationships and high-order interactions. 31 In the latest research progress, HGCN has demonstrated excellent denoising ability. Zhang et al. 32 developed an innovative hyperspectral image denoising technique, which utilized hypergraph convolution to capture block level high-order correlations. Lei et al. 33 encoded the sample signal into a hypergraph structure, which captured the high-order correlations in the sample effectively by introducing hyperedges, and then utilized the denoising characteristics of hypergraph convolution to improve signal quality and feature extraction accuracy. Similarly, Zhao et al. 34 designed a model assisted by multi-source fusion hypergraph convolutional neural network (MAMF-HGCN), which aimed at solving the intelligent fault diagnosis problem of electro-hydraulic brakes with a small number of samples.
In summary, a semi-supervised AUV fault diagnosis method based on dynamic decay learning strategy and HGAN is proposed, whose aim is to address the issues of scarce label samples and severe interference noise in the engineering application fault diagnosis of AUV thruster. The specific innovations are as follows:
Attention mechanism is introduced into HGCN to construct HGAN, which reveals the intrinsic correlation between network nodes effectively.
A parallel hybrid network architecture model based on HGAN and HCN is constructed, which captures the dynamic and static features of input graph signals synchronously.
A hybrid network training method based on dynamic decay learning strategy is proposed to achieve the effectiveness of model training.
The remaining chapters of the paper are arranged as follows: Chapter 2 is the basic theory and improvement methods related to the paper. Chapter 3 describes the method and specific diagnostic process. Chapter 4 is about experimental verification and comparative research. Conclusion is presented in Chapter 5.
Related theories
HGAN
GCN
Graph neural network (GNN) is a special type of neural network model, which excels at utilizing the inherent relational information in graph structures for learning and processing graph data. 35 As an advanced form of GNN, GCN integrates distant contextual information by expanding the global receptive field of the model effectively, thereby improving the performance of the model. 36
The standard structure of GCN is shown in Figure 1(a). A typical graph could be represented as
where
Meanwhile, let
Further revising
in which

Graph structures: (a) the structure of GCN and (b) the structure of HGAN.
HGAN
HGCN structures data into hypergraphs, and incorporates hypergraph convolution operations to capture multidimensional and deep level connections in the data more accurately,
37
thereby identifying complex high-order interactions between nodes more effectively. Similarly, suppose that
The hyperedge degree matrix is as following:
Subsequently, the hypergraph convolution is defined as following:
in which
in which
However, the spectral radius of
where
The schematic diagram of HGAN is shown in Figure 1(b), and its main purpose of constructing HGAN is to learn a dynamic correlation matrix by establishing an attention mechanism, thereby obtaining a dynamic transformation matrix. This matrix can reveal the intrinsic relationships between nodes more effectively. The attention scores are obtained by applying
where
where
Dynamic decay learning strategy
Global cyclic dynamic decay learning strategy
This article proposes an innovative global cyclic dynamic decay learning strategy,38,39 which not only improves the traditional periodic learning rate, but also retains the idea of global optimization. Furthermore, the paper combines it with the designed network to improve its training effectiveness. A learning rate decay strategy was integrated into the global cyclic learning strategy. Meanwhile, a dynamically changing exponential function was designed in order to better adapt to network model’ needs. Through the above coordinated strategy, a more flexible learning rate adjustment mechanism is provided for the network, allowing the learning rate to adjust dynamically.
In the design of learning rate, the loop length of the learning rate is set as
where ⌊·⌋ representing the number of learning rate cycles in the network.
The learning rate after its Nth iteration is as following:
in which the length of the cycle is
Based on the set initial value, the learning rate of the model will decay gradually and change during the training process. The dynamic variation of learning rate decay is presented in Figure 2. The benefits of this change are self-evident, as it enables the model to have interference suppression when subjected to strong noise interference, thereby improving the classification accuracy significantly.

Learning rate variation of global cyclic dynamic decay learning strategy.
Improved loss function
The main focus of this paper is dealing with a supervised learning classification problem, so the improved cross entropy loss function is used as the loss function to optimize the model. The cross entropy loss function is mainly used to measure the difference between the probability distribution of the model’ output and the actual label, and then evaluate the prediction accuracy of the model. In classification tasks, the smaller the value of the cross entropy loss function, the higher the classification accuracy of the model. The formula is as following:
where
where
By comparing the performance of these two loss functions (as shown in Figure 3), it can be observed that the cross entropy loss function becomes stable from the 55th round and reaches its optimal effect. Besides, the newly proposed loss function began to stabilize in the 19th round, and showed a more significant improvement in performance. Furthermore, the new loss function is more stable in the first 50 iterations compared to the cross entropy loss function. The comprehensive experimental results show that the proposed method improves the training performance of the model significantly and shortens the convergence period.

Comparison of loss functions.
The proposed hybrid model and flow chart of the proposed method
The design of hybrid model
As shown in Figure 4, HGAN and GCN worked together through a parallel architecture design, which can capture the dynamic and static features of the input graph signal simultaneously. The HGAN–GCN model further refines the expression of node features by mining the features of nodes and their neighborhoods deeply. HGAN utilizes attention mechanism to adjust information flow dynamically, assigning higher weights to nodes and edges that are more critical for improving fault diagnosis accuracy. This not only improves computational efficiency, but also reduces the interference of redundant information effectively. GCN can transmit and integrate node features in the network, enabling the model to learn and embed features from both dynamic and static dimensions simultaneously. The so-called ‘static’ refers to the graph convolutional networks, which mainly rely on the inherent properties of nodes, and propagate these properties through network structure without involving additional dynamic information. In contrast, the ‘dynamic features’ refers to the captured evolving high-order correlations between nodes. These features are continuously updated during the training iterations via the attention mechanism. This allows the model to adaptively adjust the information flow based on the changing relationships among sensor data. A more refined and efficient model has been constructed by fully utilizing the complementary advantages of HGAN and GCN. Therefore, when facing strong noise interference, the combination of HGAN and GCN can extract fault features more effectively and enhance the anti-interference ability of the diagnostic process.

The architecture design of the constructed model.
Besides, in order to handle the frequency offset and amplitude variation characteristics of samples under interference, the paper introduces batch normalization (BN) operation into each HGAN layer and GCN layer. Subsequently, there is a non-linear layer. Therefore, the formula for node update is expressed as following:
Then construct the dynamic hyperedges with node-hyperedge attention. As for the feature
in which
Subsequently, the simultaneous extracted dynamic and static feature outputs by HGAN and GCN are fused adaptively. The specific expression is as following:
In the above formula,
The architectural design of the proposed HGAN–GCN framework is specifically tailored to address the unique vulnerabilities of AUV thrusters operating in harsh marine environments. By simultaneously extracting dynamic and static features, and integrating the consistency regularization, the network is inherently equipped to suppress complex, non-Gaussian marine interferences—such as texture noise caused by biological activities and impulse noise from ocean current disturbances. This targeted interference suppression capability prevents model degradation and ensures highly reliable fault diagnosis even when labeled training data is critically scarce.
Flow chart of the proposed method
The overall framework of the proposed method is shown in Figure 5, and details are as follows:
Collect vibration signals of AUV thrusters under various health conditions, including limited labeled samples and a large number of unlabeled samples. Then, the limited labeled samples are combined with some unlabeled samples as the training set, and the remaining unlabeled samples are used as the testing set.
Design an HGAN based on HGCN, and construct the HGAN-GCN model by parallel connection between HGAN and GCN.
Input the training set into the HGAN module and GCN module simultaneously, and extract both dynamic and static features of the input graph signal. Here, the training process is optimized by the dynamic decay learning strategy. The label set predicted by equation (18) can be obtained from two FC layers. Then calculate the loss using equation (15) and update the model using the Adam algorithm.
Input the unlabeled test set into the trained model to obtain diagnostic results, and compare them with ablation experimental models and other semi-supervised fault diagnosis methods based on graph structure models.

Flow chart of the proposed method.
Experiment
The experimental dataset is collected from the submersible Qianlong-II, and the test rig is shown in Figure 6. The Qianlong-II AUV is a 4500-meter class AUV, measuring 3.5 meters in length, 1.5 meters in height, and 1.5 tons in weight. It is equipped with four main thrusters and one lateral auxiliary thruster on the servo, which allows the AUV to perform exploration tasks flexibly. The proposed method was validated using monitoring data from the South China Sea trials conducted on the submersible AUV Qianlong-II. 40 Five types of faults are conducted in the experiment: Fault 1: interference from the depth altimeter (corresponding to data V4). Fault 2: stall of the left front motor (corresponding to data V10). Fault 3: incorrect feedback of the left bow motor speed (corresponding to data V11). Fault 4: CTD failure (corresponding to data V49). Fault 5: low battery level (corresponding to data V17).

Qianlong-2 AUV.
In the experiment, vibration signals were collected from the side thrusters. By artificially simulating various fault conditions, the DH5902N acquisition system captured these signal data at a sampling frequency of 12.8 kHz and a sampling duration of 10 s. By setting 1024 data points as the length of one sample, 204,800 data points were extracted to construct 200 samples. Among these, 80 samples were used for training and 120 for testing. In the training samples, a 10% labeling ratio was set, meaning 20 labeled samples and 180 unlabeled data points. During the training process, the remaining unlabeled samples strictly serve as unannotated data to compute the unsupervised consistency regularization loss and the hypergraph smoothness loss. It must be explicitly emphasized that while the true labels of the testing samples are utilized post-training to evaluate the final diagnostic accuracy, these labels remain strictly hidden and are entirely inaccessible to the model throughout the entire training and validation phases. This ensures a rigorous and unbiased evaluation of the semi-supervised learning framework.
The time-domain and frequency-domain waveforms corresponding to each fault state are shown in Figure 7, based on which it is difficult to distinguish the five states. The reason for this is that under the influence of strong background noise, conventional signal processing methods are difficult to achieve good diagnostic results.

Time domain waveforms of the five status with their corresponding frequency domain waveforms: (a) time domain waveform of the status 1, (b) frequency domain waveform of the status 1, (c) time domain waveform of the status 2, (d) frequency domain waveform of the status 2, (e) time domain waveform of the status 3, (f) frequency domain waveform of the status 3, (g) time domain waveform of the status 4, (h) frequency domain waveform of the status 4, (i) time domain waveform of the status 5, and (j) frequency domain waveform of the status 5.
The specific configuration of the experimental operating environment is as follows: the operating system is Windows 11, the central processor is an Intel Core i5-12500H, the programming environment is based on the PyTorch framework with Python 3.9, and the runtime is 57.2 s.
The parameter settings of the hybrid model are detailed in Table 1.
The parameter settings of the hybrid model.
Convergence speed analysis
The convergence speed of an intelligent diagnosis model is one of the important criteria for evaluating its quality. Visual comparative analysis is conducted on the accuracy curve (ACC) and loss curve (Loss) under 50 dB signal-to-noise level to verify the superiority of the proposed model. The pertinent experimental findings are capable of being referred to Figures 8 and 9, based on which it could be noticed that the presented approach achieved the utmost accuracy and lowest loss after 60 iterations, and its convergence speed is also better than the other three methods (WDCNN, LSTM, and CNN), which also reaches a steady state firstly after 60 iterations.

Diagnostic accuracy of various methods under 50 dB signal-to-noise level.

Diagnostic loss of various methods at 50 dB signal-to-noise level.
Classification accuracy analysis
As illustrated in Figure 10, a contrastive examination of the diagnostic accuracy of various models under different levels of marine environmental interference is presented. Through comparison, it is observable that the approach put forward in this study has significant benefits in diagnostic accuracy compared to other comparative methods. Specifically, the approach put forward in this paper is still capable of attaining a high level of diagnostic accuracy of 95.668% while facing 2 dB ocean interference. In the ideal state of interference-free (None), its diagnostic precision is as high as 98.662%. Further comparative analysis shows that the proposed method exhibits significant performance improvement under various levels of interference compared with the WDCNN model, LSTM model, and CNN model.

Comparison of the accuracy of each model under different ocean disturbance levels.
Stability performance analysis
The result shown in Figure 11 displays the stability verification results of various methods through multiple experiments at a ratio of signal to noise of 50 dB. It is possible to observe that the method proposed in this study still maintains a low level of fluctuation in its results after multiple experiments, demonstrating good stability. This is particularly prominent compared to the other three compared methods.

The stability of multiple experiments of various methods under 50 dB signal-to-noise level.
Visualization analysis
Figures 12 and 13 demonstrate the t-SNE feature visualization and confusion matrix analysis results of different methods in the 50 dB interference environment. By observing Figures 12 and 13, it is noticeable that the approach put forward in this research performs well in feature point classification, with the minimum number of misclassified feature points and samples. In contrast, other methods have experienced varying degrees of misclassification under the same conditions. These misclassifications not only affect the accuracy of classification, but also expose the limitations of the compared methods in dealing with complex marine disturbance environments. This discovery directly indicates that the approach put forward in this study is capable of addressing the challenges brought by ocean interference effectively, providing a reliable solution for research and application in related fields. Meanwhile, this result indicates indirectly that the comparative method may not have sufficient effectiveness in handling diagnostic tasks under ocean interference conditions. This may be due to the shortcomings of these methods in feature extraction, classification algorithms, or model structures.

Visualization analysis of various methods at 50 dB signal-to-noise level by t-SNE: (a) proposed method, (b) WDCNN, (c) LSTM, and (d) CNN.

Confusion matrices of various methods at 50 dB signal-to-noise level: (a) proposed method, (b) WDCNN, (c) LSTM, and (d) CNN.
Comparison analysis with advanced methods
Finally, the proposed method was compared and analyzed with the most advanced diagnostic methods in noise-free environment. Among them, one compared model named as AM-FD 41 is an automatic feature engineering method based on bidirectional gated recurrent unit networks, which can extract the temporal dynamic characteristics effectively, and achieve fault detection through multi-layer perceptrons. HAMFD 21 is a hierarchical attention multi-source data fusion method, which is designed specifically for fault diagnosis. The comparative results show that the AM-FD method achieves a diagnostic accuracy of over 90% for different types of faults, while the HAMFD method achieves a diagnostic accuracy of over 96% under the same conditions. In comparison, the proposed method has an average diagnostic accuracy of up to 98.662% under the same testing conditions, significantly better than these two existing advanced methods. In addition, we also adopted three important indicators, Precision, Recall, and F1 Score, to further evaluate the effectiveness and performance of the proposed method comprehensively. These indicators not only reflect the accuracy of the model in fault diagnosis, but also reveal its reliability in practical applications. As shown in Figure 14, the proposed method can achieve fault diagnosis effectively under ocean interference.

Precision, Recall, and F1 Score.
Ablation experiment
Conduct ablation experiments on the collected raw vibration signals without interference. The ablation experimental model is shown in Table 2.
The ablation experiment models.
Comparison result of radar accuracy in ablation experiments without interference through number of test is shown in Figure 15, based on which it could be observed that the proposed method demonstrates excellent performance, with diagnostic results not only superior to the compared ablation experimental models, but also surpassing recognized advanced methods in the current field. Furthermore, the proposed method has been tested multiple times, and its diagnostic results have relatively small fluctuations compared to other ablation models, proving that it has both high accuracy and strong stability.

Comparison of radar accuracy in ablation experiments without interference.
Conclusion
To address the collection challenges of training and labeling samples faced by AUV thrusters during operation, as well as overcoming interference challenges in complex marine environments, a Semi-supervised AUV fault diagnosis method based on dynamic decay learning strategy and hypergraph attention network is proposed. This method can explore fault information deeply under the condition of limited labeling and training samples. The designed parallel strategy extracts signal features effectively under complex interference conditions, and can capture both dynamic and static features of the input graph signal simultaneously.
The effectiveness of the proposed method is verified through experiment analysis, and the following conclusion could be obtained:
HGAN–GCN can fully utilize the fault information in limited labeling and training samples.
The designed parallel strategy can extract both dynamic and static features from nodes more effectively in degree of interference environment.
The proposed dynamic decay learning strategy can improve the training performance of the hybrid model effectively and ensure its real-time application.
Besides, the following conclusion could be obtained by comparison study:
Compared to the other two related advanced models, the proposed method demonstrates superior performance in terms of Precision, Recall, and F1 Score.
Ablation experiments demonstrate that each module in the proposed method plays an indispensable role in enhancing the diagnostic accuracy and overall performance of the proposed method.
Though the proposed method has great potential for application in fault diagnosis under the condition of lacking labeled samples, and subsequent research will attempt to apply it on fault diagnosis of rotating machinery such as gears and rolling shafts under the condition of lacking labeled samples. However, there are still some shortcomings in this study: due to the difficulty of data acquisition, the proposed scheme has not been fully validated in actual marine environments. In the future, we plan to apply the methods of this study on actual marine environments, and explore unsupervised learning techniques to better adapt to the working environment and actual interference situations of AUVs.
Footnotes
Handling Editor: Aarthy Esakkiappan
Author contributions
Shuai Zheng is the theoretical researcher and writer of the paper and Hongchao Wang is the program programmer in the paper.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The paper is supported by Henan Province’s New Key Discipline-Machinery (grant no. 0203240011).
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data availability statement
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
