Abstract
EMG signal acquisition is mostly used in medical research. However, it has not been applied in athletes’ sports state recognition and body state detection, and there are few related studies at present. In order to promote the application of EMG signal acquisition in sports, this study combined with the actual needs of athletes to construct an EMG signal acquisition system that can collect athletes’ motion status. At the same time, in order to improve the effect of EMG signal acquisition, a wavelet packet principal component analysis model is proposed. In addition, in order to ensure the recognition efficiency of athletes’ motion state, this paper uses linear discriminant analysis method as the motion recognition assistant algorithm. Finally, this paper judges the performance of this research model by setting up comparative experiments. The research shows that the wavelet packet principal component analysis model performance is significantly better than the traditional algorithm, and the recognition rate for some subtle motions is also high. In addition, this study provides a theoretical reference for the application of EMG signals in the sports industry.
Introduction
With the continuous advancement of science and technology, the use of surface electromyography (SEMG) for the recognition of athletes’ movements and training control has become a hot topic. The use of SEMG can not only achieve precise control of the prosthesis, but also enable it to flexibly and freely perform the command actions given by the brain, and it is also beneficial for the athletes to perform daily self-training after the injury [1].
Human hand movements can follow the brain’s instructions for movement, and brain command transmission is transmitted through bioelectrical signals generated by neurons. When the nerve tip receives the transmitted electrical signal, it will issue a command to contract the muscle group associated with the action, thereby affecting the corresponding bone to complete the hand movement [2] sEMG is a bioelectrical signal collected on the surface of the human body, which is closely related to the action of the corresponding muscle group. Therefore, it has a good theoretical basis for recognition by using sEMG opponent’s action, and sEMG-based gesture recognition has great application value for precise control of prosthesis [3].
From the current situation, the application status of SEMG is not good. Different people’s physiological state and skin condition have different effects on SEMG signals. Therefore, it is difficult to establish a standard as a basis for diagnosis. During the process of transmission from the muscle to the surface of the skin, the waveform of the SEMG signal is somewhat distorted due to the filtering effect caused by the volume conductor formed on the surface of the skin. At the same time, volumetric conduction also causes multiple MUAPs of adjacent muscle tissue to overlap each other to generate crosstalk, which is difficult to distinguish. In addition, the external power frequency of 50 Hz will cause a large degree of interference to the measurement of the SEMG signal, and cause serious distortion of the SEMG signal, which brings great difficulty in obtaining an accurate and reliable SEMG signal [4]. Skipping rope is a kind of systemic exercise. It has many patterns, can be simple and complex, easy to learn, and is a kind of aerobic exercise with less time and energy consumption. Skipping rope can play a very good role in enhancing the function of cardiovascular, respiratory and nervous system. It can make blood get more oxygen, keep cardiovascular system strong and healthy. It can accelerate gastrointestinal peristalsis and blood circulation, and promote the metabolism of the whole body.
In summary, the surface EMG signal has important practical application significance for athletes’ muscle rehabilitation engineering and muscle training project application.
Related work
The origin of myoelectricity dates back to the mid-1970 s. In 1773, Walsh discovered that the muscle activity of the squid produced certain electrical changes [5]; In 1849, DuBois-Reymond of France discovered that the body’s muscles contracted and also produced electrical activity; In 1922, Gasse and Neweomer used a cathode ray oscilloscope to display electrical signals of muscle activity [6]. Electromyography was initially used only for biofeedback therapy and was later widely used in other fields such as clinical medicine, rehabilitation medicine, sports medicine, and sports science research.
With the maturity of testing techniques and research techniques, surface electromyography plays an extremely important role in the field of sports science research. Moreover, the analysis of surface EMG signals by scholars and experts mainly focuses on time domain and frequency domain analysis. The time domain is the characteristic of myoelectric changes in the time dimension. When performing time domain analysis, the original EMG signal should be digitally processed according to the research needs, but the number of treatments should not be too much, otherwise the signal distortion will be caused. Moreover, common indicators for time domain analysis include: average amplitude, maximum amplitude value, integral myoelectric (iEMG), input percentage (input%), etc. [7]. The frequency domain is a characteristic change of the myoelectric signal in terms of frequency. The data used in the frequency domain analysis must be raw data, and usually the spectrum analysis is performed using Fast Fourier Transform, which is mainly used for the study of muscle fatigue. Common indicators for frequency domain analysis include: surface EMG power spectrum, average power frequency (MPF), median frequency (MF), peak frequency, slope, etc. [8].
In the field of sports research, after a long period of experimental research, scholars have combined the motion characteristics of different sports to carry out the application research of technical movements. For example: jumping sports, football kicking inside the ball, archery technique, Tai Chi fisting, tail jump action, basketball emergency stop jumper, five-line boxing action, swimming freestyle action, taekwondo stretching training, the trampoline jumps on the net, the difficult movements of competitive aerobics, the advancement of the rumba dance, the rowing technique of the rowing boat, the blasting of the ball in the backcourt of the badminton, the different teeing techniques of the tennis ball, the final shot of the hammer, etc. [9].
At present, Chinese scholars have made great achievements in the field of sports research through the research technology of surface electromyography. Liu Yaorong [10] et al. used the electromyography test to study the SEMG changes of the knee flexors and extensors of jumping athletes, and realized the scientific basis of athletes’ strength training; Duan Yifeng [11] studied the characteristics of the oscillating leg muscle electrical signals playing in the football player’s instep, improving the strength and accuracy of passing and shooting; Zhang Xiuli [12] analyzed the muscle strength characteristics of the national archery team athletes and made important contributions to the improvement of the level of archery technology in China; Luo Wei [13] analyzed the relationship between the angle change of the knee joint of Taijiquan and the electromyogram of the lower extremity, and reduced the sports injury of the athlete; Xin Chuming [14] studied the use of electromyography and the changes of knee joint injury in college basketball technical movements, and reduced knee injury by strengthening muscle strength; Wang Song [15] studied the surface electromyography characteristics of the typical action of the five elements of boxing and found the best boxing technique. Fan Nianchun [16] studied the characteristics of freestyle exercise techniques using surface electromyography techniques and understood the characteristics and patterns of muscle activity during freestyle; Huang Yan [17] and others performed PNF stretching training on male Taekwondo athletes, and compared the EMG parameters of the athletes after training. It was found that PNF tensile training can greatly improve the athlete’s ability to lower limb muscle strength. Song Yawei [18] compared the characteristics of athletes’ different muscle groups, found out their advantages and disadvantages, and proposed training methods to enable athletes to better coordinate sports and improve muscle work efficiency; Zeng Yuanyuan [19] studied the surface electromyography characteristics and regularity of the right angle support, the leg support and the Vincent action static control in the difficulty movement of group B of aerobics competition. Its purpose is to provide scientific basis and theoretical guidance for athletes to perform these actions with high quality, so as to effectively promote training and teaching; Gao Chengcheng [20] used the surface electromyography test method to study the basic steps of the Rumba dance, to learn about the characteristics of his muscle exertion, to standardize the movement techniques, to improve the performance of the competition, and to shorten the gap with the world level. Tang Qiao [21] studied the work of the muscles of the male athletes under two kinds of pulling conditions (water full force 2KM and dynamometer full force 2KM). Moreover, the research results can accurately monitor and evaluate the rowing technology and provide good scientific research services for the training practice of rowing sports; Yang Wanli [22] studied the surface electromyography characteristics of badminton players in the back-field evacuation technique, proposed a targeted training method, and improved the rationality of the technology; Guo Quanqing [23] introduced the electromyography technique into the study of tennis movements, improved the program and plan of tennis athletes’ muscle strength training, and conducted different strength training for different batting actions; Yue Guanghua [24] used surface technology to analyze the surface EMG of the main muscle groups of athletes in the final stage of exercise. It can judge the difference between athletes’ muscle strength and sports technology characteristics, provide feedback to the coach in time, and understand the problems that athletes should pay attention to in future training; In order to find the muscle movement law of the knee joint when the athlete climbs, Yang Jianwei [25] used surface electromyography to evaluate the muscle function of the rocker’s knee flexion and extension at different angles, which provided an effective reference value for muscle strength training. In addition, related research has been conducted on many other sports [26, 27].
Theoretical basis of the algorithm
Time domain analysis and its characteristic parameters
Time domain analysis is the most simple and effective method, and the time domain features have the advantages of simple calculation and effective representation of signal characteristics. However, the time domain feature also has certain unstable characteristics and is susceptible to external interference.
By performing mathematical operations and statistics on the time domain signal data, the obtained SEMG features mainly have the average absolute value, the root mean square, the standard deviation, and the mean square error.
(1) Average absolute value
The Mean Absolute Value (MAV) is a typical characteristic parameter in the time domain analysis of the SEMG signal. It can set the threshold by the average absolute value of the surface EMG signal to determine whether the muscle is moving. The definition of the average absolute value is:
Among them, I is the number of data segments, a is the data of the i-th segment, and N i is the number of sampling points of the segment, which is set as 100.
(2) Root Mean Square
Root Mean Square (RMS) is a typical characteristic parameter in SEMG signal time domain analysis, which is used to measure the size of SEMG signal. The calculation formula for the root mean square is:
Among them, I is the number of data segments, a is the data of the i-th segment, and N i is the number of sampling points of the segment, which is set as 100.
(3) Standard deviation and mean square error
The standard deviation and mean square error can better reflect the dynamic degree of surface EMG signal. Equations (3) and (4) represent the standard deviation and the mean square error formula, respectively.
Among them, S
i
is the standard deviation,
There are certain difficulties in using the traditional time domain analysis method to extract the surface EMG signals. The main performances are as follows:
(1) Because it is difficult to grasp the muscle contraction during the experiment, long-term experiments may cause muscle fatigue and cause the myoelectric signal to be unstable. Therefore, the subject must perform standard actions in the experiment, and the experiment time should not be too long. Moreover, each action is controlled between 1 minute and 2 minutes, and the subject is given 3 to 5 minutes for rest during different actions.
(2) In the process of time domain analysis, due to the existence of external interference signals, extremely weak surface EMG signals are easily submerged by noise in various frequency bands, and whether the surface EMG signal can be effectively extracted becomes an important prerequisite for time domain analysis. Therefore, when collecting surface electromyogram signals, it is indispensable to shape and filter the collected surface EMG signals and remove noise interference.
Frequency domain analysis refers to the analysis of the frequency characteristics of a signal. In the frequency domain analysis, the SEMG signal is analyzed by Fast Fourier Transform to obtain the SEMG signal spectrum. The analysis method is a method of decomposing the SEMG signal according to frequency and observes the variation of the SEMG signal at different frequencies by the spectrum of the SEMG signal. The characteristic parameters of the frequency domain analysis mainly include two kinds of median frequency and average power frequency. The median frequency is defined as shown in equation (6):
Among them, f mf is the median frequency to be sought and p (f) is the power spectrum of the SEMG signal.
The average power frequency is defined as shown in Equation (7):
Among them, MPF is the average power frequency, and p (f) is the power spectrum of the SEMG signal.
Analysis of SEMG signals using frequency analysis method can be used to study the rate of EMG signaling and muscle fatigue. However, the frequency domain analysis method can only be used to process stationary signals, and it still has limitations in the research and application of SEMG signals.
The time-frequency domain analysis method is an analysis method for extracting signal features by combining time domain and frequency domain analysis methods. Wavelet Transform (WT) is a typical application method in time-frequency domain analysis. Wavelet transform can extract the local characteristics of the signal, which makes up for the shortcomings in time domain and frequency domain analysis. Moreover, the wavelet transform method combines the advantages of time domain and frequency domain method and has achieved good application effect in bioelectric signal processing. However, domestic and foreign researchers use wavelet transform to analyze and process SEMG signals, which is not mature enough, and is in the research and development stage. The principle of wavelet transform is as follows.
∀f (t) ∈ L2 (R), and the continuous wavelet transform of f (t) is:
Or inner product form:
In the formula,
To make the inverse transformation of Equations (8) and (9) true, Ψ (t) must satisfy the condition:’
In the formula,
At this time, the inverse transformation is
The constant C
Ψ
limits the class of the function Ψ that belongs to L2 (R) as a “base wavelet”. In particular, if Ψ is required to be a window function, then Ψ must also belong to L1 (R). Therefore,
Equation (12) shows that the wavelet function must be oscillating.
In order to reconstruct the signal, the variables a, b need to be discretized, usually taking
The wavelet packet principal component analysis is a method for constructing the principal component analysis model for feature extraction based on wavelet packet analysis. The process of extracting two SEMG signal feature matrices using wavelet packet principal component analysis is shown in Fig. 1.

Wavelet packet master analysis model flow.
Step 1: After the acquired two-way SEMG signals X1, X2 are taken by the L-layer wavelet packet transform (WPT), the wavelet packet coefficient matrix S1, S2 is obtained. The wavelet packet coefficient matrix
Among them, j = 0, 1, . . . , 2
L
- 1,
Step 2: After y1 and y2 are analyzed by the principal component, the SEMG signal characteristic matrices Y1, Y2 are obtained. Step 3 Y1 and Y2 are merged into a motion feature matrix Y. After the motion feature matrix Y is obtained, subsequent motion recognition is performed.
Wavelet packet transform is a more detailed signal analysis method developed on the principle of wavelet analysis. The wavelet packet decomposition algorithm is:
We set the scale subspace to V
j
, the wavelet subspace to W
j
, the wavelet packet scale factor to j, and the coefficient sequence to {h (k)}, {g (k)}. Then, a new space
We define the subspace U
j
to be the closure space of the function w
n
(t), and
In the formula, g (k) = (- 1)
k
h (1 - k), that is, the two coefficients also have an orthogonal relationship [57]. Its equivalent is expressed as:
The sequence {w n } n∈Z constructed by equations (15) and (16) is called a wavelet packet determined by the basis function w0 (t) = φ (t) and φ (t) is a scaling function. The 3-layer wavelet packet decomposition of the SEMG signal is shown in Fig. 2.

Wavelet packet three-layer decomposition.
In Fig. 2, (0,0) represents the SEMG raw signal data, denoted as X
i
, and i = 1, 2 represents the SEMG data collected by the i-th path. (3,0), (3,1), (3,2), (3,3), (3,4), (3,5), (3,6), (3,7) n represents the data of the 8 scales of the SEMG signal data after three-layer decomposition of the wavelet packet, which are respectively recorded as the vector
These eight wavelet coefficient vectors are constructed as wavelet coefficient matrices for single-channel SEMG signals, which are recorded as S
i
,

SEMG waveform of the brachioadialis.

SEMG waveform of radiocarpus.
It can be seen from Figs. 3–10 that each of the original SEMG signals of each path has 8 scale coefficients after the 3-layer wavelet packet transform, which are coefficient 0, coefficient 1, coefficient 2, coefficient 3, coefficient 4, coefficient 5, coefficient 6, coefficient 7, and each coefficient has 10,000 sampling points; Comparing the eight coefficients of an action category under a single muscle, it can be seen that the wavelet packet coefficient 0 and the coefficient 1 amplitude of each action are relatively large, while the other coefficients 2 to 7 are relatively small; Comparing the four movements of the radial flexor and iliac muscles, it can be seen that the wavelet-scale coefficient of the fist movement in the diaphragm and the radial wrist muscles is relatively large at the same wavelet packet scale, while the wavelet-scale coefficient of the boxing muscle in the diaphragm and the radial wrist is relatively small. When the wrist is turned inward, the amplitude of the SMG wavelet packet coefficient of the radial wrist muscle is larger than that of the diaphragm. However, when the wrist is everted, the magnitude of the SEMG wavelet packet coefficient of the radial wrist muscle is smaller than that of the diaphragm. Therefore, there are significant differences in the coefficients of the wavelet packet obtained by different muscles in different motions after wavelet packet transform, which lays a good theoretical and experimental basis for the subsequent motion recognition. We assume that after the acquired SEMG signals X1, X2 pass WPT, the wavelet packet coefficients matrix S1, S2 are obtained. Wavelet packet number matrix

SEMG waveform of the brachioradial during the exhibition boxing.

SEMG waveform of the radiocarpus during the exhibition boxing.

SEMG waveform of the brachioradial during wrist inversion.

SEMG waveform of the radiocarpus during wrist inversion.

SEMG waveform of the brachioradial during wrist valgus.

SEMG waveform of the radiocarpus during wrist valgus.
Principal component analysis is an analytical method that reduces the number of dimensions by eliminating the strict linear correlation or strong correlation of independent variable information. Moreover, principal component analysis is a commonly used data compression method in the field of statistics. By processing the data information by principal component analysis, it is possible to characterize the data samples with fewer feature quantities, thereby reducing the sample data dimension. In addition, the main data in the original data can be retained by the principal component analysis method, so that the data after the principal component analysis is more convenient to process.
The total number of features of the SEMG wavelet coefficient feature matrix yi is 2
L
, L = 3, and the number of sampling points is N, N = 400. Sample space is
The principal component analysis algorithm is as follows:
Step 1: The empirical mean of each dimension SEMG wavelet packet coefficient input data
Step 2: The difference vector B of the SEMG wavelet coefficient feature matrix and the empirical mean vector is obtained.
Among them, h is an 1 × N dimensional row vector with an element of 1.
Step 3: The covariance matrix C is calculated.
Step 4: The eigenvectors and eigenvalues of the covariance matrix are calculated, and the eigenvectors are arranged according to the eigenvalues from large to small. In the 2
L
dimensional sample space, there are 2
L
number values. Then, the cumulative contribution rate Z
i
of each eigenvalue λ
j
is calculated according to equation (19). Then, according to the threshold Z
i
> 97 %, the first P
i
(P
i
< 2
L
) principal elements are selected to constitute a feature vector D.
Step 5: The SEMG signal characteristic matrix Y i after orthogonal transformation is calculated.
The two-packet SEMG packet coefficient feature matrices y1 and y2 are transformed by PCA to obtain the SEMG signal characteristic matrix Y1, Y2.
Contribution rate of two SEMG signal eigenvalues after WPPCA processing (%)
It can be seen from Table 1 and Fig. 11 that in the temporal matrix of the wrist and the iliac muscles obtained by the WPGCA after the WPGCA, t the cumulative contribution rates obtained by adding the contribution rates of the first two features 1, 2 are 98.42% and 99.26%, respectively, which satisfy the threshold conditions Z
i
≥ 97%. However, the cumulative contribution rate obtained after the addition of the last six feature contribution rates is less than 2%. Therefore, according to the principal component analysis method, the last six features can be ignored. Then, the 16-dimensional wavelet coefficient matrix is reduced to a 4-dimensional principal to obtain a motion mode feature matrix

Statistical chart of contribution rate.
After extracting features from the EMG signal, which classifier is selected to discriminate the motion pattern to which the signal belongs is one of the key issues in the EMG signal to discriminate upper limb motion. Considering the problem of computation time and classification accuracy, this paper chooses a simple and practical linear discriminant analysis method as the classifier for forearm motion recognition.
The differential SEMG signal acquisition electrode is placed on the ulnar wrist flexor and the diaphragm of the human body, and the skin surface at the wrist joint is selected as a reference point to eliminate the common mode signal.
Accurate classification of SEMG signals with different actions is a key issue to improve athletes’ training efficiency and reliability. At present, there are many methods for classifying SEMG waveforms in action mode, but there are fewer reliable and practical methods. The reason is that the surface EMG signal has a low signal-to-noise ratio, and the actual data is interfered by many factors, such as spatial electromagnetic waves, power frequency interference, electrode position sliding, muscle fatigue and skin surface state. These factors will have different degrees of impact on the accuracy of motion recognition. Therefore, the reliability of preprocessing, feature extraction, and classification methods have an important impact on the accuracy of motion recognition classification results. This topic is based on a large number of literatures on the research of EMG signal processing, wavelet packet theory analysis, principal component analysis and SEMG signal pattern recognition, and a large number of theoretical analysis and experiments.
In this paper, a new method for extracting the characteristics of surface electromyography signals from wavelet packet principal component analysis is proposed. The wavelet packet principal component analysis method can overcome the problem of high dimensionality in the wavelet packet transform process, and the characteristic parameters extracted from the original SEMG data can effectively characterize the SEMG signal. In addition, this paper uses wavelet packet transform to analyze the SEMG signal and uses the sub-band arms of each frequency band as the feature vector of each action mode and establishes the principal component analysis (PCA) model. Wavelet packet principal component analysis (WPPCA) has good feature expression ability when using time domain analysis, wavelet packet analysis and wavelet packet principal to analyze four different SEMG signal feature extraction methods. Experiments show that in the case of fixed linear discriminant classifier, the signal features extracted by WPCCA method are better than other two feature extraction methods for motion recognition, and the classification accuracy is higher.
The linear discriminant analysis (LDA) method was used to discriminate the four motion categories of the upper limb forearm of the SEMG signal motion pattern feature matrix. After that, the LDA classifier is used to classify and identify the moving feature matrix of the obtained SEMG signal after wavelet packet principal component analysis, and compare it with BP network, RBF network and support vector machine. It can be seen that the LDA classifier has the highest recognition rate, and the algorithm is simple, fast, and robust.
Conclusion
The surface EMG signal has important practical application significance for athletes’ muscle rehabilitation engineering and muscle training project application. Therefore, this study analyzes the surface EMG signals of athletes. By performing mathematical operations and statistics on the time domain signal data, the SEMG features that can be obtained mainly include the mean absolute value, the root mean square, the standard deviation, and the mean square error. Moreover, analysis of SEMG signals using frequency analysis can be used to study the rate of EMG signaling and muscle fatigue. However, the frequency domain analysis method can only be used to process stationary signals, and it still has limitations in the research and application of SEMG signals. Wavelet packet principal component analysis is a method for feature extraction of principal component analysis model based on wavelet packet analysis. It uses the wavelet packet principal component analysis to extract the two SEMG signal feature matrices and obtains the motion feature matrix for subsequent motion recognition. Moreover, wavelet packet transform is a more detailed signal analysis method developed on the principle of wavelet analysis. In addition, experimental research shows that the principal component analysis method can be used to reduce the dimension of the wavelet coefficient matrix to remove data redundancy, reduce the computational complexity of the classifier, and improve the signal recognition efficiency.
