Abstract
Aiming at the problems of large error and poor robustness of traditional image classification methods, a three-dimensional martial arts image classification algorithm based on symmetry theory is proposed. According to the preprocessed 3D martial arts image, the classification algorithm based on symmetric neural network is used to realize the classification of 3D martial arts image. The experimental results show that the minimum error rate of this algorithm is 6.1%, which is far lower than the traditional algorithm, it is shows that the improved algorithm has higher definition, better robustness, and the test results of different topological nodes on the test solution show that the average error rate of the algorithm is lower. Compared with the same type of algorithm, the application value of the proposed algorithm is significant.
Introduction
When people recognize an object or image, the first or deepest impression is their shape and outline, so they can be quickly judged whether they are symmetrical or not. Of course, there is no very accurate symmetry in nature, so this kind of judgment is not a strict mathematical judgment. It’s just that people’s initial intuitive understanding of an object, even because each person’s observation angle is different, the conclusion will be different [7, 23]. However, through preliminary analysis, we can get the approximate symmetry axis of this object or image, so as to facilitate the later work of image recognition and positioning [16, 17].
With the advent of the information age, the visual tasks of machines in various fields also need to deal with a large number of image information. When processing and analyzing these information, in order to save time and storage space, we need to acquire and locate image feature points as quickly and accurately as possible [14]. According to the different characteristics of different types of images, we can design symmetry feature detection operators that are suitable for this kind of image, and preprocess the image with translation, scaling, filtering, denoising and so on. According to the symmetry of the object, the image compression and coding technology is used to simplify the feature information of the image, realize the fast storage of the image, record the overall features of the target image to be studied, recover some important information lost in the image preprocessing stage, and obtain the orientation and attitude of the object in the 3D space, so as to facilitate the later detection and recognition development of 3D model work [3, 4]. In order to make more achievements in this field and get everyone’s recognition, many companies, universities and scientific research institutions at home and abroad have carried out relevant researches and committed to this emerging field, and even the government has set up exclusive departments, invested a lot of human and material resources and financial resources to study the digital information of 3D model and plane image. After unremitting efforts, many valuable results have been achieved. Under the guidance of this trend, more and more people begin to have a great interest in image symmetry and have carried out a lot of theoretical research and experiments, so the research and application of various symmetry detection algorithms for image has become a hot research direction of computer vision [19]. In reference [10], an algorithm of fog image classification based on SVM and hybrid features is proposed to meet the requirement of automatic recognition of fog concentration in adaptive defogging system. Combined with the characteristics of fog images, the mixed feature vector is composed of dark channel features, wavelet features and mean value normalization features, which is used to describe the characteristics of different fog concentrations. The SVM algorithm is used to supervise and learn the mixed eigenvectors, and finally realize the automatic recognition and classification of fog images. Experimental results show that the algorithm can effectively identify and distinguish non fog, light fog and dense fog images, which provides a good classification reference for the defog system to adaptively select defog parameters according to the fog concentration, but the algorithm has a high error rate. In reference [13], an image classification algorithm based on data mining is proposed to solve the classification accuracy problem of traditional image methods. Firstly, the channel model of image retrieval in multimedia database is constructed, and vector quantization coding is carried out, then Harris corner detection algorithm is used to extract image feature points, and fuzzy c-means clustering algorithm is used to realize image classification [20, 21]. The experimental results show that the algorithm can accurately express the image content information, improve the accuracy of image classification, and has strong robustness. However, the algorithm does not enhance the image, resulting in low definition and poor robustness.
Based on the above background, in order to solve the problems of traditional methods, this paper proposes a 3D martial arts image classification algorithm based on symmetry theory, and classifies 3D martial arts image according to the symmetry of 3D martial arts image.
Definition of the algorithm
Preprocessing method of 3D martial arts image
Denoising
In real life, when collecting 3D martial arts images, generally because of one or several factors, the images formed by 3D martial arts image acquisition equipment contain certain noise, which affects the quality of images, increases the area of insignificant areas, and reduces the image quality. Therefore, it is necessary to denoise the 3D martial arts image. In the image space domain, the mean filtering method is used to denoise.
Supposing that x and y are objects in d-dimensional feature space F
d
, and the influence function of data object y on x is a function
Two dimensional windowed Fourier transform is used to extract the spectrum information W of each part of infrared image. The definition of transform can be expressed as follows:
Assuming that the image is I (x, y), it is composed of 3D martial arts image and reflection image, which are represented by L (x, y) and R (x, y) respectively. The output image is obtained by single-scale Retinex algorithm, and the expression is as follows:
The average filtering method is used to filter the 3D martial arts image. The average filtering formula is:
After filtering, the infrared image spectrum needs to be inverse Fourier transformed to obtain the spatial data of infrared image, further reducing the impact of noise on the 3D martial arts image [9]. If f′ (x, y) is the infrared image after inverse transformation, the inverse transformation is expressed as follows:
In the application of image enhancement, bilateral filter is a kind of nonlinear filter, which is mainly used to smooth the image [11, 15]. Due to the more comprehensive factors considered in bilateral filtering, bilateral filtering can remove the residual noise of the image while retaining the edge information of the image. The expression is as follows:
In the formula, R s is the filtering result; Ω is all pixels of 3D martial arts image; f and g are Gaussian functions; I p is the pixel value of p; k (s) is the standardization factor. s and p represent the left and right sides of 3D martial arts image; I s represents the pixel value of s.
The gradient vector which can reflect the 3D martial arts image is obtained by gradient operator, and the expression is as follows:
Since minimizing the overall variation of 3D martial arts image can suppress noise, in order to enhance the details of insignificant area of 3D martial arts image and suppress noise, TV model is used to enhance the insignificant area of 3D martial arts image, and the following energy pan function is constructed:
In the formula, u is the enhanced 3D martial arts image; parameters χ and δ are used to balance the enhancement and denoising of 3D martial arts image.
For any object x and distance R in 3D martial arts image space, with x as the center and R as the circular area, the gradient image expression is:
In the formula, E is the balanced gradient field; u0 is the original 3D martial arts image; E
g
[| ∇ u0|] is the equalization of 3D martial arts image;
To sum up, on the basis of filtering 3D martial arts image with mean filtering method, the 3D martial arts image is sharpened with differential operator in the spatial domain of the image [8].
Based on the 3D martial arts image sharpened by the above summary, the enhancement image processing effect method based on gray level uniformity is adopted to enhance it, and the steps are as follows:
Calculate the nonuniformity ψ
ij
of 3D martial arts image’s feature information, whose expression is:
Calculate the average nonuniformity value of special diagnosis information of 3D martial arts image for w ij , the formula is as follows:
Calculate the contrast C
ij
of the feature information pixel p (i, j) of 3D martial arts image and convert it to obtain:
In the formula, ξ ij represents the contrast magnification factor of 3D martial arts image’s feature information; δ ij represents the brightness similarity of 3D martial arts image’s feature information; g ij represents the gray value of 3D Martial arts image’s feature information pixel.
Change the gray value of the feature information pixel of 3D martial arts image, its expression is:
Repeat the above steps for all pixels of 3D martial arts image’s feature information.
Assuming that the gray level of the given 3D martial arts image’s feature information is from gmin to gmax, the determination process for the minimum magnification factor of the contrast of the 3D martial arts image’s feature information is as follows:
Step 1: calculate the histogram H (g) of 3D martial arts image’s feature information, and find all peaks of the 3D martial arts image’s feature information histogram as follows:
Step 2: calculate the average value
Step 3: find out the peak value of the feature information histogram of 3D martial arts image which is greater than
Step 4: calculate the first and last peaks of the feature information histogram of 3D martial arts image obtained in step 3, and their corresponding gray values g1 and gn′;
Step 5: calculate the minimum contrast magnification factor ξmin of 3D martial arts image’s feature information according to the following formula:
According to the calculation results of the above formula, the contrast magnification factor ξ
ij
of 3D martial arts image’s feature information is calculated by the following formula:
According to the above calculation process, the enhancement processing of 3D martial arts image is completed [18].
Based on the 3D martial arts image processed in Section 2.1, the image classification algorithm based on symmetrical neural network is used to classify it.
In this paper, the method of network combination is applied to deep belief network (DBN) and convolutional neural network (CNN), respectively integrating symmetric deep belief network (SDBN) and symmetric convolutional neural network (SCNN) [5, 16].
Structure of SDBN network
– Pre-training
Network training makes the initial weight of the network closer to the global optimum. When training the SDBN network of 3D martial arts image after preprocessing, i.e. training the RBM in the left and right subnetworks, since the RBM hidden layer of the restricted Boltzmann machine is used to reconstruct the visible layer of 3D martial arts image after preprocessing, the approximate calculation method of contrast divergence (CD) is used, which is one of the sources of differences between different networks [12].
In the left and right sub-network of SDBN of 3D martial arts image, any two adjacent layers, except the category layer, constitute an RBM, which is defined as visible layer and hidden layer. There is full connection between the hidden layer and visible layer of 3D martial arts image, and there is no connection between nodes in visible layer and nodes in hidden layer of 3D martial arts image. The node in visible layer of 3D martial arts image is v∈ { 0, 1 }, and the node in hidden layer of 3D martial arts image is h ∈ { 0, 1 } p . In 1982, Hopfield et al. proposed that the joint configuration {v, h} between hidden layer and visible layer nodes has joint energy.
The principle of conditional probability is similar on the left and right sides of SDBN network. It should be noted that SDBN is designed for classification. The logistic regression network composed of joint layer and category layer is not in the process of pre training, and its processing mode is just random initialization. Traditionally, the parameter value of RBM is solved by maximizing the likelihood function of the gradient rising process. The logarithm of the gradient likelihood function based on the energy model is:
– Fine adjustment of SDBN
After the DBN is pre trained, the joint layer of 3D martial arts image is combined with the two networks. Finally, the joint layer of 3D martial arts image is fully connected with the category layer, and the better network of 3D martial arts image is obtained by fine adjustment the network connection weight through the back propagation algorithm.
After the combination of subnetworks, the SDBN network difference is captured by measuring the angle between the corresponding output vectors of the joint layer subnetwork, and then the difference value is taken as the penalty term of the loss function in the fine adjustment stage. The loss function of our model is:
In the formula,
In SDBN network, the state values of all nodes in 3D martial arts image network are calculated by forward propagation, including the assumed loss function Γ
w
(p), in which the activation function of RBM is sigmoid or ReLU function. Ghe category layer is linear node, and their activation function is soft-max function. The purpose of fine adjustment is to use gradient descent algorithm to make formula (24) reach the minimum value for model parameter w, then the weight updating formula is:
Based on the symmetry feature extracted from the upper section, a method combining feature with deep neural network is proposed. CNN can be used as the left and right subnetworks of symmetrical deep neural network of 3D martial arts image to form SCNN, and the structure is shown in Fig. 1:

The method of combining feature with depth neural network.
The left and right sides of SCNN are composed of CNN with the same dimension of output 3D martial arts image, and the output of SCNN is the category layer of 3D martial arts image. Different from SDBN training mode, SCNN randomly initializes the weights, alternately performs convolution and sampling operations, and then fine adjusts the network connection weights through supervised learning. Its loss function of back propagation is as follows:
Experimental data source
Two datasets, MNIST and CIFAR-10 are selected to test the performance of SDBN and SCNN. Among them, the MNIST digital set contains 60000 3D martial arts test images, 10000 test images from 0 to 9 in 10 categories of digital handwriting data set. Cifar-10 data set contains 60000 3D martial arts images, which are divided into 50000 training samples and 10000 test samples. The training batch is a random division of the original data set, and data processing is carried out through the data processing software MOA (an experimental tool for massive online analysis).
Enhancement effect test
In order to verify the effectiveness and feasibility of the algorithm in 3D martial arts image enhancement, a comparative experiment is designed and analyzed. The algorithm in this paper, image classification algorithm based on SVM and image classification algorithm based on data mining are compared and analyzed. The results are shown in Fig. 2:

Comparison of three algorithms for 3D martial arts image enhancement.
According to Fig. 2, when the algorithm in this paper is used to enhance the 3D martial arts image, the processing effect of the adaptive enhancement is better than that of the other two algorithms, and the 3D martial arts image enhanced by the other two algorithms is distorted. The 3D martial arts image enhanced by the algorithm in this paper is more natural, the details are clearer, and has more advantages.
Because the 3D martial arts image has certain noise, the experiment takes the noise level as the benchmark to test. Under different noise levels, the experiment compares the algorithm in this paper, the image classification algorithm based on SVM and the image classification algorithm based on data mining [1, 6]. The comparison indexes of classification performance take PSNR and MSSIM as examples, and the comparison curves of the three algorithms are shown in Figs. 3 and 4.

PSNR index of image enhancement processing under different noise levels.

MSSIM index of image enhancement processing under different noise levels.
It can be seen from Figs. 3 and 4 that, compared with the image classification algorithm based on SVM and the image classification algorithm based on data mining, the algorithm in this paper has higher PSNR and MSSIM values at any noise level, and with the increase of noise level, the advantages are more obvious, and the PSNR index remains above 34 dB. Compared with the best of the other comparison methods, the algorithm in this paper has an average increase of 5% in PSNR index and 4% in MSSIM index. This shows that the algorithm has better robustness.
In order to verify the effectiveness of the algorithm, MNIST and CIFAR-10 are used to test the performance of SDBN and SCNN.
MNIST digital set: it contains 600003D martial arts images for test, 10000 test images are 10 types of digital handwriting data sets from 0 to 9.
CIFAR-10 data set: it contains 60000 3D martial arts images. The data set is divided into 50000 training samples and 10000 test samples. Training batch is a random division of the original data set.
Experiment and verification of MNIST data set
In the experiment, the DBNs on both sides are trained, different topological network structures are designed to verify the performance of SDBN in this algorithm, and the relationship between the classification performance of the proposed algorithm and the number of network nodes is verified. The 10 nodes in category layer of 3D martial arts image represent 10 categories from 0 to 9, and the joint layer is fully connected with the category layer. Finally, all the nodes except that in the class layer of 3D martial arts image are of logistic type. At the same time, different CNN are designed to test the performance of SCNN network. In order to verify the relationship between the performance improvement of SDBN network and the increase of network nodes, a group of comparative experiments are designed. The number in the table is the number of network nodes. The experimental results are shown in Table 1:
Mnist dataset classification accuracy table
Mnist dataset classification accuracy table
As shown in Table 1, the topological structure of DBN is 501-501-20001-11, and the topological structure of SDBN of the algorithm in this paper is 501-501-851 for the left sub network node, 501-501-851 for the right sub network node, and 11 nodes for the classification layer. The number of experimental nodes is consistent. It can be seen from the results that the SDBN performance of the algorithm in this paper is slightly better than that of Hinton’s DBN and SVM performance when classifying 3D martial arts images. In the second group of experiments, Hinton’s DBN structure is 501-501-851-11, and SDBN contains 11 categories. The left sub network topology is 201-201-851, the right sub network topology is 201-201-851, and the left and right sub network nodes are less than half of DBN. The experimental results show that the SDBN of the algorithm in this paper is still better than DBN. Two corresponding experiments exclude that the increase of the number of nodes dominates the SDBN performance of the algorithm in this paper. Table 1 shows the test results of different topological nodes in the test solution. The performance of the algorithm in this paper is better than other deep network algorithms in the set of test, which shows the effectiveness of the algorithm in MNIST.
CIFAR-10 dataset is used to test the performance of the algorithm in this paper. The data set is divided into 5 training batches and 1 test batch, each batch contains 10000 3D martial arts images. The test batch contains 1000 3D martial arts images randomly selected from each category. The training batch contains the random arrangement of the remaining images, but some of the training batch images need more than other categories, and the training data of each batch contains 5000 images of each category. The topology and results of the network are shown in Table 2: where, the DBN node excitation function is sigmoid function, while CNN excitation function is ReLUc. The SDBN of the algorithm in this paper takes the corresponding deep structure as the sub network.
Table 2 shows that the minimum error rate of this algorithm is 6.1%, compared with CNN and DNB, this algorithm has the minimum error rate.
Cifar-10 data set classification accuracy table
Cifar-10 data set classification accuracy table
Inspired by the human vision system, many image processing models are designed as multi-layer structure, and the way to establish multi-layer structure features for target detection is deeply rooted in the computer vision art. In 2006, Hinton et al proposed a greedy layer by layer processing method for training the deep belief network (DBN). Simulating the human visual cortex, each layer of the network was trained from the bottom to the top by an approximate algorithm. Each upper and lower structure was regarded as a restricted Boltzmann machine (RBM). This model creatively proposed a pre training processing method. Krizhevsky A expanded the structure of the convolutional neural network (CNN) in 2012, and improved the network structure. The biggest feature of Krizhevsky A was sparse connection and weight sharing, and the model achieved good test results in MNIST data set. In this paper, this algorithm is based on the deep theory, the essential difference is to introduce symmetry structure. The sub networks on both sides of SDNN are made up of DNN, feature difference is introduced, and sample classification is decided jointly, so the classification result of the algorithm in this paper is more accurate.
Conclusions
It is difficult for objects in nature to satisfy the strict symmetry, so it is far from enough to rely solely on the precise mathematical definition of symmetry to detect symmetry. In addition, due to the transformation of the object angle, the noise caused by the figure digitization and other reasons, the processed figure loses the original symmetrical features, resulting in the inaccurate detection results. Symmetry detection plays an important role not only in locating and recognizing plane objects, but also in 3D object reconstruction. With the wide application of intelligent robots, symmetry plays an important role in texture detection and guiding machines to grasp objects. So it is very important for pattern recognition and image classification to extract symmetry from complex images. In this paper, a 3D martial arts image classification algorithm based on symmetry theory is proposed, and its credibility is verified in the experiment. The average error rate of this algorithm is 6%. Compared with the same algorithm, the classification accuracy is the highest, which can be used in the research of 3D martial arts image classification. However, this paper does not take into account the large amount of data and the lack of classification efficiency. Next, we will take classification efficiency as the main goal for in-depth study.
