Abstract
In order to make the feature data more linearly separable for support vector machine(SVM) classifier, a set of different scale and direction parameter was proposed for improving the recognition effect of cement slope damage in high fill channels. The Gabor wavelet was used to extract the multi-scale and multi-directional features of the high-fill channel’s abrupt features. Then SVM algorithm was utilized to perform damage classification and level recognition. To compare the recognition effect of the Gabor-SVM method, histogram-SVM, grayscale symbiotic matrix -SVM, canny-SVM algorithm were adopted to identify the damage degree of cement surface in the same environment, and these damage recognition rates are compared with Gabor-SVM’s. The experimental results show that the damage recognition model, based on Gabor-SVM, tends to better stable value when the wavelet takes 6 scale and 12 directions. The recognition rate of the normal slope is 0.98, while the recognition rate of the crack, hole, and broken slope are 0.63, 0.88 and 0.90, respectively. Overall, the damage recognition model, based on Gabor-SVM, has better recognition effect, and it will provide technical support for finding potential leakage hazards in the high fill channel of South-to-North Water Diversion Project.
Keywords
Introduction
High fill channels, which are generally considered to be more than 6 m above the ground, are widely distributed in the middle route of the South-to-North Water Transfer Project in China [1–3]. Due to the high filling height, wide distribution range and complicated engineering geological conditions, the lining panel is cracked or the slope of the channel slope is damaged [4]. The high fill channel once leakage for a long time with bursts, which will directly affect the life property safety of the residents along the road. Therefore, high fill channel section of the seepage monitoring technology is one of the key technology in long distance water transfer project safety monitoring.
However, leakage information is difficult to be effectively perceived and detected. There are many methods for high fill channel leakage detection, including geological radar detection method, resistivity method, ultrasonic detection method and image processing detection method [5–7]. The algorithm for recognition and control of nonlinear systems includes model of seepage flow, multi-source data fusion, multi-objective inverse modeling, adaptive fuzzy control, and so on [8–10].
When the high fill channel is damaged, structural deformation or physical diagenesis will occur on the slope of the high fill channel, which will bring about the slope texture and corner changes [11]. Therefore, the digital image processing of machine vision can detect the slope texture and corner changes of the high fill channel section. It provides feature extraction technique support for finding structural damage detection and helps the leak detection of the high fill channel section [12].
The extraction of image features is one of the key links in the identification of cement surface damage in high fill channels. Image feature extraction methods usually include binarization, canny edge detection, and gray level co-occurrence matrix [13–15]. Although the general image detection method can have advantages in large-area detection, it is difficult to effectively identify the characteristics of the sudden change of the leakage region.
Multi-scale feature vector reflects not only the global features of the whole image, but also the sub-bands of different characteristics such as edges and texture [16]. Therefore it has more identifying information. Gabor wavelet transform method is sensitive to the edge features of images, which can provide good direction and scale selection characteristics, and has good adaptability to illumination changes [17]. The support vector machine(SVM) classifier can obtain discriminate functions based on a limited set of data samples, and still obtain small errors for independent test sets, and has strong generalization ability [18, 19]. The SVM has been used in pipeline leak and leak detection [20].
Therefore, the study of detection model for cement surface damage, based on Gabor-SVM in high fill channel, was proposed. The Gabor wavelet was used to extract the multi-scale and multi-directional features of the high-fill channel’s abrupt features, to obtain the good features of the high-fill channel local space and frequency domain information. Then SVM algorithm was utilized to perform damage classification and level judgment, in order to obtain higher leakage level and recognition effect.
Materials and methods
Materials
The high fill channel model, based on the South-North Water Transfer (Open Channel), was built in the laboratory. The slope image is acquired with a floating bin loaded with a high-definition and movable camera. Due to the limitations of experimental conditions, the 520 images data were collected in the experiment, which was derived from the image of the cement slope above the water surface.
The damage level of the cement slope in the high fill channel is defined as crack, hole, broken and normal, as shown in Fig. 1. The definition of these four types of damage level is based on the features of structure or physics, which will bring about the average variance and average mean characteristics of the slope texture and corner changes.

Cracks, holes, broken, normal cement surface.
The crack in the rock is a planar discontinuity formed by structural deformation. The crack has the characteristics of rapid change, and the variance and mean of the amplitude have lower characteristics. The brightness of the crack is darker than the brightness of the background of the image, and has certain linear characteristics, such as strips and sawtooth strips.
The hole refers to the damage of the hole in the cement panel. The diameter of the hole is generally 2.5mm∼10 mm, and the depth is 1cm∼5 cm. The variance and mean of the hole are higher, and the shape is blocky, round, elliptical, etc.
The broken is composed of cracks with multiple linear features. The width of the broken joint is wider than that of the crack, and the crack is divided into three or more pieces. And its variance and mean of the amplitude are high. If the cement slope has no above three kinds of damage, it is called normal cement surface (referred to as normal).
In general, the image is affected by various noises. Therefore, it needs to be preprocessed before extracting its features. These processing methods include image enhancement, median filtering, and grayscale processing. The Gabor wavelet has good characteristics in extracting the local space and frequency domain information of the target, and the basis function of the two-dimensional Gabor wavelet can well describe the sensory characteristics of most simple visual neurons in the mammalian primary vision system [21, 22]. The expression of the two-dimensional Gabor wavelet function is as follows [23]. Then, Gabor wavelet is used to extract the texture features of the image, as well as the image convolution, commonly used amplitude, and phase to represent the texture features.
Where W is the frequency bandwidth of the filter, which is the complex modulation frequency of the Gaussian function. σ
x
and σ
y
represent the standard deviation of the Gaussian function along two different coordinate axes. The Fourier transform of Equation (1) is the formula G (u, v).
Studies by neurophysics have shown that W = π/2(Hz) conforms to the human visual system. However, the value of W in digital image processing needs to be re-adjusted to accommodate optimal machine vision processing. Taking g (x, y) as the mother wavelet, a proper self-similar set of filters, called Gabor wavelet, can be obtained by performing appropriate scale expansion and rotation transformation on g (x, y).
In Equation (4):
Where m, n is the scale and direction of the wavelet, m = 0, 1, . . . S–1, n = 0, 1, . . . K–1. S is the total number of scales, k is the total number of directions, a-m is the scale parameter, θ = nπ/K is the direction parameter.
SVM has many advantages in solving small sample, nonlinear and high-dimensional pattern recognition, and can be extended and applied to other machine learning problems such as function fitting [24, 25]. In addition, the SVM can find the global optimal solution to the convex quadratic programming problem. The SVM can solve the problem of linear indivisibility of data through slack variables and kernel function techniques. Suppose the collection of two types of data samples is: {(x
i
, y
i
) , i = 1, 2, . . . , n}, x
i
∈ R
d
, y
i
∈ {-1, 1}, the general representation of linear discriminate functions is f (x) = wx + b, then the classification equation corresponding to f (x) = wx + b is:
The classification interval here is:
The Lagrange function is defined as follows:
Find the minimum value of Equation (7):
Solving the above optimal classification surface problem is transformed into a dual problem for solving convex quadratic programming optimization under the constraints of Equations (7) and (8):
In the above formula, the sample with a not zero is the support vector. Therefore, the linear combination of support vectors is the weight coefficient vector of the optimal classification plane. To solve the classification domain value c under the constraint of b, then the optimal classification surface function for the above problem is:
The Gabor-SVM fusion architecture is shown in Fig. 2. First, a convolution operation is performed on IM()m*m, which is the training sample image. And the Gabor wavelet filter G F (x, y) is designed with parameters. Second, the convolved matrix is partitioned and squared. The multiple variances and the mean are combined into a one-dimensional array and normalized to obtain the characteristics of the magnitude. Finally, after the SVM is trained, the sample decision function Model() is obtained. Before the pattern recognition, the picture to be tested also needs to be convolved, divided, squared and averaged with the Gabor filter, then merged and normalized. The extracted variance and mean features are classified in the decision function of the SVM classifier to obtain the classification result.

Gabor-SVM image processing architecture.
In Fig. 2, IM()m*m represents the training image and im()m*m represents the test image. R()m*m represents the feature set after the training image and gabor wavelet convolution operation. Correspondingly, r()m*m represents the feature set after the test image and the gabor wavelet convolution operation.MM(j) i *i represents the set after block processing of the R()m*m feature set. And mm(j) i *i represents the set after block processing of the r()m*m feature set.M() j*j represents the set of merged for each MM(j) i *i. While m() j*j represents the set of merged for each MM(j) i *i.M’() j*j represents the set of normalized for each M() j*j. Correspondingly, m’() j*j represents the set of normalized for each m() j*j. Model () is the decision function of SVM.
Analyze the digital features
According to the processing framework of Fig. 2, Gabor filtering is performed on of slope images of the crack, hole, broken and normal, and the average variance and average mean characteristics of each scale and each direction are obtained. Table 1 shows the mean and variance of various damage level when the frequency bandwidth W is π/2. It can be seen that the mean and variance of the holes are larger, and the mean and variance of the broken are larger than the crack and normal, and the mean and variance of the normal cement face are generally the smallest. The more obvious numerical relationship is, the more favorable it is to find the linear relationship of various damage conditions. It is also seen from Table 1 that when W = π/2, the variance and mean of the four cases are not very obvious, and further adjustment is needed.
Multi-scale and multi-directional amplitude characteristics when the frequency bandwidth is π/2
Multi-scale and multi-directional amplitude characteristics when the frequency bandwidth is π/2
Table 2 shows the variance of various damage cases when the frequency bandwidths of W = π/4 and W = π/8. It can be seen from Table 2 that when the frequency bandwidth is W = π/4, the mean and variance of the holes are larger on the 6th and 7th scales, the mean and variance of the broken are larger than the crack and normal, and the average and variance of the normal cement surface are generally The smallest. Similarly, when the frequency bandwidth W = π/8, the 4th and 5th scales also have such numerical characteristics. From the numerical value, the amplitude of the variance is the same as the amplitude of W = π/4, but the selection of the scale is different. Therefore, a moderate frequency bandwidth of W = π/4 is taken here as feature extraction. In the 6-scale and 7-scale of the frequency bandwidth of W = π/4, the 6-scale has a linear relationship in multiple directions and the difference spacing of each case is 0.88, and the linear relationship is more obvious. Therefore, the research is carried out in multiple directions at 6 scales to find the most obvious scale and direction parameter groups. This parameter group is used for classification and identification to verify the reliability of feature extraction analysis.
In Table 2(b), the amplitude variance of the 6 scale in the 5∼13 direction is shown in detail on the number axis. Among these scale and direction parameter groups, 6/12 is easier to distinguish between various types of damage in terms of eigenvalue stability and value range. Therefore, the 6-scale 12 direction is taken here as the experimental parameter set.
Multi-scale and multi-directional amplitude characteristics
Combined with the above analysis, a total of 520 images were collected, 360 of which (90 for each type of damage) were applied to Gabor-SVM algorithm training.160 (40 for each type of damage) images were used for testing. Here, the parameters of 1∼7 scale and 1∼13 direction are selected for Gabor wavelet feature extraction. The recognition rate is shown in Table 3.
Multi-directional and multi-scale recognition rates when the frequency bandwidth is π/4
Multi-directional and multi-scale recognition rates when the frequency bandwidth is π/4
It can be seen from Table 3 that the normal slope image recognition rate is generally high, and most of them are distributed between 0.8 and 1.0. The image recognition rate of the slope image of the crack grows steadily from low to high, but the overall recognition rate is low, and most of it is distributed between 0.50 and 0.65. The image recognition rate of the slope image of the hole fluctuates greatly, which is obviously related to the change of the scale. The recognition rate of broken slope image has a certain relationship with the size of the scale. from low to high to low, the fluctuation is large, and the recognition rate is mostly between 0.80 and 0.95.
By comparing the recognition rate, it is found that the scale and direction with a mean of 0.85 are 4/9, 5/11, 6/12, 6/13 (scale/direction) and other parameter groups, 6 scale / 12 direction (6/12). The extraction effect of the feature is in the optimal range, which corresponds to the analysis conclusion of the feature of Section 4.2. Therefore, the 6-scale and 12-direction are the best identified parameter sets. In the 6 scale / 12 direction, the recognition rate of hole, broken, crack, and normal is 0.88, 0.90, 0.63, and 0.98, respectively.
In the same environment, the experiments of histogram-SVM, gray level co-occurrence matrix-SVM, canny algorithm-SVM, and Gabor-Euclidean distance algorithm are also carried out, and the error analysis of Gabor-SVM recognition effect is carried out, such as Table 4 shows. From the recognition rate and discriminant effect of Table 4, the histogram-SVM, grayscale co-occurrence matrix-SVM, canny algorithm-SVM, and Gabor-Euclidean distance algorithm have higher overall recognition rate of slope image. The slope image recognition rate of cracks and holes is low. Compared with other methods, the Gabor-SVM method has better overall crack identification.
Identification rate of multiple algorithm processing and recognition rate of not less than 0.7
Identification rate of multiple algorithm processing and recognition rate of not less than 0.7
In Table 4, from the two methods of Gabor-SVM and Gabor-Euclidean distance, the normal and broken pictures processed by Gabor-European distance algorithm have better classification, but the crack and hole slope image processed by Gabor-European distance algorithm The recognition rate is low, reflecting the instability of its identification and classification. At the same time, it can be seen that the Gabor-SVM model has a higher recognition rate for the hole, normality and broken of the cement slope, but the recognition effect on the crack is not satisfactory. The reason is that the slit width of the crack is low in contrast with the background image, and after multi-directional and multi-scale division, the stripe of the crack is refined, and the probability of recognition error increases.
High-fill channel cement slope damage monitoring model, based on Gabor-SVM, was designed in this research. The characteristics of damage levels are obvious in the 6th scales and the 12th directions of Gabor wavelet filter, and its recognition effect is the best. The model of Gabor-SVM algorithm has the highest recognition effect on the normal condition of the canal slope, but the recognition effect on the crack condition is not ideal. The algorithm model of Gabor-SVM also needs retrain and test In the actual work.
Overall, the image detection method based on Gabor-SVM, realizes the monitoring of high fill slope damage. In addition, the image data in this paper is derived above the water surface, and the underwater image has not been acquired. The next step of this work will get the underwater slope image information of the high fill channel, to improve the recognition rate and reliability of the Gabor-SVM method.
Footnotes
Acknowledgments
This research work is supported by the Open Research Foundation of Key Laboratory of Sediments in Chinese Ministry of Water Resources (2017001), Scientific Research Fund of Henan Provincial Education Department (15A510003), Henan Province science and technology research program (172102210050). The authors would like to thank the editor and anonymous referees for their comments.
