Abstract
Aiming at the problems of low accuracy and long time consumption of motion attitude information collection in the existing methods, a new automatic attitude capture method based on sensor information fusion is proposed. The line builds a motion attitude information acquisition architecture through five functional modules, including data acquisition and processing, laser sensor, data storage, voltage conversion and serial communication, to obtain motion attitude data. Then, the sensor image information collected by median filtering is processed, and the target image is obtained by fusing the binary image obtained by optical flow segmentation with the original image. Finally, the Mean-Shfit algorithm and particle filter are used to track the human moving target, and the self-similar matrix is used to automatically capture the motion posture. Experiments show that the method has high recall and accuracy for the collection results of motion attitude information, the accuracy rate of automatic motion attitude capture is as high as 95.5%, the maximum capture time is 79 ms, the time consumption is short, and the practical application effect is good.
Keywords
Introduction
Body posture can be divided into two types: motion posture and static posture. The study of its motion posture refers to the use of various methods to detect and track the motion information of the human body, so as to obtain various data motion information, so as to promote our country’s research on human medicine. develop. With the gradual deepening of people’s research on sports posture, a new discipline has been formed [1]. Human motion biomechanics involves a wide range of disciplines, including new technologies such as metrology, human anatomy, surgery, kinematics, robotic mechanisms, testing techniques, and computers [2]. It is based on the basic laws of human anatomy, establishes the whole or part of the human body model, uses the test method to obtain the human body movement speed, angle, acceleration and EMG signal, and further analyzes it, and finally obtains the instantaneous posture and change law of the human body movement. It is very valuable to track and detect the movement state of the human body [3], and can be applied to the daily sports training of athletes, military training and rehabilitation of children with cerebral palsy, so as to provide more scientific rehabilitation training and treatment plans [4].
At present, some progress has been made in the research of sports posture. For example, Han [5] proposed a key posture detection method of sports video based on BP neural network. The method to athletes and coaches can more efficiently training, face lifting motion video, USES hierarchical block background estimation method for human target detection, according to the characters of edge feature extraction to accomplish goals, using the improved BP neural network to weightlifting, which can identify common four kind of body posture to related gesture recognition results are obtained. Although this method can recognize related gestures, its recognition of overall motion is not in place and cannot be applied on a large scale. Zheng [6] proposed a motion training pose intelligent capture method based on deep learning. Based on Arduino embedded development board and equipped with multiple IMU sensors, the method established a system to collect accurate human movement data such as speed and acceleration by using stepper motors, and obtained accurate human movement data. In order to accurately identify human motion models, a deep learning model based on symmetric coding, time scale coding and structure coding was built to capture motion posture. This method has the problem of poor accuracy in capturing moving objects, and its detection time is long. Chen [7] proposed a design method of real-time motion posture detection system based on inertial sensor. The system consists of hardware and software. The hardware part realizes the specific design of circuit through inertial sensor, and uses nodes to represent joint positions to detect the skeletal posture of human body during sports. The software part is composed of three modules, which are equipment communication module, data processing module and attitude discrimination module. The LAN communication mode can enhance efficiency and accuracy of transmitting information and data, and at the same time can realize continuous detection of sports posture in a real time manner. Although this method can perform real-time detection of human motion posture, its computational complexity is large and its recognition accuracy is not high. Yang et al. [8] proposed a human pose recognition method based on Android sensor. The method by collecting Android devices embedded acceleration sensor produced by the acceleration in the process of human motion data, data extract different motion features, and recognition of walk, run, go upstairs and downstairs four basic motion, and through continuous criterion and the subsequent attitude criterion to identify the transformation between different postures, gesture to accurately identify sports. This method is prone to loss when capturing data of moving objects, so its detection efficiency is not high. An automatic motion posture recognition method based on computer video processing technology is proposed by Wu and Qin [9]. The recognition criteria of motion pose are set and motion feature vectors of different motion pose are defined respectively. The video data of motion posture can be captured by the installed camera equipment. By using computer video processing technology, the initial video data is preprocessed through the steps of motion video image extraction, video transformation and motion residual compensation. On this basis, the moving target is detected, and the motion attitude features are extracted from the contour, motion period and other aspects, and the fusion feature vector is obtained. By comparing with the set recognition standard, the final automatic recognition result of motion posture is obtained. Although this method improves the accuracy of motion gesture recognition to a certain extent, it still has the problem that recognition takes a long time.
Based on this, this paper proposes an automatic motion pose capture method based on sensor information fusion. It first builds a motion pose information collection architecture and collects motion pose data. Then, the collected sensor image information is processed, and the target image is obtained by fusing the binary image obtained by optical flow segmentation with the original image. Finally, the Mean-Shfit algorithm and particle filter are used to track the human moving target, and the self-similar matrix is used to automatically capture the motion pose. It is hoped that through the research in this paper, the collection accuracy of motion attitude information can be improved, and the time-consuming recognition can be shortened.
Design of automatic capture method for sports posture
Sports posture information collection architecture
The sports posture information acquisition architecture is mainly composed of five functional modules: data acquisition and processing, sensors, data storage, voltage conversion and serial communication. The data acquisition and processing module is mainly composed of the main controller and its peripheral circuit. The main task is to collect the digital measurement signal output by the sensor in real time and fuse the original attitude data to calculate the human body attitude Angle information. Sensors mainly include MEMS sensor, laser sensor and thermal imager, and their main function is to collect real-time sports posture information [10, 11]. The data storage module stores the constant parameters of the data fusion algorithm and the initial parameters of each sensor chip. After the system is powered on, the initialization Settings of each module can be quickly completed. Voltage conversion module for the system is set to provide the required or voltage. Finally, the serial communication module sends the attitude information to the host computer for real-time display.
Data acquisition and processing module
In this paper, the STM32F405RG microcontroller is used as the main controller of the data collection and processing module in the attitude measurement system. It is a digital signal controller based on the Kernel and bits of Cortex-M4 and core frequency as high as ST company launched. Schematic diagram of main controller circuit is shown in Fig. 1.
Schematic diagram of STM32F405RG circuit.
STM32F405RG, as a digital signal controller and a powerful microcontroller (MCU) control peripheral, has fast computing power of digital signal processing (DSP) technology. STM32F405RG has a clock frequency of 168 MHZ and 32-bit operation accuracy, and provides rich controller peripheral interfaces and floating point operation units, so it meets the requirements of data collection and processing module for synchronous data collection of multiple sensors and for the speed of data fusion algorithm [12].
The laser sensor based on the working principle of radar usually uses invisible light (such as infrared light) with the unique advantages of short wavelength and narrow beam, so it has very high Angle, distance and speed resolution. It can quickly obtain the three-dimensional information of a certain section of the space object, and can quickly obtain the whole three-dimensional space information of the object through the necessary rotation and translation device, so as to realize the fast capture of sports posture. The laser sensor circuit is shown in Fig. 2.
Laser sensor circuit.
Thermal imager circuit.
In order to capture more sports posture data, a thermal imager is introduced. The circuit of the thermal imager is shown in Fig. 3.
The MEMS sensor consists of a 6-DOF MEMS inertial measurement unit (IMU) MPU600 and a digital compass (HMC5883L), in which a three-axis gyroscope and a three-axis accelerometer are integrated in the MPU600. Since the attitude sensor is responsible for providing the original motion information, the accuracy of its output data directly affects the accuracy of the attitude measurement results. Therefore, it needs to further understand their working principle when designing the software and hardware of the module, and at the same time, it is necessary to carry out sensor correction before the original data fusion.
The module is mainly used to store the initialization parameters of each sensor and some constants needed in the attitude calculation process to ensure that each module can work normally after the system is powered on. In this paper, MACRONIX’s 32Mbit serial flash memory mx25l3206em2i-12g is selected as the data memory. It provides a standard communication interface for reading and writing operations, and the clock frequency is 86 MHz. Figure 4 shows the circuit principle of the data storage module.
Circuit principle of data storage module.
Since the data acquisition and processing module, MEMS sensor module and data storage module in the system all use
Circuit principle of LP2985.
In this paper, a high integration half-duplex micro-power wireless data transmission module is used to realize the communication between the attitude measurement system and the host computer. By adopting efficient cyclic interlacing error detection coding, serial communication module has improved anti-burst interference and sensitivity, which can correct continuous burst errors at the maximum. Therefore, the module is especially suitable for use in the industrial field and other harsh environment with strong interference.
Serial communication module.
Median filtering has a suppression effect on pulse interference and image scanning noise. In addition to removing noise, the image edge details can be well preserved. The basic idea of median filtering is to sort the gray values of pixels in the neighborhood of pixels and select the intermediate value as the pixel value of the point. For the median filter of
where,
Optical flow technology, proposed by Gibson in 1950, is a key subject in the research of three-dimensional motion analysis and computer vision. The study of optical flow is to determine the “motion” of pixel position by the correlation and temporal variation of pixel intensity data.
It is assumed that the gray value of point
According to Taylor series expansion:
Ignoring higher-order terms, we can get:
In the formula,
In the formula,
Horn-schunck proposed the smoothness constraint of optical flow, and the optical flow calculation equation is:
Further collated by corresponding Euler equation:
In the formula,
In the study and application of images, only some specific areas of the image are often interested, these parts are called the target or the foreground, and the rest are the background. It is usually necessary to extract a certain area of interest from the image before further processing and utilization of the image [14]. Image segmentation and object extraction transform the image into a more abstract and compact form, which is more conducive to image analysis and understanding. Image segmentation is a basic technology of computer vision and a key step for further image analysis. Because the target moves in a static background scene, it may be complicated to segment the image directly for scenes with complex texture. The optical flow method can extract the moving object from the image after processing, but its performance is unstable and the extraction accuracy is not high. In this study, the optical flow method is used to extract the moving region of the image from the feature of the larger optical flow value in the optical flow field. Then the moving target region is further processed to extract the moving target.
Considering the imaging principle of images, different gray values in images represent different temperature values through thermal radiation imaging of objects. Usually need to detect the temperature of the target (such as a person or animal) relative to the ambient temperature is higher (most of the environment is so), target brightness than the environment brightness in the image, and the large difference of environmental background, so the threshold segmentation technology for extracting targets from the image is a very effective method [15, 16].
Assuming that
The entropy of the histogram is:
When entropy
After the above threshold extraction, the image is only a collection of scattered points, and the target object cannot be completely extracted. Due to subtle changes in the image background, background area exists in the point close to the target pixel, as well as the selection of threshold value, threshold segmentation may not be a complete target area, so the image needs further processing. Mathematical morphology can measure and extract the corresponding shape of an image by using structural elements, and then identify the image. It is capable of simplifying image data, maintaining the basic shape of the image, and removing irrelevant structures [17]. Mathematical morphology consists of four operations including expansion, corrosion, open operation and closed operation.
Expansion:
where
The bloat operation merges the background pixels into the target pixels, and if multiple target objects are close to each other, they are usually connected together.
Corrosion:
where
Corrosion operation can remove target edge pixels, objects smaller than the operation structure, and corrosion can remove small connections between objects.
Open operation and closed operation: based on the basic operation of expansion and corrosion, it is possible to compound these two operations and set (union, intersection and complement) operation combination operation, among which the two important combination operations are Opening and Closing. The open operation is to use structural element
The closed operation uses structural element
Opening operation and closing operation makes the object’s outline is smooth, can be used to filter out noise or fill the void, but open operation for decomposition of image, disconnect the narrow gap and eliminate small protrusions, and closed operation, by contrast, are commonly used to eliminate the narrow gap or bridge connecting adjacent image pixels.
The binary image obtained by optical flow segmentation is fused with the original image to obtain the target image.
The automatic capture process of sports posture based on human body posture self-similarity matrix is as follows.
Human target detection
The human body shape template was established on the basis of human body image set. For each shape template, a point set was obtained by sampling, and then the context descriptor of each point in the point set was calculated, which was used as the human body shape feature. For each image to be detected, a pyramid image is built according to the image size, and the boundary of each sub-image is obtained by Canny boundary detection algorithm. The set
In the formula,
In order to reduce the impact of noise, the matching cost function of Eq. (17) is modified. In Eq. (17), if
In the formula,
Each pair of points of
Human target detection flow chart.
In the human movement video, the horizontal and vertical coordinates of the moving target center point
The maneuver of the target is considered as a kind of random disturbance, and its disturbance size is expressed by the covariance of the process noise by
After predicting the state of the system through the motion model, the estimated value
In the formula,
The transfer probability density function
After the motion model and observation model of particle filter algorithm tracking are described, the algorithm of Mean-Shfit and particle filter to realize human moving target tracking is presented. The specific process of the algorithm is as follows:
According to the tracking target position and speed selected in the initial frame, According to the motion model of Eq. (20), sample Calculate the Euclidean distance The importance weight 5 is estimated by Eq. (21) to obtain the total weight, which is calculated by the following formula:
For each particle, normalize its weight:
Calculate effective sampling scale
The obtained particles represent the posterior distribution of the target state at
The target state value is estimated according to particle
The region cost reflects the penalty for each pixel being missegmented, such as
Relationship cost
A self-similar matrix is a graph that can reflect the cyclic properties of a system. State cycling exists in many dynamic systems. To express the cyclic characteristics of such a dynamic system in the form of graphs, Eckmann proposed a Recurrence Plot of signals that could be used to display the recursion of states in the phase space. The Recurrence Plot was defined as follows:
where
The self-similar matrix reflects the correlation of sequential images. Suppose there is a video image sequence,
Element
Simulation is carried out to validate the feasibility of the proposed automatic posture capture method.
A total of 1,500 healthy volunteers, 900 men and 600 women, aged 18 to 35, were recruited for the exercise experiment. In the process of the experiment, the subjects need to complete a number of sports experimental tasks. During the experiment, the data of the sensor terminal is transmitted to the monitoring software of the upper computer by wireless way for real-time display and storage. Then, the data of each task was selected from the experimental data of each tester, and the motion state under the corresponding posture was determined at each time point to verify the effectiveness of different methods.
Experimental environment
Experimental environment
The proposed method, the method in Reference [5] and method in Reference [6] method are comparatively analyzed in terms of the recall and precision of sports posture information collection.
The recall comparison results of sports posture information collection are shown in Fig. 8.
Comparison of recall.
The recall rate of method in Reference [5] reaches the maximum of 85% in 70 experiments and that of method in Reference [6] reaches the maximum of 86% in 13 experiments. Compared with these two methods, the recall rate of this method reaches the maximum of 97% in 47 experiments, indicating that the recall rate of sports posture information collection in this method is higher than these two methods. The results of information collection are more comprehensive.
Comparison of accuracy rate (unit: %)
Comparison of precision.
Comparison of time-consuming for automatic capture of sports posture.
The comparison results of the accuracy rate of sports posture information collection are shown in Fig. 9.
The precision rate of the method in Reference [5] reaches the maximum value of 81% in 70 experiments, and the precision rate of the method in Reference [6] reaches the maximum value of 80% in 13 experiments. In contrast, the proposed method reaches the maximum value of 96% in 60 experiments, indicating higher precision rate of sports posture information collection.
The accuracy of sports posture automatic capture of the proposed method, the method in Reference [5] and the method in Reference [6] are compared.
The average accuracy rate of sports posture automatic capture of the method in Reference [5] is 85.1%, and that of method in Reference [6] is 78.5%. In contrast, the accuracy rate of sports posture automatic capture of the proposed method is 95.5%, which is the highest.
The time-consuming of the three method in sports posture capturing is also compared.
The maximum and minimum time-consuming of sports posture automatic capture by the method in Reference [5] method are 271 ms and 169 ms, respectively; that by the method in Reference [6] are 286 ms and 182 ms, respectively. In contrast, the maximum time-consuming of sports posture automatic capture of the proposed method is 79 ms and the minimum is 62 ms, indicating the proposed method takes less time in sports posture automatic capturing and has higher efficiency.
With the rapid development of the Internet, more and more people are under pressure. More and more people are busy with work and study and have no time and energy to exercise. Jiang’s death often occurs in many industries, so people have begun to pay attention to their physical condition and hope to achieve the effect of fitness through exercise. People want to quantitatively understand their exercise effects, such as exercise volume, exercise intensity and other information, and skillfully turn to some sports aids. Motion posture capture technology is an important means to obtain human body posture information, and has been widely used in sports training, gait recognition, action recognition, medical rehabilitation and other fields. Therefore, this paper proposes an automatic motion capture method based on sensor information fusion. Experiments show that after the method in this paper is applied, the recall rate of motion pose information collection reaches 97%, the capture accuracy rate is 95.5%, and the maximum time-consuming to capture motion pose is 79 ms, which has good application performance and can be used for motion poses. The further development of capture theory lays a solid theoretical basis for computing. Although the method in this paper has certain application effects, it does not consider the interference of the surrounding environment in the operation process, which is also the future research direction.
