Abstract
Sparse Representation Classification has led to state-of-the-art results in pattern classification tasks. However, as Sparse Representation Classification has significantly high lower complexity, and vehicle recognition is a typical small-sample-size problem and trained dictionary is under-complete, all these give rise to big representation errors and unstable recognition results. In this paper, we develop a new Collaborative Representation based vehicle recognition framework, using acoustic sensor networks to reduce the time complexity in the training and testing phases, and to improve the classification accuracy in complex scenes. In the recognition, the acoustic signals of vehicles are extracted from the acoustic information to get linearly separable samples by Fast Fourier Transform, and then we encode a testing sample through linear combination of all the training samples with regularized least square and classify the testing sample into the class with the minimum representation error. As demonstrated by experimental results, the proposed method has the following two unique and important characteristics: (1) it achieves a superior performance under the circumstance of complex data sets (2) It also shows highly competitive recognition accuracy while has low computational complexity and memory requirements, compared to k-Nearest Neighbor, Support Vector Machines and Sparse Representation Classification algorithm.
Introduction
Target recognition of moving objects by using acoustic sensor networks is an important task with many envisioned applications, and the vehicle classification in acoustic sensor networks is a typical example of the pattern classification theory [1]. Eom in paper [2] claimed that many features of vehicles can be inferred from the sounds they generate, so it is feasible to classify the type of the moving vehicle based on the acoustic signals in complex scenes. Duarte et al. in paper [3] compiled a data set which contained time series data of the target vehicle classes, including Assault Amphibian Vehicle (AAV) and Dragon Wagon (DW) at a running rate of 4960 Hz and implemented vehicle type classification in Wireless Sensor Networks (WSNs) environment. He also detailed the data collection procedure, the feature extraction and the pre-processing steps, and accomplished the task of classifying the type of moving vehicles in distributed networks by using the maximum likelihood classifier based on the multi-dimensional frequency spectrum features of acoustic signals.
In recent years, various classification methods have been proposed in pattern recognition field to adapt to different situations and improve the recognition rate, such as k-Nearest Neighbor (k-NN) [12, 15], Support Vector Machines (SVM) [5, 13, 14] and Sparse Representation Classification (SRC) [6, 7, 8].
Among all of these methods, sparse representation theory which assumes that a data sample can be represented as a sparse linear combination of some basic elements in a dictionary is receiving more and more attention in solving regression and classification problems. Wright et al. [6]found that SRC performed well on face recognition and validated the robustness of SRC in noise and occlusion situations. Mei and Ling [7] proposed a robust visual tracking and vehicle classification approach using sparse representation and demonstrated its effectiveness on a vehicle tracking and classification task using outdoor infrared video sequences. Wang et al. [9] proposed a novel vehicle recognition framework in acoustic sensor networks via Sparse Representation (SR) with Mel-frequency Cepstral Coefficients (MFCC). Rahim et al. [16] proposed a homogeneous multi-classifier system for moving vehicle noise which improve the classification accuracy compared to single classifier.
Existing Sparse Representation based (SR-based) methods for vehicle recognition mentioned above suffer from high computational complexity and memory requirements. Zhang et al. [10] analyzed the working mechanism of SRC and pointed out that it was the collaborative representation but not the
In this paper, the research focuses on the effect of vehicle recognition with different classification methods in acoustic sensor networks. The data set originates from the sensor data collected during a real world WSNs experiment carried out at Twenty-nine Palms, CA in November 2001 [3]. In the recognition, the acoustic of vehicles are extracted from the acoustic information to get linearly separable samples by Fast Fourier Transform (FFT), and then we code a testing sample by linear combination of all the training samples with regularized least square. The proposed method will classify the testing sample into the class with the minimum representation error.
The rest of the paper is organized as follows. In Section 2, we present classification models based on sparse representation and collaborative representation. In Section 3, we explain our vehicle recognition method based on collaborative representation. Experimental results are presented in Section 4. In Section 5, a conclusion is made.
Related works
In this section, we will briefly review SRC for vehicle recognition and then discuss the collaborative representation.
Sparse representation classification
Sparse representation is a signal processing method to represent the main information of the signal using non-zero coefficients as little as possible. To find the sparse representation of a signal, a formula can be established as follows [6].
where
Sparse representation has been widely used in practice such as denoising and classification for its remarkable ability in simplifying signals. In practice, SRC has been proven to be an excellent classification algorithm in the fields of image processing, face recognition [6] and vehicle recognition [7], but we observe that it suffers from high computational complexity and memory requirements.
The procedure of the SRC algorithm can be described as follows.
Through the dictionary
where the former part is the residual and
From Algorithm 1, there are two key points in SRC. The first key point is that the coding vector of query sample
In our discussion in Subsection 2.1, we assumed that there are enough trained samples for each class so that the dictionary
One obvious solution to solving this problem is to use more samples of
The framework of vehicle recognition.
After the CR with all classes, the classifier classifies
It was claimed in [8] that when we judge if
Vehicle recognition framework
We have developed a new vehicle recognition method based on collaborative representation which has low computational complexity and memory requirements. Figure 1 shows the block diagram of the proposed method. After receiving the raw acoustic signals from the sensors distributed in the monitoring zone, pre-processing is essential in order to pick up the useful events own the noise and some other uncertain conditions. Besides, FFT features will be extracted from the original signals. The proposed method will be used to recognize the vehicles according to the compressed features: FFT.
We will get the residuals by the collaborative representation. Finally, the recognition results can be obtained by the minimum residuals of each class.
In the test samples, multiple observations are captured by multiple heterogeneous sensors. In order to improve classification accuracy, we can exploit correlation between different sources. It is clear that voting scheme does not exploit the relationship between different sensors, so here we use a joint model from different sources via fusion in order to make a joint classification decision. Generally, an over-complete dictionary is learned to represent the test signals which we consider containing the whole information of the original signals. Therefore, our joint classification decision can be easily made by a learned over-complete dictionary.
Feature extraction
Acoustic signals on time domain always change quickly and appear to be unsteady. So many methods are proposed on frequency domain for feature extracting. An appropriate feature extraction method is important for classification. Herein, we use classical FFT algorithm to extract the acoustic features.
Collaborative Representation based Classification (CRC)
In this sub-section, we introduce collaborative representation algorithm to solve the vehicle recognition problem. Zhang et al. [10] pointed that it was collaborative representation, but not the
According to [6], to regularize the solution and obtain a competitive performance with less computation complexity, we can use
In order to collaboratively represent the query sample using
where
It can be seen in Eq. (3) that the collaborative represent part is least square (
let
The proposed method for vehicle recognition based on collaborative representation is summarized in the following table:
We evaluate the performance of the proposed method on the data set which is extracted based on the sensor data collected during a real word Wireless Distributed Sensor Networks (WDSN) experiment carried out at Twenty-nine Palms, CA in November 2001 [3]. This data set is available at website on
Sensor field layout.
We implement the proposed algorithm in last Section with MATLAB2013a and run it on an Intel Core (TM) i5-2401 M 2.30 GHzPC with 4-GB RAM.
The results of classification through different method
Computational cost of classification through different methods
Recognition accuracy by different methods.
In this experiment, we just consider the acoustic data recorded at a rate of 4960 Hz. Firstly, we choose the data collected from the third to eleventh runs (AAV3
Time costing of vehicle recognition by CRC, SVM and K-NN.
After feature extraction, the features are sent to the proposed classification method for solving the vehicle recognition problem. There are 90 sample sets for each vehicle totally. Specially, the files used for the test are different from the files used for the training, so we choose feature vectors as training samples randomly and the rest as testing samples.
At the same time, some other methods: K-NN, SVM and SRC are also worked as references to the proposed method. Among them, since kin K-NN equals to 5 (
Computational cost of vehicle recognition by SRC.
To achieve more reliable vehicle recognition results, we have repeated the test for 100 times of each method in the same condition. For the numerous simulations, we are confident to believe that the results are concluded from the reality. First, we study the recognition accuracy of CRC with different size of training samples. Table 1 and Fig. 2 illustrate the vehicle recognition rates for various sizes of training samples with different classification methods: SRC, CRC, SVM and K-NN.
Table 1 and Fig. 3 show that the proposed method lead to an improved performance of vehicle recognition in acoustic filed. Because of the low complexity of the proposed method, there is a big gap of the classification results between CRC and SRC. Obviously, the recognition rates of the proposed method with different size of training samples are better than SRC. Moreover, the proposed method outperformed SRC in terms of CPU running time.
Table 2, Figs 4 and 5 show the computational cost of different methods for vehicle recognition. By observing and analyzing Tables 1 and 2, we discover that, although the recognition effects of different training sizes have large differences, the proposed method yields about 1–5 percentage improvements of the recognition rates and shorter running time, compared to SRC. Generally speaking, the performances of the proposed method in recognition for acoustic signals are not outstanding. However, the shorter running time is the greatest strength over SRC.
It can be obviously seen from Fig. 4 that the computational cost of SRC algorithm enlarges rapidly when the training sizes increase. Fortunately, the proposed method takes very short time for recognition no matter how large the training sizes are. Benefiting from the low complexity of the proposed method, a real-time practical and effective system for vehicle recognition will be established easier.
Judging from the computational cost and the improved performance of vehicle recognition, we can conclude that the proposed method performs better than other three methods. In the practical application, we can achieve comparable recognition quality to some traditional methods.
In this work, we have developed a vehicle recognition method based on collaborative representation, and the proposed method solves the vehicle recognition problem in acoustic sensor networks. Results of experiments demonstrate that the proposed method has higher recognition rates with different size of testing samples when compared to other recognition method. Furthermore, the proposed method has lower computational complexity, and computational cost. In future, we will further study the kernel collaborative representation classification which maps the features into a higher dimensional space to solve the non-linear classification problems in acoustic field. Meanwhile, we will also develop some feature selection methods to enhance the efficiency of vehicle recognition in acoustic sensor networks.
Footnotes
Acknowledgments
This research was supported by Zhejiang Provincial Natural Science Foundation of China under Grant No. LY14F030007, and National Natural Science Foundation of China (NSFC) under Grant No. 61301027, 61771299.
