Abstract
In the last ten years, many variants of the principal component analysis were suggested to fight against the curse of dimensionality. Recently, A. Sharma et al. have proposed a stable numerical algorithm based on Householder QR decomposition (HQR) called QR PCA. This approach improves the performance of the PCA algorithm via a singular value decomposition (SVD) in terms of computation complexity. In this paper, we propose a new algorithm called RRQR PCA in order to enhance the QR PCA performance by exploiting the Rank-Revealing QR Factorization (RRQR). We have also improved the recognition rate of RRQR PCA by developing a nonlinear extension of RRQR PCA. In addition, a new robust RBF
Keywords
Introduction
Nowadays, insecurity has become a major issue in various sectors, including the computer resources to be implemented to resolve this problem. Verification and identification of individuals are one of the means to ensure this security. The human being uses his visual system in a daily basis to identify people automatically, although the process involved is complex. Today there are means of verification of identity, which are related to either what a person has such as an identity card, or knows such as a password or PIN. Nevertheless, these elements can be forgotten, stolen or falsified. To circumvent these limitations, another means of security has been carried out which makes it possible to use the information intrinsic to a person. This new way of identifying individuals is biometrics [1]. The advantage of biometric characteristics is to be universal that is to say present in all the people to be identified. On the other hand, they are measurable and unique; two people can not have exactly the same characteristic. They are also permanent which means that they do not vary over time. The main interest of biometrics is therefore to automatically recognize and identify the identities of individuals using their physiological or behavioral characteristics. The physiological characteristics include for example the face [2], the iris [3], and the fingerprint [4]. Behavioral characteristics can include for instance the voice [5], the signature [6], and the keystroke dynamics [7].
This study is interested in facial biometrics which has many advantages such as the ease-of-use, user acceptance and its low cost. Nevertheless, facial images constitute a large scale data set. Wich usually contain redundant and noisy features that most of the time lead to biased recognition accuracy. To overcome that problem, several conventional dimensionality reduction algorithms have been proposed like Principal Component Analysis (PCA) [8], Kernel PCA (KPCA) [9], Linear Discriminant Analysis (LDA) [10], Relevance Weigthed LDA (RW-LDA) [11, 12], Kernel RW-LDA (KRWDA) [13], Regularized Discriminant Analysis (RDA) [14], Singular Value Decomposition (SVD) [15], Locality Preserving Projections (LPP) [16], and Independent Component Analysis (ICA) [17, 18]. Also many hybrid technique combines several feature extraction algorithms have been successfully employed. For example, the paper [19] have proposed a new score fusion of two robust dimensionality reduction algorithms in discrete wavelet transform (DWT) domain. These algorithms are relevance weigthed LDA with QR factorization (RW-LDA/QR) [11, 12] and SVD using the left and right singular vectors. In [22], the authors suggested a simple and new discriminative post-processing framework to make the dimensionality reduction methods robust to outliers. In the same context, Hanli Qiao proposed a new algorithm called Discriminative PCA [23]. The purpose of the proposed algorithm is to retain both the strong points of PCA and LDA by to find a feature subspace contains discriminant principal components.
Generally speaking, PCA remains the most adapted technique to extract the expressive features in any recognition application, but with a very expensive computation cost. As a recent solution to this weakness, A. Sharma et al. [24] have proposed to exploit the QR decomposition to speed up the process of PCA computation. In this paper, we focus on the application of RRQR factorization in PCA algorithm to accelerate the computation of principal components (PCs). Then, we develop a new non-linear extension of PCA using kernel trick called KPCA/RRQR. In addition, a new robust
The rest of this paper is structured as follows. In Section 2, we review briefly some related PCA algorithms. The proposed algorithm is explained in Section 3. Experimental results and discussion are presented and discussed in Section 4. Finally in Section 5, we give a concluding relevant to the paper.
Related work
Principal Component Analysis (PCA) is a linear dimensionality reduction method very used and adapted in several fields of applications such as pattern recognition, artificial intelligence, data mining, and computer vision. Its main purpose is to reduce the number of variable of the input data
Where
We assume that
[H] SVD PCA algorithm[1] Data matrix
The computational complexity of SVD PCA is
Principal component analysis using RRQR factorization
The rank revealing-
Where
A comparison of CPU time between Householder QR and economic RRQR factorizations.
According to the low computational complexity of economic RRQR factorization, we propose to modify the QR PCA algorithm by the use of economic RRQR factorization. Hence, the proposed RRQR PCA method is presented in Algorithm 1.
[H] RRQR PCA algorithm[1] Data matrix
In the next experiment, we have compared our approach with QR PCA [24] in term of the execution time. For that, we have used 3000 samples with a variable size from 6000 to 11000. Figure 2 depicts a comparison betwen the CPU time for both factorizations. It is clear that our approach is more faster than QR PCA variant.
Numerical comparison for PCA based on QR and RRQR factorizations.
Among the weaknesses of PCA is that cannot extract the nonlinear structure of the raw data [30]. To overcome this limitation, the researchers have proposed an effective extension of PCA called Kernel-PCA (K-PCA) by exploiting a nonlinear kernel function [9, 31]. This nonlinear version implements the same principle as PCA except for the mathematical transformation of the original dataset
Where,
Such that
With
The diagonalization of the covariance matrix
It’s clear to see that each eigenvector
By replacing Eq. (9) in Eq. (6), we obtain a new eigenvalues problem as:
Where
The Gram
Thanks to this matrix property
Where
It can be noted that, the most used kernel functions in conventional KPCA algorithms are:
Polynomial kernel: Gaussian RBF kernel:
In general, Gaussian RBF kernel gives good recognition rates when the optimal parameter is adapted [34]. However, classical RBF kernel is based on
In the rest of this paper, we use the
Robust kernel-PCA using RRQR factorization.[1] Data matrix
Hereafter the proposed algorithm is called
In the test step of our proposed algorithm, we get the test features
Where
In this section, we evaluate the recognition performance of our proposed algorithm using two face databases, such as ORL: Olivetti Research Laboratory [20] and Face Recognition Technology (FERET) [25]. The our RKPCA approach was compared to related existing algorithms including PCA [40], 2DPCA [41], PCA-L1 [42], WTPCA-L1 [43], LDA [10], and 2DLDA [44]. In our experiments, we have used a 2.00 GHZ i7 Processor, 8Go of RAM and the MATLAB as development environment.
Some face images from the ORL database.
The ORL face database contains 40 individuals. Each individual is represented by 10 different images. So the base contains 400 grayscale faces with a fixed size of 112
In the first experiment, we select the first five face-images of each individual for training-set and then we take the last five as testing face images. The total number of testing and training face-images is both 200. Table 1 shows the recognition rates of RRQR PCA and our two proposed algorithms. The first KPCA/RRQR algorithm uses the classic Gaussian RBF kernel and the second RKPCA method uses a new RBF
Recognition rates of RRQR PCA and our proposed algorithms on the ORL databases
Recognition rates of RRQR PCA and our proposed algorithms on the ORL databases
Average recognition rates on ORL database with noise free and noisy conditions with Salt and Pepper noise
In the next experiment, we have also examined the robustness of our approach to noises. To do that, Salt & Pepper and Gaussian noises are added to the facial images in ORL database. The noise density inserted varies with the values
Impact of Gaussian-noise on recognition accuracy for ORL database.
The Facial Recognition Technology (FERET) is a facial database containing over 10,000 facial images. These images are taken in different situations with a size of 80
Some face images from the FERET database.
Table 3 lists the recognition rates of RRQR PCA and RKPCA algorithms on the FERET database by varying the number of features from 10 to 120. Figure 6 shows some facial-images from the FERET database with and without noises. The second row and the third row are noisy with Salt & Pepper noise and Gaussian noise respectively. From the results displayed in Table 3, it can be observed that our KPCA/RRQR algorithm gives better recognition rates than RRQR PCA. Otherwise, RKPCA algorithm performs better when using RBF
Recognition rates of RRQR PCA and our algorithms on the FERET database
Average recognition rates on FERET database in noise free and noisy conditions with Salt and Pepper noise
Some facial-images of the FERET database with and without noises. The second row and the third row are noisy with Salt and Pepper noises and Gaussian noises respectively. 
We have also examined the robustness of our approach to noises. Salt & Pepper and Gaussian noises are added on the facial-images in FERET database. The noise density inserted varies with the values
Impact of Gaussian-noise on recognition accuracy of our approach on FERET database. 
Based on a novel RBF
Footnotes
Acknowledgments
The authors wish to thank the anonymous reviewers, whose comments were a great help to raise the level of technical and formal correctness of this paper.
