Convolution Based Face Recognition Using DWT and HOG: Jyothi Ravikumar, Ramachandra A C, K B Raja and Venugopal K R
Convolution Based Face Recognition Using DWT and HOG: Jyothi Ravikumar, Ramachandra A C, K B Raja and Venugopal K R
Abstract— Physiological face biometric trait is used to efficients to obtain oriented gradients. The final features are
identify a person for many real time applications. The obtained by convolving LL and HOG co-efficients.
convolution based feature extraction technique for face The paper organisation are as follows. Section II contains
identification using Discrete Wavelet Transform (DWT) and the Literature Survey. Section III gives the description of the
Histogram of Oriented Gradient (HOG) is proposed to recognize proposed model. In section IV performance parameters such as
human beings effectively. The four standard face databases with MTSR, OTSR, EER are discussed and also comparison of
different sizes are considered and resized to 128X128 to have proposed method with the existing methods are tabulated and
uniform size of images. The 2D-DWT (Two Dimensional Discrete conclusion is given in section V.
Wavelet Transform) is applied on resized face images and
considered only (LL) sub-band. The HOG is applied on LL sub- II. LITERATURE SURVEY
band to obtain HOG coefficients. The 2D convolution is used on
LL sub-band and HOG matrix to obtain final features. The In this section, the literature review of existing techniques
resized face image is compressed using DWT and HOG. The to recognize human being based on face images are explained.
Euclidean distance(ED) is used to compare features of database The various techniques of preprocessing, feature extraction
face images with test images to compute performance parameters. and matching techniques presented by various researchers are
The performance of the proposed method is better than the given. Bilel Ameur et al., [1] proposed GLBSIF algorithm
existing methods. which is combined method of Local Binary Pattern (LBP),
Gabor Wavelets and Binarized Statistical Image Features
Keywords—Discrete Wavelet Transform (DWT); Histogram
of Oriented Gradient (HOG); Biometrics ; Face Identification. (BSIF) for using as feature extraction. Then they used PCA for
dimensional reduction and KNN-SRC for classification. The
results shown that GLBSIF can reduced the time consuming
I. INTRODUCTION
and give a better accuracy than the traditional methods. Dong
Biometrics system used for recognizing the person based on Wang et al., [2] proposed an improved method of 2D-DLDA.
a physiological and behavioral characteristics. The need for The method deteriorates the effect that the edge classes have
higher security of identification and verification have become on the selection of the projection direction by redefining the
significant. The biometric-based solutions are capable of class scatter matrix. The 2D-VLDA algorithm can achieve a
providing authentic and confidential transactions. There are higher recognition rate than the existing algorithm. Michael F.
various biometric applications found in the military, Valstar et al., [3] proposed a method which includes Action
government and in commercial applications. Developing
Units (AU) event detection and AU intensity. The common
biometrics for personal authentication have become suitable
binary classifiers are applied to the problem include Artificial
and significantly increase in the accuracy compated with the
traditional methods. Neural Network (ANN), Boosting technique, Support Vector
Face recognition (FR) is a popular biometric techniques Machine (SVM), Adaboost and gentle boost. The BP4D
which is widely used in the field of human computer database is used to check the performance paremeters of the
interaction. Lot of research have been proposed in the field of proposed model and the results indicated that it achieved
FR to extract the feature in order to achieve the higher accuracy higher recognition accuracy. Nawal Yousef et al., [4] proposed
and to reduce the processing time. One of the famous feature the algorithm for face recognition to retrieve the suspect image
extraction algorithm is Histogram of Oriented Gradient (HOG), from the database using face descriptors. The proposed
which is a face descriptor. method discovers the automation of face recognition LFW
Contribution: in this, paper the convolution based face database and the results shown signigicant retrieval accuracy.
recognition using DWT and HOG is proposed. The DWT is
used to obtained compressed number of transform co-efficients. III. PROPOSED MODEL
The HOG is used on LL band to further compress LL co-
327
ICIIBMS 2018, Your Track number here, Bangkok, Thailand
In this section, the proposed model using DWT, HOG and details in horizontal direction leads to detection of horizontal
convolution is discussed to obtain better performance. The edges, details in vertical direction leads to detection of vertical
block diagram of the proposed model is as shown in Fig. 1. edges and details in both horizontal and vertical directions
leads to detection of diagonal edges.
The 2-D signals analysis uses of following two
dimensional filter functions through the multiplication of
separable scaling and wavelet functions in n1 samples in
horizontal and n2 samples in vertical directions as defined by
following equations.
For the approximate signal,
φ(n1,n2)=φ(n1)φ(n2) (1)
328
ICIIBMS 2018, Your Track number here, Bangkok, Thailand
reason for dividing the whole image into 8×8 cells is that, it features. The steps are same as mentioned in enrolment
provides a compact representation. The representation is not section.
only more dense but calculating a histogram over a patch
G. Matching
makes this representation more robust to noise. Individual
gradients may have noise, but a histogram over 8×8 patch The final features of test images are compared with features
makes the representation much less sensitive to noise. The of database images using ED formula as given in the Eq.8.
angles are between 0 and 180 degrees instead of 0 to 360
degrees. These are called unsigned gradients because a (8)
gradient and its negative are represented by the same numbers. Where M is the dimension of feature vector, pi is the
The Fig. 3 illustrates the process of computing HOG in the database feature vector, qi is test feature vector.
magnitude and direction of the gradient for the 8X8 patch.
Based on the direction bin is selected and vote of the value is IV. PERFORMANCE ANALYSIS
selected based on the magnitude. In this section, the definitions of performance parameters,
and the performance analysis using various face databases are
given. The values of FAR, FRR, EER, MTSR and OTSR are
computed for different combinations of Person Inside Databse
(PID) and Person Outside Database (POD).
(9)
2. False Rejection Rate (FRR): It is the possibility of
number of correct persons inside the database are rejected and
can be calculated using Eq.10.
329
ICIIBMS 2018, Your Track number here, Bangkok, Thailand
Fig 4. Variations of performance parameters for PID and POD of 10:10 Fig 7. Variations of performance parameters for PID and POD of 10:10
combination combination.
Fig 5. Variations of performance parameters for PID and POD of 20:10 Fig 8. Variations of performance parameterss for PID and POD of 10:20
combination combination.
The performance parameters such as percentage FRR, C. Analysis Using JAFFE Database
FAR, and TSR are noted for variations in threshold values and The performance parameters such as percentage FRR,
the graphs are obtained as shown in Fig. 7, 8 and 9 for PID FAR, and TSR are noted for variations in threshold values and
and POD combination of 10:10, 10:20, 10:30 respectively the graphs are obtained as shown in Fig. 10, 11 and 12 for PID
with PID constant values. and POD combinations of 4:4, 5:4, 6:4 respectively. It is
observed that the value of FRR decreases with increase in
330
ICIIBMS 2018, Your Track number here, Bangkok, Thailand
threshold values whereas, the value of FAR and TSR increases graphs are obtained as shown in Fig. 13, 14 and 15 for PID
with threshold values. The percentage EER values are and POD combinations of 4:4, 4:5 , 4:6 respectively.
decreasing for PID and POD combinations of 4:4, 5:4 and 6:4
respectively.
Fig 13. Variations of performance parameters for PID and POD of 4:4
combination.
Fig 10. Variations of performance parameters for PID and POD of 4:4
combination.
Fig 14. Variations of performances parameters for PID and POD of 4:5
combination.
Fig 11. Variations of performance parameters for PID and POD of 5:4
combination.
Fig 15. Variations of performance parameterss for PID and POD of 4:6
combination.
Fig 12. Variations of performance parameters for PID and POD of 6:4
combination.
The percentage variations of EER, OTSR and MTSR
for PID and POD combinations of 4:4, 4:5, 4:6 are tabulated
The percentage variations of EER, OTSR and MTSR for PID
in Table 4 It is observed that, for PID is kept constant and
and POD combination of 4:4, 5:4, 6:4 are tabulated in table 3.
POD values increases, the EER decreases where as OTSR and
It is observed that, as PID values increases OTSR values
MTSR remains constant.
increases and MTSR values are constant.
TABLE 4
TABLE 3
VARIATION OF PERFORMANCE PARAMETERS WITH POD FOR
VARIATION OF PERFORMANCE PARAMETERS WITH PID FOR
JAFFE
JAFFE
PID POD %EER %OTSR %MTSR
PID POD %EER %OTSR %MTSR
4 4 12 75 100
4 4 12 75 100
4 5 15 75 100
5 4 15 80 100
4 6 12 75 100
6 4 12 84 100
331
ICIIBMS 2018, Your Track number here, Bangkok, Thailand
D. Analysis Using YALE Database TABLE 5
The performance parameters such as percentage FRR, VARIATION OF PERFORMANCE PARAMETERS WITH PID FOR
FAR, and TSR are noted for variations in threshold values and YALE
the graphs are obtained as shown in Fig. 16, 17 and 18 for PID PID POD %EER %OTSR %MTSR
and POD combination of 8:5, 9:5, 10:5 respectively. It is 8 5 12 88 100
observed that the value of FRR decreases with increase in 9 5 11 89 100
threshold values whereas, the value of FAR and TSR increases 10 5 10 90 100
with threshold values. The percentage EER values decreases
with increase in PID. The performance parameters such as percentage FRR,
FAR, and TSR are noted for variations in threshold values and
the graphs are obtained as shown in Fig. 19, 20 and 21 for PID
and POD combinations of 8:5, 8:6 , 8:7 respectively.
Fig 16. Variations of performance parameters for PID and POD of 8:5
combination.
Fig 19. Variations of performance parameters for PID and POD of 8:5
combination.
Fig 17. Variations of performance parameters for PID and POD of 9:5
combination.
Fig 20. Variations of performance parameters for PID and POD of 8:6
combination.
Fig 18. Variations of performance parameters for PID and POD of 10:5
combination.
Fig 21. Variations of performance parameters for PID and POD of 8:7
The percentage variations of EER, OTSR and MTSR combination.
for PID and POD combinationss of 8:5, 9:5, 10:5 are The percentage variations of EER, OTSR and MTSR
tabulated in table 5 It is observed that as PID values increases, for PID and POD combinations of 8:5, 8:6 , 8:7 are tabulated
the EER decreases and OTSR increases and MTSR remains in Table 6. It is observed that PID is kept constant and POD
constant at 100. values increases, the EER remains constant as well OTSR and
MTSR .
332
ICIIBMS 2018, Your Track number here, Bangkok, Thailand
TABLE 6 The percentage variations of EER, OTSR and MTSR
VARIATION OF PERFORMANCE PARAMETERS WITH POD FOR for PID and POD combinations of 40:60, 50:60, 60:59 are
YALE tabulated in table 7. It is observed that as PID values increases,
PID POD %EER %OTSR %MTSR the EER remains zero and OTSR and MTSR remains constant.
8 5 12 88 100 TABLE 7
8 6 12 88 100 VARIATION OF PERFORMANCE PARAMETERS WITH PID FOR L-
8 7 12 88 100 SPACEK
PID POD %EER %OTSR %MTSR
E. Analysis with Combined L-SPACEK Database 40 60 0 98 100
The performance parameters such as percentage FRR, FAR, 50 60 0 99 100
and TSR are noted for variations in threshold values and the 60 59 0 99 100
graphs are obtained as shown in Fig. 22, 23 and 24 for PID The performance parameters such as percentage FRR,
and POD combinations of 40:60, 50:60, 60:59 respectively. It FAR, and TSR are noted for variations in threshold values and
is observed that the value of FRR decreases with increase in the graphs are obtained as shown in Fig. 25, 26, and 27 for
threshold values whereas, the values of FAR and TSR PID and POD combinations of 60:40, 60:50 , 60:60
increases with threshold values. The percentage EER values respectively.
are zero for PID and POD combinations of 40:60, 50:60 and
60:59 respectively.
Fig 23. Variations of performance parameters for PID and POD of 50:60
combination.
Fig 26. Variations of performance parameters for PID and
POD of 60:50 combination.
Fig 24. Variations of performance parameters for PID and POD of 60:60
combination.
333
ICIIBMS 2018, Your Track number here, Bangkok, Thailand
Fig 27. Variations of performance parameters for PID and face images of different sizes are converted to uniform size of
POD of 60:59 combination. 128×128. The DWT is applied on face images and considered
The percentage variations of EER, OTSR and MTSR only the compressed LL sub-band. The HOG is used on LL
for PID and POD combinations of 60:40, 60:50 , 60:60 are sub-band to obtain further compressed initial features. The
tabulated in Table 8 It is observed that as PID is kept constant final features are obtained using 2D convolution of LL sub-
and POD values increases, the EER remains constant as well band and HOG matrix. It is observed that, the performance of
MTSR. proposed algorithm is improved compared with the existing
Table 8 algorithms. The proposed method is implemented using
Variation of performance parameters with POD for L- embedded system processor or FPGA for real time
SPACEK applications in future.
PID POD %EER %OTSR %MTSR
40 60 0 98 100 REFERENCES
50 60 0 99 100 [1] Bilel Ameur, Mebarka Belahcene, Sabeur Masmoudi,
60 59 0 99 100 Amira Guidara Derbel and Ahmed Ben Hamida, “A New
GLBSIF Descriptor for Face Recognition in the
Uncontrolled Environments”, International Conference
F. Comparision of Recognition Rate of Proposed Method on Advanced Technologies for Signal and Image
with Existing Methods Processing, pp. 22-24, 2017.
The recognition rate using ORL and YALE face database [2] Dong Wang and Shunfang Wang, “A New Method of
of proposed method is compared with existing methods Two Dimensional Direct LDA and Its Applications in
presented by Erhu Zhang et al.,[9], and Shailaja and Anuradha Face Recognition”, International Conference on Digital
[10], is given in Table 9. It is observed that recognition rate is Home, pp. 58-63, 2016.
high in the case of proposed method compared to existing [3] Michel F. Valsfar, Ennique Sanchez-Lozano, Jeffery
methods. Fcohn, Laszlo A. Jeni, Jeffry M. Girard, Zheng Zhang,
Liking Yin and Make Panic," Addressing Head Pose in
TABLE 9 the Third Facial Expression Recognition and Analysis
COMPARISON OF %MTSR VALUES OF PROPOSED METHOD WITH Challenge", IEEE 12th International Conference on
EXISTING METHODS Automatic Face and Gesture Recognition, pp.839-847,
%MTSR 2017.
Methods
ORL database YALE database [4] Nawaf Yousef Almudhahka, Mark S Nixon and Jonathan
Erhu Zhang et S Hare,“Automatic Semantic Face Recognition”, IEEE
74.06 80.77 12th International Conference on Automatic Face and
al.,[9]
K shailaja and Gesture Recognition, pp. 180-185, 2017.
87 92.8 [5] Jaffe Database, http://www.kasrl.org/jaffe_download.html.
Anuradha[10]
Proposed method 91 100 [6] L-spacek databasehttp://cswww.essex.ac.uk/mv/allfaces.
[7] YaleUniversity,http://cvc.cs.yale.edu/cvc/projects/yaleface
The recognition rate using JAFFE face database of s/yalefaces.html.
proposed method is compared with existing methods [8] ORL database, http://www.camrol.co.uk
presented by Yubo Wang et al.[11], is given in Table 10. It is [9] Erhu Zhang, Yongchao Li and Faming Zhang, “A Single
observed that recognition rate is high in the case of proposed Training Sample Face Recognition Algorithm Based on
method compared to existing methods. Sample Extension”, 2013 Sixth International Conference
on Advanced Computational Intelligence, pp. 19-21, 2013.
TABLE 10 [10] K. Shailaja and B. Anuradha, “Effective Face
COMPARISON OF %MTSR VALUES OF PROPOSED METHOD WITH Recognition using Deep Learning Based Linear
EXISTING METHODS
Discriminant Classification”, IEEE International
%MTSR Conference on Computational Intelligence and
Methods Computing Research, pp. 1- 6, 2016.
ORL database
[11] Yubo Wang, Haizhou AI, Bo WU and Chang Huang ,
Yubo Wang et al.,[11] 92.4 “Real Time Facial Expression Recognition with
Adaboost”, Proceedings of the International Conference
Proposed method 100 on Pattern Recognition IEEE, pp. 926-929, 2004
V. CONCLUSION
The face recognition is used for several applications in day
to day activities. In this paper, convolution based feature
extraction technique for face identification using DWT and
HOG is proposed. The standard face databases are used to test
the proposed method for various performance parameters. The
334