Mini Project
Mini Project
BACHELOR OF TECHNOLOGY
in
Guided by
INTRODUCTION
Face detection is an essential process in computer vision and image processing, widely
used in applications like security, user identification, and facial analysis. This project
implements face detection for both static images and live video streams using
OpenCV's deep neural network (DNN) module with a pre-trained Caffe model. It leverages
the Single Shot Detector (SSD) architecture, specifically designed for high efficiency in real-
time object detection.
LITERATURE SURVEY
The field of face detection has evolved significantly, driven by advances in computer
vision, machine learning, and deep learning. The literature on face detection methods
spans various techniques, from early image processing approaches to complex deep
neural networks. This literature survey provides a detailed overview of face detection
methodologies, focusing on traditional methods, machine learning-based approaches,
and modern deep learning frameworks.
Early face detection relied heavily on classical image processing and statistical
techniques, which extracted features from images without deep learning. Some
foundational methods included:
• Viola-Jones Algorithm (2001): Viola and Jones developed a framework based on
Haar-like features and AdaBoost, which became the first robust real-time face
detection algorithm. This method uses a cascade of weak classifiers to identify
faces by scanning the image at multiple scales, making it efficient but sensitive to
lighting and pose variations. Viola-Jones
laid the groundwork for real-time detection applications but lacks the
adaptability to complex backgrounds and is less effective in low-light
conditions.
• Histogram of Oriented Gradients (HOG) with SVM (2005): Dalal and Triggs
introduced HOG features combined with Support Vector Machines (SVM) for
object detection, particularly for pedestrians. This approach was extended to
face detection, leveraging gradient orientation information to capture facial
structures. Although HOG-SVM is computationally intensive, it marked an
improvement in feature extraction for faces but struggled with changes in pose and
facial expression.
These traditional methods are fast and computationally efficient but have
limitations in terms of robustness to variations in lighting, pose, and
background.
• Single Shot Multibox Detector (SSD) and YOLO (You Only Look Once):
O SSD (2016): Liu et al. introduced the SSD framework, which combines
object detection and classification in a single step, significantly improving
speed and enabling real-time applications. SSD operates by creating a
fixed grid of bounding boxes and applying convolutions to detect
objects. SSD has become widely used in face detection, particularly in
the Caffe deep learning framework. It is known for being fast, lightweight,
and suitable for mobile and embedded applications, but can be less
accurate with small faces.
O YOLO (2016): YOLO, developed by Redmon et al., also performs detection
and classification in one pass and is known for its speed, achieving near
real-time performance even on limited hardware. Although effective for
face detection, YOLO’s accuracy may be slightly lower than SSD’s in
detecting smaller or occluded faces.
• Multi-task Cascaded Convolutional Networks (MTCNN): MTCNN
combines face detection with landmark localization, improving accuracy by
refining the detection bounding boxes through hierarchical stages. MTCNN has
proven effective in detecting faces across various scales and has been widely used
in applications requiring face alignment.
• Face Detection in Caffe: The Caffe deep learning framework provides an SSD-
based model (e.g., res10_300x300_ssd_iter_140000) for real-time face
detection. This model is pre-trained on a large dataset and fine-tuned for face
detection, achieving high accuracy while being computationally efficient. It is
capable of detecting faces with reasonable confidence in real-time and has
become popular for applications needing a balance between accuracy and
processing speed.
PROBLEM STATEMENT
RESEARCH GAP
Although significant progress has been made in face detection, several research gaps
still exist that limit the effectiveness of current systems:
4. Integration with Real-Time Systems: While models like SSD and YOLO offer
faster detection rates, integrating them into real-time systems with low latency
and high frame rates remains a challenge, particularly in dynamic environments.
There is a research gap in optimizing detection pipelines for smooth integration into
real-time applications without compromising accuracy.
CONCLUSION:
Facial recognition using OpenCV has proven highly effective, with benchmark studies
showing that OpenCV’s Haar Cascade classifiers can reach accuracy rates as high as 98%
for frontal face detection in controlled environments.
When using the deep learning-based DNN module in OpenCV, accuracy improves even
further, with precision rates exceeding 99% on datasets like the LFW (Labeled Faces in the
Wild) under optimal conditions. Speed benchmarks indicate that OpenCV can process
frames at rates of up to 60 frames per second (fps) on high-performance GPUs, and 15–30
fps on standard CPUs.
Additionally, using SSD models paired with ResNet, OpenCV can maintain detection times
as low as 0.1 seconds per frame, making it viable for real-time applications. In terms of
computational efficiency, OpenCV's algorithms are optimized to use approximately 30–
50% less memory than some other facial recognition libraries, making it particularly
suitable for edge devices with limited processing power, such as Raspberry Pi setups,
which can still achieve 12–15 fps with basic CNN models. This combination of high
accuracy, fast processing times, and efficient resource use has positioned OpenCV as a
powerful tool for facial recognition across industries, from surveillance to customer
analytics.
REFERENCES
1. https://www.sciencedirect.com/
2. https://www.ijert.org/research/skin-disease-detection-using-machine- learning-
IJERTCONV9IS03016.pdf
3. https://ieeexpIore.ieee.org/document/9256314