0% found this document useful (0 votes)
14 views6 pages

Mini Project Synopsis

Uploaded by

Shatakshi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views6 pages

Mini Project Synopsis

Uploaded by

Shatakshi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 6

MINI PROJECT SYNOPSIS

ON
OBJECT DETECTION AND IDENTIFICATION
MINI PROJECT LAB(KCS-354)

Submitted by
Shatakshi(2100970130103)
Shubhanshi Misra(2100970130110)
Umesh Pandey(2100970130118)

Under the supervision of


Dr. Javed Miya

Department of Information Technology


Galgotias College of Engineering and Technology
Greater Noida
Session 2022-23
INTRODUCTION

Object detection is a computer vision technique for locating instances of objects in images
or videos. Humans can easily detect and identify objects present in an image. Object
recognition is a computer vision technique for identifying objects in images or videos.
Object recognition is a key output of deep learning and machine learning algorithms. When
humans look at a photograph or watch a video, we can readily spot people, objects, scenes,
and visual details. The goal is to teach a computer to do what comes naturally to humans: to
gain a level of understanding of what an image contains.

Object detection is the process of finding instances of objects in images. In the case of deep
learning, object detection is a subset of object recognition, where the object is not only
identified but also located in an image. This allows for multiple objects to be identified and
locate.

Though there are many machine learning and deep learning algorithms for object detection
and recognition, such as Support vector machine (SVM), Convolutional Neural Networks
(CNNs), Regional Convolutional Neural Networks (R-CNNs), You Only Look Once
(YOLO) model etc., it is important to choose the right algorithm for autonomous driving as
it requires real-time object detection and recognition. Since machines cannot detect the
objects in an image instantly like humans, it is really necessary for the algorithms to be fast
and accurate and to detect the objects in real-time, so that the vehicle controllers solve
optimization problems at least at a frequency of one per second.

PROPOSAL/PLAN

This section explains the approach/methodology which will be followed during the project.

This project will aim to develop a prototype Android application which will implement
real-time object detection with semantic segmentation using a trained and quantized custom
deep learning algorithm on the live streaming image frames receiving from the device
camera.

The application will identify objects based on the model trained on a suitable database (such
as, ImageNet) using supervised learning. It will identify the objects in frames and perform
semantic segmentation in real-time to apply the approach for complete scene understanding,
in short:

Perform object detection to draw bounding boxes around each instance of a class

Perform semantic segmentation on each of the bounding boxes and identify the object class.

METHODOLOGY

R-CNN

To bypass the problem of selecting a huge number of regions, Ross Girshick et al. proposed

method where we use selective search to extract just 2000 regions from the image and

called them region proposals. Therefore, now, instead of trying to classify a huge number of
regions, you can just work with 2000 regions. These 2000 region proposals are generated
using the selective search algorithm which is written below.
Selective Search:
1. Generate initial sub-segmentation, we generate many candidate regions.
2. Use greedy algorithm to recursively combine similar regions into larger ones.
3. Use the generated regions to produce the final candidate region proposals.

Fast R-CNN

The same author of the previous paper(R-CNN) solved some of the drawbacks of R-CNN to
build a faster object detection algorithm and it was called Fast R-CNN. The approach is
similar to the R-CNN algorithm. But, instead of feeding the region proposals to the CNN,
we feed the input image to the CNN to generate a convolutional feature map. From the
convolutional feature map, we identify the region of proposals and warp them into squares
and by using a RoI pooling layer we reshape them into a fixed size so that it can be fed into a
fully connected layer.

Faster R-CNN

Both of the above algorithms(R-CNN & Fast R-CNN) uses selective search to find out the
region proposals. Selective search is a slow and time-consuming process affecting the
performance of the network. Therefore, Shaoqing Ren et al. came up with an object
detection algorithm that eliminates the selective search algorithm and lets the network learn
the region proposals.

Similar to Fast R-CNN, the image is provided as an input to a convolutional network which
provides a convolutional feature map. Instead of using selective search algorithm on the
feature map to identify the region proposals, a separate network is used to predict the region
proposals. The predicted region proposals are then reshaped using a RoI pooling layer which
is then used to classify the image within the proposed region and predict the offset values for
the bounding boxes.

CURRENT METHOLOGY

YOLO — You Only Look Once

All of the previous object detection algorithms use regions to localize the object within the
image. The network does not look at the complete image. Instead, parts of the image which
have high probabilities of containing the object. YOLO or You Only Look Once is an object
detection algorithm much different from the region-based algorithms seen above. In YOLO
a single convolutional network predicts the bounding boxes and the class probabilities for
these boxes.
TIME FLOWCHART

Module Work done Duration


Module 1 Group Formation, Topic 12 Sep-19 Sep
Finalization
Module 2 Literature survey 26 Sep-3 Oct
Module 3 Assessment 3 Oct-8 Oct
Module 4 Implementation 8 Oct-10 Oct
Module 5 Designing 10 Oct-17 Oct
Module 6 Programming 17 Oct- 20 Oct
Module 7 Creating database 1 Nov-7 Nov
Module 8 PPT formation 14 Nov-21 Nov
Module 9 Making suitable changes 21 Nov-30 Nov
Module 10 Report writing 30 Nov- 10 Dec
preparation
Module 11 Final evaluation by 12 Dec
external committee

UTILIZATION/FUTURE SCOPE
 The future of object detection technology is in the process of proving itself, and much
like the original Industrial Revolution, it has the potential to free people from menial
jobs that can be done more efficiently and effectively by machines.
 It will also open up new avenues of research and operations that will reap additional
benefits in the future.

REFERENCES
 https://www.researchgate.net/publication/
337464355_OBJECT_DETECTION_AND_IDENTIFICATION_A_Project_Report
 https://towardsdatascience.com/r-cnn-fast-r-cnn-faster-r-cnn-yolo-object-detection-
algorithms-36d53571365e
 https://blog.rebellionresearch.com/
 www.youtube.com
 https://www.hackerearth.com/blog/developers/introduction-to-object-detection/
 www.geeksforgeeks.com

You might also like