0% found this document useful (0 votes)

10 views

Object-Detection-with-YOLO

The document discusses object detection using the YOLO (You Only Look Once) algorithm, which improves speed by processing the entire image at once rather than scanning multiple regions. It outlines the steps of the YOLO algorithm, including bounding box prediction, performance measurement using Union over Intersection (UoI), and non-max suppression to avoid double counting. Additionally, it covers the use of pretrained models from the COCO dataset and the process for training custom YOLO models.

Uploaded by

Hoàng Nguyễn Thái

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Object-Detection-with-YOLO

Uploaded by

Hoàng Nguyễn Thái

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Object Detection with YOLO

Chad Wakamiya
Spring 2020
Agenda

Object Detection YOLO Algorithm YOLO Implementations

Defining the object ● YOLO algorithm steps ● Pretrained models
detection problem and a ● Bounding boxes with the COCO
naive solution. ● Measuring performance dataset.
(UoI) ● Custom trained
● Non-max suppression models
Object Detection
Classiﬁcation vs. Object Detection
Object Detection is the problem of locating and classifying objects in an image.

Classiﬁcation Object Detection

● Each image has one object ● Each image may contain multiple objects
● Model predicts one label ● Model classiﬁes objects and identiﬁes their location.

Cat Car

Car

Dog Cat
Dog
Bounding Box
Naive Approach
1. Scan the image with a sliding window. 2. Feed the images into a classiﬁer model to
predict a label for that region.

Label
Classiﬁer Dog?
Model (CNN) Person?
Nothing?

● This approach is slow since it checks many windows that

don't contain anything -> Not good for real time uses.
● The Region-based Convolutional Neural Net (R-CNN) is an
improved version that strategically selects regions that are
likely to contain an object to run through the CNN.
YOLO Algorithm
YOLO "You Only Look Once"
● Instead of making predictions on many regions of an image, YOLO passes the entire image at
once into a CNN that predicts the labels, bounding boxes, and conﬁdence probabilities for
objects in the image.
● YOLO runs much faster than region based algorithms quick because requires only a single pass
through a CNN.
Label

Conﬁdence
Probability
Convolutional Neural Net Car: 0.93

Bounding Box
Input Output
YOLO Steps
1. Divide the image into cells 2. Each cell predicts B 3. Return bounding boxes
with an S x S grid. bounding boxes. above conﬁdence threshold.

Car: 0.93

S=3 B=2
Cell A cell is responsible for detecting an All other bounding boxes have a
object if the object's bounding box conﬁdence probability less than
falls within the cell. (Notice that each the threshold (say 0.90) so they
cell has 2 blue dots.) are suppressed.
In practice, we we would use large values (S = 19 and B = 5) to identify more objects.
How are bounding boxes encoded?
Let's use a simple example where there are 3x3 cells (S=3), each cell predicts 1 bounding box (B=1),
and objects are either dog = 1 or human = 2. For each cell, the CNN predicts a vector y:
Example:
Probability the bounding box contains
pc an object 1

bx bx
Coordinates of the bounding box's
center
by by
b
(bx, y= y=
h
bh Width (height) of bounding box as
bh
by) a percent of the cell's width or
bw (height) bw
c1 Probability the cell contains an 0
object that belongs to class 1 (or 2)
bw c2 given the cell contains an object 1

*There's a probability for each class so if there are 80 classes we would have c1,…c80
Encoding Multiple Bounding Boxes
What happens if we predict multiple bounding boxes per cell (B>1)? We simply augment y.

The CNN will predict a y for each cell,

pc so the size of the output tensor
bx (multidimensional "matrix") should be:
bh S×S×(5B+C)
(bx, by) by
bh
bw
y=
pc
bw
bx
S
by
bh bh (5B+C)
(bx, by) bw
c1 S
bw c2
Notice that y has 5B+C elements (C is the number of classes).
YOLO Overview
Input Output

Convolutional Neural Net Car: 0.93

S×S×(5B+C)
W×H×3
W: Width of image in pixels Series of convolutional and A tensor that speciﬁes the
L: Height of image in pixels
3: Number of color channels in RGB
pooling layers. bounding box locations and
class probabilities.
Measuring Performance with UoI
● Union over Intersection (UoI) measures the overlap between two bounding boxes.
● During training, we calculate the UoI between a predicted bounding box and and the ground truth
(the prelabeled bounding box we aim to match)

Ground Truth
Area of Intersection
Union over
=
Intersection Area of Union

Predicted Bounding Box

Poor Good Excellent

https://www.pyimagesearch.com/2016/11/07/intersection-over-union-iou-for-object-detection/
Double Counting Objects (Non-Max Suppression)
● When predicting more than 2 bounding boxes per cell, sometimes the same object will be
detected multiple times (overlapping boxes with the same label)
● Non-max suppression solves multiple counting by removing the box with the lower conﬁdence
probability when the UoI between 2 boxes with the same label is above some threshold.

Non-Max Suppression
Dog: 0.95
Dog: 0.95
Dog: 0.95 Dog: 0.82
Dog: 0.82 Dog: 0.41
Dog: 0.41
UoI: 0.62

UoI: 0.47

1. Identify the box with the 2. Calculate the UoI between 3. Suppress boxes with UoI
highest confidence. the highest confidence above a selected
box each of the other threshold (usually 0.3)
boxes.
Implementing YOLO
Pretrained Models
● Training a YOLO model requires images labeled with bounding boxes. These datasets may take
time to label, so readily available prelabeled images are often used to train models.
● A common dataset for image classiﬁcation/detection/segmentation is the COCO (Common
Objects in Context), a database of images with 80 labelled classes.
● Popular pretrained YOLO models with COCO:
○ ImageAI (easy-to-use, lightweight YOLO implementation)
○ Darknet (trained by the author of YOLO)

YOLO Implementation
(CNN)

Pretrained Model
with COCO Pineapples and cantaloupes are not in
COCO so they are not recognized.
Applications built with COCO trained models will
COCO Pretrained Labels only be able to identify these objects!

person ﬁre elephant skis wine glass broccoli diningtable toaster

hydrant
bicycle stop sign bear snowboard cup carrot toilet sink

car parking zebra sports ball fork hot dog tvmonitor refrigerator
meter
motorbike bench giraﬀe kite knife pizza laptop book
aeroplane bird backpack baseball bat spoon donut mouse clock
bus cat umbrella baseball glove bowl cake remote vase

train dog handbag skateboard banana chair keyboard scissors

truck horse tie surfboard apple sofa cell phone teddy bear
boat sheep suitcase tennis racket sandwich pottedplant microwave hair drier

traﬃc light cow frisbee bottle orange bed oven toothbrush

Custom Models
● If your use case only uses objects in COCO → you can use a pretrained model.
● Otherwise you will need to train your own YOLO model. This will require:

1. Finding images of the objects to recognize.

2. Label bounding boxes.
3. Train your YOLO model. There are 2 options:
a. Implement your own model using OpenCV, Tensorﬂow/Keras
b. Use ImageAI's custom training methods.
References/Further Reading
● YOLO
○ ://towardsdatascience.com/you-only-look-once-yolo-implementing-yolo-in-less-than-30-lines-o
f-python-code-97fb9835bfd2
● R-CNN
○ https://towardsdatascience.com/r-cnn-fast-r-cnn-faster-r-cnn-yolo-object-detection-algorithms
-36d53571365e
● CNN
○ https://www.coursera.org/lecture/convolutional-neural-networks/optional-region-proposals-aCY
Zv
● YOLO
○ https://hackernoon.com/understanding-yolo-f5a74bbc7967
○ https://www.analyticsvidhya.com/blog/2018/12/practical-guide-object-detection-yolo-framewor
-python/
● Intersection Over Union
○ https://www.pyimagesearch.com/2016/11/07/intersection-over-union-iou-for-object-detection/

Mastering All YOLO Models From YOLOv1 To YOLO
100% (1)
Mastering All YOLO Models From YOLOv1 To YOLO
58 pages
Letter From Jay-Z, Yo Gotti To Mississippi Governor
No ratings yet
Letter From Jay-Z, Yo Gotti To Mississippi Governor
2 pages
Computer Vision - Compressed
No ratings yet
Computer Vision - Compressed
46 pages
Week 05
No ratings yet
Week 05
38 pages
Unified Real-Time Object Detection
No ratings yet
Unified Real-Time Object Detection
36 pages
Yolo India
No ratings yet
Yolo India
14 pages
Ex No 06
No ratings yet
Ex No 06
4 pages
Yolo Paper
No ratings yet
Yolo Paper
10 pages
You Only Look Once - Unified, Real-Time Object Detection
No ratings yet
You Only Look Once - Unified, Real-Time Object Detection
10 pages
Red Mon 2016
No ratings yet
Red Mon 2016
10 pages
Seminar 201202175023
No ratings yet
Seminar 201202175023
16 pages
Project
100% (1)
Project
30 pages
YOLO Object Detection Explained_ A Beginner's Guide _ DataCamp
No ratings yet
YOLO Object Detection Explained_ A Beginner's Guide _ DataCamp
14 pages
Constructon
No ratings yet
Constructon
10 pages
YOLO
No ratings yet
YOLO
31 pages
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
No ratings yet
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
8 pages
"Object Detection With Yolo": A Seminar On
No ratings yet
"Object Detection With Yolo": A Seminar On
14 pages
Yolo
No ratings yet
Yolo
10 pages
Yolo Algorithm
No ratings yet
Yolo Algorithm
37 pages
C11240283S19
No ratings yet
C11240283S19
4 pages
YOLO V3 ML Project
No ratings yet
YOLO V3 ML Project
15 pages
yolopdf
No ratings yet
yolopdf
10 pages
Algoritm For MOD
No ratings yet
Algoritm For MOD
32 pages
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
No ratings yet
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
8 pages
Object Detection Using Yolo
No ratings yet
Object Detection Using Yolo
42 pages
10 - CPU Based YOLO A Real Time Object Detection Algorithm
No ratings yet
10 - CPU Based YOLO A Real Time Object Detection Algorithm
4 pages
C11240283S19
No ratings yet
C11240283S19
4 pages
Yolov3: An Incremental Improvement: Joseph Redmon, Ali Farhadi
No ratings yet
Yolov3: An Incremental Improvement: Joseph Redmon, Ali Farhadi
6 pages
NN 09
No ratings yet
NN 09
34 pages
Signature Object Detection Based On YOLOv3
No ratings yet
Signature Object Detection Based On YOLOv3
4 pages
Overview of YOLO ObjectDetectionAlgorithm
No ratings yet
Overview of YOLO ObjectDetectionAlgorithm
7 pages
yolo
No ratings yet
yolo
32 pages
Object Detection Technique (YOLO)
No ratings yet
Object Detection Technique (YOLO)
19 pages
8 ObectDectection
No ratings yet
8 ObectDectection
60 pages
YOLO
No ratings yet
YOLO
10 pages
YOLO
No ratings yet
YOLO
4 pages
Object_Detection_Document
No ratings yet
Object_Detection_Document
4 pages
Object Detection Using Yolo Algorithm-1
No ratings yet
Object Detection Using Yolo Algorithm-1
9 pages
Yolo: You Only Look Once: Unified Real-Time Object Detection
No ratings yet
Yolo: You Only Look Once: Unified Real-Time Object Detection
60 pages
Base Paper (YOLO)
No ratings yet
Base Paper (YOLO)
6 pages
Deep Learning YOLOv2
No ratings yet
Deep Learning YOLOv2
3 pages
Yolo
No ratings yet
Yolo
10 pages
Yolov 3
No ratings yet
Yolov 3
42 pages
The Real-Time Detection of Traffic Participants Using YOLO Algorithm
No ratings yet
The Real-Time Detection of Traffic Participants Using YOLO Algorithm
4 pages
MJEER-Volume 30-Issue 1 - Page 52-57
No ratings yet
MJEER-Volume 30-Issue 1 - Page 52-57
6 pages
Yolo
No ratings yet
Yolo
13 pages
The Basics of Object Detection YOLO SSD R-CNN
No ratings yet
The Basics of Object Detection YOLO SSD R-CNN
4 pages
Object Detection and Classification Using Yolov3 IJERTV10IS020078
No ratings yet
Object Detection and Classification Using Yolov3 IJERTV10IS020078
6 pages
Lecture 10 Summary
No ratings yet
Lecture 10 Summary
2 pages
Yolo
No ratings yet
Yolo
24 pages
Yolo 220209212833
No ratings yet
Yolo 220209212833
17 pages
Real-Time Face Detection Based On YOLO
No ratings yet
Real-Time Face Detection Based On YOLO
4 pages
YOLO_v2
No ratings yet
YOLO_v2
9 pages
EEE
No ratings yet
EEE
9 pages
Od Segment
No ratings yet
Od Segment
53 pages
Detection and Content Retrieval of Object in An Image Using YOLO
No ratings yet
Detection and Content Retrieval of Object in An Image Using YOLO
8 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Voxel: Exploring the Depths of Computer Vision with Voxel Technology
From Everand
Voxel: Exploring the Depths of Computer Vision with Voxel Technology
Fouad Sabry
No ratings yet
How To Code For Quantum Computers
From Everand
How To Code For Quantum Computers
Nivio Dos Santos
No ratings yet
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
Bag of Words Model: Unlocking Visual Intelligence with Bag of Words
From Everand
Bag of Words Model: Unlocking Visual Intelligence with Bag of Words
Fouad Sabry
No ratings yet
CatnapCrochet-GhostyPDFPattern
No ratings yet
CatnapCrochet-GhostyPDFPattern
11 pages
Gen 101 U-Ii
No ratings yet
Gen 101 U-Ii
11 pages
Design Guide 7 - Simple Connections - Pinned Base Plate Connections for Columns (1)
No ratings yet
Design Guide 7 - Simple Connections - Pinned Base Plate Connections for Columns (1)
86 pages
Classical Realism
No ratings yet
Classical Realism
24 pages
Untitled
No ratings yet
Untitled
2 pages
(Ebook) New South Wales Civil Procedure Handbook 2018. by John P. Hamilton ISBN 9780455500881, 0455500886 - The full ebook with all chapters is available for download
100% (1)
(Ebook) New South Wales Civil Procedure Handbook 2018. by John P. Hamilton ISBN 9780455500881, 0455500886 - The full ebook with all chapters is available for download
28 pages
Engagement Letter of Selected Supply Chain MGT (SSCM) Cycle Audit - 24-25 BD GIA - V3
No ratings yet
Engagement Letter of Selected Supply Chain MGT (SSCM) Cycle Audit - 24-25 BD GIA - V3
9 pages
PHE Task 1
No ratings yet
PHE Task 1
5 pages
Sugarfit Document 04 09 2024 03 02 1073
No ratings yet
Sugarfit Document 04 09 2024 03 02 1073
8 pages
Po Humaira 3527 Minggu PDF
No ratings yet
Po Humaira 3527 Minggu PDF
1 page
PG Time Table-2012
No ratings yet
PG Time Table-2012
12 pages
Download (Ebook) Therapeutic antibody engineering : current and future advances driving the strongest growth area in the pharmaceutical industry by W R Strohl; Lila M Strohl ISBN 9781907568374, 1907568379 ebook All Chapters PDF
100% (8)
Download (Ebook) Therapeutic antibody engineering : current and future advances driving the strongest growth area in the pharmaceutical industry by W R Strohl; Lila M Strohl ISBN 9781907568374, 1907568379 ebook All Chapters PDF
55 pages
Psychology Final Ia
No ratings yet
Psychology Final Ia
15 pages
WDS DeploymentGuide
No ratings yet
WDS DeploymentGuide
194 pages
Data Mining: What Is Data Mining?: Oracle
No ratings yet
Data Mining: What Is Data Mining?: Oracle
16 pages
Mobiles C
No ratings yet
Mobiles C
31 pages
Individual Lab Report
No ratings yet
Individual Lab Report
4 pages
Tro3063 - WQ7 - Tro3063 - WQ7
100% (8)
Tro3063 - WQ7 - Tro3063 - WQ7
255 pages
Deconstructing Nudity
No ratings yet
Deconstructing Nudity
10 pages
Learning Programming Using Matlab
No ratings yet
Learning Programming Using Matlab
88 pages
Costanzo Festa Paper
No ratings yet
Costanzo Festa Paper
4 pages
Essay 14 - Trương Thế Hoàng
No ratings yet
Essay 14 - Trương Thế Hoàng
2 pages
Tinea - The Dermatophytes
No ratings yet
Tinea - The Dermatophytes
67 pages
A Letter To Her Husband
No ratings yet
A Letter To Her Husband
6 pages
FP Unit 5 - at The Restaurant - Food and Drinks
No ratings yet
FP Unit 5 - at The Restaurant - Food and Drinks
27 pages
11 Starter
No ratings yet
11 Starter
5 pages
Future Simple: Affirmative Sentences Negative Sentences Yes/No Questions WH Questions
No ratings yet
Future Simple: Affirmative Sentences Negative Sentences Yes/No Questions WH Questions
3 pages
PNB circular for education loan to PwDs
No ratings yet
PNB circular for education loan to PwDs
2 pages
ATn TJan Feb 2011 e Book
No ratings yet
ATn TJan Feb 2011 e Book
74 pages

Object-Detection-with-YOLO

Uploaded by

Object-Detection-with-YOLO

Uploaded by

Object Detection with YOLO

Object Detection YOLO Algorithm YOLO Implementations

Classiﬁcation Object Detection

● This approach is slow since it checks many windows that

The CNN will predict a y for each cell,

Convolutional Neural Net Car: 0.93

Predicted Bounding Box

Poor Good Excellent

person ﬁre elephant skis wine glass broccoli diningtable toaster

train dog handbag skateboard banana chair keyboard scissors

traﬃc light cow frisbee bottle orange bed oven toothbrush

1. Finding images of the objects to recognize.

You might also like