0% found this document useful (0 votes)

112 views35 pages

CV - YOLO v1

The document discusses YOLO v1, a one-stage object detection algorithm. YOLO v1 frames object detection as a regression problem to predict bounding boxes and class probabilities directly from full images in one evaluation. It divides images into grids and predicts two bounding boxes per grid cell along with confidence scores representing how confident the model is that the box contains an object and how accurate the box is. The model is trained end-to-end using a mean squared error loss function. Evaluation metrics for object detection like average precision are used to evaluate YOLO v1 on datasets like PASCAL VOC.

Uploaded by

TẤN TRÌNH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

112 views35 pages

CV - YOLO v1

Uploaded by

TẤN TRÌNH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Computer Vision

YOLO v1

Pham Viet Cuong

Dept. Control Engineering & Automation, FEEE
Ho Chi Minh City University of Technology
Face Detection: Viola - Jones Algorithm
✓ Object detection problem
❖ Single object detection ❖ Bounding box(es)
❖ Multiple object detection ❖ Class(es) of object(s)

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 2
Face Detection: Viola - Jones Algorithm
✓ Haar-like features
❖ Window size: 24x24
❖ Type
❖ Position
❖ Size
❖ ~ 160K features
✓ Features usefulness?

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 3
Face Detection: Viola - Jones Algorithm
✓ Feature selection
❖ Positive & negative sets
10K examples/set
❖ Weak classifiers

24x24 window feature

❖ Objective: min # examples
misclassified
❖ ~ 6K features selected
Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 4
Face Detection: Viola - Jones Algorithm
✓ Trade-off
❖ More features (higher detection rate, lower false positive rate)
❖ More computational complexity
✓ Cascade structure
❖ 6061 features
❖ 38 stages
❖ First 5 layers: 1, 10, 25, 25, 50 features
❖ Average: 10 feature evaluations
per window
❖ 15 – 600 times faster than others
❖ Negative examples?
false positive examples
Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 5
Face Detection: Viola - Jones Algorithm
✓ How to detect face(s) in an image?
❖ Sliding window: 24x24 ❖ Binay classifier: Face / Non face
(384x288 image) ❖ Window scaling

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 6
Face Detection: Viola - Jones Algorithm
✓ How to detect face(s) in an image?
❖ Sliding window: 24x24 ❖ Binay classifier: Face / Non face
(384x288 image)

❖ AlexNet?
▪ Binary classifier → AlexNet
▪ Multiple object detection
❖ More efficient?
▪ Region proposal

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 7
YOLO v1
✓ Two-stage object detection (R-CNN, fast R-CNN, faster R-CNN)
Region Object
Image
Proposal Classification
✓ One-stage object detection
Image CNN
❖ YOLO – You Only Look Once

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 8
YOLO v1
✓ Structure

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 9
YOLO v1
✓ S=7 ✓ Confidence score
✓ Bounding box: x, y, w, h

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 10
YOLO v1
✓ # outputs:
❖ (5B + C)S2, B = 2, C = 20, S = 7

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 11
YOLO v1

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 12
YOLO v1
✓ Linear regression problem

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 13
YOLO v1
✓ Confidence score
❖ How likely the bounding box contains an object?
❖ How accurate is the bounding box (location and size)?
Confidence score = Pr(Object)*IoU
IoU: Intersection over Union

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 14
YOLO v1
✓ Confidence score
❖ How likely the bounding box contains an object?
❖ How accurate is the bounding box (location and size)?
Confidence score C = Pr(Object)*IoU
✓ Conditional class probability
pi(c) = Pr(Classi|Object)
✓ Test:

Class-specific confidence scores for each box: probability of that class appearing
in the box and how well the predicted box fits the object.
Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 15
YOLO v1
✓ Activation function

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 16
YOLO v1
✓ Loss function

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 17
YOLO v1
✓ Training

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 18
YOLO v1

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 19
YOLO v1

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 20
YOLO v1
✓ Limitations
❖ Spatial constrain
▪ Two bounding boxes, one class per grid cell
▪ Struggle with small objects in groups, e.g. flocks of birds
❖ Relatively coarse features due to multiple downsampling layers
❖ Main error: incorrect localization

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 21
YOLO v1
✓ Results – PASCAL VOC 2007

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 22
YOLO v1
✓ Results – PASCAL VOC 2007

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 23
YOLO v1
✓ Results – PASCAL VOC 2007

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 24
YOLO v1
✓ Results – PASCAL VOC 2012

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 25
YOLO v1
✓ Results

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 26
YOLO v1

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 27
YOLO v1
✓ Results

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 28
YOLO v1
✓ Comparison

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 29
YOLO v1
✓ Comparison

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 30
YOLO v1
✓ Comparison

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 31
YOLO v1
✓ Comparison

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 32
YOLO v1
✓ Evaluation
❖ Classification problem?
▪ Top-1 error rate
▪ Top-5 error rate
❖ Object detection problem?

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 33
YOLO v1
✓ Evaluation
❖ Confusion matrix
❖ Recall (detection rate, true positive rate, sensitivity)
𝑇𝑃
𝑅𝑒𝑐𝑎𝑙𝑙 = 𝐷𝑅 =
𝑇𝑃 + 𝐹𝑁
❖ Precision
𝑇𝑃
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 =
𝑇𝑃 + 𝐹𝑃

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 34
YOLO v1
✓ Evaluation
❖ Precision - Recall curve
❖ Interpolated Precision - Recall curve
❖ AP
❖ AP50, AP75
❖ mAP
✓ Dataset
❖ PASCAL VOC
❖ COCO

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 35

AV900 Service Manual
No ratings yet
AV900 Service Manual
114 pages
PET Trace 800 Series TYLER Installation and Test Verification Certificates - IM - DOC2470460 - 2
No ratings yet
PET Trace 800 Series TYLER Installation and Test Verification Certificates - IM - DOC2470460 - 2
29 pages
Calixto - Environmental Engineering Questions and Answers
No ratings yet
Calixto - Environmental Engineering Questions and Answers
17 pages
Mil11.12lesi Iiig 17
100% (2)
Mil11.12lesi Iiig 17
3 pages
KDW Manual
No ratings yet
KDW Manual
54 pages
Panasonic Tc-l32x30 Chassis La15 Service Manual
No ratings yet
Panasonic Tc-l32x30 Chassis La15 Service Manual
73 pages
Legacy Transmission
100% (1)
Legacy Transmission
729 pages
07 Regional Geography
No ratings yet
07 Regional Geography
42 pages
(IJCST-V8I3P4) :sakshi Gupta, Dr. T. Uma Devi
No ratings yet
(IJCST-V8I3P4) :sakshi Gupta, Dr. T. Uma Devi
5 pages
Yolo: You Only Look Once: Unified Real-Time Object Detection
No ratings yet
Yolo: You Only Look Once: Unified Real-Time Object Detection
60 pages
Payment Successfully
No ratings yet
Payment Successfully
3 pages
IT5409 Ch7 Part1 Object Detection v2 4pages
No ratings yet
IT5409 Ch7 Part1 Object Detection v2 4pages
24 pages
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
No ratings yet
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
8 pages
Beyond Binary Classification
No ratings yet
Beyond Binary Classification
34 pages
RO47002 - Lecture 2A - Case Study Visual Object Detection
No ratings yet
RO47002 - Lecture 2A - Case Study Visual Object Detection
24 pages
Gyrocompass Accessories PDF
100% (1)
Gyrocompass Accessories PDF
4 pages
Part 11 MD
No ratings yet
Part 11 MD
53 pages
Lec 04
No ratings yet
Lec 04
70 pages
Haar - Cascades - 1 Ref
No ratings yet
Haar - Cascades - 1 Ref
5 pages
LAB MANUAL 2D1427 Image Based Recognitio
No ratings yet
LAB MANUAL 2D1427 Image Based Recognitio
25 pages
Viola Jones
No ratings yet
Viola Jones
3 pages
DSP Mini Project Report
No ratings yet
DSP Mini Project Report
14 pages
Class Diagram UML
No ratings yet
Class Diagram UML
5 pages
Voilajones Paper PDF
No ratings yet
Voilajones Paper PDF
8 pages
Grp2 Final PPT YOLO Moving Object Classification
No ratings yet
Grp2 Final PPT YOLO Moving Object Classification
26 pages
16 More Startup
No ratings yet
16 More Startup
12 pages
1637334851
No ratings yet
1637334851
25 pages
IT5409 Ch7 Part1 Object Detection v2
No ratings yet
IT5409 Ch7 Part1 Object Detection v2
97 pages
Evaluation
No ratings yet
Evaluation
10 pages
Lec36 Obj Detn
No ratings yet
Lec36 Obj Detn
60 pages
f7 1 Fi Media
No ratings yet
f7 1 Fi Media
16 pages
Module 6
No ratings yet
Module 6
83 pages
Automatic Cocktail Mixer Dispenser
No ratings yet
Automatic Cocktail Mixer Dispenser
15 pages
2016 05 Viola Jones PDF
No ratings yet
2016 05 Viola Jones PDF
51 pages
Sidra Face Detection Final
No ratings yet
Sidra Face Detection Final
23 pages
CV Project
No ratings yet
CV Project
7 pages
The Photosynthesis System: Lcpro
No ratings yet
The Photosynthesis System: Lcpro
6 pages
TRAT04 TransformadaHaar ViolaJones
No ratings yet
TRAT04 TransformadaHaar ViolaJones
21 pages
A Practical Implementation of Face Detection by Using Matlab Cascade Object Detector
No ratings yet
A Practical Implementation of Face Detection by Using Matlab Cascade Object Detector
6 pages
Robust Real-Time Face Detection
No ratings yet
Robust Real-Time Face Detection
2 pages
Comparison of Viola-Jones Haar Cascade Classifier
No ratings yet
Comparison of Viola-Jones Haar Cascade Classifier
9 pages
Group Number - 2 - MOVING OBJECT CLASSIFICATION USING YOLO Algorithm
No ratings yet
Group Number - 2 - MOVING OBJECT CLASSIFICATION USING YOLO Algorithm
15 pages
You Only Look Once Model-Based Object Identification in Computer Vision
No ratings yet
You Only Look Once Model-Based Object Identification in Computer Vision
12 pages
Intelligent Traffic Crossing System: Guided by
No ratings yet
Intelligent Traffic Crossing System: Guided by
28 pages
The Viola/Jones Face Detector
No ratings yet
The Viola/Jones Face Detector
21 pages
Viola Jones Algorithm
No ratings yet
Viola Jones Algorithm
19 pages
Viola Jones Presentation
No ratings yet
Viola Jones Presentation
33 pages
Part 2
No ratings yet
Part 2
225 pages
Template For Extended Abstract
No ratings yet
Template For Extended Abstract
3 pages
Opencv Introduction Highgui Basic Operations Face Detection Optical Flow Template Matching Local Feature
No ratings yet
Opencv Introduction Highgui Basic Operations Face Detection Optical Flow Template Matching Local Feature
21 pages
Description of Viola
No ratings yet
Description of Viola
10 pages
Object Detection With Deformable Part-Based Models: Many Slides Based On
No ratings yet
Object Detection With Deformable Part-Based Models: Many Slides Based On
32 pages
Rapport PFA PDF
No ratings yet
Rapport PFA PDF
30 pages
Robust Real-Time Face Detection: International Journal of Computer Vision May 2004
No ratings yet
Robust Real-Time Face Detection: International Journal of Computer Vision May 2004
2 pages
Biologically Moivated Computer Vision
No ratings yet
Biologically Moivated Computer Vision
29 pages
Banner AR Guide 7.3 Arsys70300rg
No ratings yet
Banner AR Guide 7.3 Arsys70300rg
94 pages
Object Detection
No ratings yet
Object Detection
96 pages
DL Unit-5
No ratings yet
DL Unit-5
34 pages
Proposal For The Reasearch
No ratings yet
Proposal For The Reasearch
6 pages
Face Detection SVM Analysis Updated
No ratings yet
Face Detection SVM Analysis Updated
5 pages
Project PPT Final PDF
No ratings yet
Project PPT Final PDF
34 pages
Design of A Real-Time Object Detection Prototype S
No ratings yet
Design of A Real-Time Object Detection Prototype S
6 pages
Writer Recognition by Computer Vision: Jeffrey P. Woodard Christopher P. Saunders Mark J. Lancaster
No ratings yet
Writer Recognition by Computer Vision: Jeffrey P. Woodard Christopher P. Saunders Mark J. Lancaster
19 pages
Base Paper (YOLO)
No ratings yet
Base Paper (YOLO)
6 pages
Research of Usage of Haar-Like Features and Adaboost Algorithm in Viola-Jones Method of Object Detection
No ratings yet
Research of Usage of Haar-Like Features and Adaboost Algorithm in Viola-Jones Method of Object Detection
3 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-09 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-09 Reference-Material-I
55 pages
1.long Term and Medium Term Capacity Planning
No ratings yet
1.long Term and Medium Term Capacity Planning
12 pages
Unit 3-Non CNN Approaches To Object Recognition
No ratings yet
Unit 3-Non CNN Approaches To Object Recognition
26 pages
CV - YOLO v1
No ratings yet
CV - YOLO v1
35 pages
Cive
No ratings yet
Cive
2 pages
ARDUINO
No ratings yet
ARDUINO
14 pages
X Road Arhitecture 1.0 Y 879 3
No ratings yet
X Road Arhitecture 1.0 Y 879 3
15 pages
A History of The Personal Computer: The People and The Technology
100% (1)
A History of The Personal Computer: The People and The Technology
526 pages
NNG Jakob's Usability Heuristic 4
No ratings yet
NNG Jakob's Usability Heuristic 4
1 page
Steel Calculation
No ratings yet
Steel Calculation
43 pages
Drupal On Windows Azure
No ratings yet
Drupal On Windows Azure
21 pages
Government - Bank - IT Freshers Jobs - Walkins
No ratings yet
Government - Bank - IT Freshers Jobs - Walkins
18 pages
Deskripsi Asus Zenfone 3 Ze520Kl (Asus - Z017D) Software Image: WW - 14.2015.1701.8
No ratings yet
Deskripsi Asus Zenfone 3 Ze520Kl (Asus - Z017D) Software Image: WW - 14.2015.1701.8
2 pages
TutoRial Visual Dial Plan
No ratings yet
TutoRial Visual Dial Plan
6 pages
Image and Video Analytics Unit 3
No ratings yet
Image and Video Analytics Unit 3
18 pages
Biometrics Lecture Face 5
No ratings yet
Biometrics Lecture Face 5
82 pages
16 - Chapter 5 - CV - Object Recognition and Classification - Machine-Learning
No ratings yet
16 - Chapter 5 - CV - Object Recognition and Classification - Machine-Learning
16 pages
Wa0001.
No ratings yet
Wa0001.
14 pages
WAF Bypass Matrix PDF
No ratings yet
WAF Bypass Matrix PDF
3 pages
Table of Contents by Abdulrahman
No ratings yet
Table of Contents by Abdulrahman
10 pages
Yolov10 To Its Genesis A Decadal and Comprehensive
No ratings yet
Yolov10 To Its Genesis A Decadal and Comprehensive
49 pages

CV - YOLO v1

Uploaded by

CV - YOLO v1

Uploaded by

Computer Vision

Pham Viet Cuong

24x24 window feature

You might also like