0% found this document useful (0 votes)

7 views

RO47002 - Lecture 2A - Case Study Visual Object Detection

This document discusses visual object detection through machine learning. It describes taking an image as input and detecting objects like pedestrians through outputting bounding boxes. The task is formulated as a supervised classification problem over region proposals to determine if each proposal contains the target object or not.

Uploaded by

Haia Al Sharif

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

RO47002 - Lecture 2A - Case Study Visual Object Detection

Uploaded by

Haia Al Sharif

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

1

Case study:
Visual object detection
Course: RO47002
Lecturer: Julian Kooij
2

Case study: Visual Object Detection

• Input: a camera image
• Task: detect all instances of a
particular “object”,
e.g. “Pedestrian”.
• Output: bounding boxes around
all detected instances
• Measures of success:
– Don’t miss any pedestrians
– Don’t report any pedestrians where there are none
– The bounding box should tightly fit around the pedestrian
3

Case study: Visual Object Detection

Q: Should we address this
with machine learning?

A: Yes, uncontrolled outdoor conditions, large variance in object

appearance, size, shape, colour, background
4

Case study: Visual Object Detection

Q: What category of machine
learning should we use?
• Supervised?
• Unsupervised?
• Reinforcement learning?

A: We want to capture a particular type of concept, “Pedestrian”. Best done

by supervised learning, i.e. giving labelled examples.
Not unsupervised (no clustering). Not RL (not learning in a control loop).
5

Case study: Visual Object Detection

Q: What type of machine
learning task is this?
• Classification?
• Regression?

• Measures of success:
– Don’t miss any pedestrians
Classification
– Don’t report any pedestrians where there are none
– The bounding box should tightly fit around the pedestrian Regression
6

Case study: Visual Object Detection

• We will here only focus on
classification problem
• Fixed size bounding boxes

• How to formulate this task

concretely as a function,
with fixed size domain, and
fixed size output?
7

Visual Object Detection through Classification

Classify region proposals:

Is this a pedestrian?
Yes / No
8

Visual Object Detection through Classification

Output the region proposals that
were classified positively

Is this a pedestrian?
Yes / No
9

Visual Object Detection through Classification

☺
True positive:
Pedestrian found


False positive:
Incorrectly classified
as pedestrian


False negative:
Incorrectly classified
as not pedestrian
10
x1 93

Images as High-Dimensional Vectors x2

x3
65
64
x4 100
x5 135
• I is a gray-scale image patch of 160 x 80 pixels x6 176
x7 •
we consider I is a vector in 12800-D space !
•
x8
•
x9
92
• Putting this in perspective I = x10 = 63
160 x11 61
256 (12800) ≈ 1×10 30720 possible images x12
pixels 108
“number of atoms in the universe” ≈ 4×1080 x13 148
x14 178

• •
• But, almost all of the possible images look like •
•
‘noise’, only relatively small amount can occur • •

in our data! 184

x12800
80 pixels
11

Visualization – „Real“ Pedestrian Dataset

• „Real“ Pedestrian Dataset
collected at Daimler from a
moving vehicle

• Visualization by projecting
images to a 3D dimensional
space using Principal
Component Analysis (PCA)

• Note: natural images are

concentrated in regions of the
input space, not uniformly
scattered
12

Image Features x1
Function x = f(I) extracts of features on image I x2
and represents its content by feature vector x. …
xM
x1 x1
x2 x2 Feature
vector
…

captures the

…
image content

xM xM
dense features sparse features
13

Classifying Objects
1. Feature extraction: turn image region into a D-dimensional feature vector x
2. Classification: apply classifier h, test if h(x) is above a threshold 𝜏

ℎ x ≥𝜏
classify as
“Pedestrian”
x2

ℎ x <𝜏
classify as
“Not Pedestrian”

x1
14

Classifying Objects
How to obtain useful classifier h(x) ? Training data
• Use representative training data to optimize its decision boundary
positive class:
“Pedestrian”

? negative class:
“Not pedestrian”
x2

Test data
sample to be
classified
decision boundary

x1
15

Evaluating a Linear classifier

• Classify x : is the label y = -1 or y = +1 ?
+1 • On what side of decision boundry is x ?
x
w -1
• 𝒘⊤ x is positive if α < 90°
α
𝐚⊤ ⋅ 𝐛 = 𝐚 𝐛 cos 𝛼
16

Evaluating a Linear classifier

• Classify x : is the label y = -1 or y = +1 ?
+1 • On what side of decision boundry is x ?

w -1
• 𝒘⊤ x is positive if α < 90°
α x • 𝒘⊤ x is negative if α > 90°

𝐚⊤ ⋅ 𝐛 = 𝐚 𝐛 cos 𝛼

• Classification rule: 𝑦ො = sign(𝒘⊤ x)

Evaluating a Linear classifier

• Classify x : is the label y = -1 or y = +1 ?
+1 • On what side of decision boundry is x ?

-1
• 𝒘⊤ x is positive if α < 90°
w
x • 𝒘⊤ x is negative if α > 90°

𝐚⊤ ⋅ 𝐛 = 𝐚 𝐛 cos 𝛼
b'
• Classification rule: 𝑦ො = sign(𝒘⊤ x + b)
• Easy to compute, only dot product!
• Parameter wi weights contribution of xi
18

Problem 1: dimensionality
• Amount of training data needed grows exponentially with dimensionality
• 1D case, 2 samples: few decision boundries possible

2D case, 2 samples: many decision boundries possible

Even worse in spaces with thousands of dimensions!

Problem 2: classes not seperable

?
x2

x1
20

Solution 1: Different classifiers

More complex classification boundary

Problems
• More complicated to train
• More parameters to optimize
• More expensive to evaluate
x2

? Remember:
many proposals to evaluate!

Latency could become a problem

x1
21

Solution 2: Use a different representation

Extract better features

x’ = f’(I)
x22

Apply transformation f
x‘

x’ = f(x)
?

x11
x'
22

What are good features?

What are good features to use?

Intuition
• Captures relevant object information: shape, color distribution, …
• Not affected by irrelevant photometric and geometric changes, noise
• Encodes image content locally (robust to partial occlusion)
• Is efficient: feature vector dimensionality << number of image pixels

What is “relevant” depends on application 22

What are good features?

Example

When distinguishing 60 and 80 speed

limit signs, it makes sense to consider
grayscale and intensity-normalized

This is because color and intensity do

not facilitate differentiation, but
expand the feature space significantly.
24

Summary
• We have seen how ML can be applied to a task in
Computer Vision
– Classify image patches, not images
– 1 Image does not equal 1 Classification problem
• Not all CV tasks require ML necessarily, but often
required for challenging real-world conditions
• Example of linear classification
• Role of image feature extraction

Calculus - Student Solutions Manual (8th Edition) PDF
No ratings yet
Calculus - Student Solutions Manual (8th Edition) PDF
1,079 pages
4
No ratings yet
4
31 pages
IT5409 Ch7 Part1 Object Detection v2
No ratings yet
IT5409 Ch7 Part1 Object Detection v2
97 pages
Lecture4 GAN b
No ratings yet
Lecture4 GAN b
38 pages
Pedestrian Detection - Kristina Pickl
No ratings yet
Pedestrian Detection - Kristina Pickl
45 pages
Face Recognition Using Facenet
No ratings yet
Face Recognition Using Facenet
46 pages
04 - Polygon Rasterization
No ratings yet
04 - Polygon Rasterization
35 pages
computer_vision_2_feature_extraction_2_students
No ratings yet
computer_vision_2_feature_extraction_2_students
70 pages
Pattern Recognition
No ratings yet
Pattern Recognition
52 pages
What Is Computer Vision?
No ratings yet
What Is Computer Vision?
125 pages
Unit 3-Non CNN approaches to object recognition
No ratings yet
Unit 3-Non CNN approaches to object recognition
26 pages
Features
No ratings yet
Features
60 pages
Learning in Artificial Intelligence
No ratings yet
Learning in Artificial Intelligence
6 pages
Feature Matching: "What Stuff in The Left Image Matches With Stuff On The Right?"
No ratings yet
Feature Matching: "What Stuff in The Left Image Matches With Stuff On The Right?"
62 pages
1 Introduction
No ratings yet
1 Introduction
27 pages
Lect1 PDF
100% (1)
Lect1 PDF
45 pages
Image Features and Categorization: Computer Vision Jia-Bin Huang, Virginia Tech
No ratings yet
Image Features and Categorization: Computer Vision Jia-Bin Huang, Virginia Tech
70 pages
Lect3 PDF
No ratings yet
Lect3 PDF
47 pages
1 ObjectDetection
No ratings yet
1 ObjectDetection
46 pages
Bag of Feature
No ratings yet
Bag of Feature
75 pages
Unit II - Chapter 4 - Feature Detection
No ratings yet
Unit II - Chapter 4 - Feature Detection
56 pages
L02 ImagingPixelProc
No ratings yet
L02 ImagingPixelProc
53 pages
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-04 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-04 Reference-Material-I
69 pages
Week5_Computer_Vision
No ratings yet
Week5_Computer_Vision
58 pages
Few-shot, Zero-shot learning - CONTENT BEYOND SYLLABUS
No ratings yet
Few-shot, Zero-shot learning - CONTENT BEYOND SYLLABUS
36 pages
unit 1
No ratings yet
unit 1
179 pages
Introduction To Object Recognition: Slides Adapted From Fei-Fei Li, Rob Fergus, Antonio Torralba, and Others
No ratings yet
Introduction To Object Recognition: Slides Adapted From Fei-Fei Li, Rob Fergus, Antonio Torralba, and Others
60 pages
What Is Computer Vision?
No ratings yet
What Is Computer Vision?
120 pages
Hidden Surface Removal - Computer Graphics
No ratings yet
Hidden Surface Removal - Computer Graphics
10 pages
RPCN Imaging 3 - depth accuracy
No ratings yet
RPCN Imaging 3 - depth accuracy
14 pages
Computer_vision_part1
No ratings yet
Computer_vision_part1
96 pages
3-Binary image analysis I
No ratings yet
3-Binary image analysis I
19 pages
05 - Spatial Filtering
No ratings yet
05 - Spatial Filtering
70 pages
Forming the Image and Understanding Lense
No ratings yet
Forming the Image and Understanding Lense
52 pages
CSE4261 Lecture-12
No ratings yet
CSE4261 Lecture-12
24 pages
13 PracticalMachineLearning
100% (1)
13 PracticalMachineLearning
84 pages
CVlecture 4
No ratings yet
CVlecture 4
62 pages
1.1. Introduction To DIP
No ratings yet
1.1. Introduction To DIP
61 pages
Object Detection With Deformable Part-Based Models: Many Slides Based On
No ratings yet
Object Detection With Deformable Part-Based Models: Many Slides Based On
32 pages
Images, Neural Networks, CNNs
No ratings yet
Images, Neural Networks, CNNs
26 pages
Lecture 03 Calibration
No ratings yet
Lecture 03 Calibration
44 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
56 pages
09object Detection I
No ratings yet
09object Detection I
49 pages
Chapter 1
No ratings yet
Chapter 1
58 pages
002-Supervised Learning Setup 01 W2L2
No ratings yet
002-Supervised Learning Setup 01 W2L2
21 pages
Lec 02 Cam Models
No ratings yet
Lec 02 Cam Models
44 pages
lecture5-1
No ratings yet
lecture5-1
40 pages
Main V1 Habiba Seminar
No ratings yet
Main V1 Habiba Seminar
41 pages
Vazquez ImageProcessFundamentals
No ratings yet
Vazquez ImageProcessFundamentals
83 pages
06 Features
No ratings yet
06 Features
94 pages
2024-SCU-ML-2-1-SVM
No ratings yet
2024-SCU-ML-2-1-SVM
36 pages
NN 09
No ratings yet
NN 09
34 pages
Exploring Augmented Reality With Python
No ratings yet
Exploring Augmented Reality With Python
23 pages
Object Recognition
No ratings yet
Object Recognition
30 pages
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
100% (1)
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
57 pages
4. Image Sampling and Quantization
No ratings yet
4. Image Sampling and Quantization
16 pages
Pattern Image Rec
No ratings yet
Pattern Image Rec
45 pages
Chapter03 Tracking
No ratings yet
Chapter03 Tracking
44 pages
Ch3 Filters v3 Part1
No ratings yet
Ch3 Filters v3 Part1
60 pages
Subtraction
From Everand
Subtraction
Sally Fisk
No ratings yet
Casting Shadows: Creating Visual Dimension in Your Quilts
From Everand
Casting Shadows: Creating Visual Dimension in Your Quilts
Colleen Wise
3.5/5 (5)
RO47002 - Course Introduction
No ratings yet
RO47002 - Course Introduction
48 pages
RO47002 - Lecture 2B - ML Formalized - Part2
No ratings yet
RO47002 - Lecture 2B - ML Formalized - Part2
8 pages
RO47002 - Lecture 2C - Hyperparameters and Cross-Validation
No ratings yet
RO47002 - Lecture 2C - Hyperparameters and Cross-Validation
10 pages
RL Project - Deep Q-Network Agent Presentation
No ratings yet
RL Project - Deep Q-Network Agent Presentation
15 pages
A Survey of Spiking Neural Network Accelerator
No ratings yet
A Survey of Spiking Neural Network Accelerator
15 pages
ICT5357 Assessment Brief T3 2024
No ratings yet
ICT5357 Assessment Brief T3 2024
12 pages
What Is Artificial Intelligence
No ratings yet
What Is Artificial Intelligence
2 pages
Computational Intelligence
No ratings yet
Computational Intelligence
3 pages
Capstone Report Final 08
No ratings yet
Capstone Report Final 08
44 pages
2-Candidate Elimination Algorithm
No ratings yet
2-Candidate Elimination Algorithm
6 pages
PGP in Data Science and AI With Fellowship
No ratings yet
PGP in Data Science and AI With Fellowship
14 pages
YOLO-based Threat Object Detection in X-Ray
No ratings yet
YOLO-based Threat Object Detection in X-Ray
5 pages
Neural Networks
No ratings yet
Neural Networks
5 pages
Octave MLP Neural Networks
No ratings yet
Octave MLP Neural Networks
25 pages
2024_Generalizing VT for Face Anti-Spoofing
No ratings yet
2024_Generalizing VT for Face Anti-Spoofing
14 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
Benny GOEDBLOED - AI in A Prison environment-CDPPS2019 PDF
No ratings yet
Benny GOEDBLOED - AI in A Prison environment-CDPPS2019 PDF
19 pages
Dhanush - Diabetes Report
No ratings yet
Dhanush - Diabetes Report
4 pages
CNN Course V1.3
No ratings yet
CNN Course V1.3
19 pages
Class Note
No ratings yet
Class Note
3 pages
Machine Learning Roadmap
No ratings yet
Machine Learning Roadmap
31 pages
2018-Gait Challenges
No ratings yet
2018-Gait Challenges
11 pages
Artificial Neural Network - Hopfield Networks - Tutorialspoint
No ratings yet
Artificial Neural Network - Hopfield Networks - Tutorialspoint
3 pages
Aiml Online Brochure
No ratings yet
Aiml Online Brochure
16 pages
Artificial Intelligence With Lab: Report: Machine Learning
No ratings yet
Artificial Intelligence With Lab: Report: Machine Learning
6 pages
Vasu Gupta, Sharan Srinivasan, Sneha Kudli, Prediction and Classification of Cardiac Arrhythmia
No ratings yet
Vasu Gupta, Sharan Srinivasan, Sneha Kudli, Prediction and Classification of Cardiac Arrhythmia
5 pages
CLIQUE Algorithm Grid-Based Subspace Clustering
No ratings yet
CLIQUE Algorithm Grid-Based Subspace Clustering
10 pages
Introduction To DL With TensorFlow
No ratings yet
Introduction To DL With TensorFlow
55 pages
20 - Efficient - Pneumonia - Detection - Using - Vision - Transfo
No ratings yet
20 - Efficient - Pneumonia - Detection - Using - Vision - Transfo
18 pages
Machine Learning Classification Techniques For Heart Disease Prediction: A Review
No ratings yet
Machine Learning Classification Techniques For Heart Disease Prediction: A Review
8 pages
Shape-Scale Co-Awareness Network for 3D Brain Tumor Segmentation有道翻译
No ratings yet
Shape-Scale Co-Awareness Network for 3D Brain Tumor Segmentation有道翻译
14 pages
Arti Cial Intelligence & Deep Learning: Model Institute of Engineering & Technology (Autonomous)
No ratings yet
Arti Cial Intelligence & Deep Learning: Model Institute of Engineering & Technology (Autonomous)
4 pages
Linear Regression With One Variable
No ratings yet
Linear Regression With One Variable
49 pages

RO47002 - Lecture 2A - Case Study Visual Object Detection

Uploaded by

RO47002 - Lecture 2A - Case Study Visual Object Detection

Uploaded by

1

Case study: Visual Object Detection

Case study: Visual Object Detection

A: Yes, uncontrolled outdoor conditions, large variance in object

Case study: Visual Object Detection

A: We want to capture a particular type of concept, “Pedestrian”. Best done

Case study: Visual Object Detection

Case study: Visual Object Detection

• How to formulate this task

Visual Object Detection through Classification

Visual Object Detection through Classification

Visual Object Detection through Classification

Images as High-Dimensional Vectors x2

in our data! 184

Visualization – „Real“ Pedestrian Dataset

• Note: natural images are

Evaluating a Linear classifier

Evaluating a Linear classifier

• Classification rule: 𝑦ො = sign(𝒘⊤ x)

Evaluating a Linear classifier

2D case, 2 samples: many decision boundries possible

Even worse in spaces with thousands of dimensions!

Problem 2: classes not seperable

Solution 1: Different classifiers

More complex classification boundary

Latency could become a problem

Solution 2: Use a different representation

Extract better features

What are good features?

What is “relevant” depends on application 22

What are good features?

When distinguishing 60 and 80 speed

This is because color and intensity do

You might also like