0% found this document useful (0 votes)

2 views

Chapter+1+Introduction+Part+1

The document outlines the course ECE467 on Image Processing and Robot Vision, taught by Dr. Saqer S. Alja’afreh, emphasizing the fundamentals of computer vision and image processing. Key topics include image formation, edge detection, object detection, and the differences between computer vision and human vision. The course requires programming knowledge and emphasizes self-discipline, with strict policies on attendance, participation, and late submissions.

Uploaded by

nasser.shraifi

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Chapter+1+Introduction+Part+1

Uploaded by

nasser.shraifi

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 72

Introduction

ECE467
Image Processing & Robot Vision

Dr. Saqer S. Alja’afreh

Textbook Information

• Textbook:
Rick Szeliski, Computer Vision: Algorithms and Applications online at:
http://szeliski.org/Book/
Course Information Chapter 0

Course Information
• This course is an introduction to those areas of Artificial Intelligence that
deal with fundamental issues and techniques of computer vision and
image processing.

• The emphasis is on physical, mathematical, and information-

processing aspects of the vision.

• Topics to be covered include image formation, edge detection and

segmentation, convolution, image enhancement techniques, extraction
of features such as color, texture, and shape, object detection, 3-D vision,
and computer vision system architectures and applications.

• The material is based on undergraduate-level texts augmented with

research papers, as appropriate.
Course Information Chapter 0

• The course will move fast

• Self-discipline is important
• The emphasis of the course is to develop practical skills for solving
Computer Vision and Image Processing problems
• Fair evaluations: undergraduate and graduate students will be
scored separately
• Academic Integrity (AI) will be taken into consideration, please
refer to the course syllabus. (Work on homework and projects
independently)
Course Information Chapter 0

Preferable Skills

• Introduction to Programming Languages

• Knowledge of Linear Algebra
• Programming in Python Preferred
Course Information Chapter 0

Course Requirements
• Class attendance and participation is expected
• You are responsible for ALL materials presented in class and
assigned to read
• Quizzes will be given during class time only.
• Regular deliverables on the project will be graded during the
course
Course Information Chapter 0

Late Submission Policy

• Completed homework and project deliverables are to be
submitted by their deadline (11:59pm).
• For homework and projects, every day in delay will result in a
10% of deduction of its score.
Introduction Chapter 1

What is Computer Vision

• Computer vision is a field of computer science

• works on enabling computers to see,
• identify and process images in the same way that
human vision does, and
• then provide appropriate output.
• It is like imparting human intelligence and instincts
about vision to a computer.

23
Introduction Chapter 1

Every image tells a story

ThTS
• Goal of computer vision: perceive
the “story” behind the picture
definition
IS
a 8 - a

&
of
• Compute properties of the world the "
actual
• 3D shape definitiOh
• Names of people or objects of
Comp
• What happened? Vestor
.
Introduction Chapter 1

What is Computer Vision

• Computer vision is a field of computer science

23
Introduction Chapter 1

What is Computer Vision

• Automatic understanding of images and video

1. Computing properties of the 3D world from
visual data (measurement)
2. Algorithms and representations to allow a
machine to recognize objects, people, scenes,
and activities. (perception and interpretation)

23
Introduction Chapter 1
Visual Perception
• Definition: Process of acquiring knowledge about
environmental objects and events by extracting information
from the light they emit or reflect [Palmer, 2012].

Cognitive Acquisition of
Vision
Activity knowledge

Perception is analogous to taking a picture!

(credit: Palmer, 2012)
23
Introduction Chapter 1

Computer Vision vs Human Vision

24
Introduction Chapter 1
Computer Vision vs Human Vision
Interpreting
Sensing device device
Interpretations
Picture
Man
Thrash
Bulb
Light
…

24
Introduction Chapter 1
Can computers match human perception?
• Yes and no (mainly no)
• computers can be better at “easy”
things
• humans are better at “hard” things

• But huge progress

• Accelerating in the last five years due
to deep learning
• What is considered “hard” keeps
changing
Introduction Chapter 1
Human perception has its shortcomings

What can you see

in this picture?
Introduction Chapter 1
Human perception has its shortcomings

What can you see

in this picture?

Credit: Thompson, Basic Vision, Oxford Press, 2012.

Introduction Chapter 1
Human perception has its shortcomings

Copyright A.Kitaoka 2003

Introduction Chapter 1
But humans can tell a lot about a scene from a little
information…

Source: “80 million tiny images” by Torralba, et al.

Introduction Chapter 1

Related disciplines
Artificial
intelligence Machine
Graphics learning
Computer
Image vision Cognitive
processing science
Algorithms

25
Introduction Chapter 1
Chapter 0 Week 1

-
Al tool applied
structured (labeled
S

for
Jata .
Introduction Chapter 1

Bothfools
Suitable for

unstructured Jata
Introduction Chapter 1
Image Segmentation extraction
Manual Feature =

& L for
Structured
equires ma data

Features distinguishing
properties

annotation
one
byimage

Image Processing Important steps

computer vision
initial step : Image processing
Introduction Chapter 1
fre
high precision J useful
high accuracy structures
Jata

humans only
needed for usedfor unstructured data:
preprocessing
& implementation cunsupervised
learning]

neural O
network
Introduction Chapter 1
Introduction Chapter 1
Introduction Chapter 1
Image Processing vs. Computer Vision
• Image Processing
• Research area within electrical engineering/signal processing
• Focus on syntax, low level features

image image
(Denoising/inpainting)
• Computer Vision Coutcome of computer vision is story
a

• Research area within computer science/artificial intelligence

• Focus on semantics, symbolic or geometric descriptions
31
Faces, People
Chairs, etc.
(Recognition/Detection)
image
Introduction Chapter 1
21 Spatial signals
What is a (digital) Image? An image is an array of
numbers (pixels).

What humans see What Computer see

Introduction Chapter 1
What is a (digital) Image?
• Definition: A digital image is defined by integrating and sampling
continuous (analog) data in a spatial domain [Klette, 2014].
Left Hand

O
32

coordinate system Left hand coordinate

system
Introduction Chapter 1
What Digital Image Processing?

• To Bridge the Gap

between Pixels and
Meaning

32
Introduction Chapter 1
perinch
-
tridot
Image Types: (Gray)Scalar and Binary resizing an
image

-y changes
the Spr

• A scalar image has integer simplest images

a=8 a=3 a=2
values Not complex
• a: level (bit)
• Ex. If 8 bit (a=8)
• image spans from 0 to
255
• 0 black and 255 white
• Ex. If 1 bit (a=1) 32

• it is binary image
# of colors =
2"
• 0 and 1 only
Grayscale
112
Image ot colors 2 = =
246

# of pixels/image 512x236 131072

1266
= =

1048576 bimage
z
#
bit/image i Pixe
/*8
131 077hd
every single pixel # of
bytes 131072 bytes
=
= .

8 Gets
is represented
by
8 lors
2 Co
4 biES
#of
-
bit
bytes
Introduction Chapter 1
Image Type: RGB (red, green, blue)
• Each channel spans a-bit values. Human Cone-cells (normalized)
RGB3 image responsivity spectra

seutet

Goverlanges
into when
31
, Gina
32
Wavelength (nm)
• Some people might have 4 cone-types!
• Some might have just 2!
Each has a different array
Introduction Chapter 1
Color
• Color vision has evolved over millions of years.
visible
light Normalized
curve edium
hort ong

cone
comes
Come

C fX=

X =
f= ; ↑f + xy

wavelength
32

Shortest Longest
M M
Introduction Chapter 1
Color
• If there is no light, there is no color!
• Human vision can only discriminate a few dozens of grey levels on a
screen, but hundreds of thousands of different colors.
• RED -> ~625 to 780 nm [long wavelength]
• ORANGE -> ~ 590 to 625 nm [long wavelength]
• YELLOW -> ~565 to 590 nm [middle range wavelength]
• GREEN -> ~ 500 to 565 nm [middle range wavelength] [
• CYAN -> ~485 to 500 nm middle range wavelength]
• BLUE -> ~440 to 485 nm [short wavelength]
• VIOLET -> ~330 to 440 nm [very short wavelength] 32
Introduction Chapter 1
Retina of Human Eye

There are three different types of color-

sensitive cones corresponding to (roughly)
• RED (64% of the cones)
• GREEN (about 32%), and
↓ Photoreceptor
Gone Trols
• BLUE (about 2%).

6-7 million cones ·

responsible -Vision

120 million rods Station Jurongt

during day
Some may have only 2 cones 32

Credit: Klette, 2012.

eyes
Photoreceptors

Efcones Cross
Img Annotation =
Inglabeling
/
Bound Box 17
Polygon polyline
Answer this
In what I
question
cases
are we
:

going
to use
enchecase 3
adv & disadv ?
Introduction Chapter 1

Why Computer Vision?

32
Introduction Chapter 1
The goal of computer vision
• Compute the 3D shape of the world

ZED 2i
Camera

32
Introduction Chapter 1
The goal of computer vision
e
• Recognize objects and people
color

32
Introduction Chapter 1
The goal of computer vision
• “Enhance” images

32
Introduction Chapter 1
The goal of computer vision

32
Introduction Chapter 1
The goal of computer vision
• Forensics

Source: Nayar and Nishino, “Eyes for Relighting”

Introduction Chapter 1
The goal of computer vision

32
Introduction Chapter 1
The goal of computer vision

Source: Nayar and Nishino, “Eyes for Relighting”

Introduction Chapter 1
The goal of computer vision
• Billions of images/videos captured per day

• Huge number of potential applications 32

• The next slides show the current state of the art

Introduction Chapter 1
The goal of computer vision
• Optical character recognition (OCR)
• If you have a scanner, it probably came with OCR software

Automatic check processing

Digit recognition, AT&T labs (1990’s) License plate readers

http://yann.lecun.com/exdb/lenet/ http://en.wikipedia.org/wiki/Automatic_number_plate_recognition

32
Introduction Chapter 1
The goal of computer vision
• Face detection

• Nearly all cameras detect faces in real time

• (Why?)
Introduction Chapter 1
The goal of computer vision

• Face analysis and

recognition
Introduction Chapter 1
The goal of computer vision
• Login without a password (Face ID)

Fingerprint scanners on Face unlock on Apple iPhone X

many new smartphones and See also http://www.sensiblevision.com/
other devices
Introduction Chapter 1
The goal of computer vision
• Image synthesis

Karras, et al., Progressive Growing of GANs for Improved Quality, Stability, and Variation, ICLR 2018
Introduction Chapter 1
The goal of computer vision

• Sports and
Advertising
Introduction Chapter 1
The goal of computer vision

• Smart cars

• Mobileye
• Tesla Autopilot
• Safety features in many cars
Introduction Chapter 1
The goal of computer vision

• Self-driving cars

Waymo
Introduction Chapter 1
The goal of computer vision

• Robotics

Eig E NASA’s Mars Curiosity Rover Amazon Picking Challenge

http://www.robocup2016.org/en/events
https://en.wikipedia.org/wiki/Curiosity_(rover)
/amazon-picking-challenge/

computer
vision Amazon Prime Air Amazon Scout
Introduction Chapter 1
The goal of computer vision
• Medical imaging

3D imaging
(MRI, CT) Skin cancer classification with deep learning
https://cs.stanford.edu/people/esteva/nature/
Introduction Chapter 1
The goal of computer vision
• Virtual & Augmented
Reality

6DoF head tracking Hand & body tracking

3D scene understanding 3D-360 video capture

Introduction Chapter 1
Why computer vision is difficult?
Stroblems
to
fall
Ch 3
①
.

Viewpoint variation

Credit: Flickr user michaelpaul

② Illumination ⑤ Scale
Introduction Chapter 1
Why computer vision is difficult? more problems

Motion (Source: S. Lazebnik)

Blurry Zig
Intra-class variation
Different features
with the same object :
-Bearded vS unbearded
.

Nigab VS
.
Nigabless

Background clutter Occlusion (Hidden)

Introduction Chapter 1
Why computer vision is difficult?
Challenges: local
ambiguity

slide credit: Fei-Fei, Fergus & Torralba

Introduction Chapter 1
Can we manage these difficulties?
Yes, there are lots of
visual cues we can
use…
-10

• We often must use

prior knowledge
about the world’s
structure
Introduction Chapter 1
Can we manage these difficulties?

What do we lose
geometrically?

• Angles
• Distances
• and therefore Area
Introduction Chapter 1
Can we manage these difficulties?

• Vanishing points and lines

Parallel lines in the world

intersect in the image at
a “vanishing point”
Introduction Chapter 1
Can we manage these difficulties? Vertical
vanishing
Example: point
(at infinity)

Vanishi
Vanishi
ng
ng
point
point
Slide from Efros, Photo from Criminisi

1
Introduction Chapter 1
Can we manage these difficulties?
• Any two lines, parallel in
3D will meet at a unique
vanishing point in image
plane.
• All pairs of parallel lines on
the same plane in 3D will
have vanishing points on a
unique vanishing line.

1
Introduction Chapter 1
Course Overview (Tentative)

1. Low-level vision
• image processing, edge detection,
feature detection, cameras, image
formation
2. Geometry and algorithms
• projective geometry, stereo,
structure from motion,
optimization
3. Recognition
• face detection / recognition,
category recognition,
segmentation
Introduction Chapter 1
Course Overview (Tentative)
1. Low-level vision
• Basic image processing and image formation

* =
Filtering, edge detection

Image formation
Feature extraction
Introduction Chapter 1
Course Overview (Tentative)
2. Geometry

Image credit: IDS Imaging

Projective geometry Stereo vision

Multi-view stereo Structure from motion

Introduction Chapter 1
Course Overview (Tentative)
2. Recognition
“dog”

Image classification

Object detection

Convolutional Neural Networks

Exploring Humans - Philosophy of Science For The Social Sciences
100% (1)
Exploring Humans - Philosophy of Science For The Social Sciences
213 pages
Formal, Non-Formal and Informal Education: Concepts/Applicability
No ratings yet
Formal, Non-Formal and Informal Education: Concepts/Applicability
3 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
Unit 1 Chapter 1
No ratings yet
Unit 1 Chapter 1
27 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
Lect1 PDF
100% (1)
Lect1 PDF
45 pages
1 Vision Lec 1
No ratings yet
1 Vision Lec 1
49 pages
Chapter 1 Introduction to Computer Vision and Image Processing
No ratings yet
Chapter 1 Introduction to Computer Vision and Image Processing
42 pages
Unit 5 Introduction Robot Vision
No ratings yet
Unit 5 Introduction Robot Vision
60 pages
Lec 01 CompVision N DIP Intro
No ratings yet
Lec 01 CompVision N DIP Intro
91 pages
CV - Lec01 - Introduction
No ratings yet
CV - Lec01 - Introduction
50 pages
Lec00 Intro For Web Highlighted
No ratings yet
Lec00 Intro For Web Highlighted
72 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
Lec 00
No ratings yet
Lec 00
76 pages
T2310 TDS3651 L01 Introduction
No ratings yet
T2310 TDS3651 L01 Introduction
73 pages
Lec 1
No ratings yet
Lec 1
51 pages
AI-Computer Vision
No ratings yet
AI-Computer Vision
16 pages
Ch-1-Intro To DIP
No ratings yet
Ch-1-Intro To DIP
87 pages
Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
81 pages
CV Lecture 1
No ratings yet
CV Lecture 1
65 pages
Ch01_Introduction_to_computer_vision_and_image_processing_1 (1)
No ratings yet
Ch01_Introduction_to_computer_vision_and_image_processing_1 (1)
29 pages
Lecture 1
No ratings yet
Lecture 1
52 pages
Computer Vision and Image Processing (updated) (2)
No ratings yet
Computer Vision and Image Processing (updated) (2)
165 pages
Lect 1 Computervision Student PPT 16-9-2017
No ratings yet
Lect 1 Computervision Student PPT 16-9-2017
143 pages
ECE885 Computer Vision: Prof. Bhupinder Verma
No ratings yet
ECE885 Computer Vision: Prof. Bhupinder Verma
59 pages
Computer Vision
No ratings yet
Computer Vision
14 pages
Topic 5 Computer Vision
No ratings yet
Topic 5 Computer Vision
65 pages
Computer Vision CS-6350: Prof. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
No ratings yet
Computer Vision CS-6350: Prof. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
48 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
Machine - Learning (Computer Vision)
No ratings yet
Machine - Learning (Computer Vision)
56 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
Introduction To CVIP
No ratings yet
Introduction To CVIP
33 pages
Introduction To Image File Formats
No ratings yet
Introduction To Image File Formats
87 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
CV (Unit1&2ans)
No ratings yet
CV (Unit1&2ans)
32 pages
cv
No ratings yet
cv
4 pages
01 IAU CV Kul-Introd-2023-Public 3
No ratings yet
01 IAU CV Kul-Introd-2023-Public 3
81 pages
Administrivia: CMPSCI 370: Introduction To Computer Vision
No ratings yet
Administrivia: CMPSCI 370: Introduction To Computer Vision
12 pages
C280 Computer Vision C280, Computer Vision: Prof. Trevor Darrell
No ratings yet
C280 Computer Vision C280, Computer Vision: Prof. Trevor Darrell
68 pages
2023 - 12 - 06 7 - 57 PM Office Lens
No ratings yet
2023 - 12 - 06 7 - 57 PM Office Lens
11 pages
intro
No ratings yet
intro
66 pages
Computer Vision
No ratings yet
Computer Vision
35 pages
CS 474 Lec 01 Introduction
No ratings yet
CS 474 Lec 01 Introduction
69 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
CV_Lecture_1-DD-Don
No ratings yet
CV_Lecture_1-DD-Don
38 pages
Lecture 1
No ratings yet
Lecture 1
21 pages
1 Introduction
No ratings yet
1 Introduction
67 pages
Image and Video Analytics Unit 1
No ratings yet
Image and Video Analytics Unit 1
110 pages
Cv Unit 1 Overview of Computer Vison and Application
No ratings yet
Cv Unit 1 Overview of Computer Vison and Application
51 pages
Computer Vision
No ratings yet
Computer Vision
15 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
Lecture 1-Introduction Fundamentals
No ratings yet
Lecture 1-Introduction Fundamentals
42 pages
Digital Image Processing: Vipin V Asst. Professor, ECE SJCET, Palai
No ratings yet
Digital Image Processing: Vipin V Asst. Professor, ECE SJCET, Palai
156 pages
Chapter 1 [CV & IP]
No ratings yet
Chapter 1 [CV & IP]
41 pages
C7-L4-AI-VISUAL DATA
No ratings yet
C7-L4-AI-VISUAL DATA
8 pages
CV-1.1
No ratings yet
CV-1.1
18 pages
CV
No ratings yet
CV
9 pages
computer-vision-al-701
No ratings yet
computer-vision-al-701
50 pages
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Percept: Fundamentals and Applications
From Everand
Percept: Fundamentals and Applications
Fouad Sabry
No ratings yet
SDLC - Agile Model - Quitoriano
No ratings yet
SDLC - Agile Model - Quitoriano
5 pages
Daily Protocol
No ratings yet
Daily Protocol
15 pages
Cft. 23.05.24
No ratings yet
Cft. 23.05.24
1 page
Merak Peep Ps PDF
No ratings yet
Merak Peep Ps PDF
2 pages
Unit 10: Have Some Tea, Please!
No ratings yet
Unit 10: Have Some Tea, Please!
8 pages
The Mediating Effect of Intellectual Capital, Management Accounting Information Systems, Internal Process Performance, and Customer Performance
No ratings yet
The Mediating Effect of Intellectual Capital, Management Accounting Information Systems, Internal Process Performance, and Customer Performance
22 pages
Design of Passive Harmonic Filters
100% (1)
Design of Passive Harmonic Filters
8 pages
MAR002-6 Brand Mgt. & Research: Global Branding Strategy
No ratings yet
MAR002-6 Brand Mgt. & Research: Global Branding Strategy
35 pages
3TNV70-XHB YANMAR MOTOR DIESEL
No ratings yet
3TNV70-XHB YANMAR MOTOR DIESEL
22 pages
Chapter 3-Balancing
No ratings yet
Chapter 3-Balancing
30 pages
Journal Impact Factor
No ratings yet
Journal Impact Factor
376 pages
Nerf Slam
No ratings yet
Nerf Slam
10 pages
NVH Director 021412 Web
No ratings yet
NVH Director 021412 Web
2 pages
GST Amendments - CS - CMA - CA - CA Saumil Manglani V1 2024
No ratings yet
GST Amendments - CS - CMA - CA - CA Saumil Manglani V1 2024
24 pages
Job Description Community Coordinator
No ratings yet
Job Description Community Coordinator
4 pages
2D and 3D Seismic
100% (6)
2D and 3D Seismic
14 pages
Quiz # 07 - WPE (Answer Key) - 1001PJA106219240052
No ratings yet
Quiz # 07 - WPE (Answer Key) - 1001PJA106219240052
1 page
SKD 30 Bridge Rectifier
No ratings yet
SKD 30 Bridge Rectifier
3 pages
Internship Report Giki
0% (1)
Internship Report Giki
67 pages
Work Immersion Training Plan Orlan
75% (4)
Work Immersion Training Plan Orlan
2 pages
Modulador Am-Dsb-Fc
No ratings yet
Modulador Am-Dsb-Fc
3 pages
Direct and Indirect Strategies
No ratings yet
Direct and Indirect Strategies
28 pages
Hungarian Method Calculator
No ratings yet
Hungarian Method Calculator
6 pages
GDE Avio 500 ICP OES Preparing Your Lab 013390 01
No ratings yet
GDE Avio 500 ICP OES Preparing Your Lab 013390 01
7 pages
Ece Syllabus s7 Kannur University
No ratings yet
Ece Syllabus s7 Kannur University
30 pages
Types of Balances and Its Accuracy Class
No ratings yet
Types of Balances and Its Accuracy Class
3 pages
970803B Meter Factor Linearization
No ratings yet
970803B Meter Factor Linearization
6 pages
The Theory and Practice of Psychoanalytic Therapy Listening for the Subtext 1st Edition Gullestad download pdf
100% (5)
The Theory and Practice of Psychoanalytic Therapy Listening for the Subtext 1st Edition Gullestad download pdf
82 pages