0% found this document useful (0 votes)
2 views

Chapter+1+Introduction+Part+1

The document outlines the course ECE467 on Image Processing and Robot Vision, taught by Dr. Saqer S. Alja’afreh, emphasizing the fundamentals of computer vision and image processing. Key topics include image formation, edge detection, object detection, and the differences between computer vision and human vision. The course requires programming knowledge and emphasizes self-discipline, with strict policies on attendance, participation, and late submissions.

Uploaded by

nasser.shraifi
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

Chapter+1+Introduction+Part+1

The document outlines the course ECE467 on Image Processing and Robot Vision, taught by Dr. Saqer S. Alja’afreh, emphasizing the fundamentals of computer vision and image processing. Key topics include image formation, edge detection, object detection, and the differences between computer vision and human vision. The course requires programming knowledge and emphasizes self-discipline, with strict policies on attendance, participation, and late submissions.

Uploaded by

nasser.shraifi
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 72

Introduction

ECE467
Image Processing & Robot Vision

Dr. Saqer S. Alja’afreh


Textbook Information

• Textbook:
Rick Szeliski, Computer Vision: Algorithms and Applications online at:
http://szeliski.org/Book/
Course Information Chapter 0

Course Information
• This course is an introduction to those areas of Artificial Intelligence that
deal with fundamental issues and techniques of computer vision and
image processing.

• The emphasis is on physical, mathematical, and information-


processing aspects of the vision.

• Topics to be covered include image formation, edge detection and


segmentation, convolution, image enhancement techniques, extraction
of features such as color, texture, and shape, object detection, 3-D vision,
and computer vision system architectures and applications.

• The material is based on undergraduate-level texts augmented with


research papers, as appropriate.
Course Information Chapter 0

• The course will move fast


• Self-discipline is important
• The emphasis of the course is to develop practical skills for solving
Computer Vision and Image Processing problems
• Fair evaluations: undergraduate and graduate students will be
scored separately
• Academic Integrity (AI) will be taken into consideration, please
refer to the course syllabus. (Work on homework and projects
independently)
Course Information Chapter 0

Preferable Skills

• Introduction to Programming Languages


• Knowledge of Linear Algebra
• Programming in Python Preferred
Course Information Chapter 0

Course Requirements
• Class attendance and participation is expected
• You are responsible for ALL materials presented in class and
assigned to read
• Quizzes will be given during class time only.
• Regular deliverables on the project will be graded during the
course
Course Information Chapter 0

Late Submission Policy


• Completed homework and project deliverables are to be
submitted by their deadline (11:59pm).
• For homework and projects, every day in delay will result in a
10% of deduction of its score.
Introduction Chapter 1

What is Computer Vision

• Computer vision is a field of computer science


• works on enabling computers to see,
• identify and process images in the same way that
human vision does, and
• then provide appropriate output.
• It is like imparting human intelligence and instincts
about vision to a computer.

23
Introduction Chapter 1

Every image tells a story


ThTS
• Goal of computer vision: perceive
the “story” behind the picture
definition
IS
a 8 - a

&
of
• Compute properties of the world the "
actual
• 3D shape definitiOh
• Names of people or objects of
Comp
• What happened? Vestor
.
Introduction Chapter 1

What is Computer Vision

• Computer vision is a field of computer science


• works on enabling computers to see,
• identify and process images in the same way that
human vision does, and
• then provide appropriate output.
• It is like imparting human intelligence and instincts
about vision to a computer.

23
Introduction Chapter 1

What is Computer Vision

• Automatic understanding of images and video


1. Computing properties of the 3D world from
visual data (measurement)
2. Algorithms and representations to allow a
machine to recognize objects, people, scenes,
and activities. (perception and interpretation)

23
Introduction Chapter 1
Visual Perception
• Definition: Process of acquiring knowledge about
environmental objects and events by extracting information
from the light they emit or reflect [Palmer, 2012].

Cognitive Acquisition of
Vision
Activity knowledge

Perception is analogous to taking a picture!


(credit: Palmer, 2012)
23
Introduction Chapter 1

Computer Vision vs Human Vision

24
Introduction Chapter 1
Computer Vision vs Human Vision
Interpreting
Sensing device device
Interpretations
Picture
Man
Thrash
Bulb
Light

24
Introduction Chapter 1
Can computers match human perception?
• Yes and no (mainly no)
• computers can be better at “easy”
things
• humans are better at “hard” things

• But huge progress


• Accelerating in the last five years due
to deep learning
• What is considered “hard” keeps
changing
Introduction Chapter 1
Human perception has its shortcomings

What can you see


in this picture?
Introduction Chapter 1
Human perception has its shortcomings

What can you see


in this picture?

Credit: Thompson, Basic Vision, Oxford Press, 2012.


Introduction Chapter 1
Human perception has its shortcomings

Copyright A.Kitaoka 2003


Introduction Chapter 1
But humans can tell a lot about a scene from a little
information…

Source: “80 million tiny images” by Torralba, et al.


Introduction Chapter 1

Related disciplines
Artificial
intelligence Machine
Graphics learning
Computer
Image vision Cognitive
processing science
Algorithms

25
Introduction Chapter 1
Chapter 0 Week 1

-
Al tool applied
structured (labeled
S

for
Jata .
Introduction Chapter 1

Bothfools
Suitable for

unstructured Jata
Introduction Chapter 1
Image Segmentation extraction
Manual Feature =

& L for
Structured
equires ma data

Features distinguishing
properties

annotation
one
byimage

Image Processing Important steps

computer vision
initial step : Image processing
Introduction Chapter 1
fre
high precision J useful
high accuracy structures
Jata

humans only
needed for usedfor unstructured data:
preprocessing
& implementation cunsupervised
learning]

00

neural O
network
Introduction Chapter 1
Introduction Chapter 1
Introduction Chapter 1
Image Processing vs. Computer Vision
• Image Processing
• Research area within electrical engineering/signal processing
• Focus on syntax, low level features

image image
(Denoising/inpainting)
• Computer Vision Coutcome of computer vision is story
a

• Research area within computer science/artificial intelligence


• Focus on semantics, symbolic or geometric descriptions
31
Faces, People
Chairs, etc.
(Recognition/Detection)
image
Introduction Chapter 1
21 Spatial signals
What is a (digital) Image? An image is an array of
numbers (pixels).

32

What humans see What Computer see


Introduction Chapter 1
What is a (digital) Image?
• Definition: A digital image is defined by integrating and sampling
continuous (analog) data in a spatial domain [Klette, 2014].
Left Hand

O
32

coordinate system Left hand coordinate


system
Introduction Chapter 1
What Digital Image Processing?

• To Bridge the Gap


between Pixels and
Meaning

32
Introduction Chapter 1
perinch
-
tridot
Image Types: (Gray)Scalar and Binary resizing an
image

-y changes
the Spr

• A scalar image has integer simplest images


a=8 a=3 a=2
values Not complex
• a: level (bit)
• Ex. If 8 bit (a=8)
• image spans from 0 to
255
• 0 black and 255 white
• Ex. If 1 bit (a=1) 32

• it is binary image
# of colors =
2"
• 0 and 1 only
Grayscale
112
Image ot colors 2 = =
246

# of pixels/image 512x236 131072


1266
= =

1048576 bimage
z
#
bit/image i Pixe
/*8
131 077hd
every single pixel # of
bytes 131072 bytes
=
= .

8 Gets
is represented
by
8 lors
2 Co
4 biES
#of
-
bit
bytes
Introduction Chapter 1
Image Type: RGB (red, green, blue)
• Each channel spans a-bit values. Human Cone-cells (normalized)
RGB3 image responsivity spectra

seutet

Goverlanges
into when
31
, Gina
32
Wavelength (nm)
• Some people might have 4 cone-types!
• Some might have just 2!
Each has a different array
Introduction Chapter 1
Color
• Color vision has evolved over millions of years.
visible
light Normalized
curve edium
hort ong

cone
comes
Come

C fX=

X =
f= ; ↑f + xy

wavelength
32

Shortest Longest
M M
Introduction Chapter 1
Color
• If there is no light, there is no color!
• Human vision can only discriminate a few dozens of grey levels on a
screen, but hundreds of thousands of different colors.
• RED -> ~625 to 780 nm [long wavelength]
• ORANGE -> ~ 590 to 625 nm [long wavelength]
• YELLOW -> ~565 to 590 nm [middle range wavelength]
• GREEN -> ~ 500 to 565 nm [middle range wavelength] [
• CYAN -> ~485 to 500 nm middle range wavelength]
• BLUE -> ~440 to 485 nm [short wavelength]
• VIOLET -> ~330 to 440 nm [very short wavelength] 32
Introduction Chapter 1
Retina of Human Eye

There are three different types of color-


sensitive cones corresponding to (roughly)
• RED (64% of the cones)
• GREEN (about 32%), and
↓ Photoreceptor
Gone Trols
• BLUE (about 2%).

6-7 million cones ·


responsible -Vision

120 million rods Station Jurongt


during day
Some may have only 2 cones 32

Credit: Klette, 2012.


eyes
Photoreceptors

Efcones Cross
Img Annotation =
Inglabeling
/
Bound Box 17
Polygon polyline
Answer this
In what I
question
cases
are we
:

going
to use
enchecase 3
adv & disadv ?
Introduction Chapter 1

Why Computer Vision?

32
Introduction Chapter 1
The goal of computer vision
• Compute the 3D shape of the world

ZED 2i
Camera

32
Introduction Chapter 1
The goal of computer vision
e
• Recognize objects and people
color

32
Introduction Chapter 1
The goal of computer vision
• “Enhance” images

32
Introduction Chapter 1
The goal of computer vision

32
Introduction Chapter 1
The goal of computer vision
• Forensics

32

Source: Nayar and Nishino, “Eyes for Relighting”


Introduction Chapter 1
The goal of computer vision

32
Introduction Chapter 1
The goal of computer vision

32
Introduction Chapter 1
The goal of computer vision

32

Source: Nayar and Nishino, “Eyes for Relighting”


Introduction Chapter 1
The goal of computer vision
• Billions of images/videos captured per day

• Huge number of potential applications 32

• The next slides show the current state of the art


Introduction Chapter 1
The goal of computer vision
• Optical character recognition (OCR)
• If you have a scanner, it probably came with OCR software

Automatic check processing

Digit recognition, AT&T labs (1990’s) License plate readers


http://yann.lecun.com/exdb/lenet/ http://en.wikipedia.org/wiki/Automatic_number_plate_recognition

32
Introduction Chapter 1
The goal of computer vision
• Face detection

• Nearly all cameras detect faces in real time


• (Why?)
Introduction Chapter 1
The goal of computer vision

• Face analysis and


recognition
Introduction Chapter 1
The goal of computer vision
• Login without a password (Face ID)

Fingerprint scanners on Face unlock on Apple iPhone X


many new smartphones and See also http://www.sensiblevision.com/
other devices
Introduction Chapter 1
The goal of computer vision
• Image synthesis

Karras, et al., Progressive Growing of GANs for Improved Quality, Stability, and Variation, ICLR 2018
Introduction Chapter 1
The goal of computer vision

• Sports and
Advertising
Introduction Chapter 1
The goal of computer vision

• Smart cars

• Mobileye
• Tesla Autopilot
• Safety features in many cars
Introduction Chapter 1
The goal of computer vision

• Self-driving cars

Waymo
Introduction Chapter 1
The goal of computer vision

• Robotics

Eig E NASA’s Mars Curiosity Rover Amazon Picking Challenge


http://www.robocup2016.org/en/events
https://en.wikipedia.org/wiki/Curiosity_(rover)
/amazon-picking-challenge/

computer
vision Amazon Prime Air Amazon Scout
Introduction Chapter 1
The goal of computer vision
• Medical imaging

3D imaging
(MRI, CT) Skin cancer classification with deep learning
https://cs.stanford.edu/people/esteva/nature/
Introduction Chapter 1
The goal of computer vision
• Virtual & Augmented
Reality

6DoF head tracking Hand & body tracking

3D scene understanding 3D-360 video capture


Introduction Chapter 1
Why computer vision is difficult?
Stroblems
to
fall
Ch 3

.

Viewpoint variation

Credit: Flickr user michaelpaul

Credit: Flickr user michaelpaul

② Illumination ⑤ Scale
Introduction Chapter 1
Why computer vision is difficult? more problems

Motion (Source: S. Lazebnik)


Blurry Zig
Intra-class variation
Different features
with the same object :
-Bearded vS unbearded
.

Nigab VS
.
Nigabless

Background clutter Occlusion (Hidden)


Introduction Chapter 1
Why computer vision is difficult?
Challenges: local
ambiguity

slide credit: Fei-Fei, Fergus & Torralba


Introduction Chapter 1
Can we manage these difficulties?
Yes, there are lots of
visual cues we can
use…
-10

• We often must use


prior knowledge
about the world’s
structure
Introduction Chapter 1
Can we manage these difficulties?

What do we lose
geometrically?

• Angles
• Distances
• and therefore Area
Introduction Chapter 1
Can we manage these difficulties?

• Vanishing points and lines

Parallel lines in the world


intersect in the image at
a “vanishing point”
Introduction Chapter 1
Can we manage these difficulties? Vertical
vanishing
Example: point
(at infinity)

Vanishi
Vanishi
ng
ng
point
point
Slide from Efros, Photo from Criminisi

1
Introduction Chapter 1
Can we manage these difficulties?
• Any two lines, parallel in
3D will meet at a unique
vanishing point in image
plane.
• All pairs of parallel lines on
the same plane in 3D will
have vanishing points on a
unique vanishing line.

1
Introduction Chapter 1
Course Overview (Tentative)

1. Low-level vision
• image processing, edge detection,
feature detection, cameras, image
formation
2. Geometry and algorithms
• projective geometry, stereo,
structure from motion,
optimization
3. Recognition
• face detection / recognition,
category recognition,
segmentation
Introduction Chapter 1
Course Overview (Tentative)
1. Low-level vision
• Basic image processing and image formation

* =
Filtering, edge detection

Image formation
Feature extraction
Introduction Chapter 1
Course Overview (Tentative)
2. Geometry

Image credit: IDS Imaging

Projective geometry Stereo vision

Multi-view stereo Structure from motion


Introduction Chapter 1
Course Overview (Tentative)
2. Recognition
“dog”

Image classification

Object detection

Convolutional Neural Networks

You might also like