Unit1_Biometrics(Part-2) (1)
Unit1_Biometrics(Part-2) (1)
Unit – 1 (Part – 2)
Cloud Computing 1
Session – 5 & 6
• Image Processing Basics
– What is an Image?
– Image Acquition
– Type, Point Operations
– Geometric Transformations
– First and Second order derivatives
• Steps in Edge Detection, Smoothing,
Enhancement, Thresholding & Localization
Image Processing - Basics
Image
• An image may be defined as a two-dimensional function, f(x, y), where x and y
are spatial (plane) coordinates, and the amplitude of f at any pair of
coordinates (x, y) is called the intensity or gray level of the image at that point.
• When x, y, and the amplitude values of f are all finite, discrete quantities, we
call the image a digital image.
• These elements are referred to as picture elements, image elements, pels, and
pixels. An image can be defined by a two-dimensional array arranged in rows
and columns.
Types of Images
Binary Images
• It is the simplest type of image. Also called as
silhouette image
• It takes only two values i.e, Black and White or 0
and 1.
• The binary image consists of a 1-bit image and it
takes only 1 binary digit to represent a pixel.
• Binary images are mostly used for general shape or
outline.
• For Example: Optical Character Recognition (OCR).
Pros & Cons of Binary Images
Advantages
• Easy to acquire: simple digital cameras can be used or low-cost
scanners, or thresholding may be applied to grey-level images.
• Low storage: no more than 1 bit/pixel
• Simple processing: the algorithms are much simpler than those
applied to grey-level images.
Disadvantages
• Limited application: Application is restricted to tasks where
internal detail is not required as a distinguishing characteristic.
• Does not extend to 3D:
• Specialised lighting is required: it is difficult to obtain reliable
binary images without restricting the environment. The simplest
example is an overhead projector or light box.
Thresholding
• Robert's method,
• Sobel's method,
• Perwitts
Roberts Edge Detection
Advantages:
Limitations:
• Laplacian of Gaussian,
• Zero crossing
Laplacian of Gaussian
• The Laplacian is a 2-D isotropic measure of
the 2nd spatial derivative of an image.
• The Laplacian of an image highlights
regions of rapid intensity change and is
often used for edge detection.
• The Laplacian is often applied to an image
that has first been smoothed with
something approximating a Gaussian
smoothing filter in order to reduce its
sensitivity to noise, and hence the two
variants will be described together here
• The operator normally takes a single grey-
level image as input and produces another
Second Order Derivatives
•
Low Level Feature Extraction
•
Low Level Feature Extraction
• Difference Image
Approaches to Detect Motions
• There are many approaches for detecting the motion in
an image
• Area Based Approach
• Differential Approach
Area Based Approach
Area Based Approach
Area Based Approach
Differential Approach
Differential Approach
Differential Approach
Difference between AB and DB
Approaches
Difference between AB and DB
Approaches
High Level Features
Template Matching
Template Matching
Template Matching
Hough Transform
• The Hough transform is a technique which can be used
to isolate features of a particular shape within an image.
• Because it requires that the desired features be specified
in some parametric form, the classical Hough transform
is most commonly used for the detection of regular
curves such as lines, circles, ellipses, etc.
• The idea behind the Hough technique for line detection
is that each input measurement (e.g. coordinate point)
indicates its contribution to a globally consistent
solution (e.g. the physical line which gave rise to that
image point).
Hough Transform
• As an example, consider the common problem of fitting
a set of line segments to a set of discrete image points
(e.g. pixel locations output from an edge detector).
• Figure shows some possible solutions to this problem.
• Here the lack of a priori knowledge about the number
of desired line segments render this problem under-
constrained.
Hough Transform
• The Hough Space is a 2D plane that has a horizontal
axis representing the slope and the vertical axis
representing the intercept of a line on the edge image.
• A line on an edge image is represented in the form of
y = ax + b
• One line on the edge image produces a point on the
Hough Space since a line is characterized by its
slope a and intercept b.
• On the other hand, an edge point (xᵢ, yᵢ) on the edge
image can have an infinite number of lines pass
through it.
• Therefore, an edge point produces a line in the Hough
Space in the form of b = axᵢ + yᵢ
Hough Transform
• In the Hough Transform algorithm, the Hough Space is
used to determine whether a line exists in the edge
image.
• There is one flaw with representing lines in the form
of y = ax + b and the Hough Space with the slope and
intercept.
• In this form, the algorithm won’t be able to detect
vertical lines because the slope a is undefined/ infinity
for vertical lines
Hough Transform
• This means that a computer would need an
infinite amount of memory to represent all
possible values of a.
• To avoid this issue, a straight line is instead
represented by a line called the normal line that
passes through the origin and perpendicular to
that straight line.
• The form of the normal line is ρ = x cos(θ) + y
sin(θ) where ρ is the length of the normal line
and θ is the angle between the normal line and
the x axis.
Hough Transform
• So, a convenient equation for describing a set of lines uses
parametric or normal notion: ρ = x cos(θ) + y sin(θ)
• where ρ is the length of a normal from the origin to this line
and theta (θ) is the orientation of ρ with respect to the X-
axis.
• For any point (x,y ) on this line, ρ and θ are constant.
• Instead of representing the Hough Space with the
slope a and intercept b, it is now represented with ‘ρ’ and
‘θ’ where the horizontal axis is for the θ values and the
vertical axis are for the ρ values.
• The mapping of edge points onto the Hough Space works in
a similar manner except that an edge point (xᵢ, yᵢ) now
generates a cosine curve in the Hough Space instead of a
straight line
Hough Transform
Hough Transform
• As mentioned, an edge point produces a cosine curve
in the Hough Space.
• From this, if we were to map all the edge points from
an edge image onto the Hough Space, it will generate a
lot of cosine curves.
• If two edge points lay on the same line, their
corresponding cosine curves will intersect each other
on a specific (ρ, θ) pair.
• Thus, the Hough Transform algorithm detects lines by
finding the (ρ, θ) pairs that have a number of
intersections larger than a certain threshold.
Hough Transform – Algorithm
1. Determine the range of ρ and θ. Typically, the range of
θ is [0, 180] degrees and ρ is [-d, d], where d is the
diagonal length of the edge. It is important to quantize
the range of ρ and θ, which means there should only
be a finite number of possible values.