UNITII
UNITII
The science of
? facilitating
human-like
capability of
seeing and
understanding
the environment
to a machine or
computer.
1
http://www.bmva.org/visionoverview
Computing structure from
binocular vision
Courtesy: www.cs.gmu.edu/~kosecka/lect4.ppt
4
2
1
1 2 3 4
P p : Projection
Center of projection
O
p
Projection plane
Projective Geometry
Real Space and Projective
Space (2D)
p(kx,ky,k)
p(x,y) p(x,y,1)
y
x
o
y
z
x
o
Homogeneous
p(x,y) p(kx,ky,k) Coordinate system
Homogeneous Representation
p(x,y) l(a/c,b/c,1)
y
x
o
A point in P2 A line in P2 b
c
a
o
Point containment in P2
Points and lines in P2
l:ax+by+c=0 l1 l2
x1(x1,y1) p(x,y)
y y
x2(x2,y2)
x x
o o
ax+by+c=0
p(x,y) l(a/c,b/c,1)
y b
z c
o x
a
o
x Tl 0 lT x 0
x l l' l x x'
In projective geometry, a scaling factor refers to a value that
scales or resizes a point, line, or other geometric object while
preserving its projective properties.
l1:ax+by+c1=0
y X(x,y,1)
x
o l2:ax+by+c2=0
(b, -a, 0)
y
c
x
o
Ideal Point
In projective geometry, points at infinity cannot be
assigned Cartesian coordinates like regular points in
Euclidean geometry. Instead, ideal points are
introduced to handle these cases.
Intersection point
(b, -a, 0)
A direction !
Ideal points
Line at infinity
l(a/c,b/c,1)
=0
A plane passing
l(a/c,b/c,1)
through the origin.
b
A straight line passing
through the origin. c
a
From Hartley and Zisserman, “Multiple view geometry in computer vision”, Cambridge Univ. Press.
(2000)
Projection of parallel lines
from any arbitrary plane
Canonical projection plane
(CPP)
Vanishing Point
(b, -a, 0)
y
z
x
o
Ideal Plane
Vanishing points
Vanishing Points
corresponding to parallel
lines of a plane lie on a
line, called vanishing line.
Vanishing
Line
Vanishing Points
A real life example
Vanishing points
A journey toward infinity ….
xTl=0, and lTx=0
x=l x l’, and l=x x x’
Summary
A point in a 2-D projective space represents a ray
passing through origin of an implicit 3D space.
Requires additional dimension for representation.
O
p
Pinhole camera
Z Principal axis
Y
World coordinate
Focal
P (X,Y,Z)
length f o
p(x,y) Image coordinate
C
Image plane
Camera Center
Principal plane X
(XY-plane)
Pinhole Camera:
Mapping from P P
3 2
The matrix is not square (it’s 3x4), so it does not have a determinant and
therefore cannot have an inverse in the traditional sense.
O Principal
point
X
Projection Matrix under the
offset
Y
P
Z p
C
R,t
Xc
O X
CCD Camera
A charge-coupled device (CCD) is a
light-sensitive integrated circuit that
captures images by converting photons
to electrons.
No. of pixels
per unit length
General Projective Camera
3
3
11 d.o.f
5
Extrinsic parameters: R, t
Intrinsic parameters: K
Properties of projective
camera matrix P=[M | p4]
Rank of P: 3; # of extrinsic params: 6
Size: 3x4; # of intrinsic params: 5
d.o.f.=11;
For n correspondences
Principal point
Y
P
Z p
C
R,t Principal plane
O X Xc
Properties of projective
camera matrix P=[M | p4]
Properties of projective
camera matrix P=[M | p4]
Properties of projective
camera matrix P=[M | p4]
The projective camera matrix is a 3x4 matrix that describes the
mapping of a pinhole camera from 3D points in the world to 2D points
in an image. It has the following properties:
The first three columns of the matrix define the camera's intrinsic
parameters, which are the focal length, the principal point, and the
skew.
The last column of the matrix defines the camera's extrinsic parameters,
which are the camera's position and orientation in the world.
The image of a point at infinity is a line that passes through the camera
center.
Use Cases: Cameras at infinity are particularly useful for scenes where
depth is not critical, and you want to simplify calculations. They are
often used in architectural rendering, certain types of technical
drawings, and other scenarios where maintaining accurate perspective
isn't essential
It's important to note that "cameras at infinity" is a
mathematical construct and not a physical reality.
For n points
Summary
o Pinhole camera model provides the projection
matrix which maps a 3D point to an image point.
o Projection matrix:
o 3x4
o Dof: 11
o 5 intrinsic parameters and 6 extrinsic parameters.
o Minimum 6 point correspondences required for
estimation
o Affine projection matrix
o Last row [0 0 0 1]T
o Dof:8
o Minimum 4 point correspondences required to estimate.
Summary (contd.)
o Geometry encoded in a projection matrix
o P=[ M | p4] or P=[p1 p2 p3 p4] or P=[r1T ; r2T ; r3T]
o Camera Center: -M-1p4
o For affine projection matrix: Right zero of M (A direction).
o Vanishing points
o X-axis: p1
o Y-axis: p2
o Z-axis: p3
o Image of world origin: p4
o Special planes passing through the camera center
o Principal plane: r3TX=0
o Direction of optical axis: <r31, r32, r33>
o Principal point: M [r31 r32 r33]T
o Plane formed with x-axis of image coordinate system: r1TX=0
o Plane formed with y-axis of image coordinate system: r2TX=0
Summary (contd.)
o Geometric derivatives from Projection Matrix:
P= [M|p4]
o Projection ray formed at image point x.
o Direction ratio: M-1x
o A point on the ray:
o Camera center (-M-1p4)