An Introduction To Computer Vision
An Introduction To Computer Vision
Contents
1 What Motivates Us? 2
1
1 What Motivates Us?
An interesting question we always ask is what the next generation of computers is going
to be like. To answer this question, let’s recall our first touch of computer. At least, my
experience was that I waved my hands and said “how are you” to a machinery. Obviously,
no answer at all.
It was a dream that computers would be able to see and think, which has been driving
us to explore various research issues to make this dream come true. Although computers
become faster and faster, they are still quite dull, since they can neither see nor even perform
simple reasonings. Obviously, we are not satisfied to just use our computers as a calculator,
a word processer, a CD player, or a game station; instead, we expect computers to do more
intelligent things like our human beings. For example,
• ···
2
Biomechanics Psychology
Human-Computer Virtual
Multimedia Robotics Enviornments
Interaction
Computer Computer
Vision Graphics
Speech Image
Processing Processing
Machine Learning
A.I.
Pattern Recognition
3
2.1.3 Multimedia
Multimedia is a vague term. Different people have different emphasis. We are particularly
interested in the analysis of the content of multimedia. An interesting question we ask is
what is inside this picture or what this video means, which involves a quite challenging task
of image/video understanding. Many appealing applications have been proposed, but yet to
be accomplished. When given just a photo of Sophie Marceau, without knowing her name,
computers could search the Internet and get tones of her photos and movies. When you
get tied of watching a long movie, computers could automatically summarize the movie in
maybe five minutes.
4
model from examples?” such that models could be learned implicitly, instead of constructed
explicitly.
Image Video
Understanding Understanding
Object
Recognition
3D Motion
IBR Reconstruction Capturing
Geometric Visual
Modeling Tracking
Multiview Segmentation
Geometry Stereo SfM
Image Formation
5
a word, computer vision is an inverse processing of the forward process of image formation
and graphics. In this sense, as many people agree, vision is a much more challenging problem
than computer graphics, because it is full of uncertainties.
6
objects and scenes, such that 3D reconstruction and image-based rendering could be made
possible.
Another task of middle-level vision is to answer the question “how the object moves”.
Firstly, we should know which areas in the images belong to the object, which is the task
of image segmentation. Image segmentation has been a challenging fundamental problem
in computer vision for decades. Segmentation could be based on spatial similarities and
continuities. However, uncertainty can not be overcome for static image. When considering
motion continuities, we hope the uncertainty of segmentation could be alleviated. On top
of that is visual tracking and visual motion capturing, which estimate 2D and 3D motions,
including deformable motions and articulated motions.