ECE885 Computer Vision: Prof. Bhupinder Verma
ECE885 Computer Vision: Prof. Bhupinder Verma
ECE885-Computer Vision
COURSE OBJECTIVES: The course addresses various dimensions of image analysis, image understanding, automated visual inspections of industrial processes, medical imaging, intelligent robotics etc.
After learning this course the student should be able to visualise software & hardware issues related to industrial applications of image processing.
05 20 25 50
Of these five, vision is undoubtedly the one that we have come to depend upon above all others, and indeed the one that provides most of the data we receive.
Not only do the input pathways from the eyes provide megabits of information at each glance, but the data rate for continuous viewing probably exceed 10 megabits per second. However, much of this information is redundant and is compressed by the various layers of the visual cortex, so that the higher centers of the brain have to interpret abstractly only a small fraction of the data.
Human Vision
Types of Images
Types of Images
resolution (7x7)
binary image gray-scale (or gray-tone) image color image multi-spectral image range image labeled image
Cardiac Cycle
The visulaised flow pattern in the right attrium of a normal subject at different phases
Effect of changing the dpi resolution while keeping number of pixels constant a) a 450x450 image at 200 dpi(2.25x2.25 in) b) The same 450x450 image at 300dpi (1.5x1.5 in)
Original image
Image flipped vertically Cropped image Sub-sampled image
Original image
Its histogram
Original image courtesy Dr. Roger's Research School of Biological Sciences at Australian National University Canberra
Its histogram
Original Image coutesy NASA
Examples of quality classes of pine wood lumber outward faces (Nordic Timber 1994). Printed with permission from the publisher. The original is a color picture.
IR images of Vegetables
1
2 3 4 5 6 7
Visible Blue
Visible green Visible red Near Infrared Middle Infrared Thermal Infrared Middle Infrared
0.45-0.52
0.53-0.60 0.63-0.69 0.76-0.90 1.55-1.75 10.4-12.5 2.08-2.35
A Research Presentation
Vision allows humans to perceive & understand the world surrounding them, while computer vision aims to duplicate the effect of human vision by electronically perceiving and understanding an image. Vision = Geometry + Measurement + Interpretation
Image capture
Early processing
Segmentation
Model fitting Motion prediction Qualitative / Quantitative conclusions
Image understanding by a machine can be seen as an attempt to find a relation between input image(s) and previously established models of the observed world. Transition from input images to model reduces the information contained in the image to relevant information for application domain. Image representation can be divided according to data organization at four levels (on next slide). This hierarchy of image representation and related algorithms is often categorized to simply three levels: Low Level Processing, Intermediate & High level image understanding
Low-Level
sharpening
blurring
42
Image segmentation is the key step at this level, in which the computer vision tries to separate objects from image background and from each other.
Low-Level
Canny
original image
edge image
Mid-Level
ORT
Mid-level
45
High level computer vision tries to imitate human cognition and ability to make decisions according to information contained in the images
Hi-level vision begins with some form of formal model of the world, and then the 'reality' perceived in the form of digitized images is compared to the model. A match is attempted, and when differences emerge, partial matches (or sub-goals) are sought that overcome the mis-matches. The computer switches to Low-level image processing to find information needed to update the model.
This process is then repeated iteratively and 'understanding' an image thereby becomes a cooperation between top-down and bottom-up processes.
A computer vision system recovers useful information about a scene from its 2D projections. Computer Vision is a field within artificial intelligence. goal is to develop methods for image understanding: recover the three-dimensional shape of objects in the scene or understand the environment with some purpose, like autonomous navigation
Pattern recognition(PR):
Pattern Recognition classifies numerical and symbolic data. Many statistical and syntactical techniques have been developed for classification of patterns.
Techniques from pattern recognition play an important role in machine vision for recognising objects.
AI is used to analyse scenes by computing a symbolic representation of the scene contents after the images have been processed to obtain features.
Psychophysics:
Psychophysics alongwith cognitive science, has studied human vision for a long time.
Many researchers in Computer Vision are more interested in preparing computational models of human vision than in designing machine vision systems.
Machine vision produces measurements or abstractions from geometrical properties. It may be useful to remember the equation; Vision = Geometry + Measurement + Interpretation
IP algorithms are used in early stages of a computer vision/ machine vision systems.
The pinhole model of imaging geometry does not distinguish size of objects.
Interpretation:
Interpretation of image(s) constitutes the principal tool of computer vision to approach problems which humans solve unwittingly. When a human tries to understand an image then previous knowledge & experience is brought to the current observation. Artificial intelligence has invested several decades in attempts to endow computers with the capability to understand observations. Interpretation of images can be seen as a mapping: interpretation: image datamodel There may be several interpretations of the same image(s).
Noise:
Noise is inherently present in each measurement in the real world which needs mathematical tools to cope up with it e.g. probability theory.