INT345 COMPUTER VISION
INT345 COMPUTER VISION
CO2 :: elaborate various camera models such as pinhole camera model, camera calibration and
lens distortion correction.
CO3 :: derive the concept of epipolar geometry including the epipolar constraint and fundamental
matrix.
CO4 :: apply appropriate feature descriptors to find relevant features from images.
CO6 :: analyze various machine learning algorithms for image classification and clustering.
Unit I
Fundamentals of Image Processing : Introduction to image processing, key differences between
image processing and computer vision, applications of image processing and computer vision, image
file formats
Noise removal and filtering : image noise and its types, spatial domain filtering techniques,
frequency domain filtering
Image Enhancement : histogram equalization and specification, contrast enhancement
Unit II
Camera Geometry : Pinhole camera model, cameras with lenses, CCD cameras, general projective
cameras, affine cameras, camera calibration
2-D Projective Geometry : planar geometry, projective spaces, Representation in projective
coordinates, Homography and its properties, Applications: image stitching, perspective correction and
rectification
Unit III
Stereo Geometry : epipolar geometry, fundamental matrix, normalized 8-point algorithm, algebric
minimization algorithm, geometric distance computation
Camera motion and 3D Reconstruction : camera motion, motion models, optical flow, Linear
triangulation method
Unit IV
Feature Detection and Description : Overview of feature detection, Harris corner detector, scale
invariant feature transform, speeded up robust features, binary feature detectors, FAST, BRIEF, ORB,
HOG, applications of descriptors
Feature Matching and model Fitting : Overview of feature matching, similarity measures, brute
force matching, K-D tree, Locality-Sensitive Hashing (LSH), RANSAC for robust matching
Unit V
Color Processing : Importance of color in computer vision, color models, conversion between color
spaces, color augmentation techniques, color constancy
Range Image Processing : introduction to range images, difference between 2D intensity images
and 3D range images, active range sensors, preprocessing of range data, applications of range data
Unit VI
Image Segmentation : Overview of image segmentation, thresholding based segmentation, edge-
based segmentation, region based segmentation, clustering based segmentation
Advanced topics in computer vision : Generative Models, multimodal vision models, Vision
transformers, large vision models, Vision-Language Models
Practicals
• Implementing basic image processing operations, such as image filtering, edge detection, and image
enhancement..
• Performing geometric transformations on images, including rotation, scaling, and translation.
• Estimating optical flow between consecutive video frames to understand motion in videos.
Text Books:
1. COMPUTER VISION: A MODERN APPROACH by DAVID A. FORSYTH, Pearson Education
India
References:
1. PROGRAMMING COMPUTER VISION WITH PYTHON: TOOLS AND ALGORITHMS FOR
ANALYSING IMAGES by JAN ERIK SOLEM, O'REILLY