AD8703 BCV Unit I 2023
AD8703 BCV Unit I 2023
This document is confidential and intended solely for the educational purpose of
RMK Group of Educational Institutions. If you have received this document
through email in error, please notify the system manager. This document
contains proprietary information and is intended only to the respective group /
learning community as intended. If you are not the addressee you should not
disseminate, distribute or copy through e-mail. Please notify the sender
immediately by e-mail if you have received this document by mistake and delete
this document from your system. If you are not the intended recipient you are
notified that disclosing, copying, distributing or taking any action in reliance on
the contents of this information is strictly prohibited.
AD8703
BASICS OF COMPUTER
VISION
UNIT I
Department: AI&DS
Date : 26.07.2023
Table of Contents
S
CONTENTS PAGE NO
NO
1 Contents 1
2 Course Objectives 6
8
3 Pre Requisites (Course Names with Code)
5 Course Outcomes 12
7 Lecture Plan 16
9 Lecture Notes 20
Lecture Slides 37
Lecture Videos 39
10 Assignments 41
11 Part A (Q & A) 43
12 Part B Qs 47
16 Assessment Schedule 55
COURSE OBJECTIVES
To review image processing techniques for computer vision.
To understand various features and recognition techniques
To learn about histogram and binary vision
Apply three-dimensional image analysis techniques
Study real world applications of computer vision algorithms
Prerequisite
CS 404 OPERATING SYSTEM
PREREQUISITE
Syllabus
AD8703 -BASICS OF COMPUTER VISION
SYLLABUS 3003
UNIT I INTRODUCTION
Feature Extraction -Edges - Canny, LOG, DOG; Line detectors (Hough Transform),
Corners -Harris and Hessian Affine, Orientation Histogram, SIFT, SURF, HOG, GLOH,
Scale-Space Analysis- Image Pyramids and Gaussian derivative filters, Gabor Filters
and DWT. Image Segmentation -Region Growing, Edge Based approaches to
segmentation, Graph-Cut, Mean-Shift, MRFs, Texture Segmentation.
UNIT V APPLICATIONS
CO1: Recognise and describe how mathematical and scientific concepts are
applied in computer vision.
PO’s/PSO’s
COs
PO PO PO PO PO PO PO PO PO PO PO PO PSO PSO PSO
1 2 3 4 5 6 7 8 9 10 11 12 1 2 3
CO1
3 2 2 - - - - - - - - - 2 - -
CO2
3 3 2 - - - - - - - - - 2 - -
CO3
2 2 1 - - - - - - - - - 2 - -
CO4
3 3 1 - - - - - - - - - 2 - -
CO5
3 3 1 - - - - - - - - - 3 1 -
No of Taxo Mode
Actual
S No Topics period Pertain nom of
Proposed date Lecture
s ing CO y delivery
Date
level
Introduction to
CO1
1 Image Processing, 1 7.08.2023 K1 Lecture
Computer Vision
- Low-level, Mid-
level, High-level CO1
2 1 9.08.2023 K2 Lecture
Fundamentals of
Image Formation
Transformation:
3 Orthogonal, 1 10.8.2023 CO1 K2 Lecture
Euclidean
Affine, Projective 10.08.202
4 1 CO1 K2 Lecture
3
16.08.202 Lectur
5 Fourier Transform 1 CO1 K2
3 e
Restoration, Lectur
8 Histogram 1 17.08.202 CO1 K2 e
Processing. 3
Revision – Quiz 23.08.202 Lectur
9 Activity 1 3 CO1 K2 e
Activity Based Learning
ACTIVITY BASED LEARNING
21
Image Enhancement: Image enhancement techniques aim to
improve the visual quality of an image, making it easier for humans
and machines to perceive relevant details.
23
4. Object Detection and Recognition: Locating and identifying specific
objects or patterns within an image or video.
2. Scale and Complexity: The sheer amount of visual data and the
complexity of real-world scenes require powerful and efficient algorithms.
4. Semantic Gap: Bridging the gap between low-level pixel data and
high-level semantic understanding.
2
5
a. Image Enhancement:
b. Image Segmentation:
c. Feature Extraction:
d. Image Registration:
2. Transformation
Affine transformations can change the position, orientation, shape, and size
of objects but preserve parallel lines.
3. FOURIER TRANSFORM
where F(ω) represents the frequency domain representation of the function, f[n]
is the discrete function, ω is the angular frequency, and the summation is
performed over all discrete time instances.
The Fast Fourier Transform (FFT) is an efficient algorithm for computing the
discrete Fourier transform (DFT) or its inverse. It allows the Fourier
transform to be computed quickly, especially for large datasets.
The Fourier transform has revolutionized many fields by enabling the analysis,
manipulation, and synthesis of signals in the frequency domain. It is an essential
tool in understanding and processing various types of signals and has
widespread applications in science, technology, and everyday life.
3
4
4. Convolution and Filtering
4.1 Convolution:
4.2 FILTERING
There are various types of filters used in signal processing, including low-
pass filters, high-pass filters, band-pass filters, and notch filters. Each filter
has its own characteristics and is designed to address specific requirements
in signal processing applications.
There are various methods and algorithms for image enhancement, and the
choice of technique depends on the specific requirements and characteristics
of the image. Here are some commonly used image enhancement
techniques:
5.4. Noise Removal: Image noise can degrade the quality of an image
and reduce its clarity. Various noise reduction techniques, such as median
filtering, Gaussian filtering, and Wiener filtering, can be employed to
suppress noise and enhance image details.
5.5. Image Fusion: Image fusion combines multiple images of the same
scene captured under different conditions or using different sensors to
create a single enhanced image. Fusion methods aim to extract the most
salient information from each input image and generate an output image
with improved quality and more comprehensive information.
6.1 Noise Reduction: Image denoising techniques are used to reduce the
effect of noise in an image. These methods aim to preserve image details
while suppressing the unwanted noise. Common denoising techniques
include median filtering, Gaussian filtering, wavelet denoising, and total
variation denoising.
6.3. Super-Resolution:
Lecture Slides
Lecture Videos
Lecture Videos
Lecture Videos
Assignment
Assignment
Feature Extraction and Matching Vision and Image Processing
Describe two types of filters commonly used in image processing and their
respective applications.
12. What is the Fourier Transform used for in image processing? Answer: The Fourier
Transform is used to analyze the frequency components of an image, aiding tasks
like filtering and compression.
Part B Q
5
2
PART -B
Course
S No Course title Link
provider
https://www.udemy.co
Computer vision applies m/topic/computer-
machine learning.
1 Udemy vision/
https://www.udacity.co
https://www.coursera.o
Advanced Computer Vision rg/learn/advanced-
3 Coursera with TensorFlow
computer-vision-with-
tensorflow
Computer Vision and https://www.edx.org/lear
Image Processing n/computer-
Fundamentals programming/ibm-
edX
4 computer-vision-and-
image-processing-
fundamentals?webview=
false&campaign=Comput
er+Vision+and+Image+
Processing+Fundamental
s&source=edx&product_
category=course&placem
ent_url=https%3A%2F%
2Fwww.edx.org%2Flearn
%2Fcomputer-vision
54
Real life Applications in
day to day life and to
Industry
5
6
AND TO INDUSTRY
https://www.youtube.com/watch?v=2w8XIskzdFw
Assessment Schedule
ASSESSMENT SCHEDULE
FIAT
Proposed date :24.08.2023
Prescribed Text books &
Reference books
PRESCRIBED TEXT BOOKS AND REFERENCE BOOKS
TEXT BOOKS
D. A. Forsyth, J. Ponce, “Computer Vision: A Modern Approach”,
Pearson Education,
2003.
2. Richard Szeliski, “Computer Vision: Algorithms and Applications”,
Springer Verlag London Limited,2011.
REFERENCE BOOKS
B. K. P. Horn -Robot Vision, McGraw-Hill.
Simon J. D. Prince, Computer Vision: Models, Learning, and
Inference, Cambridge University Press, 2012.
Mark Nixon and Alberto S. Aquado, Feature Extraction & Image
Processing for Computer Vision, Third Edition, Academic Press,
2012.
E. R. Davies, (2012), “Computer & Machine Vision”, Fourth Edition,
Academic Press.
Concise Computer Vision: An Introduction into Theory and
Algorithms, by Reinhard Klette,2014
Mini Project
Suggestions
MINI PROJECT SUGGESTIONS
Disclaimer:
This document is confidential and intended solely for the educational purpose of RMK Group of
Educational Institutions. If you have received this document through email in error, please notify the
system manager. This document contains proprietary information and is intended only to the
respective group / learning community as intended. If you are not the addressee you should not
disseminate, distribute or copy through e-mail. Please notify the sender immediately by e-mail if you
have received this document by mistake and delete this document from your system. If you are not
the intended recipient you are notified that disclosing, copying, distributing or taking any action in
reliance on the contents of this information is strictly prohibited.