0% found this document useful (0 votes)

9 views

Computer

The document provides an introduction to computer vision, covering its definition, history, and key concepts such as geometric primitives, transformations, and convolutional neural networks (CNNs). It outlines the processes involved in image formation, the role of digital cameras, and various applications of computer vision technology. Additionally, it discusses point operators used in image processing for enhancing or modifying image characteristics.

Uploaded by

Balaji

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Computer

Uploaded by

Balaji

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

NANDHA ENGINEERING COLLEGE

COMPUTER
VISION 1 7 A I X 0 7

UNIT - 1 INT RODUC T ION TO IM AGE

FOR M AT I ON A N D P ROC E S S I N G

J a n u a r y | 2 0 2 4
CONTENT
1 Computer Vision
2 Geometric Primitives and
Transformation
3 Photometric Image Formation
4 Digital Camera
5 Point Operators
Computer Vision
What is CV?
 One of the most powe ul and compelling types of AI is
Computer Vision which you have almost surely
experienced in any number of ways without even
knowing.
 Computer Vision is the field of Computer Science that
focuses on replicating pa s of the complexity of the
human vision system and enabling computers to
identify and process object in image and videos in the
same way that human do.
 The Ultimate Goal is for computers to emulate the
striking perceptual capability of human eyes and brains
or even to surpass assist the humen in ce ain way.
History of Computer Vision
• The origins of computer vision can be traced • In the 1970s and 1980s, advancements were
back to the 1950s and 1960s when made in edge detection, shape recognition,
researchers began exploring pa ern and early a empts at three-dimensional (3D)
recognition and image analysis. Early effo s vision. However, limited computational power
focused on simple tasks like character hindered progress.
recognition.

• The 1990s witnessed an increased focus on • The 2000s saw a surge in computer vision applications,
robust algorithms for image understanding, driven by improved hardware capabilities and machine
feature extraction, and object recognition. learning techniques. Notable breakthroughs include the
Applications extended to medical imaging, development of Convolutional Neural Networks (CNNs)
surveillance, and industrial automation. for image classification and object detection.
Concept of Computer
Vision
• Computer vision involves teaching machines to interpret and make decisions
based on visual data. The concept encompasses various tasks, including:

 Image acquisition  Segmentation

 Preprocessing  3D Reconstruction

 Feature extraction  Motion analysis

 Image recognition  Machine learning Integration

 Object detection  Applications

Convolutional Neural Networks
(CNNs)
Definition
• Convolutional Neural Networks (CNNs or ConvNets) are a class of deep
neural networks designed for processing and analyzing visual data, making
them especially effective for tasks such as image recognition and
computer vision. CNNs are characterized by their unique architecture,
which includes convolutional layers, pooling layers, and fully connected
layers.
Key Concept for CNNs

Convolutional Layers Activation Functions

Convolutional layers apply filters (also Activation functions, such as ReLU
known as kernels) to input data, (Rectified Linear Unit), introduce non-
enabling the network to learn linearity to the network, allowing it to
hierarchical features such as edges, learn complex relationships and
textures, and pa erns. pa erns.
Pooling Layers Fully Connected Layers
Pooling layers downsample the spatial Fully connected layers connect every
dimensions of the input data, reducing neuron from one layer to every neuron
computational complexity and retaining in the next layer, consolidating learned
impo ant features. Common pooling features for classification or regression
operations include max pooling and tasks.
average pooling.
Key Concept for CNNs

Convolutional Filters Weight Sharing

Filters capture local pa erns in the CNNs leverage weight sharing, where the
input data, effectively learning features same set of parameters (weights and biases)
like edges, corners, and textures. These is used across different regions of the input,
filters are adapted during training to facilitating the detection of similar features in
recognize higher-level features. different pa s of an image.
Striding Padding
Striding controls the step size of the Padding involves adding extra pixels to
filter as it moves across the input data, the input data, preventing information
influencing the spatial dimensions of loss at the borders during convolution.
the output. Striding helps reduce the It ensures that the spatial dimensions
spatial dimensions and computational of the input and output match
appropriately.
load.
Dropout
Dropout is a regularization technique used to
prevent ove i ing by randomly se ing a fraction of
input units to zero during training, reducing reliance
on specific neurons.
Applications
CNNs are widely used in various applications
• Image classification
• Object detection
• Facial recognition
• Image segmentation
• Medical image analysis
• Autonomous vehicles
• Natural language processing (when
combined with recurrent networks)
Geometric Primitives and
Transformation
Definition
 Geometric primitives in computer vision refer to basic shapes and
structures used to represent objects in an image. These primitives serve
as foundational elements for various computer vision tasks, including
object recognition, image analysis, and scene understanding. Common
geometric primitives include points, lines, circles, rectangles, and polygons.
 Geometric transformations involve altering the positions, orientations,
and sizes of geometric primitives within an image. These transformations
play a crucial role in tasks such as image registration, object tracking, and
image manipulation.
Key Concept

Points Circles
Lorem ipsum dolor sit amet, consectetuer Defined by a center point and a radius, circles are
adipiscing elit. Aenean commodo ligula eget used to model rounded objects or features in
dolor. Represented by coordinates (x, y), points images.
serve as the fundamental building blocks for
constructing more complex geometric shapes.

Lines Rectangles
Lorem ipsum dolor sit amet, consectetuer Composed of four points or defined by a
adipiscing elit. Aenean commodo ligula eget center, width, and height, rectangles are
dolor. Defined by two points or a point and a used to represent objects with rectangular
direction vector, lines are essential for shapes.
representing edges and contours in images.
Polygons
Composed of a sequence of connected points, polygons are
versatile geometric primitives used to represent complex
shapes with multiple sides.
Key Concept

Translation Scaling
Shi s the position of geometric primitives Enlarges or shrinks geometric primitives based on
horizontally and ve ically. a scaling factor.

Rotation Affine Transformation

Rotates geometric primitives around a Combines translation, rotation, scaling, and
specified point or axis. shearing to pe orm a more generalized
transformation.

Shearing
Disto s geometric primitives by shi ing one axis relative to the
other.
Photometric Image Formation

Definition
Photometric image formation in computer vision refers to the process by which a digital
image is created based on the interaction of light with a scene and the subsequent
capture of this light by an imaging device, such as a camera. This process involves various
factors related to illumination, reflection, and the characteristics of the imaging system.
Key Concept

Illumination Reflection Su ace Prope ies Shading Models

Illumination represents the Reflection describes how The material prope ies of Shading models are
incident light on a scene, and su aces in a scene interact su aces, such as diffuse and mathematical representations
it plays a crucial role in image with incident light. Different specular reflectance, affect that simulate how light and
formation. The intensity, materials exhibit varying how light interacts with them. shadows interact with su aces.
direction, and color of light reflectance prope ies, Diffuse reflection is Common models include
impact how objects in the influencing how much light responsible for the sca ered Lambe ian reflectance for diffuse
scene are captured. they absorb or reflect. appearance, while specular su aces and Phong or Blinn-
reflection creates highlights. Phong models for specular
su aces.
Key Concept

Ambient, Diffuse & Light Source Camera Characteristics

Specular Components The position, type, and
characteristics of light
The characteristics of the
imaging device, such as the
In computer graphics and sources in a scene impact camera, influence image
vision, the interaction of light how objects are illuminated formation. This includes
is o en decomposed into and, consequently, how they factors like exposure time,
ambient, diffuse, and appear in the captured ape ure size, and sensor
specular components. This image. sensitivity.
decomposition aids in
simulating realistic lighting
conditions.
Digital Camera

Definition
A digital camera in computer vision refers to an
electronic imaging device that captures visual
information in the form of digital images. It plays a
fundamental role in computer vision applications by
providing a means to acquire, process, and analyze
visual data for various tasks such as image recognition,
object detection, and scene understanding.
Key Concept
The image sensor
conve s light into
electrical signals, forming
the basis for digital image
creation. Common types
Resolution The lens system focuses
include CMOS light onto the image
(Complementary Metal- The resolution of a digital sensor. The choice of
Oxide-Semiconductor) and camera is the number of lenses influences factors
CCD (Charge-Coupled pixels it can capture. like focal length, ape ure,
Device) sensors. Higher resolution and depth of field.
contributes to finer
details in images.
Image censor Lens System
Key Concept

Shu er Speed
Digital cameras capture
color information using & Exposure White balance
RGB (Red, Green, Blue) adjustments ensure
channels. Color Shu er speed and accurate color
representation is crucial exposure se ings reproduction under
for various computer vision determine how long the different lighting
tasks. camera's shu er remains conditions.
open. They impact the
amount of light reaching
Colour the sensor and influence White Balance
Representation image quality.
Key Concept

In-built image processing

capabilities enhance
Frame Rate Autofocus systems and
images and may include focus modes contribute
features like noise The frame rate indicates to capturing sharp and
reduction, sharpness how many images per clear images by adjusting
adjustment, and color second the camera can the focus based on the
correction. capture. It is crucial for scene.
applications like video
analysis.
Image Auto Focus
Processing
Point Operators

Definition
• Point operators in computer vision refer to a class of image processing
operations where each pixel in an image is independently transformed
based on a predefined mathematical function. These operations are
applied individually to every pixel, o en without considering the
surrounding pixels, and are fundamental for enhancing or modifying image
characteristics.
Key Concept

Pixel Transformation Gamma Correction

Point operators involve Gamma correction is a point
transforming the intensity or color operation used to adjust the
of each pixel independently based overall brightness of an image,
on a specific mathematical rule or pa icularly in cases where the
function display device's characteristics
need to be taken into account.
Brightness & Contrast Thresholding
Adjustment Thresholding is a point operation
Common point operators include that conve s an image into a
operations for adjusting the binary form by se ing pixels above
brightness and contrast of an or below a ce ain threshold to
image. These operations scale or specific values.
shi pixel values to achieve the
desired visual effect.
Key Concept
Logarithmic & Exponential
Histogram Equalization Transformation
Histogram equalization is a point Logarithmic and exponential
operation that enhances the transformations are point
contrast of an image by operations used for enhancing
redistributing pixel values to cover details in ce ain intensity ranges.
the entire intensity range.
Negative Transformation Bit-Plane Slicing
A simple point operation involves
obtaining the negative of an Bit-plane slicing involves extracting
image, where pixel values are specific bits from the binary
inve ed. representation of pixel values,
allowing for detailed analysis or
modification.

Computer Graphics Quantum-1
No ratings yet
Computer Graphics Quantum-1
246 pages
Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
Tamil Handwritten Character Recognition: BY V.Meenalochini K.Dharshini
No ratings yet
Tamil Handwritten Character Recognition: BY V.Meenalochini K.Dharshini
31 pages
Quarter 2 - Module 1: Technology-Based Arts (Computer/Digital Arts)
82% (17)
Quarter 2 - Module 1: Technology-Based Arts (Computer/Digital Arts)
28 pages
Black Etc. Color Derives From The Spectrum of Light (Distribution of Light Energy Versus
No ratings yet
Black Etc. Color Derives From The Spectrum of Light (Distribution of Light Energy Versus
11 pages
Computer Vision
No ratings yet
Computer Vision
22 pages
Final
No ratings yet
Final
30 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
26 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
26 pages
visualProcessing
No ratings yet
visualProcessing
25 pages
Unit 2 DLT
No ratings yet
Unit 2 DLT
8 pages
DL Unit-4
No ratings yet
DL Unit-4
26 pages
UNIT_IV_DL
No ratings yet
UNIT_IV_DL
26 pages
DIGI-Net: A Deep Convolutional Neural Network For Multi-Format Digit Recognition
No ratings yet
DIGI-Net: A Deep Convolutional Neural Network For Multi-Format Digit Recognition
11 pages
Evaluation of Distance Measures For Feature Based Image Registration Using Alexnet
No ratings yet
Evaluation of Distance Measures For Feature Based Image Registration Using Alexnet
7 pages
Computer Vision Mid
No ratings yet
Computer Vision Mid
2 pages
UNIT -4 DL
No ratings yet
UNIT -4 DL
19 pages
Syllabus
No ratings yet
Syllabus
15 pages
4.convolutional Neural Networks (CNNS)
No ratings yet
4.convolutional Neural Networks (CNNS)
1 page
Different Ann Algorithms
No ratings yet
Different Ann Algorithms
9 pages
IA 3 Must Study Merged
No ratings yet
IA 3 Must Study Merged
69 pages
Deep Learning Module-04 Search Creators
No ratings yet
Deep Learning Module-04 Search Creators
17 pages
Research on Learning Representations in Computer Vision
No ratings yet
Research on Learning Representations in Computer Vision
52 pages
Recognition and Detection of Language On Inscriptions: Dr. C Parthasarathy, R.Sarvanan, M Sathish, U.Sai Sri Teja
No ratings yet
Recognition and Detection of Language On Inscriptions: Dr. C Parthasarathy, R.Sarvanan, M Sathish, U.Sai Sri Teja
3 pages
Visual Image Understanding
No ratings yet
Visual Image Understanding
7 pages
[IJCST-V13I1P1]:Puneet Kaur, Taqdir, Sahezpreet Singh
No ratings yet
[IJCST-V13I1P1]:Puneet Kaur, Taqdir, Sahezpreet Singh
5 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
Aiml Neural Net
No ratings yet
Aiml Neural Net
19 pages
Modelling and Designing of CNN For Feature Abstraction.: Presented By:-Krusha Sandip Joshi
No ratings yet
Modelling and Designing of CNN For Feature Abstraction.: Presented By:-Krusha Sandip Joshi
13 pages
Computer Vision Experiential Learning Report
No ratings yet
Computer Vision Experiential Learning Report
20 pages
Obstacle Detection For Visually Impaired
No ratings yet
Obstacle Detection For Visually Impaired
4 pages
unit-3-CNN-2024
No ratings yet
unit-3-CNN-2024
58 pages
A Review On Image Feature Detection and Description
No ratings yet
A Review On Image Feature Detection and Description
4 pages
Deep Learning
No ratings yet
Deep Learning
9 pages
PEC CS 802C Deep Learning
No ratings yet
PEC CS 802C Deep Learning
13 pages
Convolution in Machine Learning
No ratings yet
Convolution in Machine Learning
2 pages
Name Reel Abdelsamad Hassan
No ratings yet
Name Reel Abdelsamad Hassan
2 pages
Image Classification Using Small Convolutional Neural Network
No ratings yet
Image Classification Using Small Convolutional Neural Network
5 pages
(IJCST-V10I5P12) :mrs J Sarada, P Priya Bharathi
No ratings yet
(IJCST-V10I5P12) :mrs J Sarada, P Priya Bharathi
6 pages
Project Detecto!: A Real-Time Object Detection Model
No ratings yet
Project Detecto!: A Real-Time Object Detection Model
3 pages
Automatic Facial Expression Recognition Based On A Deep Convolutional-Neural-Network Structure
No ratings yet
Automatic Facial Expression Recognition Based On A Deep Convolutional-Neural-Network Structure
6 pages
Vehicle_Detection_and_Tracking_Techniques_Based_on
No ratings yet
Vehicle_Detection_and_Tracking_Techniques_Based_on
5 pages
sunum
No ratings yet
sunum
35 pages
DLT Unit - 4
No ratings yet
DLT Unit - 4
36 pages
Department of Information Science and Engineering Technical Seminar (18Css84) Convolutional Neural Networks
No ratings yet
Department of Information Science and Engineering Technical Seminar (18Css84) Convolutional Neural Networks
15 pages
Reviewer - Convolutional Neural Networks (CNNs) - Muqaddas Bin Tahir
No ratings yet
Reviewer - Convolutional Neural Networks (CNNs) - Muqaddas Bin Tahir
8 pages
Research Acv
No ratings yet
Research Acv
6 pages
UNIT - 2
No ratings yet
UNIT - 2
31 pages
DL_EXP-4_16010422230
No ratings yet
DL_EXP-4_16010422230
4 pages
Image Processing
No ratings yet
Image Processing
6 pages
NCRTTC P154 PDF
No ratings yet
NCRTTC P154 PDF
14 pages
Oct2022 CSC649 SupervisedDL - CNN
No ratings yet
Oct2022 CSC649 SupervisedDL - CNN
79 pages
Research Paper Hand Gesture Recognition
No ratings yet
Research Paper Hand Gesture Recognition
4 pages
Semantic_Segmentation_With_Attention_Mechanism_for
No ratings yet
Semantic_Segmentation_With_Attention_Mechanism_for
13 pages
3- Feature extraction and images classification - Part 3
No ratings yet
3- Feature extraction and images classification - Part 3
29 pages
Chapter 1 [CV & IP]
No ratings yet
Chapter 1 [CV & IP]
41 pages
CV 4
No ratings yet
CV 4
8 pages
Deep Learning Module-04
No ratings yet
Deep Learning Module-04
17 pages
249-254Tesma601IJEAST
No ratings yet
249-254Tesma601IJEAST
7 pages
Unleashing-the-Power-of-Convolutional-Neural-Networks
No ratings yet
Unleashing-the-Power-of-Convolutional-Neural-Networks
13 pages
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
TFT Proto Manual v200
No ratings yet
TFT Proto Manual v200
4 pages
Open Ended Problem: Computer Aided Design (CAD)
No ratings yet
Open Ended Problem: Computer Aided Design (CAD)
5 pages
Microsoft PPT On Bilateral Filtering
100% (1)
Microsoft PPT On Bilateral Filtering
30 pages
I Plugins
No ratings yet
I Plugins
6 pages
Doctor's Details-Indian Society of Vascular & Interventional Radiology (ISVIR)
No ratings yet
Doctor's Details-Indian Society of Vascular & Interventional Radiology (ISVIR)
22 pages
Advanced Editing Module1-7
No ratings yet
Advanced Editing Module1-7
31 pages
NVIDIA 191.07 - WinXP - Desktop - Release - Notes
No ratings yet
NVIDIA 191.07 - WinXP - Desktop - Release - Notes
54 pages
Kartik Internship Report
No ratings yet
Kartik Internship Report
12 pages
Simplify Your Palette: BS VR VB GB
No ratings yet
Simplify Your Palette: BS VR VB GB
14 pages
1-Creating New Ship Drawings
100% (1)
1-Creating New Ship Drawings
7 pages
Palletet2: Color 1 Color 2 Color 3 Color 4 Color 5
No ratings yet
Palletet2: Color 1 Color 2 Color 3 Color 4 Color 5
4 pages
PRACTICAL EXERCISES AR VR - EX 3
No ratings yet
PRACTICAL EXERCISES AR VR - EX 3
7 pages
Image Processing
No ratings yet
Image Processing
13 pages
13 Interpolative Shading
No ratings yet
13 Interpolative Shading
12 pages
11.EdgeDetection
No ratings yet
11.EdgeDetection
35 pages
HDR+ Talk (SIGGRAPH Asia 2016)
No ratings yet
HDR+ Talk (SIGGRAPH Asia 2016)
33 pages
00 MultiLit PDF
No ratings yet
00 MultiLit PDF
34 pages
03 Laboratory Exercise 1
No ratings yet
03 Laboratory Exercise 1
4 pages
Business Card Design in CorelDraw
No ratings yet
Business Card Design in CorelDraw
37 pages
Read First Visa Secure21
No ratings yet
Read First Visa Secure21
1 page
Real Estate Company Profile Presentation Brown Variant
No ratings yet
Real Estate Company Profile Presentation Brown Variant
26 pages
Artificial Intelligence - 14 - Data Visualization With Python
No ratings yet
Artificial Intelligence - 14 - Data Visualization With Python
58 pages
Histogram Processing
No ratings yet
Histogram Processing
17 pages
Sculpt in Blender
No ratings yet
Sculpt in Blender
9 pages
PITHAGI2012-072.Software Development For Interactive 2D Gravity and Magnetic Forward Modeling, Ca
No ratings yet
PITHAGI2012-072.Software Development For Interactive 2D Gravity and Magnetic Forward Modeling, Ca
4 pages
LECTURE NOTES On Computer Graphics and Multimedia: Dr. Ankur Pachauri MCA Department RATM, Mathura
100% (1)
LECTURE NOTES On Computer Graphics and Multimedia: Dr. Ankur Pachauri MCA Department RATM, Mathura
59 pages
Ip Unit 4 One Shot
No ratings yet
Ip Unit 4 One Shot
20 pages

Computer

Uploaded by

Computer

Uploaded by

NANDHA ENGINEERING COLLEGE

UNIT - 1 INT RODUC T ION TO IM AGE

 Image acquisition  Segmentation

 Feature extraction  Motion analysis

 Image recognition  Machine learning Integration

 Object detection  Applications

Convolutional Layers Activation Functions

Convolutional Filters Weight Sharing

Rotation Affine Transformation

Illumination Reflection Su ace Prope ies Shading Models

Ambient, Diffuse & Light Source Camera Characteristics

In-built image processing

Pixel Transformation Gamma Correction

You might also like