Computer Vision QnA
Computer Vision QnA
All parts of question-1 are compulsory. However, internal choices are given. Each part carries ONE
[3X1=3]
Q1.
- The depth of field (DoF) refers to the range of distances within a scene that appear acceptably
sharp in an image. It depends on factors like aperture size, focal length, and distance between the
camera and the subject. A smaller aperture increases the depth of field, making more of the scene
appear sharp.
(b) How does the size of the pinhole affect the image quality?
- The size of the pinhole in a pinhole camera affects image sharpness and brightness.
- A smaller pinhole improves sharpness by reducing light scattering but may make the image
dimmer.
- A larger pinhole allows more light, making the image brighter but less sharp due to increased
diffraction.
OR
(b) Consider a 3-bit gray scale image with dimensions 32 × 32. What will be the range of values in
its X-axis?
- For a 3-bit grayscale image, pixel intensity values range from 0 to 7 (since 2^3 = 8).
- The X-axis in a 32 × 32 image will span 0 to 31, corresponding to the column indices.
1. Object detection: Used in autonomous vehicles to identify objects like pedestrians and traffic
signs.
3. Medical imaging: Enhances diagnostic accuracy by analyzing X-rays, MRIs, and CT scans.
---
Section - B
All questions are compulsory. However, internal choices are given. Each question carries THREE
[4 X 3 =12]
1. Translation: Shifts an image by moving every pixel a fixed distance in a specified direction.
- Represented by:
[x'] = [x + tx]
[y'] [y + ty]
2. Rotation: Rotates an image around a point (usually the origin) by a specified angle (theta).
- Represented by:
3. Scaling: Changes the size of an image by multiplying pixel coordinates by scaling factors (sx, sy).
- Represented by:
---
|---------------------|--------------------------------------------|-------------------------------------------|
| Effect on Image | Smoothens the image, reducing noise and details. | Enhances edges and fine
details. |
OR
Q3. How are edges detected using the Laplacian of Gaussian (LoG) method?
2. Second Derivative: The Laplacian operator (second derivative) is applied to identify regions of
4. Implementation: LoG combines smoothing and edge detection into a single operation using a
convolution kernel.
---
Q4. Examine how a higher stride value contributes to the downsampling of an image and suggest
Effect of Stride:
- A higher stride reduces the overlap between the receptive fields of a convolution operation,
- While this decreases computational cost, it may result in loss of important spatial details.
Methods to Address:
Image:
[0 1 4 3]
[2 4 3 0]
[0 1 2* 7]
[4 1 5 6]
[-1 0 1]
[-1 0 1]
[-1 0 1]
[ 1 1 1]
[ 0 0 0]
[-1 -1 -1]
Apply kernels and calculate Gx and Gy, then find G = sqrt(Gx^2 + Gy^2).
---
Section - C
Question-6 is compulsory. However, internal choices are given. Each question carries Five marks.
[1X5=5]
1. Edge Detection: Apply an edge detection algorithm (e.g., Canny) to extract edges from the image.
2. Hough Space: Define a parameter space (rho, theta) where lines are represented.
3. Voting: For each edge pixel, calculate rho = x costheta + y sintheta for various theta values and
5. Line Reconstruction: Map the detected peaks back to the image space to draw the lines.
OR
Double Thresholding:
- Pixels with gradient magnitude above Thigh are strong edges, while those between Tlow and
Hysteresis:
- Links weak edges to strong edges if they are connected, ensuring continuity.
- Weak edges not connected to strong edges are suppressed as noise.