Group 17 Computer Vision @Lcd-1
Group 17 Computer Vision @Lcd-1
VISION
TRANSFORMING THE WAY
MACHINES SEE
GROUP 17
GROUP MEMBERS
1) DIVYANSH GANDHI
(00911503123)
2) PIYUSH NEGI (00611503123)
3) UTSAV GUPTA (02111503123)
CONTENT OVERVIEW
Introduction to Computer Vision
Contemporary Issues
BY DIVYANSH GANDHI
PEEK INTO THE FIELD OF VISUAL
PERCEPTION FOR COMPUTERS
Analysis:
ffExtracting features from the processed
ffimages, such as edges, textures,
ffshapes, or specific patterns.
Why is Computer Vision Important?
Automation of Repetitive Complex Visual Tasks
Computer vision automates tasks that would otherwise
require human vision and intervention, saving time and
reducing the chance of human error.
• Machine learning, particularly deep learning, fuels the core of modern computer vision.
• Deep learning models, especially Convolutional Neural Networks (CNNs), excel at extracting complex patterns from images.
• CNNs employ layers that learn to identify and process various features like edges, shapes, and textures, culminating in an understanding of the
image's content.
Specialized Algorithms
• Object detection algorithms like YOLO and Faster R-CNN pinpoint objects within an image, determining their location and category.
• Image segmentation algorithms, such as U-Net and Mask R-CNN, divide an image into segments, highlighting distinct regions of interest.
segmentation, are crucial for teaching models to recognize and interpret visual information.
Bringing Computer Vision to Life: Building a Model
To build a functional computer vision Next, select a suitable model based on Finally, evaluating model performance
model, you'll need the right tools and your specific task. Consider factors is crucial. Metrics like accuracy,
techniques. Start by setting up your like the complexity of the task, the size precision, recall, and F1 score provide
environment. Popular choices include of your dataset, and the desired insights into how well your model
Python as a programming language, accuracy. After selecting a model, performs. These metrics help you
TensorFlow or PyTorch as deep you'll need to train it using a carefully understand the model's ability to
learning frameworks, and a GPU for curated and preprocessed dataset. classify, identify, and segment objects
Sports Detecting
Analytics Events
Device
Facial Unlocking
Recognition
This technology identifies and
verifies individuals by
analyzing facial features. It is
widely used in security
systems, unlocking devices,
and in social media for Filters and
tagging friends in photos. Masks in
Algorithms analyze unique social
characteristics such as the
distance between eyes, nose media
shape, and jawline.
Autonomous
Vehicles
Computer vision plays a
crucial role in the operation
of self-driving cars. It
enables vehicles to detect
and recognize objects (like
pedestrians, other vehicles,
and traffic signs),
understand their
surroundings, and make
real-time decisions to
navigate safely.
Medical
Imaging
In healthcare, computer
vision algorithms analyze
medical images (e.g., X-
rays, MRIs, and CT scans) to
assist in diagnosing
diseases. These systems can
detect anomalies such as
tumors or fractures with
high accuracy, helping
radiologists make informed
decisions.
Sports
Analytics
Computer vision in sports HAWK-EYE TECHNOLOGY IN SPORTS
analyzes player movements,
strategies, and game
dynamics. It helps coaches
assess performance,
develop training programs,
and enhance strategies by
providing detailed data on
player statistics and game
situations.
Pokemon Go
game
Augmented
Reality (AR)
AR applications use
computer vision to overlay
digital information onto the
real world. This technology
is popular in mobile games AR Cricket game
(like Pokemon GO) and
training simulations,
providing interactive
experiences by recognizing
real-world objects and
environments.
Fashion and Visual
Search
Style features
from fashion
Recognition apps
Object
Introductio
Image Image Detection
n to Advanced Challenges
Processing Classificati &
Computer Topics & Ethics
Basics on Segmentati
Vision
on
3D vision,
Defined Preprocessing:
Traditional Techniques: depth Data bias,
computer filtering, edge
methods to YOLO, Mask R- estimation interpretability,
vision and its detection,
CNNs. CNN. (e.g., LiDAR). privacy issues.
evolution. normalization.
Key
applications: Generative
Uses: security,
facial Role of transfer models (GANs)
medical
recognition, learning. for image
imaging.
autonomous creation.
vehicles.
THANKYOU !