0% found this document useful (0 votes)

11 views

Lec01 Intro

This presentation provides an introduction to the field of computer vision. It covers the basic goals of the field, challenges, and its relevance across different applications, from personal photo albums to robotics. It also introduces course requirements and the connection of computer vision to other disciplines

Uploaded by

cap.cafu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Lec01 Intro

Uploaded by

cap.cafu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 61

COMP 776: Computer Vision

Today
• Introduction to computer vision
• Course overview
• Course requirements
The goal of computer vision
• To bridge the gap between pixels and “meaning”

What we see What a computer sees

Source: S. Narasimhan
What kind of information can we extract
from an image?
• Metric 3D information
• Semantic information
Vision as measurement device

Reconstruction from
Real-time stereo Structure from motion Internet photo collections

NASA Mars Rover

Pollefeys et al. Goesele et al.

Vision as a source of semantic information

slide credit: Fei-Fei, Fergus & Torralba

Object categorization

sky
building

flag

face
banner
wall
street lamp
bus bus

cars slide credit: Fei-Fei, Fergus & Torralba

Scene and context categorization
• outdoor
• city
• traffic
•…

slide credit: Fei-Fei, Fergus & Torralba

Qualitative spatial information

slanted

non-rigid moving
object

vertical

rigid moving rigid moving

object object
horizontal slide credit: Fei-Fei, Fergus & Torralba
Why study computer vision?
• Vision is useful: Images and video are everywhere!

Personal photo albums Movies, news, sports

Surveillance and security Medical and scientific images

Why study computer vision?

• Vision is useful
• Vision is interesting
• Vision is difficult
– Half of primate cerebral cortex is devoted to visual processing
– Achieving human-level visual perception is probably “AI-complete”
Why is computer vision difficult?
Challenges: viewpoint variation

Michelangelo 1475-1564 slide credit: Fei-Fei, Fergus & Torralba

Challenges: illumination

image credit: J. Koenderink

Challenges: scale

slide credit: Fei-Fei, Fergus & Torralba

Challenges: deformation

Xu, Beihong 1943

slide credit: Fei-Fei, Fergus & Torralba

Challenges: occlusion

Magritte, 1957 slide credit: Fei-Fei, Fergus & Torralba

Challenges: background clutter
Challenges: Motion
Challenges: object intra-class
variation

slide credit: Fei-Fei, Fergus & Torralba

Challenges: local ambiguity

slide credit: Fei-Fei, Fergus & Torralba

Challenges or opportunities?
• Images are confusing, but they also reveal the structure of
the world through numerous cues
• Our job is to interpret the cues!

Image source: J. Koenderink

Depth cues: Linear perspective
Depth cues: Aerial perspective
Depth ordering cues: Occlusion

Source: J. Koenderink
Shape cues: Texture gradient
Shape and lighting cues: Shading

Source: J. Koenderink
Position and lighting cues: Cast shadows

Source: J. Koenderink
Grouping cues: Similarity (color, texture,
proximity)
Grouping cues: “Common fate”

Image credit: Arthus-Bertrand (via F. Durand)

Bottom line
• Perception is an inherently ambiguous problem
– Many different 3D scenes could have given rise to a particular 2D picture
Bottom line
• Perception is an inherently ambiguous problem
– Many different 3D scenes could have given rise to a particular 2D picture

• Possible solutions
– Bring in more constraints (more images)
– Use prior knowledge about the structure of the world
• Need a combination of different methods
Connections to other disciplines

Artificial Intelligence

Robotics Machine Learning

Computer Vision

Computer Graphics Cognitive science

Neuroscience

Image Processing
Origins of computer vision

L. G. Roberts, Machine Perception

of Three Dimensional Solids,
Ph.D. thesis, MIT Department of
Electrical Engineering, 1963.
Computer Vision in the Real World
Special effects: shape and motion capture

Source: S. Seitz
3D urban modeling

Bing maps, Google Streetview

Source: S. Seitz
3D urban modeling: Microsoft Photosynth

http://labs.live.com/photosynth/ Source: S. Seitz

Face detection

Many new digital cameras now detect faces

• Canon, Sony, Fuji, …

Source: S. Seitz
Smile detection

Sony Cyber-shot® T70 Digital Still Camera Source: S. Seitz

Face recognition: Apple iPhoto software

http://www.apple.com/ilife/iphoto/
Biometrics

How the Afghan Girl was Identified by Her Iris

Patterns

Source: S. Seitz
Biometrics

Face recognition systems now

Fingerprint scanners on
beginning to appear more widely
many new laptops, http://www.sensiblevision.com/
other devices

Source: S. Seitz
Optical character recognition (OCR)
Technology to convert scanned docs to text
• If you have a scanner, it probably came with OCR software

Digit recognition, AT&T labs License plate readers

http://en.wikipedia.org/wiki/Automatic_number_plate_recognition

Source: S. Seitz
Mobile visual search: Google Goggles
Mobile visual search: iPhone Apps
Automotive safety

Mobileye: Vision systems in high-end BMW, GM, Volvo models

• “In mid 2010 Mobileye will launch a world's first application of full
emergency braking for collision mitigation for pedestrians where
vision is the key technology for detecting pedestrians.”
Source: A. Shashua, S. Seitz
Vision in supermarkets

LaneHawk by EvolutionRobotics
“A smart camera is flush-mounted in the checkout lane, continuously watching for items.
When an item is detected and recognized, the cashier verifies the quantity of items that
were found under the basket, and continues to close the transaction. The item can remain
under the basket, and with LaneHawk,you are assured to get paid for it… “
Source: S. Seitz
Vision-based interaction (and games)

Sony EyeToy

Nintendo Wii has camera-based IR

tracking built in. See Lee’s work at
CMU on clever tricks on using it to
create a multi-touch display!

Assistive technologies
Source: S. Seitz
Vision for robotics, space exploration

NASA'S Mars Exploration Rover Spirit captured this westward view from atop
a low plateau where Spirit spent the closing months of 2007.

Vision systems (JPL) used for several tasks

• Panorama stitching
• 3D terrain modeling
• Obstacle detection, position tracking
• For more, read “Computer Vision on Mars” by Matthies et al.
Source: S. Seitz
The computer vision industry
• A list of companies here:

http://www.cs.ubc.ca/spider/lowe/vision.html
Course overview
I. Early vision: Image formation and processing
II. Mid-level vision: Grouping and fitting
III. Multi-view geometry
IV. Recognition
V. Advanced topics
I. Early vision
• Basic image formation and processing

* =
Linear filtering
Edge detection
Cameras and sensors
Light and color

Feature extraction: corner and blob detection

II. “Mid-level vision”
• Fitting and grouping

Alignment

Fitting: Least squares

Hough transform
RANSAC
III. Multi-view geometry

Stereo Epipolar geometry

Tomasi & Kanade (1993)

Affine structure from motion Projective structure from motion

IV. Recognition

Patch description and matching Clustering and visual vocabularies

Bag-of-features models Classification

Sources: D. Lowe, L. Fei-Fei

V. Advanced Topics
• Time permitting…

Segmentation Face detection

Articulated models Motion and tracking

Basic Info

• Instructor:
Svetlana Lazebnik ([email protected])

• Office hours:
By appointment, FB 244

• Textbooks (suggested):
Forsyth & Ponce, Computer Vision: A Modern Approach
Richard Szeliski, Computer Vision: Algorithms and
Applications (draft available online)

• Class webpage:
http://www.cs.unc.edu/~lazebnik/spring10
Course requirements
• Philosophy: computer vision is best experienced hands-on

• Programming assignments: 50%

– Four assignments
– Expect the first one in the next couple of classes
– Brush up on your MATLAB skills (see web page for tutorial)

• Final assignment: 30%

– Recognition competition
– Winner gets a prize!

• Participation: 20%
– Come to class regularly
– Ask questions
– Answer questions
Collaboration policy
• Feel free to discuss assignments with each other, but coding
must be done individually

• Feel free to incorporate code or tips you find on the Web,

provided this doesn’t make the assignment trivial and you
explicitly acknowledge your sources

• Remember: I can Google too (and I have the copies of

everybody’s assignments from the last two years this class
was offered)
For next time
• Self-study: MATLAB tutorial
• Reading: cameras and image formation (F&P chapter 1)

PHOTOSHOP FOR ARCHITECTURE.
No ratings yet
PHOTOSHOP FOR ARCHITECTURE.
10 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
Lec00 Intro For Web Highlighted
No ratings yet
Lec00 Intro For Web Highlighted
72 pages
00CV Intro Full
No ratings yet
00CV Intro Full
58 pages
1 Intro Visión Artificial
No ratings yet
1 Intro Visión Artificial
50 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Computer Vision Intorduction
No ratings yet
Computer Vision Intorduction
57 pages
CS 143: Introduction To Computer Vision
No ratings yet
CS 143: Introduction To Computer Vision
38 pages
Computer Vision: Linda Shapiro
No ratings yet
Computer Vision: Linda Shapiro
73 pages
1 Vision Lec 1
No ratings yet
1 Vision Lec 1
49 pages
Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
81 pages
Administrivia: CMPSCI 370: Introduction To Computer Vision
No ratings yet
Administrivia: CMPSCI 370: Introduction To Computer Vision
12 pages
Computer Vision
100% (1)
Computer Vision
48 pages
Computer Vision
No ratings yet
Computer Vision
41 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
Computer Vision Introduction
No ratings yet
Computer Vision Introduction
42 pages
Lec 00
No ratings yet
Lec 00
76 pages
Lect1 PDF
100% (1)
Lect1 PDF
45 pages
Introduction to Data Science: (Khoa học dữ liệu)
No ratings yet
Introduction to Data Science: (Khoa học dữ liệu)
91 pages
CV Module 1
No ratings yet
CV Module 1
166 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
What Computer Vision With The OpenCV
100% (5)
What Computer Vision With The OpenCV
137 pages
Overview of Computer Vision: CS491E/791E
No ratings yet
Overview of Computer Vision: CS491E/791E
55 pages
Computer Vision and Artificial Intelligence
No ratings yet
Computer Vision and Artificial Intelligence
55 pages
IT5409 Ch1 Intro New Template
No ratings yet
IT5409 Ch1 Intro New Template
14 pages
T2310 TDS3651 L01 Introduction
No ratings yet
T2310 TDS3651 L01 Introduction
73 pages
02 Feature Extraction & DLCV
No ratings yet
02 Feature Extraction & DLCV
165 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
Introduction To Computer Vision: by James Hays
No ratings yet
Introduction To Computer Vision: by James Hays
32 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
01 Lecture No. 1
No ratings yet
01 Lecture No. 1
52 pages
Week-16 Lecture-32
No ratings yet
Week-16 Lecture-32
65 pages
Book
No ratings yet
Book
2 pages
Computer_vision_part1
No ratings yet
Computer_vision_part1
96 pages
Introduction To Computer Vision
No ratings yet
Introduction To Computer Vision
34 pages
CS 474 Lec 01 Introduction
No ratings yet
CS 474 Lec 01 Introduction
69 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
62 pages
Fundamentals of Computer Vision
No ratings yet
Fundamentals of Computer Vision
30 pages
intro
No ratings yet
intro
66 pages
Lecture1 - Introduction
No ratings yet
Lecture1 - Introduction
35 pages
Computer_Vision_1_introduction
No ratings yet
Computer_Vision_1_introduction
44 pages
Lec01 - Intro To Computer Vision
No ratings yet
Lec01 - Intro To Computer Vision
43 pages
C280 Computer Vision C280, Computer Vision: Prof. Trevor Darrell
No ratings yet
C280 Computer Vision C280, Computer Vision: Prof. Trevor Darrell
68 pages
1 Sirg Bsu - 1
No ratings yet
1 Sirg Bsu - 1
46 pages
Cv Unit 1 Overview of Computer Vison and Application
No ratings yet
Cv Unit 1 Overview of Computer Vison and Application
51 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
01 Introduction
No ratings yet
01 Introduction
62 pages
MODULE-1
No ratings yet
MODULE-1
18 pages
Computer Vision: Evolution and Promise
No ratings yet
Computer Vision: Evolution and Promise
5 pages
Lec1 - Computer Vision - v1
No ratings yet
Lec1 - Computer Vision - v1
38 pages
01 Introduction 2023
No ratings yet
01 Introduction 2023
83 pages
CV-1.1
No ratings yet
CV-1.1
18 pages
Computer Vision
No ratings yet
Computer Vision
7 pages
Computer Vision: Cse 576 Ali Farhadi
No ratings yet
Computer Vision: Cse 576 Ali Farhadi
90 pages
Lecture 01
No ratings yet
Lecture 01
79 pages
CV - Lec01 - Introduction
No ratings yet
CV - Lec01 - Introduction
50 pages
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Optical Braille Recognition: Empowering Accessibility Through Visual Intelligence
From Everand
Optical Braille Recognition: Empowering Accessibility Through Visual Intelligence
Fouad Sabry
No ratings yet
It Verticals
No ratings yet
It Verticals
106 pages
Harnham - Data & Analytics Recruitment - US Salary Guide 2021
No ratings yet
Harnham - Data & Analytics Recruitment - US Salary Guide 2021
33 pages
Current Affairs JWT Guess Paper 2025 (1)
No ratings yet
Current Affairs JWT Guess Paper 2025 (1)
135 pages
Dip Lab 8 Muhammad Ali 18-Se-16: Example 1
No ratings yet
Dip Lab 8 Muhammad Ali 18-Se-16: Example 1
6 pages
These Graph Match PDF
No ratings yet
These Graph Match PDF
214 pages
A Review of Image Enhancement Techniques for Underwater Images
No ratings yet
A Review of Image Enhancement Techniques for Underwater Images
5 pages
Make Round Stars
No ratings yet
Make Round Stars
14 pages
PhotoshopCAFE Blending Modes Ebook
No ratings yet
PhotoshopCAFE Blending Modes Ebook
13 pages
AI_With_CPP
No ratings yet
AI_With_CPP
119 pages
Basic Operation On Images
No ratings yet
Basic Operation On Images
2 pages
GEO424 Lect15 Unsup and Object Based PDF
No ratings yet
GEO424 Lect15 Unsup and Object Based PDF
23 pages
07 Representation Learning
No ratings yet
07 Representation Learning
11 pages
Competitionacademy: Competition Academy
No ratings yet
Competitionacademy: Competition Academy
7 pages
### Abstract - Image Classification Using Convolut...
No ratings yet
### Abstract - Image Classification Using Convolut...
2 pages
GE' Pre-Owned 1.5T Signa HDXT MRI Scanner 16 Channel (16X)
No ratings yet
GE' Pre-Owned 1.5T Signa HDXT MRI Scanner 16 Channel (16X)
4 pages
Download ebooks file Computer Vision and Machine Intelligence Paradigms for SDGs Select Proceedings of ICRTAC CVMIP 2021 R. Jagadeesh Kannan all chapters
100% (1)
Download ebooks file Computer Vision and Machine Intelligence Paradigms for SDGs Select Proceedings of ICRTAC CVMIP 2021 R. Jagadeesh Kannan all chapters
55 pages
Stecno
No ratings yet
Stecno
2 pages
PDF Download Graphic Designer
No ratings yet
PDF Download Graphic Designer
12 pages
Diploma in Digital Photography Low Res
No ratings yet
Diploma in Digital Photography Low Res
2 pages
Thesis Title For Computer Science Philippines
100% (2)
Thesis Title For Computer Science Philippines
5 pages
Shading Methods
No ratings yet
Shading Methods
28 pages
RGPV Syllabus 6 Sem
No ratings yet
RGPV Syllabus 6 Sem
12 pages
10 1108 - LHT 07 2021 0242
No ratings yet
10 1108 - LHT 07 2021 0242
24 pages
Artificial Intelligence Graduate Program Course Planning
No ratings yet
Artificial Intelligence Graduate Program Course Planning
4 pages
Hand Gesture Recognition
No ratings yet
Hand Gesture Recognition
28 pages
Instant download MATLAB Image Processing Toolbox User s Guide The Mathworks pdf all chapter
100% (4)
Instant download MATLAB Image Processing Toolbox User s Guide The Mathworks pdf all chapter
65 pages
AI 2ND SEM - AI 2ND SEM One of the key technologies underlying _________________ vehicles is deep - Studocu
No ratings yet
AI 2ND SEM - AI 2ND SEM One of the key technologies underlying _________________ vehicles is deep - Studocu
40 pages
Classical Computer Vision - Session 1
No ratings yet
Classical Computer Vision - Session 1
130 pages

Lec01 Intro

Uploaded by

Lec01 Intro

Uploaded by

COMP 776: Computer Vision

What we see What a computer sees

NASA Mars Rover

Pollefeys et al. Goesele et al.

slide credit: Fei-Fei, Fergus & Torralba

cars slide credit: Fei-Fei, Fergus & Torralba

slide credit: Fei-Fei, Fergus & Torralba

rigid moving rigid moving

Personal photo albums Movies, news, sports

Surveillance and security Medical and scientific images

Michelangelo 1475-1564 slide credit: Fei-Fei, Fergus & Torralba

image credit: J. Koenderink

slide credit: Fei-Fei, Fergus & Torralba

Xu, Beihong 1943

slide credit: Fei-Fei, Fergus & Torralba

Magritte, 1957 slide credit: Fei-Fei, Fergus & Torralba

slide credit: Fei-Fei, Fergus & Torralba

slide credit: Fei-Fei, Fergus & Torralba

Image source: J. Koenderink

Image credit: Arthus-Bertrand (via F. Durand)

Robotics Machine Learning

Computer Graphics Cognitive science

L. G. Roberts, Machine Perception

Bing maps, Google Streetview

http://labs.live.com/photosynth/ Source: S. Seitz

Many new digital cameras now detect faces

Sony Cyber-shot® T70 Digital Still Camera Source: S. Seitz

How the Afghan Girl was Identified by Her Iris

Face recognition systems now

Digit recognition, AT&T labs License plate readers

Mobileye: Vision systems in high-end BMW, GM, Volvo models

Nintendo Wii has camera-based IR

Vision systems (JPL) used for several tasks

Feature extraction: corner and blob detection

Fitting: Least squares

Stereo Epipolar geometry

Tomasi & Kanade (1993)

Affine structure from motion Projective structure from motion

Patch description and matching Clustering and visual vocabularies

Bag-of-features models Classification

Sources: D. Lowe, L. Fei-Fei

Segmentation Face detection

Articulated models Motion and tracking

• Programming assignments: 50%

• Final assignment: 30%

• Feel free to incorporate code or tips you find on the Web,

• Remember: I can Google too (and I have the copies of

You might also like