0% found this document useful (0 votes)

2 views

Lecture-21-Semantic-Segmentation

The document discusses semantic segmentation, which involves assigning labels to each pixel in an image for pixel-level analysis. It highlights common datasets like PASCAL VOC and MSCOCO, and applications including autonomous navigation, medical diagnosis, and image editing. The lecture also covers techniques before deep learning, the use of Fully Convolutional Networks (FCN), and various architectures such as SegNet and U-Net for semantic segmentation tasks.

Uploaded by

muneebke

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Lecture-21-Semantic-Segmentation

Uploaded by

muneebke

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Semantic Segmentation

Lecture 17
21

11/30/2021 CAP5415 - Lecture 17 2

Semantic Segmentation
Assign a label to each pixel in an image:
• Pixel-level image annotation/analysis (vs. object-level analysis)

Common datasets: PASCAL VOC (2012) and MSCOCO

11/30/2021 CAP5415 - Lecture 17 3
Semantic Segmentation
▪ A key part of Scene Understanding
▪ Applications
▪ Autonomous navigation
▪ Assisting the partially sighted
▪ Medical diagnosis
▪ Image editing

(Clockwise11/30/2021 CAP5415 - Lecture 17

from top) [1] Cityscapes Dataset. [2] ISBI Challenge 2015, dental x-ray images. [3] Royal National Institute of Blind People 4
Semantic Segmentation
▪ Applications
▪ Autonomous navigation

(Clockwise11/30/2021 CAP5415 - Lecture 17

from top) [1] Cityscapes Dataset. [2] ISBI Challenge 2015, dental x-ray images. [3] Royal National Institute of Blind People 5
Semantic Segmentation
▪ Applications
▪ Assisting the partially sighted

(Clockwise11/30/2021 CAP5415 - Lecture 17

from top) [1] Cityscapes Dataset. [2] ISBI Challenge 2015, dental x-ray images. [3] Royal National Institute of Blind People 6
Semantic Segmentation
▪ Applications
▪ Medical diagnosis

(Clockwise11/30/2021 CAP5415 - Lecture 17

from top) [1] Cityscapes Dataset. [2] ISBI Challenge 2015, dental x-ray images. [3] Royal National Institute of Blind People 7
Semantic Segmentation
▪ Applications
▪ Image editing

(Clockwise11/30/2021 CAP5415 - Lecture 17

from top) [1] Cityscapes Dataset. [2] ISBI Challenge 2015, dental x-ray images. [3] Royal National Institute of Blind People 8
Segmentation tasks
A sample image from the PASCAL VOC2011 dataset

Original (input) Image Instance (object) Segmentation Semantic (class) Segmentation

Question: How about the frame, the fruit (tomato?) and the papers on the table?
11/30/2021 CAP5415 - Lecture 17 9
Image Source: http://host.robots.ox.ac.uk/pascal/VOC/voc2012/segexamples/index.html
Before deep learning
Many techniques… Such as:
• TextonBoost
• J. Shotton, J. Winn, C. Rother, and A. Criminisi,
TextonBoost: Joint Appearance, Shape And Context Modeling For Multi-class Object
Recognition And Segmentation, ECCV 2006.

• TextonForest
• “Semantic Texton Forests for Image Categorization and Segmentation”, 2008

• Conditional Random Forest based approaches:

• SuperParsing (J. Tighe and S. Lazebnik, SuperParsing: Scalable Nonparametric Image
Parsing with Superpixels, ECCV 2010)

11/30/2021 CAP5415 - Lecture 17 10

Fully Convolutional networks (FCN)

J. Long, E. Shelhamer, and T. Darrell, Fully Convolutional Networks for Semantic Segmentation, CVPR 2015

11/30/2021 CAP5415 - Lecture 17 11

Fully “CONVOLUTIONAL” Networks (FCN)

• Use pre-trained networks for classification for segmentation! (VGG,

AlexNet, etc.)

• Re-interpret the fully-connected layers as fully convolutional networks.

• Utilize skip-layer concept to improve the segmentation accuracy.

11/30/2021 CAP5415 - Lecture 17 12

Fully Convolutional networks (FCN)

Interpret the FC layers as conv layers.

11/30/2021 CAP5415 - Lecture 17 13

J. Long, E. Shelhamer, and T. Darrell, Fully Convolutional Networks for Semantic Segmentation, CVPR 2015
FCN

11/30/2021 CAP5415 - Lecture 17 14

(21 is the number of classes here!)
FCN

No skip connection 1-skip connection 2-skip connections

11/30/2021 CAP5415 - Lecture 17 15

J. Long, E. Shelhamer, and T. Darrell, Fully Convolutional Networks for Semantic Segmentation, CVPR 2015
Upsampling

11/30/2021 CAP5415 - Lecture 17 16

Upsampling
Bi-linear interpolation

11/30/2021 CAP5415 - Lecture 17 17

Upsampling

11/30/2021 CAP5415 - Lecture 17 18

Deconvolution

11/30/2021 CAP5415 - Lecture 17 19

Deconvolution Network for Semantic Segmentation

11/30/2021 CAP5415 - Lecture 17 20

H. Noh, S. Hong, and B. Han, Learning Deconvolution Network for Semantic Segmentation, ICCV 2015
Input image 14 × 14 deconvolutional layer 28 × 28 unpooling layer 28 × 28 deconvolutional layer 56 × 56 unpooling layer

56 × 56 deconvolutional layer 112 × 112 unpooling layer 112 × 112 deconvolutional layer 224 × 224 unpooling layer 224 × 224 deconvolutional layer
11/30/2021 CAP5415 - Lecture 17 21
Image source: H. Noh, S. Hong, and B. Han, Learning Deconvolution Network for Semantic Segmentation, ICCV 2015
Learned upsampling architectures

11/30/2021 CAP5415 - Lecture 17 22

Figure source
SegNet

Uses VGG architecture!

Image source: http://mi.eng.cam.ac.uk/projects/segnet/
No FC layer!
11/30/2021 CAP5415 - Lecture 17 23
V Badrinarayanan, et al., A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling, 2015
U-Net

11/30/2021 CAP5415 - Lecture 17 24

Source: Olaf Ronneberger, Philipp Fischer, Thomas Brox “U-Net: Convolutional Networks for Biomedical Image Segmentation”, MICCAI, 2015
Questions?

Sources for this lecture include materials from works by Sedat Ozer, Ulas
Bagci, and Svetlana Lazebnik

11/30/2021 CAP5415 - Lecture 17 25

5.1 CEMS - Manual - Forbes Marshal
100% (5)
5.1 CEMS - Manual - Forbes Marshal
209 pages
More Tools in Flash: Computer Grade 7 Keyboard (3Rd Edition)
100% (1)
More Tools in Flash: Computer Grade 7 Keyboard (3Rd Edition)
6 pages
Instruction Type Instruction Count Cycles Per Instruction
No ratings yet
Instruction Type Instruction Count Cycles Per Instruction
12 pages
Fully Convolutional Networks For Semantic Segmentation
No ratings yet
Fully Convolutional Networks For Semantic Segmentation
17 pages
Image Denoising With Block-Matching and 3D Filteri
No ratings yet
Image Denoising With Block-Matching and 3D Filteri
13 pages
3. High-Precision Reversible Data Hiding Predictor UCANet
No ratings yet
3. High-Precision Reversible Data Hiding Predictor UCANet
5 pages
Paper of Rolling Net
No ratings yet
Paper of Rolling Net
9 pages
Updated PPT presentation-ISA-1 Phase-2
No ratings yet
Updated PPT presentation-ISA-1 Phase-2
27 pages
Structure-Aware Motion Deblurring Using
No ratings yet
Structure-Aware Motion Deblurring Using
14 pages
Exact Decomposition of Joint Low Rankness and Local Smoothness Plus Sparse Matrices PDF
No ratings yet
Exact Decomposition of Joint Low Rankness and Local Smoothness Plus Sparse Matrices PDF
16 pages
Visvesvaraya Technological University: An Image Is Worth 16X16 Words Transformers For Image Recognition at Scale
No ratings yet
Visvesvaraya Technological University: An Image Is Worth 16X16 Words Transformers For Image Recognition at Scale
21 pages
A Survey of Visual Transformers
No ratings yet
A Survey of Visual Transformers
21 pages
2020 Bioinformatics 36 3077-3083
No ratings yet
2020 Bioinformatics 36 3077-3083
7 pages
(NIPS23) Scattering Transformation For ViT
No ratings yet
(NIPS23) Scattering Transformation For ViT
21 pages
Deep Dynamic Scene Deblurring From Optical Flow
No ratings yet
Deep Dynamic Scene Deblurring From Optical Flow
11 pages
A Review of Deep Learning Methods For Pixel-Level Crack Detection
No ratings yet
A Review of Deep Learning Methods For Pixel-Level Crack Detection
24 pages
1 s2.0 S131915782300099X Main
No ratings yet
1 s2.0 S131915782300099X Main
13 pages
Knn Block Dbscan
No ratings yet
Knn Block Dbscan
15 pages
sensors-23-07260-v2
No ratings yet
sensors-23-07260-v2
17 pages
A survey of the Vision Transformers and its CNN-Transformer based Variants_Khan et al_
No ratings yet
A survey of the Vision Transformers and its CNN-Transformer based Variants_Khan et al_
82 pages
FISS GAN A Generative Adversarial Network For Foggy Image Semantic Segmentation
No ratings yet
FISS GAN A Generative Adversarial Network For Foggy Image Semantic Segmentation
12 pages
A Distortion-Aware Multi-Task Learning Framework for Fractional Interpolation in Video Coding
No ratings yet
A Distortion-Aware Multi-Task Learning Framework for Fractional Interpolation in Video Coding
13 pages
Bvit
No ratings yet
Bvit
12 pages
WIREs Data Min Knowl - 2018 - Li - Deep Learning For Remote Sensing Image Classification A Survey
No ratings yet
WIREs Data Min Knowl - 2018 - Li - Deep Learning For Remote Sensing Image Classification A Survey
17 pages
Applsci 14 02905 v2
No ratings yet
Applsci 14 02905 v2
14 pages
s41598-024-76886-w
No ratings yet
s41598-024-76886-w
14 pages
U-Net and Its Variants for Medical Image Segmentat
No ratings yet
U-Net and Its Variants for Medical Image Segmentat
43 pages
2022-Learning Discriminative Features by Covering Local Geometric Space For Point Cloud Analysis
No ratings yet
2022-Learning Discriminative Features by Covering Local Geometric Space For Point Cloud Analysis
16 pages
PQA-Net_Deep_No_Reference_Point_Cloud_Quality_Assessment_via_Multi-View_Projection (1)
No ratings yet
PQA-Net_Deep_No_Reference_Point_Cloud_Quality_Assessment_via_Multi-View_Projection (1)
16 pages
Systematic Evaluation of Convolution Neural Network Advances On The Imagenet-2017
No ratings yet
Systematic Evaluation of Convolution Neural Network Advances On The Imagenet-2017
9 pages
Semantic-Disentangled_Transformer_With_Noun-Verb_Embedding_for_Compositional_Action_Recognition
No ratings yet
Semantic-Disentangled_Transformer_With_Noun-Verb_Embedding_for_Compositional_Action_Recognition
13 pages
UCFilTransNet_ Cross-Filtering Transformer-based Network for CT Image Segmentation(科研通-Ablesci.com)
No ratings yet
UCFilTransNet_ Cross-Filtering Transformer-based Network for CT Image Segmentation(科研通-Ablesci.com)
12 pages
20536-Article Text-24549-1-2-20220628
No ratings yet
20536-Article Text-24549-1-2-20220628
9 pages
Fully_Convolutional_Networks_for_Semantic_Segmentation
No ratings yet
Fully_Convolutional_Networks_for_Semantic_Segmentation
12 pages
Fully Convolutional Networks For Semantic Segmentation
No ratings yet
Fully Convolutional Networks For Semantic Segmentation
12 pages
Woa PPT 26-03-24
No ratings yet
Woa PPT 26-03-24
23 pages
Self-Supervised Learning With Swin Transformers: Zhenda Xie Yutong Lin Zhuliang Yao Zheng Zhang Qi Dai Yue Cao Han Hu
No ratings yet
Self-Supervised Learning With Swin Transformers: Zhenda Xie Yutong Lin Zhuliang Yao Zheng Zhang Qi Dai Yue Cao Han Hu
8 pages
Domain Adaptive and Interactive Differential Attention Network For Remote Sensing Image Change Detection
No ratings yet
Domain Adaptive and Interactive Differential Attention Network For Remote Sensing Image Change Detection
16 pages
Post-Reading Report Alex Shen (Mid Exam)
No ratings yet
Post-Reading Report Alex Shen (Mid Exam)
36 pages
Usa U Net
No ratings yet
Usa U Net
27 pages
EDP An Efficient Decomposition and Pruning Scheme
No ratings yet
EDP An Efficient Decomposition and Pruning Scheme
15 pages
Applications of Fractional Calculus in Computer Vision
No ratings yet
Applications of Fractional Calculus in Computer Vision
22 pages
Huang Seeing Out of The Box End-to-End Pre-Training For Vision-Language Representation CVPR 2021 Paper
No ratings yet
Huang Seeing Out of The Box End-to-End Pre-Training For Vision-Language Representation CVPR 2021 Paper
10 pages
Triplet Cross-Fusion Learning For Unpaired Image Denoising in Optical Coherence Tomography
No ratings yet
Triplet Cross-Fusion Learning For Unpaired Image Denoising in Optical Coherence Tomography
16 pages
Remotesensing 13 04712 v2
No ratings yet
Remotesensing 13 04712 v2
51 pages
Weakly Supervised Semantic Segmentation Using Superpixel Pooling Network
No ratings yet
Weakly Supervised Semantic Segmentation Using Superpixel Pooling Network
7 pages
Expt 8 Morphssp
No ratings yet
Expt 8 Morphssp
4 pages
002 Pattern Detection
No ratings yet
002 Pattern Detection
15 pages
Exploring_Convolution_Neural_Network_for_Branch_Prediction
No ratings yet
Exploring_Convolution_Neural_Network_for_Branch_Prediction
9 pages
Attention-guided CNN for image denoising
No ratings yet
Attention-guided CNN for image denoising
25 pages
PSLT
No ratings yet
PSLT
16 pages
Depth-Aware Unpaired Video Dehazing
No ratings yet
Depth-Aware Unpaired Video Dehazing
16 pages
1-s2.0-S0141938224003184-main
No ratings yet
1-s2.0-S0141938224003184-main
10 pages
2015 - DeepLab v1 - Semantic Image Segmentation With Deep Convolutional Nets and Fully Connected Crfs
No ratings yet
2015 - DeepLab v1 - Semantic Image Segmentation With Deep Convolutional Nets and Fully Connected Crfs
14 pages
GNN-Computer aided Civil Eng - 2022 - Song - Elastic structural analysis based on graph neural network without labeled data
No ratings yet
GNN-Computer aided Civil Eng - 2022 - Song - Elastic structural analysis based on graph neural network without labeled data
17 pages
【2022】IEEE Cyb Visual Relationship Detection a Survey
No ratings yet
【2022】IEEE Cyb Visual Relationship Detection a Survey
14 pages
VLP: A Survey On Vision-Language Pre-Training
No ratings yet
VLP: A Survey On Vision-Language Pre-Training
19 pages
Review - UNet++ - A Nested U-Net Architecture (Biomedical Image Segmentation) - by Sik-Ho Tsang - Medium
No ratings yet
Review - UNet++ - A Nested U-Net Architecture (Biomedical Image Segmentation) - by Sik-Ho Tsang - Medium
9 pages
10 Transformers
No ratings yet
10 Transformers
22 pages
BTP Project
No ratings yet
BTP Project
15 pages
RefinePocket_An_Attention-Enhanced_and_Mask-Guided_Deep_Learning_Approach_for_Protein_Binding_Site_Prediction
No ratings yet
RefinePocket_An_Attention-Enhanced_and_Mask-Guided_Deep_Learning_Approach_for_Protein_Binding_Site_Prediction
8 pages
Visualization and Interpretation: Humanistic Approaches to Display
From Everand
Visualization and Interpretation: Humanistic Approaches to Display
Johanna Drucker
No ratings yet
Schaum’s Outline of Computer Graphics 2/E
From Everand
Schaum’s Outline of Computer Graphics 2/E
Zhigang Xiang
3.5/5 (6)
Lecture-28-TransformerIntroductionFinal-1
No ratings yet
Lecture-28-TransformerIntroductionFinal-1
69 pages
Lecture-27-Introduction to VLM
No ratings yet
Lecture-27-Introduction to VLM
46 pages
DiffPose
No ratings yet
DiffPose
15 pages
Term Paper
No ratings yet
Term Paper
9 pages
Scheduler Activations
No ratings yet
Scheduler Activations
27 pages
Microsoft Cloud Networking For Enterprise Architects
No ratings yet
Microsoft Cloud Networking For Enterprise Architects
12 pages
Air105 EN 1.1
No ratings yet
Air105 EN 1.1
259 pages
Shenzhen Divi Electronic Co.,Ltd: 5A Fast Charging
No ratings yet
Shenzhen Divi Electronic Co.,Ltd: 5A Fast Charging
7 pages
Dun & Bradstreet PDF
No ratings yet
Dun & Bradstreet PDF
8 pages
Week 1 - Introduction and Basic Concepts of AI
No ratings yet
Week 1 - Introduction and Basic Concepts of AI
31 pages
Car Detection From Low-Altitude UAV Imagery With
No ratings yet
Car Detection From Low-Altitude UAV Imagery With
11 pages
Distribution Switchgear: Technical Catalogue - June 2014
No ratings yet
Distribution Switchgear: Technical Catalogue - June 2014
404 pages
Set Theory With an Introduction to Real Point Sets 1st Edition Abhijit Dasgupta (Auth.) download
100% (2)
Set Theory With an Introduction to Real Point Sets 1st Edition Abhijit Dasgupta (Auth.) download
49 pages
CV Odorovic
No ratings yet
CV Odorovic
4 pages
UNIT3 Computer
No ratings yet
UNIT3 Computer
3 pages
Algo Bot
No ratings yet
Algo Bot
2 pages
2018 Google Dorks Master
No ratings yet
2018 Google Dorks Master
84 pages
BOE Assignment1 AyushNigam
No ratings yet
BOE Assignment1 AyushNigam
4 pages
AWS CSAA Free Test: Whizlabs Learning Center
No ratings yet
AWS CSAA Free Test: Whizlabs Learning Center
28 pages
796 Ict 24 Al P1
No ratings yet
796 Ict 24 Al P1
5 pages
zd621 Thermal Transfer Parts Catalog
No ratings yet
zd621 Thermal Transfer Parts Catalog
2 pages
Null
100% (1)
Null
54 pages
Device Manager
No ratings yet
Device Manager
2 pages
PDF V
No ratings yet
PDF V
1 page
Resume of Shelethamiles - 1
No ratings yet
Resume of Shelethamiles - 1
3 pages
Comandos Cdma
No ratings yet
Comandos Cdma
24 pages
Chat App Presentation
No ratings yet
Chat App Presentation
14 pages
M320TF Winmate
No ratings yet
M320TF Winmate
2 pages
Tutorial 08 - Water Pressure in SWedge
No ratings yet
Tutorial 08 - Water Pressure in SWedge
16 pages
Lean Manufacturing (JIT)
No ratings yet
Lean Manufacturing (JIT)
5 pages
Pandas Overview
No ratings yet
Pandas Overview
4 pages
Precedence Network Analysis: Civ4101 Civil Engineering Management
No ratings yet
Precedence Network Analysis: Civ4101 Civil Engineering Management
7 pages