0% found this document useful (0 votes)

16 views

Cats and Dogs: A Summary

The document summarizes a paper that introduces a new dataset of cat and dog images with breed labels and evaluates models for classifying images into species and breed. It describes the Oxford-IIIT dataset, a two-part classification model using shape and appearance features, and reports the model's performance on species and breed classification tasks.

Uploaded by

Hasnain Abdur Rehman

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

Cats and Dogs: A Summary

Uploaded by

Hasnain Abdur Rehman

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Cats and Dogs

A Summary

This paper addresses a fine-grained image classification problem of classifying 37 breeds of cats and dogs –
12 cat breeds and 25 dog breeds. Classification task between breeds of the same species is particularly
challenging since robust models are needed to learn subtle differences in fur textures and phenotypic details
present in the pets’ images – most famous computer vision datasets and classification models at the time of
this publication consisted of little visual similarity, e.g. ImageNet, and hence classification is easier on such
datasets. Besides the available literature of the time pointing out breed classification being a challenging task,
one other utilization for breed classification is the reporting of breed to pet owners – many of whom are
misinformed about their pets’ breed.

The paper has the following contributions: 1) Introduces a new cats vs dogs dataset, the Oxford-IIIT dataset.
2) Provides a model for species only, or species + breed classification. 3) Demonstrates the model to break the
ASIRRA challenges to a considerable amount of accuracy.

The Oxford IIIT dataset with 12 cat breeds, 25 dog breeds, is annotated with breed label, and bounding boxes
for animal head are provided. Further, pixel level segmentation of animal in the picture is also done, making the
dataset a very promising and easily useable dataset for further research purposes. Performance is measured
as average per class classification accuracy (average of row of confusion matrix)

The model given combines shape (pet face) and appearance (bag of words model) to classify. The
classification model is a 2-part model: 1) Shape Model: A ‘Deformable Part Model’ that captures the face
‘shape’ data from an image. This information is made part of the feature descriptor. 2) Appearance Model: A
bag of words model that captures the animals’ fur texture information.
Size is an important indicator of animal breed but unavailability of absolute reference in dataset requires use of
shape as a feature, and the appearance of fur.

Shape Model: As is in a deformable part model, a root part is connected with spring to 8 smaller parts. History
of Oriented Gradients (HOG) filters are used to capture latent information, local distribution of image edges.
Dynamic Programming is used to find best compromise of matching parts to the image while minimally
distorting the springs. The DPM is used to detect distinctive components of the animal body. Head annotations
are used to learn a DPM of cat faces, and one of dogs faces.

Appearance Model: SIFT descriptors are extracted, then quantized on a vocabulary of 4000 visual words
learned by using k means on randomly sampled features from data. Quantized SIFT features are pooled into
spatial histogram, which has dimension 4000 x # of bins. After normalization, SVM is used for classification by
exponential chi squared kernel. Different variants of spatial histogram can be obtained by placing spatial bins in
accordance with different features of the pet. 3 different spatial layouts for computing image descriptors are
shown. Spatial histograms computed on separate spatial components are concatenated to obtain the final
image descriptor. First one has only the image as a whole and then subdivided into 4 2x2 sub regions. Second
one has the head as a separate spatial tile, and the remaining image as a single spatial tile. The last has
descriptor containing those of first and second ones as such, the foreground object (the pet) as separate,
divided into 5 spatial bins, the background without any spatial subdivisions. The sizes of feature vectors grow
as 20, 28 then 48000 respectively.

Auto segmentation: Segmentation between foreground and background regions, containing the pet and
otherwise respectively, are obtained using grab-cut segmentation. An SVM classifier assigns a confidence
scores to super pixels used to initialize the grab-cut segmentation. Berkley’s Ultra metric colormap was used to
obtain super pixels of the image: color histogram of the image provides the feature map – and Sift -BoW
histogram is computed on it. 65% segmentation accuracy achieved.
2 Approaches are adopted to run the model:1) Flat Approach, and 2) Hierarchical Approach.
Flat approach regresses both parts of model simultaneously, while hierarchical approach first uses shape
features to detect if cat or a dog, then uses texture bag of words model to detect breed. VLFeat-BoW
classification on the dataset is used as the baseline.

Pet – Family Discrimination: i) Shape only: face detectors are used, and scores fed to linear SVM to
discriminate. This gives accuracy of 94.21 %.II) Appearance only: non-linear SVM regresses over spatial
histograms of visual words. Accuracy of models depend on layout of spatial bins used, which increases as
more bins are added in spatial layout, thus increasing feature vector size. Using ground truth segmentation
rather than our automatic segmentation increases 1% accuracy. III) Shape and appearance: linear SVM kernel
applied on shape is summed with exp. Chi squared kernel for appearance to combine shape and appearance
information. A 95.37% accuracy is obtained.

Similarly, breed classification results increase accuracy as bigger feature vectors are used, and segmenation
gets more accurate. Results are shown in the table below.

Improvements: The paper was published in 2012, where the image segmentation is only achieving 65%
accuracy. Since then, multiple better image segmentation models have been proposed. We are told that
classification accuracy increases with increasing segmentation accuracy, therefore using a better segmentation
model would increase classification accuracy.

Minor Project Synopsis - Dog Breed Identification (1) - Removed
No ratings yet
Minor Project Synopsis - Dog Breed Identification (1) - Removed
42 pages
Image Classification With Artificial Intelligence Cats Vs Dogs
No ratings yet
Image Classification With Artificial Intelligence Cats Vs Dogs
5 pages
Cat and Dog Classification Using CNN: Project Objective
No ratings yet
Cat and Dog Classification Using CNN: Project Objective
7 pages
A Historical Account of The Stratigraphy of Qatar, Middle-East (1816 To 2015)
100% (2)
A Historical Account of The Stratigraphy of Qatar, Middle-East (1816 To 2015)
1,220 pages
Special Forces Mounted Operations - FM 31-23D
100% (2)
Special Forces Mounted Operations - FM 31-23D
124 pages
Pharmaceutical Calculation Volume 4
No ratings yet
Pharmaceutical Calculation Volume 4
2 pages
Cats and Dogs
No ratings yet
Cats and Dogs
8 pages
DogCat Report
No ratings yet
DogCat Report
10 pages
Dog Breed Identification: Whitney Larow Brian Mittl Vijay Singh
No ratings yet
Dog Breed Identification: Whitney Larow Brian Mittl Vijay Singh
7 pages
Animal Classification and Prediction
No ratings yet
Animal Classification and Prediction
8 pages
Paper 1473
No ratings yet
Paper 1473
7 pages
Biometric Recognition For Pet Animal
No ratings yet
Biometric Recognition For Pet Animal
13 pages
petApplication presentation
No ratings yet
petApplication presentation
37 pages
Machine Learning Capstone Project PDF
No ratings yet
Machine Learning Capstone Project PDF
17 pages
AI Proposal
No ratings yet
AI Proposal
4 pages
Stanford Dog Classification Using Convolutional Neural Network (CNN)
No ratings yet
Stanford Dog Classification Using Convolutional Neural Network (CNN)
8 pages
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Dog - Breed Base Paper
No ratings yet
Dog - Breed Base Paper
4 pages
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Machine learning
No ratings yet
Machine learning
5 pages
Machine Learning Engineer Nanodegree: Capstone Proposal
No ratings yet
Machine Learning Engineer Nanodegree: Capstone Proposal
3 pages
AI Project
No ratings yet
AI Project
13 pages
Python para
No ratings yet
Python para
104 pages
Detection of Animals and Alerting System: Volume 12, Issue 06, Jun 2022 ISSN 2457 - 0362
No ratings yet
Detection of Animals and Alerting System: Volume 12, Issue 06, Jun 2022 ISSN 2457 - 0362
4 pages
DL Copy5
No ratings yet
DL Copy5
10 pages
2018-09-sisy-paper11
No ratings yet
2018-09-sisy-paper11
6 pages
EE4483 Project2
No ratings yet
EE4483 Project2
3 pages
Document Mosaicing: Unlocking Visual Insights through Document Mosaicing
From Everand
Document Mosaicing: Unlocking Visual Insights through Document Mosaicing
Fouad Sabry
No ratings yet
TGS Besar ML 8488 8684 8861 9010 9027
No ratings yet
TGS Besar ML 8488 8684 8861 9010 9027
8 pages
2012.10878 Major Reference For Abstract
No ratings yet
2012.10878 Major Reference For Abstract
12 pages
Content#
No ratings yet
Content#
67 pages
Dog Breed Classifier - The Startup - Medium
No ratings yet
Dog Breed Classifier - The Startup - Medium
18 pages
Computer Vision Fundamental Matrix: Please, suggest a subtitle for a book with title 'Computer Vision Fundamental Matrix' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Computer Vision Fundamental Matrix: Please, suggest a subtitle for a book with title 'Computer Vision Fundamental Matrix' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet
Dog Breed Identification Using Deep Learning
No ratings yet
Dog Breed Identification Using Deep Learning
17 pages
Contextual Image Classification: Understanding Visual Data for Effective Classification
From Everand
Contextual Image Classification: Understanding Visual Data for Effective Classification
Fouad Sabry
No ratings yet
Deep Learning Approaches for Dog Breed Identification and Behavior
No ratings yet
Deep Learning Approaches for Dog Breed Identification and Behavior
8 pages
C I U M I D L T: Attle Dentification Sing Uzzle Mages and EEP Earning Echniques
No ratings yet
C I U M I D L T: Attle Dentification Sing Uzzle Mages and EEP Earning Echniques
9 pages
Approach of Image Processing in Diagnosis and Medication of Fungal Infections in Pet Animals
No ratings yet
Approach of Image Processing in Diagnosis and Medication of Fungal Infections in Pet Animals
15 pages
nikhilreport (2)
No ratings yet
nikhilreport (2)
19 pages
Parkhi 2011 The Truth About Cats and Dogs
No ratings yet
Parkhi 2011 The Truth About Cats and Dogs
8 pages
Geometric Hashing: Efficient Algorithms for Image Recognition and Matching
From Everand
Geometric Hashing: Efficient Algorithms for Image Recognition and Matching
Fouad Sabry
No ratings yet
1119-Article Text-1850-1-10-20221018
No ratings yet
1119-Article Text-1850-1-10-20221018
18 pages
Exercise 2 Building Convolution Neural Network
No ratings yet
Exercise 2 Building Convolution Neural Network
15 pages
Dog Face Verification and Recognition
No ratings yet
Dog Face Verification and Recognition
13 pages
Machine Learning Attacks Against The Asirra CAPTCHA: Philippe Golle
No ratings yet
Machine Learning Attacks Against The Asirra CAPTCHA: Philippe Golle
8 pages
Major_Project_Synopsis
No ratings yet
Major_Project_Synopsis
3 pages
2230046_LAB2
No ratings yet
2230046_LAB2
11 pages
Machine Learning Engineer Nanodegree: Capstone Proposal
No ratings yet
Machine Learning Engineer Nanodegree: Capstone Proposal
11 pages
5 April2023
No ratings yet
5 April2023
15 pages
Canine_Biometric_Identification_Using_ECG_Signals_and_CNN-LSTM_Neural_Networks
No ratings yet
Canine_Biometric_Identification_Using_ECG_Signals_and_CNN-LSTM_Neural_Networks
15 pages
Cad and Dog
No ratings yet
Cad and Dog
5 pages
Minor Project Synopsis - Dog Breed Identification
No ratings yet
Minor Project Synopsis - Dog Breed Identification
43 pages
Deep Learning - How To Build A Dog Detector and Breed Classifier Using CNN - ! - by Rahil Bagheri - Towards Data Science
No ratings yet
Deep Learning - How To Build A Dog Detector and Breed Classifier Using CNN - ! - by Rahil Bagheri - Towards Data Science
21 pages
U2543617 Animal Classification
No ratings yet
U2543617 Animal Classification
20 pages
Supplementary
No ratings yet
Supplementary
3 pages
DAMM For The Detection and Tracking of Multiple Animals Within Complex Social and Environmental Settings
No ratings yet
DAMM For The Detection and Tracking of Multiple Animals Within Complex Social and Environmental Settings
15 pages
CV Lab 4 Muhammad Umer Siddiq
No ratings yet
CV Lab 4 Muhammad Umer Siddiq
6 pages
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
From Everand
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
Fouad Sabry
No ratings yet
Kaggle Github
No ratings yet
Kaggle Github
3 pages
2_rgb_t_cattle_facial_landmark_b
No ratings yet
2_rgb_t_cattle_facial_landmark_b
5 pages
Image Classification
No ratings yet
Image Classification
16 pages
Artificial Intelligence(AI) Class_Research Proposal Format
No ratings yet
Artificial Intelligence(AI) Class_Research Proposal Format
8 pages
12.digital Image Processing-Dr.N.vedakumar
No ratings yet
12.digital Image Processing-Dr.N.vedakumar
39 pages
Karakteristik Pasien Transfusi Darah Dengan Inkompatibilitas Crossmatch Di UTD RSUP DR M Djamil Padang
No ratings yet
Karakteristik Pasien Transfusi Darah Dengan Inkompatibilitas Crossmatch Di UTD RSUP DR M Djamil Padang
5 pages
DESIGN AND ANALYSIS OF RESIDENTIAL BUILDING (C+G+5) Using E-Tabs Paper
No ratings yet
DESIGN AND ANALYSIS OF RESIDENTIAL BUILDING (C+G+5) Using E-Tabs Paper
2 pages
Superstructure Instructions Installation Tractor
No ratings yet
Superstructure Instructions Installation Tractor
19 pages
Monografia Sebastiaan Relationships in Biodanza
100% (1)
Monografia Sebastiaan Relationships in Biodanza
81 pages
Agronomy Shagufta 21
No ratings yet
Agronomy Shagufta 21
19 pages
Catálogo LG
No ratings yet
Catálogo LG
149 pages
Cambridge IGCSE™: Additional Mathematics 0606/12 May/June 2020
No ratings yet
Cambridge IGCSE™: Additional Mathematics 0606/12 May/June 2020
9 pages
Petco 5080 Electronic and Battery Operated Toys - V8
100% (1)
Petco 5080 Electronic and Battery Operated Toys - V8
21 pages
Modernity in Tradition: Re Ections On Building Design and Technology in The Asian Vernacular
No ratings yet
Modernity in Tradition: Re Ections On Building Design and Technology in The Asian Vernacular
11 pages
Mass Media Diary. Yaryna Hachok, Report Number 8
No ratings yet
Mass Media Diary. Yaryna Hachok, Report Number 8
3 pages
Remote Earth Cathodic Protection
No ratings yet
Remote Earth Cathodic Protection
2 pages
Have You Ever Had A Pet Before?: Do U Like Keeping A Pet? Why or Why Not?
No ratings yet
Have You Ever Had A Pet Before?: Do U Like Keeping A Pet? Why or Why Not?
3 pages
Experts Guide To OTN Ebook
No ratings yet
Experts Guide To OTN Ebook
40 pages
Ruchi Anjali
No ratings yet
Ruchi Anjali
9 pages
Pdfcoffeecom James Gurney Dinotopia A Land Apart 240506 080520
No ratings yet
Pdfcoffeecom James Gurney Dinotopia A Land Apart 240506 080520
168 pages
FLSmidth Cross-Bar Cooler Brochure
No ratings yet
FLSmidth Cross-Bar Cooler Brochure
8 pages
DOCUMENTATION STM32G431dm00493601-stm32g4-nucleo-32-board-mb1430-stmicroelectronics
No ratings yet
DOCUMENTATION STM32G431dm00493601-stm32g4-nucleo-32-board-mb1430-stmicroelectronics
30 pages
Win10 MD100 PPT Mod5 Final
No ratings yet
Win10 MD100 PPT Mod5 Final
77 pages
SOLARA
No ratings yet
SOLARA
1 page
Nasal Polyps
50% (2)
Nasal Polyps
27 pages
Matl Control Procedure
No ratings yet
Matl Control Procedure
3 pages
5 - Light Wrap Up
No ratings yet
5 - Light Wrap Up
17 pages
Omega User Manual en
No ratings yet
Omega User Manual en
46 pages
Laxative
No ratings yet
Laxative
8 pages
Olhuveli Factsheet en Nov 2020
No ratings yet
Olhuveli Factsheet en Nov 2020
8 pages
Electronic and Electrical Servicing Consumer and Commercial Electronics 2nd Edition Ian Sinclair - The latest updated ebook is now available for download
100% (2)
Electronic and Electrical Servicing Consumer and Commercial Electronics 2nd Edition Ian Sinclair - The latest updated ebook is now available for download
47 pages

Cats and Dogs: A Summary

Uploaded by

Cats and Dogs: A Summary

Uploaded by

Cats and Dogs

You might also like