Class Notes6 F24
Class Notes6 F24
Sensing Data
Image Classification
40
35
30
Band 4
25
Vegetation
20
Soil
15
10
5
0
0 5 10 15 20
Band 3
Basic Strategy: Dealing with
variability
45 With variability, the vegetation
40
35
pixels now occupy a region, not a
30 point, of n-dimensional space
25
20
15
10
0
Band 1 Band 2 Band 3 Band 4 Band 5 Band 7
45
40
35
30
Band 4
25
different region of n- 10
5
dimensional space 0
0 2 4 6 8 10 12 14 16 18 20
Band 3
Basic strategy: Dealing with variability
• Classification:
• Delineate boundaries of 45
35
• Assign class names to pixels 30
Band 4
25
20
15
10
5
0
0 2 4 6 8 10 12 14 16 18 20
Band 3
Image Classification Techniques
1) Supervised classification
2) Unsupervised classification
3) Contextual classification
4) Fuzzy classification
Supervised Classification
• The process of using samples of known
informational classes (training sets) to
classify pixels of unknown identity.
• Identification and delineation of training
areas is key to successful implementation
Training Sets
• # of pixels - want to statistically characterize the
spectral properties of an informational class (i.e. forest,
crop, water), should have >= 100 pixels total for an
informational class
• location - geographically dispersed, boundaries away
from edge/mixed pixels
• number of areas - depends on number of information
categories, 10 at a minimum, enough for accuracy
assessment and incorporation of spectral subclasses
• uniformity - unimodal distributions, use training areas
to characterize mean, variance, covariances -
sometimes not easy due to spectral variation present
Supervised Classification Steps
Known Conifer
Area
Water
Known Water
Area
Deciduous
Known Deciduous
Area
Digital Image
Supervised Classification
Mean Spectral Information
Signatures Multispectral Image (Classified Image)
Conifer
Deciduous
Water Unknown
Spectral Signature
of Next Pixel to be
Classified
The Result is Information--in this case a Land Cover map...
Legend:
Water
Conifer
Deciduous
Supervised classifiers
• Minimum distance to means classifier: uses the central
values (means) of the spectral data clusters (defined by
training data) to assign pixels to information categories
• Parallelepiped classifier : uses ranges of pixel values within
the training data to define classification regions in
multivariate space; one of the earliest classification
algorithms developed
• Maximum likelihood classifier : estimates the means and
variances of information classes defined by training data to
estimate probabilities for each pixel in an image, most
commonly used method
– takes into account the shape of the cluster distribution, as well
as overlapping regions, requires normal distributions
Minimum distance to means
– Find mean value of
pixels of training sets 45
40
in n-dimensional 35
30
Band 4
25
space 20
15
classified according 0 2 4 6 8 10
Band 3
12 14 16 18 20
Candidate Pixel
Mean Signature 2
Candidate Pixel
Mean Signature 2
– Pro:
• Most sophisticated; achieves good separation of
classes
– Con:
• Requires strong training set to accurately describe
mean and covariance structure of classes
Some advanced techniques
– Neural networks
• Use flexible, not-necessarily-linear functions to
partition spectral space
– Contextual classifiers
• Incorporate spatial or temporal conditions
– Linear regression
• Instead of discrete classes, apply proportional values of
classes to each pixel; ie. 30% forest + 70% grass
Unsupervised Classification
• The identification of natural groups, or
structures/patterns, within multispectral data
• Spectral classes are defined by the computer
through statistical clustering method;
informational classes are assigned to output
spectral clusters
Unsupervised Classification
The analyst requests the computer to examine
the image and extract a number of spectrally
distinct clusters… Spectrally Distinct Clusters
Cluster 3 Cluster 6
Cluster 5 Cluster 2
Cluster 1 Cluster 4
Digital Image
Unsupervised Classification
Saved Clusters
Output Classified Image
Cluster 3 Cluster 6
Next Pixel to
be Classified
Cluster 5 Cluster 2
Cluster 1 Cluster 4
Unknown
Unsupervised Classification
The result of the The analyst determines the
unsupervised classification ground cover for each of
is not yet information until… the clusters…
??? Water
??? Water
??? Conifer
??? Conifer
??? Hardwood
??? Hardwood
Unsupervised Classification
It is a simple process to The result is essentially the
regroup (recode) the clusters same as that of the supervised
into meaningful information classification:
classes (the legend).
Land Cover Map Legend
Labels
Water
Water
Water
Conif.
Conifer
Hardw.
Conifer
Hardwood
Hardwood
Unsupervised Classification
• Pros
– Takes maximum advantage of spectral
variability in an image
• Cons
– The maximally-separable clusters in spectral
space may not match our perception of the
important classes on the landscape
ISODATA -A Special Case of
Minimum Distance Clustering
• “Iterative Self-Organizing Data Analysis Technique”
• Parameters you must enter include:
– N - the maximum number of clusters that you want
– T - a convergence threshold and
– M - the maximum number of iterations to be
performed.
ISODATA Procedure
• Program begins with randomly selected cluster
centroids
• Distance of each pixel to cluster centroids is
computed
• New cluster centroids are developed based on
distances
• Distance of each pixel to new cluster centroids is
computed
• Iterates until cluster centroids no longer change
(or convergence definitions are met
ISODATA Procedure
• After each iteration, the algorithm calculates
the percentage of pixels that remained in the
same cluster between iterations
• When this percentage exceeds T (convergence
threshold), the program stops or…
• If the convergence threshold is never met, the
program will continue for M iterations and
then stop.
ISODATA Pros and Cons
• Not biased to the top pixels in the image (as
sequential clustering can be)
• Non-parametric--data does not need to be
normally distributed
• Very successful at finding the “true” clusters
within the data if enough iterations are allowed
• Cluster signatures saved from ISODATA are easily
incorporated and manipulated along with
(supervised) spectral signatures
• Slowest (by far) of the clustering procedures.
Unsupervised Classification