Framework For Network-Level Pavement Condition Assessment Using Remote Sensing Data Mining
Framework For Network-Level Pavement Condition Assessment Using Remote Sensing Data Mining
Abstract. Pavement condition monitoring is fundamental for the efficient allocation of resources in transportation
asset management. However, data collection involves laborious and costly procedures. This study intends to
investigate the usage of remote sensing data for network-level pavement condition assessment offering a more cost-
effective alternative and a rapid infrastructure assessment tool which can be used in the aftermath of natural disasters.
Based on an extensive literature review, a data mining framework was established to train models that predict the
pavement condition of different road segments. The framework exploits the inherent information of multispectral
images by generating spectral related attributes. To identify pavement sampling areas, an automated procedure using
image segmentation replaces manual surface digitizing. Unlike previous research, different classification models to
approximate the mapping function from spectral information to pavement conditions. A preliminary case study was
conducted with data provided by the City of Dallas and multispectral images acquired from the Texas Natural
Resources Information System. The mean-shift segmentation algorithm was used to locate noise introducing areas on
the pavement surface. Four different classification models were trained using k-nearest neighbors, naïve Bayes,
support vector machines, and a multilayer perceptron. The developed models were employed to predict the road
surface condition class of a test set not included in the training procedure. The multilayer perceptron presented the
highest accuracy level of 71 percent, showing that the framework might have the potential for future implementation.
Keywords: classification, data mining, image processing, infrastructure management, pavement condition, remote
sensing.
1 Introduction
Pavement data collection technologies have seen a widespread application during the past
decade 1. Traditional manual field inspections and video logging used to collect pavement
condition data are getting replaced by mobile automated systems. These systems include high-
speed laser, acoustic and infrared imaging sensors. Using measurements of surface distress,
roughness, deflection and skid resistance, pavements are assigned an index score that reflects their
1
overall condition. Transportation infrastructure management agencies have developed various
such indices and use them as a basis for resource allocation and maintenance decisions.
mounted to vans designed for collecting pavement and roadway characteristic data. Assessment of
the condition of pavements through existing inspection approaches can be expensive, laborious,
and time-consuming 2. The costs of automated pavement condition data collection and processing
vary. Full-featured collection and processing averages more than $30 per lane-km ($50 per lane-
mi) and may reach $125 per lane-km ($200 per lane-mi) 3. The cost can be even higher in urban,
high-traffic areas. By considering the vast network size that a typical transportation agency
manages, the total cost of pavement condition surveying and processing can reach several million
dollars per year. Furthermore, traditional pavement data collection involves disruptions of traffic
due to its stop-and-go, low-speed nature 4. Significant safety hazards arise due to that fact and there
developments in remote sensing and data science have shown potential in using high-resolution
such as unmanned aerial vehicles, airplanes and satellites can capture these images.
Data collection requirements for network and project-level decisions present major
differences 6. At the network level, agencies collect a large amount of pavement condition data,
which is transformed into composite indices or scores. This level of information is most
projections for the overall network condition. On the other hand, project-level data collection
involves detailed distress identification and severity assessments. Engineers can use these
assessments to select specific maintenance and rehabilitation treatments and estimate project costs.
2
7
To assist in data collection and satisfy the corresponding requirements, Paterson and Scullion
proposed dividing data needs into different information quality levels. Each level correlates to the
degree of sophistication required for transportation asset management decision making. In this
context, they can aggregate very detailed data into progressively higher-level forms. Such forms
can be specific key performance measures or indicators that combine key factors from several
pieces of information. Bennett and Paterson 8 defined five such information quality levels (IQL).
These are ranging from project-level (IQL 1) to high-level system performance monitoring (IQL
5) data. The methods employed in this paper focus more on the planning and performance
evaluation information level (IQL 4) based on this classification. Consequently, this research
focuses on assessing the overall pavement condition and does not consider detecting individual
distresses.
After delimitating the scope of this paper, a review of the remote sensing applications for
pavement performance assessment is provided. The applications that this review entails, involve
pavement condition information in the planning and performance evaluation information level
(IQL 4) and utilize aerial and satellite images of high and very high spatial resolution. Although
common pavement distresses, such as cracking and rutting, cannot be directly detected in these
aerial and satellite images, there have been many interesting efforts that try to model the
relationship between some spectral attributes and higher-level condition indicators. For that
reason, both hyperspectral and multispectral data have been used. A summary of the sensors and
data types that has been used in the literature for pavement condition assessment can be found in
Table 1. It is noted that the spatial resolution mentioned in that table refers to pan-sharpened
images for sensors with panchromatic bands. For the ground spectrometers the resolution refers to
3
the diameter of the corresponding circular areas that are measured on ground based on the
Table 1 Spectral data used in previous research for pavement condition assessment.
Authors Sensor name Platform Data type Spectral range Spatial resolution
Herold et al. 9 IKONOS Satellite Multispectral 0.45 - 0.85 μm ~ 0.8 m
AVIRIS Airborne Hyperspectral 0.35 - 2.50 μm ~ 3.6 m
ASD FR FieldSpec Ground Hyperspectral 0.35 - 2.50 μm ~ 0.4 m
Herold et al. 10 HyperSpectir Airborne Hyperspectral 0.45 - 2.45 μm ~ 0.5 m
ASD FR FieldSpec Ground Hyperspectral 0.35 - 2.40 μm ~ 0.4 m
Kavzoglu et al.11 IKONOS Satellite Multispectral 0.45 - 0.85 μm ~ 0.8 m
ASD FR FieldSpec Ground Hyperspectral 0.35 - 2.50 μm ~ 0.4 m
Pascucci et al.12 MIVIS Airborne Hyperspectral 0.43 - 12.70 μm ~ 3.0 m
Mei et al. 13,14 MIVIS Airborne Hyperspectral 0.43 - 12.70 μm ~ 3.0 m
ASD FieldSpec3 Ground Hyperspectral 0.35 - 2.50 μm ~ 0.1 m
Mei & Salvatori 15 IKONOS Satellite Multispectral 0.45 - 0.85 μm ~ 4.0 m
ASD FieldSpec3 Ground Hyperspectral 0.35 - 2.50 μm ~ 0.1 m
Andreou et al. 16 CASI 550 Airborne Hyperspectral 0.40 - 1.00 μm ~ 0.5 m
SVC GER-1500 Ground Hyperspectral 0.28 - 1.09 μm ~ 0.2 m
Mettas et al. 17 Landsat 7 ETM+ Satellite Multispectral 0.44 - 2.40 μm ~ 15.0 m
SVC HR-1024 Ground Hyperspectral 0.35 - 2.50 μm ~ 0.1 m
Mohammadi 18 HyMap Satellite Hyperspectral 0.45 - 2.49 μm ~ 4.0 m
Pan et al. 19 WorldView-2 Satellite Multispectral 0.45 - 0.80 μm ~ 0.5 m
Shahi et al. 20 WorldView-2 Satellite Multispectral 0.45 - 0.80 μm ~ 0.5 m
Many attempts have been made to utilize hyperspectral field spectroscopy in order to
characterize pavement conditions 9–11,13–17. The objective of these efforts is the development of a
relationship that is later extrapolated for usage in remote sensing images to provide assessments
9
with high area coverage. Herold et al. observed that for recently paved roads, the spectral
reflectance was generally lower across the range of the measured spectrum. On the contrary, for
old and deteriorated road surfaces, a general increase in reflectance was found. This difference
was higher in the infrared spectrum. The corresponding spectral signatures derived from the
imagery was presented and compared to the ground measurements. It was observed that there were
large differences that can be attributed to atmospheric and systematic noise, as well as spectral
mixture effects of surfaces other than pavements. The authors also identified several bands that
could be used to identify raveling and aging effects. A similar methodology was also followed by
4
11 10
Kavzoglou et al. . In later work, Herold et al. integrated hyperspectral data with in situ
pavement condition surveys. This was the only effort found in the cited literature that utilized
pavement condition data collected according to established protocols. The research focused only
on the empirical relationship between the reflectance difference of two specific bands with the
pavement condition index and a structural index. The two bands were in the blue visible and near-
infrared spectrum.
12
Pascucci et al. focused on pavement surface reflectance in the long-wave infrared
spectrum. The authors tried to exploit the absorption property of limestone at a specific band of
that spectrum. The notion behind this research was that since limestone was the dominant
aggregate of asphalt concrete in the study area, it could be captured in the thermal infrared
spectrum. This threshold was used to discriminate between pavements in good condition and those
that had to be checked for maintenance. However, the threshold that the authors identified was
based on limited field observations. Later, Mei et al. 13–15 explored pavement aggregate exposure
using a quantitative metric in the form of an index. This metric was based on area calculation of
exposed aggregate coverage in natural color band images. The authors identified four asphalt
surface brightness clusters in commercial multispectral images. These clusters were assumed to be
Andreou et al. 16 developed a spectral library for asphalt from field measurements, defining
five potential categories of asphalt condition and minimizing the dimension of the hyperspectral
space. These categories ranged from good condition to highly distressed pavements. Different
processing methods were used, and it was found that principal component analysis performed well
at distinguishing between asphalt conditions. Mohammadi 18 used hyperspectral data and asphalt
signatures assuming that the mean reflectance from the visible to the short-wave infrared spectrum
5
would be disproportional with the pavement condition. Three states of condition (good,
intermediate, and bad) were defined, and the classification results were compared to limited field
17,21,22
visits. Mettas et al. , developed separability indices for cracked and uncracked pavement.
The authors recognized that the blue visible and the short-wave infrared spectrum presented the
highest separability values. The indices were extended for an in-band analysis of a multispectral
satellite sensor.
There is a general shift in the literature from approaches that tried to find the highest
correlation between specific bands of the spectrum and a performance indicator. Recent works 19,20
have instead tried to move away from costly and generally unavailable hyperspectral datasets and
try to exploit as much information given overall by the spectral and spatial patterns of commercial
multispectral images. Shahi et al. 20 used object-based image analysis to differentiate between good
and poor condition classes using satellite images. Spatial, spectral, textural, and color-related
attributes were generated, and different feature selection techniques were used and compared.
These techniques included support vector machine, random forest, and chi-square algorithms that
were evaluated to select the most effective one in identifying the best set of attributes. Pan et al. 19
used also commercial satellite images and spectral mixture analysis to classify the age of the
pavements. Some asphalt pavement pixels were covered by other objects (e.g., vegetation,
vehicles, sidewalks), and spectral unmixing was performed to calculate the percentage of
pavement surface found in the mixed pixels. Based on field investigation and in situ measurements,
aged asphalt pavements were categorized into four stages: preliminarily aged, moderately aged,
A very important issue found in the review of the cited literature is that most of the existing
applications are not based on actual condition data but rather use proxy indicators such as
6
assessments of pavement age and brightness that are interrelated with aggregate exposure and
bitumen removal. Also, the vast majority of developed models are not adequately validated. The
validation is most usually based on limited subjective on-site evaluations conducted by the
researchers rather than using standardized pavement data collection protocols. The only exception
found was the work of Herold et al. 10 that utilized Roadware PCI data in developing a relationship
with a specific spectral attribute. However, even in that research the proposed relationship was not
applied in unseen data and there is no estimate concerning the generalization error of the model.
The relationship was based on a limited number of pavement condition data samples and exhibited
relatively low correlation. Therefore, applicability of remote sensing data in pavement condition
Furthermore, previous research in the field of pavement condition assessment using remote
sensing has not fully exploited developments in computer vision and data science. Most of the
studies focus only on limited image processing steps and consider specific prediction model
the aforementioned shortcomings. The main research challenge that we intend to tackle is
exploring whether remote sensing can be practically used to assess pavement condition. To address
this question different machine learning models are trained and tested against unseen data in order
techniques to analyze image data. Also, instead of manually digitizing pavement sampling areas
3 Methodological Framework
will provide high-level information to transportation asset managers. This will contribute to
reducing pavement data collection costs. To achieve this goal, we use different supervised data
mining techniques to classify road pavements’ condition based on the inherent multispectral ortho-
image information. A synthesis of best practices found in the literature is used in developing the
framework. The methodological framework is generic. That means it describes several basic steps
for analysis using readily available data, as shown in Fig. 1. It is divided into three main stages,
which include data selection, analysis, and data mining techniques that are embedded for extracting
Fig. 1 Framework for pavement condition assessment using remote sensing data.
The first stage of the framework deals initially with collecting and preparing pavement condition
data for analysis. The critical factors of this process are the selection of specific pavement types
8
and condition updates based on maintenance projects completed between pavement inspections
and imagery sensing. Pavements that are made of different materials (e.g. asphalt or cement
concrete) have different characteristics and thus separate models might have to be developed.
However, in this research we focus on asphalt pavements that usually comprise the majority of a
city’s street network. Except for pavement condition inspection data, transportation agencies
usually publish an inventory of geospatial data that corresponds to the centerlines of the road
network. This inventory is usually provided in vector format and is made available for public use.
In this research, it is assumed that readily available centerline information will provide a sufficient
pavement surface sample while advanced road centerline extraction techniques can be used to
increase location precision 23. In case the road segments’ location and geometry are not available,
roadway centerlines can be extracted using remote sensing data. This topic has been addressed
extensively by many researchers, but it is not part of the scope of this investigation.
Multispectral images capture data within specific wavelength ranges of the electromagnetic
spectrum. The wavelengths are separated using instruments that are particularly sensitive to these
or by using specific filters. These images may capture data from frequencies beyond the visible
range, i.e. infrared and ultra-violet. Thus, spectral imaging allows the extraction of additional
information the human eye cannot sense. Contrary to hyperspectral imaging where often hundreds
of contiguous spectral bands are captured, in multispectral images a small number typically 3 to
15 of spectral bands are included. Many organizations perform imaging acquisition programs on
a recurrent basis, which has made multispectral images easily accessible comparing to
hyperspectral data that are generally limited. For this reason, multispectral images are utilized in
the framework to extract spectral attributes that can be used for pavement condition classification.
9
3.2 Data Analysis
The first step for data analysis is data cleaning. This involves removing irrelevant attributes from
the pavement condition inspection tables, as well as the attribute table of the roadway linework
vector data. The removed attributes include street names and numbers, roadway type and other
information. After this process, each road segment is characterized only by the corresponding
pavement condition index attribute that is selected. Pavement condition data can be cross-checked
with maintenance and rehabilitation history for changes and inaccurate records. The pavement
condition data can be integrated with road centerline and project information data, using a linear
referencing tool or a unique key attribute. In this way, the inspection values of the pavement
After integrating the linework with the condition inspection data, the vector features that
contain the information must be re-projected to the coordinate system of the multispectral image.
This is necessary for the two datasets to align spatially (Fig. 2a) in support of geoprocessing
operations to take place later. By aligning the two datasets, buffer zones with variable width are
then created around the linework so that pavement sections can be sampled in the multispectral
images (Fig. 2b). In locations where two street segments intersect, a small circular buffer zone is
radiometrically corrected to reduce errors in their intensity. The process improves the
interpretability and quality of remotely sensed data. Radiometric calibration and correction are of
intensity from digital numbers to apparent reflectance. This topic is of high importance when using
multiple images from different sensing periods or different sensors. Reflectance is a spectral unit
10
that provides a common measure between different sensing systems. The spectral signature of a
pavement surface is not transferable if the intensity values of the electromagnetic radiation are
recorded in digital numbers. Digital numbers are image specific. They depend on the gain and bias
of the sensor at each band, the viewing geometry, the location of the sun, as well as weather and
other factors. In certain applications, it is more useful to convert the digital number values to
reflectance. This conversion will enable direct application of classification models to other images
for rapid assessment of pavement conditions. In cases where the sensing specifications are not
known, the empirical line method along with pseudo-invariant surfaces can be proved useful tools
11
Fig. 2 Study area subregion: (a) centerline, (b) buffer zones, (c) segmentation and (d) extracted pavement sections.
An attribute generation procedure can be applied to extract not only spectral but also
textural and geometric information from the different bands. Many such metrics can be used, some
of which were employed in Shahi et al. 20. The different vector polygons are used to extract the
information. If the distribution of each attribute value cannot be used as an input, basic descriptive
12
statistics can be calculated for each polygon. These might include minimum and maximum values,
After forming the attribute table, a supervised classification scheme is used to identify and
extract the clusters that represent pavement surfaces. The objective is to remove all the other
sections covered by non-pavement surfaces automatically. There are two ways to deal with this
classification procedure. The first is to create a training set by manually assigning a binary class
that separates pavement and non-pavement covered polygons. Some classification algorithms that
can be used for this classification are described in Sec. 3.4. An alternative method would be using
existing libraries of the spectral signatures of different materials. For this research the first method
has been employed and object-based binary classification has been used to extract pavement
surfaces. In this process the clustering algorithm used for segmenting the image and creating the
corresponding objects has a crucial role in the final accuracy of the results. The objects that are
extracted are subsequently dissolved within each road segment (Fig. 2d). Following this procedure,
the main noise-introducing areas, such as trees, cars, and shadows, are excluded from the sampling.
By removing the noisy areas, the zonal attributes are subsequently recalculated. The new
calculation involves the collection of remaining polygons that are located within a buffer zone of
each road segment of the linework. A spectral unmixing technique can be employed for pixels
adjacent to noisy areas. Spectral unmixing is the procedure that decomposes the measured
corresponding fractions indicates the proportion of each material present in the pixel 19.
to guarantee the efficiency of the classification algorithms. There are many approaches to deal
with this issue 20. Attribute selection can be based in terms of correlation and information gain.
13
Dimensionality reduction and feature extraction can be performed by principal components
analysis 16, linear discriminant analysis, canonical components, non-negative matrix factorization,
or other techniques. Since the pavement condition classes might be dominated by imbalanced
datasets leading to overrated accuracy results, a resampling method can optionally eliminate this
negative effect. Examples of such methods are undersampling, oversampling, synthetic data
3.3 Clustering
A clustering module is embedded to extract pavement surface areas from remote sensing data,
excluding surfaces that might introduce noise in the distributions of the pixel intensities. It can be
observed that noisy areas, such as those covered with vehicles, vegetation, and shadows, are
included in those resulting buffer zones. To deal with this problem, an image segmentation
procedure coupled with binary classification can be utilized. The intention of image segmentation
is to separate areas with different spectral characteristics, such as markings, vegetation, building,
and vehicles, from pure pavement surfaces. Also, based on the quality of the road centerline
geospatial data and the remote sensing images, the roads might not always align properly. As a
result, the buffer zones might not capture all the pavement surfaces. Many clustering algorithms
can be explored for image segmentation starting from a simple distance-based k-means to a more
sophisticated mean-shift algorithm, shown in Fig. 2c. However, there might be areas in which the
segmentation algorithm might not discern different objects accurately. Consequently, errors in the
14
3.3.1 Mean-shift
For segmentation, the mean-shift clustering algorithm 25 can be selected to be used among
others. This algorithm does not require specifying the number of clusters in the data ex ante. On
the contrary, mean-shift builds upon the probability density function for a set of data. The
algorithm works by placing a kernel on each point in the data set. By considering a set of points in
an input space corresponding to all the pixels of a multi-band image, a window is assumed to be
centered on each point. This window has a specific radius and corresponds to the kernel. If p is an
initial estimate and pξ is an input sample point, the Gaussian kernel or radial basis function K,
K ( p − p ) = e
2
− p −p
, (1)
where γ is a parameter that sets the “spread” of the kernel. The above function determines the
weight of nearby points for the re-estimation of the mean. The weighted mean of the density in the
K ( p − p ) p
p B ( p )
g (p ) = , (2)
K ( p − p )
p B ( p )
where B(p) is the neighborhood of p, i.e. a set of points for which K(pξ) is non-zero.
density region at each iteration until convergence. The difference g(p) – p is called mean-shift and
at each iteration, the algorithm sets p = g(p) until g(p) converges. At convergence, there is no
direction at which a shift can accommodate more points inside the kernel.
15
3.4 Classification
For both the noisy area detection and the pavement condition classification schemes, the samples
under study must be divided into three groups: training, validation, and test. The training dataset
is used to fit the parameters of the model. A validation dataset is useful for fine-tuning the
hyperparameters of the classifier. The test data group is used to provide an unbiased evaluation of
the final model fit on the data and provide an estimate of the generalization error. Whether a class
resampling technique is used or not, the three groups should have the same level of variability to
Many classifiers can be used to predict the condition class of a pavement segment. These
classifiers, and ensemble techniques that enrich base classifiers with metaheuristic techniques. We
can compare the performance of these classifiers based on specific metrics. The computational
effort and the accuracy that each classifier achieves are some of the most important performance
factors. The accuracy of a classifier on the test set is the percentage of test set tuples that were
The classifier with the best performance can be selected for pavement management system
application. In the following sections, four classifiers that are used in the preliminary case study
are discussed. The classifiers that are selected are simple enough to demonstrate the applicability
of the framework and are available in most data mining platforms and software.
The k-nearest neighbor algorithms use instance-based learning, where the function is only
16
approximated locally and are considered among the simplest of classification techniques 26. A k-
nearest neighbor classifier may use Euclidean distance D as a metric among others. It searches the
pattern space for the “k” training tuples closest to the unknown tuple. For example, for the first
data tuple to be classified based on the information of the other given instances, the following
M
D ( t1 , t n ) = (t − tnm ) n = 2,
2
1m ,N , (4)
m =1
where tn the vector tn1,…,tnM of the attribute values of the n-th data tuple, N the number of data
All the tuples are sorted based on the minimum calculated distance from the tuple to be
classified. The number of nearest neighbors “k” is specified explicitly. If only one neighbor is
Predictions from more than one neighbor can be based on a majority vote or weights according to
A naïve Bayes classifier is based on the assumption of attribute independence 27. Let the n-th data
instance be represented as an M-dimensional vector tn, where M is the number of attributes after
the dimensionality reduction procedure. The classifier assigns to each instance probabilities that
its class Cn is equal to all existing classes ck. According to the Bayes theorem, the posterior
probability equals the product of the prior with the likelihood of an instance given the class, divided
17
P ( Cn = ck ) P ( t n Cn = ck )
P ( Cn = ck t n ) = k = 1, ,K n = 1, ,N . (6)
P (tn )
Since the probability of the evidence is fixed, the posterior probabilities are proportional to the
nominator while the probability acts as a scaling factor. Then, according to the assumption of the
naïve Bayes classifier, each attribute is assumed to be independent of each other. Consequently,
the class label, which has the maximum value, is selected as the class label of the test data:
M
Cn = arg max P ( Cn = ck ) P ( tnm Cn = ck ) n = 1, ,N . (7)
k1, , K m =1
The multilayer perceptron is a specific feed-forward artificial neural network. It comprises at least
three layers of nodes 2. In this study, the number of layers is equal to the number of attributes
selected after the dimensionality reduction process. In each node i, a sigmoid activation function
is used to map the weighted sum input xi to the output f (xi). A logistic function can be used as an
1
f ( xi ) = . (8)
1 + e − xi
The training of the perceptrons is performed by changing the weights wij of the links
between the nodes i and j of two layers after each data tuple is processed. The change of weights
is related to the amount of the output error compared to the expected result yj. The most widely
used loss function is the squared error, which can be expressed at the n-th data point as:
E (n) =
1
(
yj − f (xj ) . )
2
(9)
2 j
The training process is carried out through backpropagation of the activation function. Using
18
E ( n )
wij ( n ) = −r f ( xi ( n ) ) , (10)
x j ( n )
where r is the learning rate, which is selected so that the weights converge to a set of values
efficiently.
Support vector machines use nonlinear mapping to transform the original attribute space into a
higher dimensional feature space 28. Within this higher dimensional space, this classifier searches
for the linear optimal separating hyperplane, which corresponds to the decision boundary
separating the tuples of one class from another. With an appropriate nonlinear mapping to a
sufficiently high dimension, data can always be separated by a hyperplane. The classifier finds this
hyperplane using support vectors. These are the essential training tuples that define the hyperplane
margins. For a binary classification problem, the classifier seeks to minimize the hinge loss:
1 n
n max ( 0,1 − ci ( w z i − b ) ) + w , (11)
i =1
where zi is the i-th tuple in the higher dimensional feature space, ci is the class of that tuple, and w
is the normal vector of the hyperplane. The parameter λ denotes the trade-off between increasing
the margin size and ensuring that each tuple lies on the correct side of the hyperplane. Thus, for
sufficiently small values of λ, the second term in the loss function will become negligible. For
multiclass problems, the labels are drawn from a finite set of several elements by reducing the
The applicability of the framework was explored with a case study utilizing road inspection data
19
provided by the Department of Public Works of the City of Dallas. We used a subset of 362 data
instances covering a wide street network paved with asphalt concrete. The pavement condition
data were collected using in situ measurements from a van-mounted sensor by an external vendor.
The derived pavement condition index for each data instance corresponds to average values of
street segments with a length of 30 m approximately. The pavement condition index numerically
denotes the average surface condition of each 30-m pavement segment with respect to the distress
29
level and size . An index value of 100 signifies pavements in excellent condition and no
distresses. On the other hand, a value of 0 indicates a pavement that has completely failed and is
closed to traffic. We discretized the pavement condition data into three bins corresponding to a
custom rating scale. The labels of these three classes along with their corresponding pavement
condition index values were poor (0-55), fair (55-70), and good (70-100). Multispectral ortho-
images with a 30-cm pixel resolution were acquired from the Texas Natural Resources Information
System. These were captured using a Leica ADS100 digital camera sensor system as part of the
Statewide Texas Orthoimagery Project under the Texas Orthoimagery Program. The flights were
conducted from October 28th to November 8th and the images depict leaf-on conditions. The image
acquisition period is very important as it might drastically help in segregating vegetated noisy
objects from asphalt surfaces. Each image tile consists of four bands while. These bands along
with their corresponding spectral range are: red (619-651 nm); green (525-585 nm); blue (435-495
nm); and near-infrared (808-892 nm). The imagery covers a study area of 2.25 km2.
a subset of one image is provided in Fig. 3. The three sampled areas are marked with green, pink,
and blue. The area sampled in green color depicts a pavement in a relatively poor condition. This
is indicated by its lower pavement condition index, as there is an evident amount of cracking and
20
rutting. On the other hand, the pavement area in pink is sampled from a section with fair condition.
The blue depicts a pavement area covered with parked vehicles. The corresponding distributions
of the pixel intensities included in these areas are shown in the scatter diagram on the right. Each
grey dot represents the intensity of each pixel in the image and the corresponding colored dots
refer to the sampled areas on the left image. The intensity of the near-infrared band of the
multispectral image is on the vertical axis. In the horizontal axis, the corresponding intensity of
the green band is shown. The scatter plot indicates that the pavement in the worse condition shows
a higher variation of the intensity values in both channels. Finally, pavement covered with vehicles
introduces a significant amount of noise in the distribution of pixel intensities, as the variance of
these areas gets significantly higher. Thus, it is vital to exclude from sampling areas that induce
Fig. 3 Sampled pavement areas: (a) location and (b) scatter plot of pixel intensities for these areas.
21
The framework for data pre-processing, image segmentation, and pavement condition
classification was implemented through open source geographic information systems and machine
30–35
learning software . After running the mean-shift segmentation algorithm a supervised
procedure was followed in order to label the generated objects. One label was given to objects
corresponding to pavement surfaces and a second one to non-pavement noisy areas. This procedure
led to a binary classification problem that achieved ~98% accuracy within the established buffer
zone. Thus, in forming the final pavement sampling areas most probably only a very small
percentage of noisy objects was included in the analysis. In this initial stage, only spectral related
attributes were generated, and a simple feature selection procedure based on information gain was
employed. The mean and the standard deviation from the near-infrared band as well as attributes
Over-optimistic estimates due to overfitting might occur if the accuracy of the classifier is
calculated using the same data that were used for training. The classifier’s accuracy should be
measured using a test set comprising class-labeled tuples not used to train the model. Thus, the
original dataset was divided: 70 percent for training and 30 percent for testing. In other words, out
of the 362 pavements segments in the dataset, 253 were used to derive the model. The rest 109
segments were kept out of the training procedure to estimate the accuracy. This test sample that is
practically “unseen” by the classification algorithms is later used to estimate the accuracy of each
classification model. Thus, the accuracy results that we report and the segments that have been
misclassified do not correspond to the training error but rather the generalization error of the
models. This is the common practice of verifying reliability in data mining applications.
To demonstrate the effect of the clustering module, two runs were conducted for each
classifier. The first run involved pavement sampling without applying the clustering module. Thus,
22
zonal statistics calculations included noisy area intensities in this case. In the second run, the
sampling was conducted only in clusters that represent pavement surfaces after having excluded
the noisy areas. Figure 4 shows that there was a significant increase in the accuracy of each
classifier by using the clustering procedure and ruling out non-pavement covered areas.
Fig. 4 Accuracy level achieved for each pavement condition classification model.
The accuracy of the naïve Bayes, multilayer perceptron, support vector machines, and k-
nearest neighbors was 62.15, 70.71, 69.38 and 56.07 percent respectively. Without following the
clustering and noise filtering process, the accuracy achieved was much lower for all the algorithms.
The corresponding results were 38.67, 40.33, 39.50 and 37.29 percent accordingly. The multilayer
perceptron achieved the highest accuracy. Artificial neural networks are universal approximators
and it seems that they adjust better to the intricate nature of the problem.
A downside of the multilayer perceptron is that its training time is slightly higher than the
rest of the algorithms. The theoretical time complexity of the specific implementations of the
classifiers along with the actual training and testing times is shown in Table 1. Time complexity
is expressed in terms of the limiting behavior of the algorithms where M is the number of features
and N the number of instances and K the number of classes. The complexity of testing time for
23
support vector machines depends on the number of the support vectors nsv. For the multilayer
perceptron the corresponding time complexity depends also on the network architecture. This
includes the number of hidden layers h, the number of perceptrons npr of each layer and the number
of epochs t. It is assumed that all hidden layers have an equal number of nodes.
Using the trained multilayer perceptron, we predict the pavement condition classes of the
362 segments. Figure 5 presents the model results in contrast with the actual inspection data. The
three classes poor, fair, good are marked with red, yellow, and green. The misclassified predictions
Fig. 5 Pavement condition classes: (a) ground truth and (b) predicted by the multilayer perceptron model.
24
6 Discussion
This study investigated remote sensing data usage for network-level pavement condition
assessment. A detailed data mining framework was established, utilizing the inherent spectral
characteristics of pavement surfaces. These are used to train models that will be capable of
predicting the pavement condition of different road segments. In contrast with previous literature
we explore the applicability of such models by testing them in unseen pavement condition data. A
preliminary case study was conducted with data provided by the City of Dallas and remote sensing
images acquired from the Texas Natural Resources Information System. Contrary to previous
research that utilizes hyperspectral and commercial image products, the methodology that we
develop can be applied with images that are publicly available. The case study examined four
accuracy in classifying pavements in three different condition states. However, due to the
algorithms complexity it might require higher training time depending on the network architecture.
This issue was of little importance in the case study since training times of all four classifiers were
less than one second. However, significant efficiency issues might arise for larger datasets. Further
analysis based on the trade-off between accuracy and efficiency should be conducted in that case.
Research has shown that measurement variation and errors can influence pavement
36
inspection data depending on the data source . For this reason, the accuracy achieved is
satisfactory compared to a ~33.3 percent equal probability of a segment belonging to one of the
three classes at random. This is an indicator of an underlying connection between image inherent
spectral information and pavement condition. The results also show a significant increase on the
accuracy when the clustering module is employed. Thus, the framework might have the potential
for future implementation if further research is conducted on the different constituent steps. For
25
that effort it is also important to develop models trained to classify the conditions of rigid
pavements. The usage of larger training datasets will contribute to better approximations of the
classification models to the underlying mapping functions of remote sensing data to pavement
conditions.
7 Conclusion
Based on the results generated from the classification models, it seems that the methodological
framework can be potentially used for high-level information checks. More specifically, the
predicted classes can be compared with existing pavement condition datasets. The comparison will
help to identify potential errors in data collection. The framework is based on synthesizing
knowledge from the literature and is directed towards guiding future research on this topic.
Applying more sophisticated data processing techniques and classification algorithms will provide
Acknowledgments
The authors wish to express their gratitude to Mr. Cosmin Spiridon and Ms. Jennifer Nicewander,
Department of Public Works of the City of Dallas, for providing pavement condition inspection
data. We also extend our appreciation to the Texas Water Development Board for providing public
access to the Texas Natural Resources Information System. This includes the multispectral ortho-
Disclosures
The authors have no relevant financial interests in the manuscript and no other potential conflicts
of interest to disclose.
26
References
2. I. H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and
Construction, Maintenance, and Service Equipment Warning Lights and Pavement Data
(1998).
Guidelines on System Design and Data Issues,” The World Bank, Washington, D.C.
(1990).
27
Infrastructure,” Online J. Sp. Commun.(3), 1–29 (2003).
10. M. Herold et al., “Imaging Spectrometry and Asphalt Road Surveys,” Transp. Res. Part C
11. T. Kavzoglu, Y. E. Sen, and M. Cetin, “Mapping urban road infrastructure using remotely
[doi:10.1080/01431160802639582].
12. S. Pascucci et al., “Road Asphalt Pavements Analyzed by Airborne Thermal Remote
Sensing: Preliminary Results of the Venice Highway,” Sensors 8(2), 1278–1296 (2008)
[doi:10.3390/s8021278].
13. A. Mei, R. Salvatori, and A. Allegrini, “Analysis of Paved Areas with Field Data and
[doi:10.5721/ItJRS201143212].
14. A. Mei et al., “Integration of Field and Laboratory Spectral Data with Multi-Resolution
Remote Sensed Imagery for Asphalt Surface Differentiation,” Remote Sens. 6(4), 2765–
15. A. Mei and R. Salvatori, “Urban Mapping Using Ikonos Imagery,” Int. J. Remote Sens.
Sensing for Mapping Asphalt Road Conditions,” Int. J. Remote Sens. 32(21), 6315–6333
(2011).
17. C. Mettas et al., “Risk Provision Using Field Spectroscopy to Identify Spectral Regions
for the Detection of Defects in Flexible Pavements,” Nat. Hazards 83(1), 83–96, Springer
28
18. M. Mohammadi, “Road Classification and Condition Determination Using Hyperpsectral
Information Sciences 39(B7), pp. 141–146, International Society for Photogrammetry and
19. Y. Pan et al., “Mapping Asphalt Pavement Aging and Condition Using Multiple
Endmember Spectral Mixture Analysis in Beijing, China,” J. Appl. Remote Sens. 11(1),
21. C. Mettas et al., “Monitoring asphalt pavement damages using remote sensing
22. C. Mettas et al., “Detection of Asphalt Pavement Cracks Using Remote Sensing
23. W. Wang et al., “A Review of Road Extraction From Remote Sensing Images,” J. Traffic
24. A. Mei et al., “The Use of Suitable Pseudo-invariant Targets for MIVIS Data Calibration
by the Empirical Line Method,” ISPRS J. Photogramm. Remote Sens. 114, 102–114,
29
25. J. Michel, D. Youssefi, and M. Grizonnet, “Stable Mean-Shift Algorithm and its
Trans. Geosci. Remote Sens. 53(2), 952–964, Institute of Electrical and Electronics
Machine Learning 6(1), pp. 37–66, Kluwer Academic Publishers, Boston (1991)
[doi:10.1007/BF00153759].
28. C. C. Chang and C. J. Lin, “LIBSVM: A Library for Support Vector Machines,” ACM
Trans. Intell. Syst. Technol. 2(3), Association for Computing Machinery, New York, NY
(2011) [doi:10.1145/1961189.1961199].
29. ASTM, “D6433-18: Standard Practice for Roads and Parking Lots Pavement Condition
[doi:10.1520/D6433-18].
30. QGIS Development Team, “QGIS Geographic Information System,” 2.18, Open Source
31. OTB Development Team, “Orfeo ToolBox - Open Source Processing of Remote Sensing
33. GRASS Development Team, “Geographic Resources Analysis Support System (GRASS
30
GIS) Software,” 7.4, Open Source Geospatial Foundation (2018).
34. O. Conrad et al., “System for Automated Geoscientific Analyses (SAGA) v. 2.1.4,”
35. E. Frank, M. A. Hall, and I. H. Witten, “The WEKA Workbench. Online Appendix,” in
Data Mining: Practical Machine Learning Tools and Techniques, 4th ed., Morgan
Kaufmann (2016).
36. P. A. Serigos et al., “Automated Distress Surveys: Analysis of Network Level Data (Phase
and holds the John E. Kasch Endowed Graduate Fellowship. His current research interests include
remote sensing and machine learning applications in transportation asset management and
to infrastructure management for more than 20 years. Dr. Zhang earned his
31
PhD degree in civil engineering from the University of Texas at Austin in 1996. He is an author
statistical models for pavement performance. She received her Ph.D. degree
in Civil Engineering from the University of Texas at Austin in 2019 and her
MS degree in Road and Transportation Engineering from the Iran University of Science and
Technology in 2013. She also earned a MS degree in Statistics from the University of Texas at
Austin in 2017. She has published five peer-reviewed articles in top-tier transportation journals
engineering and project management. His research interests include information technologies and
sensors for construction engineering and project management, advanced data analysis techniques,
32