Principal Components Analysis and Track Quality Index: A Machine Learning Approach
Principal Components Analysis and Track Quality Index: A Machine Learning Approach
net/publication/329377226
CITATIONS READS
5 648
1 author:
Ahmed Lasisi
University of Manitoba
28 PUBLICATIONS 75 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Principal Component Analysis in the Evaluation of Track Quality Index View project
All content following this page was uploaded by Ahmed Lasisi on 03 December 2018.
A R T IC LE I N F O ABS TRA CT
Keywords: Track geometry data exhibits classical big data attributes: value, volume, velocity, veracity and
Track quality index variety. Track Quality Indices-TQI are used to obtain average-based assessment of track segments
Principal components and schedule track maintenance. TQI is expressed in terms of track parameters like gage, cross-
Machine learning level, etc. Though each of these parameters is objectively important but understanding what they
Railway engineering
collectively convey for a given track segment often becomes challenging. Several railways in-
cluding passenger and freight have developed single indices that combines different track
parameters to assess overall track quality. Some of these railways have selected certain para-
meters whilst dropping others. Using track geometry data from a sample mile track, we de-
monstrate how to combine track geometry parameters into a low dimensional form (TQI) that
simplifies the track properties without losing much variability in the data. This led us to principal
components. To validate the use of principal components as TQI, we employed a two-phase
approach. First phase was to identify a classic machine learning technique that works well with
track geometry data. The second step was to train the identified machine learning technique on
the sample mile-track data using combined TQIs and principal components as defect predictors.
The performance of the predictors were compared using true and false positive rates. The results
show that three principal components were better at predicting defects and revealing salient
characteristics in track geometry data than combined TQIs even though there were some cor-
relations that are potentially useful for track maintenance.
1. Introduction
This paper examines the potential of machine learning applications in railway track engineering. The railroad industry across the
world is experiencing an increased demand in operations and services due to the world’s increasing human needs and global trade
advancements. The US Class I freight railroads alone is worth over $60 billion operating approximately 140,000 miles of track (FRA,
2013). This is second to none in the world. Other major railway networks include but not limited to Russia, China, Japan, etc. are also
undergoing similar growth. The United Kingdom currently boasts of about 20,000 miles of track which currently ranks it in the first
20 rail networks in the world (Odlyzko, 2016). The global railway network is expected to reach an estimated 900,000 miles-route in
2020 (Statista, 2017). As promising as the industry is, a significant amount of the industry’s budget is continuously committed
towards repairs and maintenance. The annual capital program for US railroads alone is about $15 billion (Zarembski, 2015). A
railway disaster is often very ruinous and sometimes fatal. Track geometry-caused derailments constitute a significant portion of
railway accidents worldwide. In the United States alone, track-caused accidents have consistently constituted 30–40% of total
This article belongs to the Virtual Special Issue on “Big Data Railway”.
⁎
Corresponding author.
E-mail addresses: [email protected] (A. Lasisi), [email protected] (N. Attoh-Okine).
https://doi.org/10.1016/j.trc.2018.04.001
Received 26 July 2017; Received in revised form 14 March 2018; Accepted 2 April 2018
0968-090X/ © 2018 Elsevier Ltd. All rights reserved.
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
accidents for the past decade (FRA, 2015). This is only second to human as a primary cause category.
Understanding track geometry from a design or mechanistic perspective is often not far-fetched. However, it is difficult to monitor
the performance of track infrastructure and geometry throughout its life cycle with the same mechanistic models used during design
without considering inevitable influences of the loading environment, climate, track-train dynamics and a host of other factors (DYK,
2014). To continuously, assess track geometry performance, it is common practice to take ‘per-foot’ measurements of crucial track
geometry parameters for every mile of track using track geometry cars (Kelshaw, 1995). With varying geometry parameters and
wavelength complexities, measurements taken on every foot of mile across 140,000 miles of track in the US results in about a billion
observations for a given inspection date. If monthly routine inspections are scheduled, the data explodes to about several billion
observations. Other measurement systems include high resolution track imaging, wheel impact load detection, strain gauge mea-
surements, LIDAR and GPR-based ballast measurements, rail wear inspection system, etc. (Zarembski et al., 2013). The scale of data
measurements in US railroads has grown from megabytes and gigabytes in the past three decades to terabytes and petabytes today.
With this ‘mess’ of perpetual data collection, we are confronted with two major challenges that are characteristic of a typical ‘big data’
problem (Ishwarappa and Anuradha, 2015). Firstly, the problem of storage has been significantly confronted by modern cloud
solutions. Secondly, data analysis and processing time remains a major challenge in order to make sensible conclusions from the
entire ocean of data collected. Therefore, the fact that track geometry data exhibits classical big data attributes cannot be over-
emphasized.
In this paper we investigate the possibility of reducing multivariate track geometry indices into a low-dimensional form without
losing much information. Similar to the Pavement Condition Index in highways wherein weights are assigned to each parameter and
then summed up (Karim et al., 2016). However, our proposed approach takes cognizance of the fact the observed multidimensional
data often lies in an unknown subspace of two to three dimensions (Hastie et al., 2009). Hence, detecting this subspace in track
geometry data can significantly enable us to eliminate redundant information. This will make it possible to visualize multi-
dimensional track geometry data in two or three dimensions which was hitherto impossible with the raw parameters obtained from
track geometry cars. The second section of this paper focuses on introducing track geometry parameters, data collection and track
quality indices. The third section considers selected machine learning methods that are used to train, test and validate the use of
single and combined track quality indices including the proposed principal components. Low-dimensional representation of multi-
variate track geometry parameters in terms of principal components was validated and compared to existing TQIs in the penultimate
section. The last section of this paper highlights key findings with concluding remarks.
A major component of railway accidents and derailments are due to track geometry irregularities. These irregularities include
wide gage, excessive warp/twist, horizontal and vertical rail deformities which could also culminate in broken rails and derailments.
The Federal Railroad Administration (FRA) reported over 450 track geometry-caused derailments in 2016 according to major cause
categories. In the preceding year, the worth of track geometry derailments constituted about 15% of all major accidents (FRA, 2015).
While derailments are often misunderstood to imply all train accidents, there are in fact other train accidents but statistics show that a
significant proportion of accidents are derailments (Scientific American, 2016). As a result, major railways incur huge expenditures
on track maintenance activities alone. These activities include inspection programs, rail grinding, tamping, etc. so as to reduce the
risk of track geometry failure. In 2008 alone, about $8 billion was spent by US Class I railroads on track maintenance activities (Peng,
2011). A substantial portion of this cost go into geometry inspection. Historically, track geometry inspections were carried out
manually. In recent times, the use of automated inspection cars to collect and assess track geometry cars has significantly expedited
the inspection process spatially and temporally. The increase in the deployment of track geometry cars and several cutting-edge
technologies has aided the performance of risk-based scheduling of track maintenance activities with the philosophy to simulta-
neously minimize cost, optimize operations and reduce risk (Jamshidi et al., 2017).
1.2. Background
In the area of machine learning and track geometry data analytics, some researchers have focused on track classification, track
structure, components and defect detection (Molodova and Li, 2014). Significant contributions have also been made in the area of
predictive maintenance and condition monitoring. Yella et al. (2009) employed a classifier fusion to categorize sleepers in railway
tracks using machine vision. Using support vector machine and classifier fusion, the researchers arrived at a prediction accuracy of
92%. Fernando et al. (2011) worked on a machine vision inspection of railroad track sorting different track components from one
another. They outlined an algorithm using edge detection and texture information to inspect fasteners, cut spikes and switch com-
ponents. In the area of geometry forecasting, Peng (2011) proposed a short-range prediction method to forecast the amplitude of
future measurements by track geometry cars. With Bayesian statistics, a work which focused on a classification-based learning using a
tree augmented naïve Bayes track quality index (TAN-TQI) established better prediction accuracy than the short-ranged method (Bai
et al., 2016) used to predict track irregularity of each day in a future short period (Xu et al., 2012). Hu and Liu (2016) completed a
recent work on modelling track geometry degradation using support vector machine. The trained classifier was 70% accurate when
tested on a validation set. Understanding the behavior of track geometry data is an important step towards developing an effective
classification or regression tool. Investing some resources in this area prior classification would greatly improve on the results of the
above-cited papers. Li et al. (2008) conducted some correlation analysis between track geometry and surface defects like squats. The
research found a close correlation between short-wave track irregularities and the latter. Whether track quality is an indicator of
231
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
performance was the subject of the investigation by Hendry (2015), the results of the research proved to be affirmative. To assess the
importance of machine learning in predictive track maintenance, Li et al. (2014) investigated how rail network velocity can be
improved. While all of these studies aimed to solve different railway/track related problems, the ability to reduce track parameters
into simple variables can significantly simplify many complex problems formulated in the above-mentioned studies. Although, it has
been established that some of the challenges posed by point-based regression has been solved by profile-based prediction (Pedankar
et al., 2003) by segmenting tracks into chunks of specified length (e.g. 100 ft, 500 ft, 200 m, etc.), the challenge that remain to be
addressed is simplifying the variety of TQIs from these track segments into one or two meaningful variables. There is no doubt that
high dimensional data can be difficult to visualize or even understand but with effective dimension reduction tools that can possibly
explain a great portion of the data’s variance in 2D or 3D, it is possible to interact with track defects and data in a Cartesian
coordinate and possibly specify maintenance thresholds. Sequel to previous attempts to develop artificial indices by using linear
combination of track parameters, Liu et al. (2015) demonstrated an application of such for Beijing Metro. However, it was established
in this work that with two principal components only, over 90% variance of track geometry data was effectively summarized and
could therefore be a reasonable substitute for the multidimensional track quality indices. In the following section we introduce the
commonly used track quality indices just before discussing principal components.
Track geometry is a description of the track in terms of its longitudinal (alignment), transverse (gage) and vertical properties
(surface/profile and cross level). Other track parameters combine these track irregularities in two-dimensions or more, e.g. vertical
and longitudinal (warp/twist). Track quality index on the other hand is a quantitative representation of ride quality in an attempt to
distinguish a good track from a ‘bad’ one. At this point, it is important to distinguish between track index, defects, irregularities and
how they contribute to derailments. Firstly, tracks are laid to meet very stringent construction standards. Wear and tear as a result of
track usage and tonnage results in deviations from construction standards. These deviations are often found in rails, track geometry,
structure, etc. Since track parameters are often defined by a nominal value which is the characteristic of an ideal track. Deviations
from these nominal values develop into track irregularities (Ciobanu, 2016). These irregularities grow gradually until it reaches an
unacceptable limit (maintenance threshold) that requires intervention. Nominal values for a parameter beyond this limit defines a
defect as seen in Fig. 1. Track geometry defects left to propagate is likely to lead to derailments as discussed in Section 1.
To evaluate, assess and make decisions based on each parameter per unit length of track is almost practically impossible because it
results in tremendous data-points and hypersensitivities in variations. Therefore, TQI is employed as an aggregate measure of a given
track geometry parameter over a specific length of track. Standard deviation, mean, power spectral density (PSD), etc. are among the
common average-based measures used as TQIs. Next, we discuss crucial track parameters and track quality indices expressed in terms
of individual parameters.
TQI = f (wi ) (1)
where TQI is the track quality indicator and wi is a single track parameter (Fazio, 1986).
There is not a single set of track geometry parameters across the world. Europe and North America play a dominant role in all
major sectors of the industry (Thompson, 1989; Woodburn, 2014). Hence, while most of the track geometry parameters are similar
across the industry globally, there are subtle variations in the usage and understanding of these terms. A fundamental difference is
Gage for example. Variations such as chord length in taking measurements on vertical or horizontal irregularities also exist. For the
purpose of this research, the following track parameters have been chosen; Gage, Cross level, Surface, Alignment and Warp. They are
defined as follows: Gage is the distance between the inner faces of two adjacent rails measured at a certain distance (e.g. 5/8 in.)
below the rail surface. Surface (or profile) is a measure of uniformity in the vertical plane of each rail measured at the mid-point of a
given chord length (e.g. 31 ft or 62 ft). Cross level (or cant) is the vertical difference in height (or angle) between two adjacent rails
while Warp (or twist) measures changes in cross level (or cant) within a specific length interval (e.g. 62 ft) along the track length.
Alinement (or alignment) is the measure of uniformity in the horizontal plane of each rail measured at the mid-point of a given chord
length (FRA, 2002). The alternative terms in parentheses following the parameters used above are the European equivalents of the
232
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
North American track geometry parameters. While these terms are often used interchangeably they are not essentially the same. A
comprehensive detail of these terms, their subtle similarities and differences have been sufficiently discussed in Hay (1982) and
Esveld (2016). All of the said parameters are either collected manually or via automated track inspection vehicles which obviously is
the order of the day. For safety reasons, each of these parameters are better kept within defined thresholds. A violation of a track
parameter threshold is tantamount to a defect. While there are several types of thresholds (construction, maintenance and safety – in
order of increasing tolerance limits), it suffices to examine only safety in this work since it is the major threshold crucial to de-
railment.
For objective TQIs, surface, cross-level, gage, alignment, etc. are often expressed as separate TQIs to address different aspects of
track quality (FRA, 2005). A simple overview of the parameter-based TQIs for each parameter amongst other TQIs are discussed
below:
L
TQIs = ⎛ s −1⎞ × 106
⎜ ⎟
⎝ Lo ⎠ (3)
where Lo is the theoretical length of section.
As mentioned in Section 2.2, the number of TQIs per parameter for each track section becomes challenging to handle. It grows
rapidly if several wavelengths of the same parameters are also considered and therefore not effective for maintenance decisions. To
solve this problem, many railroads have developed a way to combine all TQIs into a single measure of ride quality that can be used for
maintenance planning (as highlighted in the case of Canadian National Railway above). We have selected TQIs from both freight and
passenger railways in order to observe the inter-correlations and possibly discern any relationship they would have with the principal
components proposed in this work. Some of those considered are given below.
233
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
where SZ ,SY ,SW and SE are standard deviation values of vertical irregularities (surface), horizontal irregularities (alignment), twist/
warp, and gauge, respectively (Berawi et al., 2010).
where UI ,TI ,GI , and AI are the indices for unevenness (surface), twist, gauge, and alignment respectively (Berawi et al., 2010).
Later on, we examine the performance of each of these TQIs as a predictor for defects in track sections and propose an alternative.
In one of the earliest definition of machine learning, it was defined as a sub-field of computer science that teaches computer to
learn without being explicitly programmed (Munoz, 2014). It is the ability of a system to learn and improve (performance) from
experience. Machine learning is increasingly gaining relevance in many engineering disciplines and civil or railway engineering is not
an exception (Bergmeir et al., 2013; Li et al., 2014). Hence, machine learning is closely related to computational statistics which
focuses on predictive analysis leveraging on modern day computing capabilities. A common synonym found in statistical learning is
also sometimes called data mining if the motive is to find critical anomalies in data or to provide exploratory data analysis. This type
of learning is referred to as unsupervised. Supervised learning however deals with inferential or predictive analysis based on ob-
servable outcome (Mannila, 2016). In the next subsection, an exploration of the potentials of unsupervised learning in track geometry
data was examined with principal components analysis after which some resources of supervised learning were deployed for binary
classification based on the presence of geometry defects.
Another synonym for dimension reduction is feature extraction or selection. It is the idea that data can be ‘squeezed’ in such a way
that the variables can be reduced and replaced with principal dimensions (or selected features) without losing relevant information in
the data. Factor analysis, t-distributed stochastic neighbor embedding, principal components, etc., are popular examples of dimension
reduction techniques. It suffices that we focus only on principal components in this work since dimension reduction has hardly been
considered in railway engineering.
234
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
Geometry parameters on a given section of track are either within acceptable safety thresholds or otherwise. This is a classical
binary categorical problem since the outputs are discrete as opposed to being continuous or quantitative. A few of the common
classification methods are thus introduced:
ξi ⩾ 0 (11)
for i = 1,…,N
where c is the trade-off between the margin and misclassification error, w is the vector of coefficients, b is a constant, and ξi represents
a slack parameter for handling non separable data. Support vector machine was used in this study and it proved to be a very efficient
classifier especially with the cost parameter tuning. One additional advantage is that the SVM is easy to visualize with 2 principal
components only. However, a Gaussian or radial basis kernel was employed rather than the traditional linear or polynomial decision
boundary. This is defined as follows:
1
For interpretation of color in Figs. 2 and 11, the reader is referred to the web version of this article.
235
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
(x i−x j )2
Kij = e−
σ (12)
where Kij is the kernel matrix, σ or c is a positive constant and x i,x j are pairs of training observations. The reason for choosing a radial
kernel becomes clear after the principal component analysis.
4. Case study
Dataset was collected from a US Class I railroad. 1 mile of track was selected with 28 inspection dates. Specific information on the
particular section of track has been withheld due to privacy. The dates and the given mile was chosen because it had a section that
underwent a major maintenance work. This section of track had a history of sub-grade failure which was later improved by the
installation of a geocell layer that gave it the required stability to support load transfer. The mile of track was divided into segments of
specific lengths (150 ft and 500 ft) over which the TQIs were computed. This can further be used to assess quality of track based on
the measurements and defects collected over the given segments of track over time (El-Sibaie and Zhang, 2014). It is expected that the
segment length or locations do not change for accurate monitoring of the uncertainties and variations in track. See Fig. 3 below for
variations in Gage and Surface TQIs before and after maintenance. We believe that the miscellanies expressed in the data collected
after and prior the major maintenance are varied enough to support some of the objectives of this research.
Data was in a matrix format for each inspection date with about 31 features (e.g. gage, cross level, alignment, surface and profile
including wavelength variations) for every foot of a mile track. After data cleansing and preprocessing, adjacent points of track were
grouped into sections. This paved the way for profile-based prediction rather than point-based analysis. We considered two section
lengths, 150 ft and 500 ft. Therefore, the 150 ft and 500 ft section lengths are equivalent to 35 and 11 sections respectively. For a
500 ft section, each inspection date is an 11 sections by 31 parameters matrix. The data was further processed into section matrices
with 28 inspection dates by 11 parameters as features.
236
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
Fig. 3. Exploratory data: effect of geocell installation on gage and surface parameters.
groupings for horizontal and vertical parameters-i.e. Gage and alignment are horizontal parameters while surface, cross level and
warp are vertical Soleimanmeigouni et al. (2016)-than Fig. 5a. Other exploratory plots are shown with the principal components in
Figs. 6 and 7.
237
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
238
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
$OLgQPeQW5LgKWfW
$OLgQPeQW5LgKWfW
$OLgQPeQW/efWfW
SurfaFe5LgKWfW
$lLgQPeQW/efWfW
SurfaFe5LgKWfW
SurfaFe/efWfW
SurfaFe/efWfW
WarSfW
Crosslevel
Gage
Gage
SurfaFe5LgKWfW
SurfaFe/efWfW
SurfaFe5LgKWfW
WarSfW
$lLgQPeQW/efWfW
í
$lLgQPeQW5LgKWfW
í
Crosslevel
í
$lLgQPeQW/efWfW
í
$lLgQPeQW5LgKWfW
$lLgQPeQW/efWfW
SurfaFe5LgKWfW
$lLgQPeQW/efWfW
SurfaFe5LgKWfW
SurfaFe/efWfW
SurfaFe/efWfW
WarSfW
Crosslevel
Gage
Crosslevel
WarSfW
$lLgQPeQW5LgKWfW
$lLgQPeQW/efWfW
SurfaFe/efWfW
SurfaFe/efWfW
í
Gage
í
$lLgQPeQW5LgKWfW
í
$lLgQPeQW/efWfW
í
(b) Correlation plot for track geometry parameters considering all sections in miletrack
Fig. 5. Exploratory data: correlation plots.
239
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
í04 í02 00 02 04 í08 í06 í04 í02 00 02
V23 Gage
6
V12
04
2
V13
02
V23 V16
4
V24 V7
V9
V10
V15 V11
V8
V14
AL_124
AR_124
02
Warp
00
V17
0
V25 V24 V20
V21
2
V19V18
V22
V25 SL_124
V12 V26 Xlevel AL_62
V16V13 V27
PC2
PC2
V15V14 V1
SR_124
í02
V28
Xlevel V26 V6
00
Gage V5
0 AL_62
V17 AL_124 V3V4
í2
V22V18
V8V20
V21 V19 Warp
AR_62 V7
V9 AR_124 V2
V2 V10
V11
V3
V5
V6
V4
í04
SL_62
í2
V27
í02
V1
AR_62
í4
V28 SR_62
í06
í4
í04
SR_62
SL_62
SL_124
í6
í08
í6
SR_124
í6 í4 í2 0 2 4 6 í6 í4 í2 0 2
PC1 PC1
Fig. 6. Biplots for a defective Section, 2 (left) and non-defective section, 5 (right).
Fig. 7. Principal component score plots for 1st 2nd PCs (left) and 1st and 3rd PCs (right).
Fig. 7 is a sample plot for sections with defects. It is clear from the score plot that the decision boundary for sections with defect is
not linear. We therefore hypothesize that a machine learning procedure capable of effectively separating sections with defects from
those without would require an elliptical/circular decision boundary if the principal component scores are not to be mapped onto a
higher dimensional space. To test this hypothesis, we train selected models in Section 4.2 below.
Scree plot samples for sections in both 150 ft and 500 ft lengths give elbows at two principal components as shown in Fig. 8. It is
also obvious that the first two principal components summarize about 90% of the data. As such, it is proposed that multidimensional
track geometry parameters could be effectively expressed as a bi-variate or tri-variate data.
Altogether, three sections were found to have at least one defect going by the 500 ft segments (Sections 5, 6 and 11). Afterwards, a
binary response variable was created based on presence of defect in a section. The number of observations (rows) are 28 inspection
dates for each of the Sections 5, 6 and 11 vertically stacked into a total of 84 inspections (28 for each section). This stacking was
necessary to provide enough observations for the model training and testing. The binary classification problem to be solved is
expressed as follows:
y = f (x ) (13)
where
240
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
Table 1
Summary of principal components for each section.
Section Length = 150 feet, 35 Sections Section Length = 500 feet, 11 Sections
Section PCs, Variance Sum > 90% %Variance 1st PC (%) Section PCs, Variance Sum > 90% %Variance 1st 2PCs (%)
1. 1ST, 2ND & 3RD 94.68 50.48 1 1ST & 2ND 91.50 91.50
2. 1ST, 2ND & 3RD 95.38 51.25
3. 1ST, 2ND, 3RD & 4TH 94.34 41.49
14. 1ST & 2ND 97.00 81.56 5 1ST, 2ND & 3RD 97.61 89.05
15. 1ST 91.79 91.79
16. 1ST, 2ND, 3RD & 4TH 95.70 62.84
17. 1ST & 2ND 92.45 81.06
18. 1ST & 2ND 90.48 59.10 6 1ST, 2ND & 3RD 94.96 85.40
19. 1ST, 2ND & 3RD 93.97 69.25
20. 1ST, 2ND & 3RD 95.49 55.17
21. 1ST, 2ND, 3RD, 4TH & 5TH 93.56 54.30 7 1ST, 2ND & 3RD 96.14 88.57
22. 1ST, 2ND & 3RD 92.93 74.81
23. 1ST, 2ND & 3RD 97.16 61.00
28. 1ST, 2ND & 3RD 94.20 71.56 9 1ST, 2ND & 3RD 98.00 86.28
29. 1ST & 2ND 97.55 87.61
30. 1ST & 2ND 90.48 65.66
31. 1ST, 2ND, 3RD & 4TH 96.84 50.73 10 1ST 91.30 97.33
32. 1ST & 2ND 96.21 85.91
33. 1ST 91.70 91.70
34. 1ST, 2ND & 3RD 92.90 52.99 11 1ST & 2ND 95.52 95.52
35. 1ST & 2ND 98.10 88.83
⎛ 0−no defects ⎞
⎜ 1−yes defect ⎟
y=⎜ : ⎟
⎜ : ⎟
⎝ 0−no defects ⎠ (14)
x x . . ,x1p
⎛ 11 12 ⎞
x= : : :
⎜x x . . ,x ⎟
⎝ n1 n2 np ⎠ (15)
The response variable above, y is a binary type where 1 or “Yes/True” stands for the presence of defect and 0 or “No/False” is no
defect. The predictor variables, x, provided from the Class I railroad (operating on FRA-Class 7 track) is about thirty parameters, some
of which are repeated due to varying wavelengths. However, other parameters like “Curve” and “Speed” are not TQIs while others
like “Cross level rate” were dropped in place of Warp 62 ft. After eliminating further redundancies, the selected TQIs (explanatory
variables) are as follows: Gage, Crosslevel, Surface Right 62 ft, Surface Right 124 ft, Surface Left 62 ft, Surface Left 124 ft, Alignment
Right 62 ft, Alignment Right 124 ft, Alignment Left 62 ft, “Alignment Left 124 ft and Warp 62 ft. Therefore, after having identified
which inspection dates had at least one defect according to the FRA thresholds (FRA, 2002), we employed machine learning tools to
241
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
Table 2
Defects, section found and count.
S/No Parameters for Class 7 Track Sections with defects (500 ft) Location in feet(s) Count(s) Total
TOTAL 75
Fig. 8. Variance (LHS) and Cumulative Variance (RHS) explained by principal components for sample sections in 150 ft (above) and 500 ft (below)
lengths.
learn from this dataframe and be able to predict should there be a new observation with a defect. The purpose of this is to select
which method/model works best before deploying it to validate principal components as artificial TQI.
The classification methods trained on these stacked sections are three, two of which are parametric (Linear Discriminant Analysis
and Support Vector Machine) while the other is non-parametric (Random Forest). The sections were combined so as to enable the
learning tool recognize sufficient defect pattern for better prediction. To evaluate the accuracy of the predictors, we employ two
measures: True positive rate (TPR) and False positive rate (FPR), the former measures the proportion of inspection dates with defects
in a section that are actually identified while the latter identifies the proportion of dates identified with defects that are not true. Our
focus here is to be able to identify defects and that’s why the TPR is our utmost priority. The FPR can be seen as a false alarm.
Based on Table 3, it is interesting to see that the support vector machine overfits the training data but performs almost at par with
Table 3
Error rates for selected training models.
S/No LEARNING TOOL TRAINING (%) TESTING (%)
1.0 Linear Discriminant Analysis (LDA) 100 2.60 97.62 66.67 7.69 90.48
2.0 Support Vector Machine (SVM) 100 0.00 100 66.67 0.00 97.62
3.0 Random Forest 66.67 1.15 92.70 0.00 2.63 92.86
242
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
the LDA on the test sample. This overfitting can be attributed to the limited observations in the training data. And because the
percentage of dates with defect is very low, it is possible for the training algorithm to classify with maximum accuracy. The SVM
performed woefully initially but after tuning out the parameters c (cost of constraint violation-regularization term) and gamma
(required by all kernels except a linear decision boundary) for a radial kernel through a ten-fold cross validation, it turned out with a
better classifier. Being non-parametric, the random forest could not spot any section with defects on the TPR for the test data even
with a thousand trees. A possible reason here is because it could not effectively learn from the data structure since the random trees
are uncorrelated. Hence, the SVM with a zero FPR on both training and testing data has been selected for model validation with PCA
and other TQIs listed earlier in Section 2. Fig. 9 provides a simple illustration of the model performance for the selected machine
learning tools.
Given that track sections with defects are significantly smaller than non-defective sections, the binary classification problem
solved is very highly unbalanced. Hence, an overall misclassification rate may not be appropriate, the more reason why the TPR has
been proposed for validation.
243
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
Table 4
Prediction performance for selected TQIs with SVM.
Sections Alla J-Coeffc ITGIa CHTQIb CNTQIa 1st 3 PCsa
TPR FPR TPR FPR TPR FPR TPR FPR TPR FPR TPR FPR
Section 11 100 0.00 67.00 0.00 66.67 0.00 100 0.00 100 0.00 100 0.00
Section 5 100 0.00 50.00 96.15 0.00 0.00 50.00 0.00 100 0.00 100 0.00
a
cost = 3.4, gamma = 1.3.
b
cost = 4.3, gamma = 1.0.
c
cost = 2.9, gamma = 1.0.
Fig. 10a shows the important variable critical to defect classification in order of importance. The cross entropy and the Gini index
have been used as the node impurity measures in this case (Hastie et al., 2009). SVM classification plot-for a sample section with
defects using just two principal components in Fig. 10b-enunciates the fact that the defect decision boundary for a track geometry
data may not be linear, rather the plot suggests it to be radial or elliptical. Hence, any classification tool for track geometry defects
has to take cognizance of this to ensure accurate prediction.
To validate the possibility of identifying sections with defects using principal components as predictors, we trained support vector
machine on 2 sections with defects using all the parameters, J-synthetic coefficient, the Indian Track Geometry Index (ITGI), Chinese
track quality index (CHTQI) and the Canadian National rail TQI (CNTQI). TPR and FPR for the corresponding parameters are
recorded in Table 4.
First of all, it is important to note that there are no false-positives except with J-Coefficient in Section 5, the table also shows that 3
principal components and the CNTQI are sufficient substitutes for all 11 parameters as predictors. While J-coefficient and CHTQI
could not do better than a random guess on the TPR in Section 5, the ITGI’s performance is surprisingly poor at the same section, even
with zero FPR. A 100% TPR from most of the predictors is nothing but a consequence of over-fitting. While the results are quite
encouraging, the SVM is not guaranteed to perform this incredibly on every new data set despite using a radial kernel. Another
explanation for this phenomenon is because of the limited dataset. The radial kernel successfully wraps around the inspections with
defects contained in the test set as shown in Fig. 10. Plots of the principal components, TQIs as a function of time (inspection dates) as
well as a correlation plot between the different artificial TQIs are given in Figs. 11 and 12.
4.4. Discussion
In this section, we discuss the output features of the artificial TQIs, how they vary with the presence of defects, time, and whether
they represent horizontal, vertical and transverse track properties based on the model results. We conclude with a discussion on the
principal component scores and how they represent most of the original track parameters than most of the artificial TQIs discussed in
Section 2. From Fig. 11, the red dots in each plot indicates the corresponding TQIs for an inspection date featuring at least one track
geometry defect. For the principal components, the plot suggests that the scores maintained a relatively smooth profile until track
geometry defects on the highlighted dates were observed. This observation also holds for the other TQIs as shown except for the
CNTQI and CHTQI. While the J-coefficient and the Indian TGI had hitherto maintained similar profile, their graphs crossed at
inspection date ‘1115’. However, the same inspection date featured a spike for the CNTQI and CHTQI despite the fact that they are
profile-based rather than point-based observations as explained in Section 4. These spikes suggest an indication for outliers that could
be further investigated. A closer look at this date revealed that certain cross level defects were picked up. With the principal com-
ponents scores in Fig. 12a, we observe a trend similar to those of the original single TQIs in Fig. 13. Especially with surface and the
first principal components. The correlation plots in Fig. 11 shows the salient dependencies between the mix of track geometry
parameters and principal components. The plot was ordered by hierarchical clustering based on a package developed on the R
programming statistical software (Wei and Simko, 2016). The most-correlated parameters are ordered from top-left to bottom-right in
decreasing order of interdependence.
Fig. 11, we observe that the first principal component can be said to represent vertical track parameters as it is closely positively
correlated to both surface wavelengths, buttressing the findings in previous studies that surface parameters are sufficient to assess
ride quality (Audley and Andrews, 2013; Khouy, 2013). Secondly, the second principal components explains longitudinal variations
of the track parameter which is seen from its close correlation to both waveforms of alignment on both rails. As for the combined
TQIs, PC2 from Fig. 11 suggests a strong correlation with CNTQI but negative correlations with the Chinese TQI, Polish J-coefficient
as well as the Indian TGI. This finding implies that while the afore-stated railways might have used their respective TQIs to measure
ride quality, the TQIs actually emphasize longitudinal deviations. Hence, their use should be appropriately channeled, perhaps to
monitor longitudinal degradation. What more, the third principal component appears to be a measure of transverse irregularities
(Gage and cross level), see Fig. 11.
244
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
S8)_R_124
S8)_L_124
ALI_R_124
S8)_R_62
ALI_L_124
S8)_L_62
WARP_62
ALI_R_62
ALI_L_62
XLEVEL
J_Coeff
CNTQI
CHTQI
GAGE
PC2
PC3
TGI
PC1
1
CNTQI
PC2
08
ALI_R_124
ALI_L_124 06
J_Coeff
TGI 04
ALI_R_62
02
ALI_L_62
GAGE
0
PC3
XLEVEL
í02
CHTQI
WARP_62 í04
S8)_R_62
S8)_L_62 í06
PC1
í08
S8)_R_124
S8)_L_124
í1
We can therefore summarize the 3D properties of track geometry parameters with 3 principal components with the first being
vertical, second being longitudinal and the third principal component represents transverse irregularities. With this, we propose that
the use of 3 principal components can simplify track geometry parameters instead of dealing with the numerous explanatory vari-
ables and wavelength variations (sometimes up to 30) often rolled out by track geometry cars. Traditionally, most of the combined
TQIs used by the railways selected for this study selects about 7 of these variables and neglect the rest due to subjective reasons. But
with principal components we can still keep most of the variance and the three dimensional irregularities in our track geometry data
with comparatively higher objectivity and much simplicity.
5. Concluding remarks
This paper formally described the work on principal components and track quality indices. To summarize heterogeneous track
geometry data, some railways assign weights to selected track geometry parameter. This assignment is followed by the sum of all the
products of the weights and the parameters to arrive at a value that is used as a measure of overall track quality. While the assigned
weights are often subjective, the parameters selected vary from one railway to the other. Also, relevant information is lost through
neglected parameters and subjective weight assignment. In order to prevent this, the use of principal components as combined TQIs
was proposed in this work. This made it possible to simplify track geometry data in a way that most of the variance in the data is
captured.
The use of principal components as TQIs was tested using classical machine learning algorithms and the following conclusions are
highlighted: (1) With a sample mile track, ⩾ 90% variance in the geometry data was explained by 1st 3 components in 100% of 500 ft
sections and 88% of 150 ft sections. The first principal component captured track variations in the vertical plane, the second principal
component in the longitudinal plane and the third correlated well with transverse irregularities. This information can be used to plan
maintenance activities such as tamping or stone-blowing (1st PC and 2nd PC) as well as gage correction (3rd PC). (2) Support Vector
Machine (SVM) was the most effective learning tool for classifying track sections with geometry defects among other selected ma-
chine learning tools; and (3) Using principal components and other combined TQIs from different railways, SVM predicted track
defects better with 3 principal components and Canadian TQI than any other TQIs considered in this study. The prediction perfor-
mance was measured using TPR (True Positive Rate) and FPR (False Positive Rate) since the defect data is highly unbalanced.
This approach will help railways and track engineers assess track geometry monitoring from a different perspective as a novel
245
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
method of combined/artificial TQI for maintenance scheduling. This work is a first step in incorporating dimension reduction in track
geometry data analysis using classical techniques. Future work will consider the development of thresholds for principal components
through correlation studies with vertical or lateral accelerations on train; and the use of classifier fusion to obtain better predictions.
Because dimension reduction/feature extraction with machine learning have not been widely adopted in track geometry data and
analysis, there is great potential for optimized maintenance scheduling under this approach.
246
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
Acknowledgment
The authors would like to acknowledge the University of Nevada, Las Vegas (UNLV/USDOT) University Transportation Center
(UTC) on Improving Rail Transportation Infrastructure Sustainability and Durability for partly sponsoring this research.
References
Audley, M., Andrews, J., 2013. The effects of tamping on railway track geometry degradation. Proc. Inst. Mech. Eng., Part F: J. Rail Rapid Transit 227 (4), 376–391.
Bai, L., Liu, R., Sun, Q., Wang, F., Wang, F., 2016. Classification-learning-based framework for predicting railway track irregularities. Inst. Mech. Eng., Part F: J. Rail
Rapid Transit 230 (2), 598–610.
Berawi, A.R.B., Delgado, R., Calçada, R., Vale, C., 2010. Evaluating track geometrical quality through different methodologies. Int. J. Technol. 1 (1), 38–47.
Bergmeir, C., Sáinz, G., Martínez Bertrand, C., Benítez, J.M., 2013. A study on the use of machine learning methods for incidence prediction in high-speed train tracks.
In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 7906 LNAI, pp. 674–683.
Ciobanu, C., 2016. Evaluation of the Track Quality. < https://pwayblog.com/2016/09/11/evaluation-of-track-quality/?blogsub=confirming#blog_
subscription-3 > .
DYK, B.J.V., 2014. Characterization of the Loading Environment for Shared-Use Railway Superstructure in North America (Ph.D. Thesis). University of Illinois Urbana
Champaign.
El-Sibaie, M., Zhang, Y.-J., 2014. Objective track quality indices. J. Transp. Res. Board 1863. <http://trrjournalonline.trb.org/doi/pdf/10.3141/1863-11> .
Esveld, C., 2016. Modern Railway Track, second ed. MRT-Productions.
Fazio, J.L.C., 1986. Track quality index for high speed tracks. Transp. Eng. 1 (112), 46–61.
Fernando, L., Hart, J., Al., E., 2011. Machine Vision Inspection of Railroad Track. Tech. Rep., University of Illinois Urbana-Champaign.
FRA, 2002. Track Safety Standards Compliance Manual. Track and Rail and Infrastructure Integrity Compliance Manual 4.
FRA, 2005. Development of Objective Track Quality Indices. US Department of transportation Federal Railroad Administration Research Results, 6–9. < https://www.
fra.dot.gov/Elib/Document/2146 > .
FRA, 2013. Track Safety Standards; Improving Rail Integrity; Final Rule. < https://www.gpo.gov/fdsys/pkg/FR-2014-01-24/pdf/2014-01387.pdf > .
FRA, 2015. FRA Office of Safety Analysis. < http://safetydata.fra.dot.gov/officeofsafety/default.aspx > .
Hastie, T., Tibshirani, R., Friedman, J., 2009. The Elements of Statistical Learning, vol. 1 Springer. <http://www.springerlink.com/index/10.1007/b94608> .
Hay, W.W., 1982. Railroad Engineering, second ed. Wiley.
Hendry, M., 2015. JRC2015-5654. Joint Rail Conference, 1–7.
Hu, C., Liu, X., 2016. Modeling track geometry degradation using support vector machine technique. In: The ASME Joint Rail Conference, 2016. Article Number: V
001T01A011 (JRC2016-5739).
Ishwarappa, Anuradha, J., 2015. A brief introduction on big data 5Vs characteristics and hadoop technology. Procedia Comput. Sci. 48 (C), 319–324.
James, G., Witten, D., Hastie, T., Tibshirani, R., 2013. An Introduction to Statistical Learning, first ed. Springer. <http://www-bcf.usc.edu/∼gareth/ISL/
ISLRFirstPrinting.pdf> .
Jamshidi, A., Faghih-Roohi, S., Hajizadeh, S., Núñez, A., Babuska, R., Dollevoet, R., Li, Z., De Schutter, B., 2017. A big data analysis approach for rail failure risk
assessment. Risk Anal. 37 (8), 1495–1507.
247
A. Lasisi, N. Attoh-Okine Transportation Research Part C 91 (2018) 230–248
Karim, F.M.A., Rubasi, K.A.H., Saleh, A.A., 2016. The road pavement condition index (PCI) evaluation and maintenance: a case study of Yemen. Organ., Technol.
Manage. Construct.: Int. J. 8 (1), 1446–1455.
Kelshaw, M.T.T.-L.Y.R., 1995. Developments and applications in the Amtrak track geometry measurement car. Nondestructive Evaluation of Aging Railroads 2458.
< http://spie.org/Publications/Proceedings/Paper/10.1117/12.212678 > .
Khouy, I.A., 2013. Cost-Effective Maintenance of Railway Track Geometry, A Shift from Safety Limits to Maintenance Limits (Ph.D. Thesis). Luleå University of
Technology.
Li, H., Parikh, D., He, Q., Qian, B., Li, Z., Fang, D., Hampapur, A., 2014. Improving rail network velocity: a machine learning approach to predictive maintenance.
Transp. Res. Part C: Emerg. Technol. 45, 17–26.
Li, Z., Zhao, X., Esveld, C., Dollevoet, R., Molodova, M., 2008. An investigation into the causes of squats—correlation analysis and numerical modeling. Wear 265
(9–10), 1349–1355.
Liu, R.K., Xu, P., Sun, Z.Z., Zou, C., Sun, Q.X., 2015. Establishment of track quality index standard recommendations for beijing metro. Discr. Dyn. Nat. Soc.
Madejski, J., Grabczyk, J., 2002. Continuous geometry measurement for diagnostics of tracks and switches. In: Proceedings of the International Conference on
Switches.
Mannila, H.., 2016. Data mining: machine learning, statistics, and databases. In: Int’l Conf. Scientific and Statistical Database Management. IEEE Computer Society.
Martinez, A.M., Kak, A.C., 2001. PCA versus LDA. IEEE Trans. Pattern Anal. Mach. Intell. 23 (2), 228–233.
Molodova, M., Li, Z., 2014. Automatic detection of squats in railway infrastructure maria. IEEE Trans. Intell. Transp. Syst. 15 (5), 1980–1990.
Munoz, A., 2014. Machine Learning and Optimization. Courant Institute of Mathematical Sciences. <https://www.cims.nyu.edu/∼munoz/files/ml_optimization.
pdf> .
Odlyzko, A., 2016. The early British railway system, the Casson counterfactual, and the effectiveness of central planning. Essays Econ. Bus. History 34 (1), 60–94.
Pedankar, N., Rao, B.V., Kapur, P., 2003. Profile-based prediction: an enhanced methodology for prediction of track degradation. In: American Railway and
Maintenance of Way Association (AREMA) 2003 Conference Proceedings, pp. 1543–1548.
Peng, F., 2011. Scheduling of Track Inspection and Maintenance Activities in Railroad Networks (Ph.D. Thesis). University of Illinois at Urbana-Champaign.
Scientific American, 2016. Broken Rails Are Leading Cause of Train Derailments. < http://www.scientificamerican.com/article/broken-rails-are-leading-cause-of-
train-derailments/ > .
Soleimanmeigouni, I., Ahmadi, A., Kumar, U., 2016. Track geometry degradation and maintenance modelling: a review. Proc. Inst. Mech. Eng., Part F: J. Rail Rapid
Transit 0 (0), 1–30.
Statista, 2017. Transportation and Logistics, Development of Metro Networks Worldwide. < http://www.statista.com/statistics/619184/metro-networks-worldwide-
track-distribution/ > .
Thompson, L.S., 1989. Railways In Eastern Europe. Tech. Rep., The World Bank.
Wei, T., Simko, V., 2016. An Introduction to Corrplot Package. < https://cran.r-project.org/web/packages/corrplot/vignettes/corrplot-intro.html > .
Woodburn, A., 2014. Development of rail freight in Europe: What regulation can and cannot do United Kingdom Case Study. CERRE: Center Regul. Europe 32
(December), 1–43.
Xu, P., Liu, R., Sun, Q., Wang, F., 2012. A novel short-range prediction model for railway track irregularity. Discr. Dyn. Nat. Soc. http://dx.doi.org/10.1155/2012/
591490.
Yella, S., Pasha, S.M., Dougherty, P.M., 2009. Classifier fusion for condition monitoring of wooden railway sleepers. In: Systems, Man and Cybernetics, 2009. SMC
2009. IEEE International Conference on 1 (May), pp. 2–4.
Zarembski, A.M., 2015. Some examples of big data in railroad engineering. In: Proceedings – 2014 IEEE International Conference on Big Data, IEEE Big Data 2014, pp.
96–102.
Zarembski, A.M., Grissom, G.T., Euston, T.L., 2013. Use of ballast inspection technology for the prioritization, planning and management of ballast delivery and
placement. In: American Railway Engineering Association Annual Conference.
248