An Ensemble Learning Model For Failure Rate Predic - 2020 - Procedia Manufacturi
An Ensemble Learning Model For Failure Rate Predic - 2020 - Procedia Manufacturi
com
ScienceDirect
Procedia Manufacturing 42 (2020) 41–48
* Corresponding author. Tel.: +39 050 2218139; fax: +39 050 2218140. E-mail address: [email protected]
Abstract
In the Industry 4.0 era, Preventive Maintenance (PM) is still an attractive solution to prevent breakdowns and failures and to reduce
maintenance and failure costs. A PM program is part of both the Total Productive Maintenance (TPM) philosophy and the Reliability Centered
Maintenance (RCM) process. A prerequisite to carry out effective PM activities is the availability of a reliable estimate of the equipment failure
rate. Assessing it may be a hard task, as it requires analysing a large set of maintenance data, which includes both quantitative and qualitative
variables. To this aim, it is possible to exploit advanced data analysis techniques that permit extracting information and knowledge from big
datasets. This paper presents an ensemble-learning model to estimate the failure rate of equipment subject to different operating conditions. At
the same time, the method permits to identify the most important working parameters affecting the failure rate. An industrial application is
considered to show the potentialities and the effectiveness of the proposed method. In particular, a sample of 143 centrifugal pumps installed in
an oil refinery plant is analysed.
The refinery is characterized by processing and service identified. While the name of some variables is self-
systems which occupy a surface of nearly 650,000 m2 with explanatory, some others require a brief description. “Plant
3,000 km of piping. The plant has a storage capacity of more type” defines the part of the refinery plant where the pump
than 1,500,000 m3, an annual production capability of about operates. “Soot” identifies the solid carbon-based particles
390,000 tons of oil, and an oil tanker receiving capability up present in the fluid, which typically have disruptive actions on
to 400,000 tons displacement. A closed-loop water system the seals. Finally, we clustered the seal type into four
capable of delivering 700 m3/h of water and a fire system able categories: single-seal (S), dual-seal (DS), lip-seal (SL), and
to bring in up to 3,000 m3/h of seawater are included in the tandem-seal (T).
plant. An integrated combined cycle plant assures the The net operating time (NOT) included in Table 1 is
necessary power supply capable of about 280 MW, which defined as the cumulated functioning time, from the start up
operates by burning a synthesis gas obtained in the refining until the last observed failure. In formulas, the net operating
cycle.
time of the jth pump, , can be expressed as follows:
After a primary distillation phase, which is known as
topping, the materials flow into two separate distillation units:
(1
)
The atmospheric distillation unit (i.e., unifining) treats light
fractions by separating petrol from liquefied petroleum gas
(LPG). Then, petrol undergoes two further transformation where is the number of observed failures for the jth pump,
phases (i.e., platforming and isomerization) required to is the failure time of failure i on pump j, and
increase the octane number and to eliminate aromatic are the opening time and the closing time, respectively, of the
compounds. maintenance order regarding failure i on pump j.
The vacuum unit treats the middle distillation fraction The exact values of were not available in the database.
(mainly kerosene) and feeds a desulphurization process. Hence, we replaced with in the evaluation of the net
operating time. This approximation can be justified according
Finally, the heavy fraction and all distillation residuals are to the following argument. Reactive maintenance is the first
processed by means of thermal cracking and visbreaking. option, while condition maintenance applies only to the pumps
These treatments are required to improve oil conversion rate that do not operate in active redundancy. However, even in
and to increase the overall production of lighter products. this case, when an operating threshold limit is trespassed (e.g.,
excessive vibration and/or leakage) and a maintenance order is
issued, the remaining useful life of the pump can be
considered nearly null. Hence, the approximation
can be assumed reasonable, and the MTBF is finally given by:
Fig. 1. Oil refinery processing scheme. To apply ensemble modelling to the pumps’ dataset, the
stacked generalization and the bagging methods were used.
3.2. Structure One of the most interesting aspects of stacking is that it may
be used to combine models of different types. The most
The analysis considered 143 centrifugal pumps installed in important goal of stacking is, possibly, the reduction of bias.
the oil refinery plant described in Section 2.1. These pumps To begin with, the algorithm splits the training set into two
were monitored over a period of 18 months. In this period, separate subsets and trains several learners on the first subset.
operating time, failures and maintenance tasks were recorded The remaining data are used to test and validate the model
in a standard computerized maintenance management system but, instead of using a monolithic approach in which the best
(CMMS). Table 1 shows a sample of the records included into learner becomes the winner, the outcomes of all the models
the CMMS. Within the database, ten potential predictors were
At this point, the dataset is partitioned into two different The “Base learner” includes three algorithms: k-nearest
samples, to leave the 40% of data unseen by the learner and neighbours (k-NN), naïve Bayes, and random forest. The
available for a later validation. To this aim, the well-known “Stacking model learner” makes use of decision trees. All
stratified sampling algorithm is used. It builds random subsets these are widely known methods in machine learning [18]:
and ensures that the class distribution in the subsets is the the k-NN algorithm is a non-parametric method for
same as in the whole dataset. classification, whose output is a class membership;
Following, the ensemble model was built as follows: first, naïve Bayes is a probabilistic model-based algorithm that
an “Optimise parameters (Grid)” was introduced to perform relies on Bayes’ theorem with independence assumptions
multiple optimizations on the various algorithms (these between the features;
settings are reported in detail in Table 2). This is a nested
operator that executes all given sub-processes for all
Braglia Marcello et al. / Procedia Manufacturing 42 (2020) 41– 4
random forests are an ensemble learning method which Table 3. Optimization parameters.
builds a multitude of decision trees at training time and
gives as output the class that is (typically) the mode of the
classes of the individual trees; Operator Parameter Range
decision tree is non-parametric method used to go from Validation Split ratio 0.5 to 0.8
Random forest Number of trees 10 to 100
observations about an item to conclusions about the item’s Decision Tree Criterion Gain ratio, Information gain, Gini
target value. index, Accuracy
Decision Tree Min. leaf size 2 to 6
Decision Tree Min. size for split 4 to 12
Finally, the ensemble model is used to perform a validation
cycle.
This activity involves using those samples (40% of the Table 4. Optimal values.
entire dataset) that had been split in a previous step. The
“Optimise parameter” operator is used to evaluate the optimal Operator Parameter Range
configuration acting on the parameters reported in Table 3. Validation Split ratio 0.79
Random forest Number of trees 40
Running the ensemble model yields the following results. Decision Tree Criterion Accuracy
To begin with, the optimal values for the above-mentioned Decision Tree Min. leaf size 4
parameters are summarised in Table 4. Decision Tree Min. size for split 8
Marcello Braglia graduated (with distinction) in 1988 in Luca Padellini graduated in 2018 in Mechanical Engineering
Electronic Engineering at Politecnico di Milano. Since 1995, at the University of Pisa. In the same year, he began his PhD
he has been a Researcher in Mechanical Technology and studies in Smart Industry within the same University. His
Production Systems at the Università degli Studi di Brescia. research activities mainly concern the digitalization of the
Since 1998, he has been employed as a researcher and since supply chain in the Tuscan Fashion district. His research
2002, as a Full Professor, in Industrial Plants at the Università focuses on models and techniques able to improve the
di Pisa. His research activities mainly concern maintenance, communication and the transfer of information between the
reliability, production planning, lean production, logistics and members of the supply chain.
statistical quality control. He is the author of about 180
technical papers published in national and international