ISH2017-151
ISH2017-151
Abstract: This paper proposes the use of conventional data analysis and computational
intelligence methods for the estimation of critical flashover voltage on polluted Cap & Pin
porcelain insulators. Specifically, modeling using Artificial Neural Networks (ANNs) and
Multiple Linear Regression (MLR) has been attempted, based on related application data.
The database, used for the analysis, consists of 168 cases (i.e. series of related
measurements) represented by six (6) numeric variables, namely, the diameter, the
height, the creepage distance, the manufacturing constant, the pollution of insulators and
the critical flashover voltage. Part of these data derives from a specialized data
generation model corresponding to incidents of flashover voltage on polluted Cap & Pin
insulators (simulation data), while the rest of the data consist of real experimental
observations (real data). The comparison showed that ANNs are proved to be superior
for modeling the estimation problem of the critical flashover voltage on polluted insulators.
Finally, comparison between the results of this work with other similar approaches,
previously existing in literature, shows that the results of the ANNs application are
satisfactory, yet there is certainly still room for further improvement.
According to statistical analysis, held in the set of As already mentioned above, the conventional and
the real data, it was observed that the only intelligent methods, used for the estimation of
variables, which follow normal distribution, are the flashover voltage on polluted Cap & Pin porcelain
variables C and UC. The correlation factor (r) insulators, were Artificial Neural Networks (ANNs)
among the individual input variables and between and Multiple Linear Regression (MLR). Evaluation
input–output variables is presented in Table 3. was performed according to the known measures
According to Table 3, it is observed that the of correlation factor (r), the mean-absolute error
correlations among all variables are potent and (MAE), the root-mean-square error (RMSE), the
range approximately within the same level. root-absolute error (RAE) and the relative-root-
squared error (RRSE), between of the actual value
Table 3: Correlation among problem variables for (Uc(act)) and predicted value (Uc(pr)) of critical
real data. flashover voltage.
Real Data, N=28 4.1 ANN METHOD
Correlation Factor (r), p<0.05
Var. Dm H L F C Uc
Dm 1.0 -1.0 -0.61 -0.95 0.44 -0.51 The ANN model used in this study belongs to the
H -1.0 1.0 0.61 0.95 -0.44 0.51 category of the feed-forward artificial neural
L -0.6 0.6 1.00 0.83 -0.57 0.67 networks (Multilayer Perceptrons–MLPs) [11, 12].
F -0.9 0.9 0.83 1.00 -0.52 0.62
C 0.4 -0.4 -0.57 -0.52 1.00 -0.87 On finding the optimal MLP for the estimation of
UC -0.5 0.5 0.67 0.62 -0.87 1.00 critical flashover voltage, 1895 architectures of
MLPs, in total, have been developed and
The regression equations expressing the linear compared with each other. All MLPs had one
dependence of the variable UC on the other hidden layer and the activation function of hidden
problem variables are in this case as follows: nodes was the logistic function of (17), [11].
UC(Exp.) = 76.749 − 2.550 ∙ Dm (12) 1
φ�uj � = −α∙uj (17)
1+e
UC(Exp.) = −34.58 + 3.1881 ∙ H (13)
The online learning for the supervised training of
UC(Exp.) = −4.366 + 0.50413 ∙ L (14) each MLP into batch mode has been performed
using the back-propagation algorithm (BP) [11, 12],
UC(Exp.) = 2.8882 + 11.719 ∙ F (15) while validation of each MLP has been carried out
using 10-fold cross validation method [12]. The
UC(Exp.) = 19.226 − 20.63 ∙ C (16) training process was divided into four (4) separate
stages, each one of them for the selection of the
optimum value for ANN parameters applying
3 SELECTION AND PRE-PROCESSING OF consecutive iterations. The goal in each stage was
DATASETS to minimize MAE, RMSE, RAE, and RRSE and to
maximize the correlation factor (r) between the
The entire database was divided into a training set UC(actual) and UC(predicted).
and a test set. The training set was used for the
development of the predictive models of critical The aim of the first stage of the training process
flashover voltage and the test set was used to was to find the optimum combinations among the
validate the final predictive models. learning rate (η), the momentum term (α), and the
number of neurons in the hidden layer (Nn) for
The test set is derived from the random selection specific numbers of epochs (e) (Table 4).
of 6 cases of actual data and was used solely for
the final test of predictive models with a view to Table 4: First stage of the training process.
their generalization ability. The training set consists
of the rest 162 data cases (140 cases of simulation First Stage
data and 22 cases of actual data). It is sorted in No e η α Nn
ascending order (according to the values of the 0.1–0.3 0.1–0.9 2-20
variable UC) and it was used in both modeling 1 500
with step 0.1 with step 0.1 with step 2
methods (ANNs and MLR). Before the 2 2500
0.1–0.3 0.1–0.9 2-20
experimentation with the datasets, all variables with step 0.1 with step 0.1 with step 2
(Dm, H, L, F, and C) were normalized in the interval 0.1–0.3 0.1–0.9 2-20
3 4500
with step 0.1 with step 0.1 with step 2
[-1, 1], a usual practice for ANNs.
0.1–0.3 0.1–0.9 2-20
4 6500
with step 0.1 with step 0.1 with step 2
0.1–0.3 0.1–0.9 2-20
5 8500
with step 0.1 with step 0.1 with step 2
0.1–0.3 0.1–0.9 2-20
6 10 500
with step 0.1 with step 0.1 with step 2
The search which took place at the second stage
of the training process concerns the finding of the
best combinations between the number of epochs
and the number of neurons in the hidden layer of
MLPs, for the combinations η-α, resulted from the
first stage (Table 5).
In the third stage, having taken into account the
best combinations of e–η–α–Nn, yielded from the
second stage of the experimental procedure, the
behavior of MLPs was observed by altering the
number of neurons in the hidden layer (Nn) while
keeping fixed the combinations of e–η–α (Table 6).
Table 5: Second stage of the training process. Figure 3: Scatter plot for the training–validation
set.
Second Stage Table 8: ANN model for the estimation of critical
No c η α Nn flashover voltage.
12 500–20 500 2–20
1 0.1 0.8
with step 2000 with step 2
ANN Architecture
12 500–20 500 2–20
2 0.1 0.9
with step 2000 with step 2 Name ANN1
12 500–20 500 2–20 Type MLP
3 0.2 0.5
with step 2000 with step 2 Number of Layers 3
12 500–20 500 2–20 Number of Hidden Layers 1
4 0.2 0.6
with step 2000 with step 2 Number of Hidden Units (Nn) 21
12 500–20 500 2–20 Number of Inputs 6
5 0.2 0.7
with step 2000 with step 2 Number of Outputs 1
12 500–20 500 2–20 Inputs Dm, H, L, F, C
6 0.3 0.5
with step 2000 with step 2 Outputs UC
12 500–20 500 2–20
7 0.3 0.6 Cases of Training-Validation Set 162
with step 2000 with step 2
Training Procedure Supervised
Training Type Batch Mode
Validation Procedure 10-fold cross validation
Table 6: Third stage of the training process.
Training Algorithm Back Propagation
Learning Rate (η) 0.1
Third Stage Momentum Term (α) 0.8
No e η α Nn Number of Epochs (e) 36 500
1 20 500 0.1 0.8 19,20,21,22 Performance
2 20 500 0.2 0.6 13,14,15 Training–validation set Test set
r 0.9994 0.9842
MAE (kV) 0.1370 1.3249
In the last fourth stage, using the combinations of
RMSE (kV) 0.1886 1.6957
the third one, the behavior of MLPs was observed
RAE (%) 2.9136 -
by altering the number of epochs (e) (Table 7). RRSE (%) 3.3936 -
Table 7: Fourth stage of the training process. Finally, Figures 3 and 4 present the correlation (r)
between the actual and estimated values of UC.
Fourth Stage
No e η α Nn
1 20 500–38 500 with step 2000 0.1 0.8 21
2 20 500–28 500 with step 2000 0.2 0.6 14
The application of multiple linear regression (MLR) The comparison between the models of this paper
method [14] has been performed on the same (ANN1 and MLR) shows that ANNs are superior for
datasets with those used in the ANN methodology the estimation of flashover voltage on polluted Cap
(i.e. training and test set). The MLR model for the & Pin insulators. The final performance results for
estimation of flashover voltage is described by both models are tabulated in Table 10.
(18).
Table 10: Comparison between ANN1 and MLR.
Therefore, Table 9 presents the performance of
the MLR model and Figures 5 and 6 give the Comparison ANN1–MLR
corresponding scatter diagrams.
Modeling ANN1 MLR
UC = W + Q + 12.6931 (18) r 0.9994 0.8748
MAE (kV) 0.1370 2.2036
where: RMSE (kV) 0.1886 2.6868
RAE (%) 2.9136 46.8765
W = 0.7663 ∙ D𝑚 + 0.6931 ∙ H (19) RRSE (%) 3.3936 48.3468
Test ANN1 MLR
Q = 2.0396 ∙ L − 0.1124 ∙ F − 6.7013 ∙ C (20) r 0.9842 0.9322
MAE (kV) 1.3249 2.7646
Table 9: Performance of the MLR model. RMSE (kV) 1.6957 3.2932
Comparison ANN1–ΑΝΝ
Training–validation ANN1 ANN
r 0.9994 0.9972
Test ANN1 ANN
r 0.9842 0.9853
MAPE (%) 7.48 3.84