Comparative Dissolved Gas Analysis with Machine Learning and Traditional Methods
Comparative Dissolved Gas Analysis with Machine Learning and Traditional Methods
net/publication/352806309
CITATIONS READS
8 960
3 authors:
M. Cengiz Taplamacioglu
Gazi University
81 PUBLICATIONS 2,114 CITATIONS
SEE PROFILE
All content following this page was uploaded by Merve demirci on 21 September 2021.
Abstract— Power transformers are one of the vital TC 10 dataset and the data in the literature, and performed
equipment for power systems. Therefore, in case of outage of the transformer fault diagnosis using the Extreme learning
service, the effects on the system are fatal. The fault diagnosis is machine method, which uses gas ratios and percentages as
of great importance. In this study, interpretation methods of input data, depending on the gas concentrations in this data
dissolved gas analysis used in diagnosis of power transformers set. They showed that the method used has a higher diagnostic
are examined. In the MATLAB GUI, a user interface has been accuracy than Fuzzy logic and ANN [11]. In 2014, Shari et al.
created for traditional fault diagnosis. Fault diagnosis is made used the K-nearest Neighbor method as a classifier and the K-
and the shortcomings of traditional methods have been shown. nearest Neighbor method as a classifier, using the IEC TC 10
Support Vector Machine, K-Nearest Neighbors and Decision
dataset and the data set they obtained from the Malaysian
Tree algorithms are used for diagnosis of machine learning
methods. Between these methods, the Python programming
Utility Company [12]. In 2017, Ma et al. performed fault
language is used for fault diagnosis and fault classification is classification with SVM using three gas ratios from 118 DGA
made to the IEC TC 10 database. Confusion matrices and data from the IEC TC 10 dataset and used Particle Swarm
classification performance measurements of machine learning Optimization for the optimization of SVM parameters. The
methods are obtained. The classification accuracy of the results obtained are compared with SVM and Genetic
methods has been investigated and compared to each other. Algorithm -SVM [13]. In 2019, Zhang et al. used gas ratios as
input data for the SVM, as well as gas concentrations obtained
Keywords—Dissolved gas analysis, traditional methods, SVM, from an IEC TC 10 dataset and a dataset of 117 samples. They
KNN, decision tree, power transformer, Python used Krill Herd Optimization algorithm for optimization of
SVM parameters and compared the results with different
I. INTRODUCTION optimization algorithms [14].
A power transformer is one of the most important and most In this study, DGA method used for diagnosis of power
expensive parts of power systems. Diagnosis of transformers transformers is mentioned, traditional methods and smart
is of great importance in order to ensure the continuity of methods are explained. The application and results of
energy and to avoid costly problems [1,2]. traditional methods in MATLAB GUI are given in detail.
Dissolved gas analysis (DGA) method has been widely Applications and results of smart methods used to eliminate
used in transformer fault diagnosis in recent years. Using the the shortcomings of traditional methods in Python are also
gas concentrations obtained by these methods, the fault given in detail simultaneously.
diagnosis is made with traditional and intelligent methods.
While traditional methods provide fault classification using II. DISSOLVED GAS ANALYSIS
rule bases created using expert knowledge and experience, Gas molecules of different densities are formed during
intelligent methods are used for performing the fault failure in power transformers. The formation temperature and
classification with computer-assisted learning methods [3]. formation energies of these gas molecules are different from
each other. Since the type and amount of gas molecules
Rogers Ratios Method [3], IEC Ratios Method [4],
formed during the fault differ, these gases are can be used for
Doernenburg Ratios Method [3] and Duval triangle method
fault detection. Dissolved gas analysis method is one of the
[3] are widely used in the diagnosis of power transformers.
successful methods used to detect dissolved gases in
These methods make fault classification using different ratios
transformer insulation liquid. With the chromatograph
of gas concentrations and gas percentages [3, 4]. Intelligent
method, the amount of gases in the insulating liquid is
methods are used to overcome the existing shortcomings of
measured and used for fault detection [3]. The measured
these methods. Commonly used intelligent methods are
gases by DGA method and used for transformer fault
artificial neural networks [5], expert systems [6], fuzzy
diagnosis are as follows: Hydrogen, Acetylene, Methane,
methods [7] and machine learning methods [8, 9].
Ethylene, Ethane, Carbon monoxide, Carbon dioxide [3]. The
One of the frequently used data sets in the literature in gas concentrations obtained by DGA are diagnosed by using
diagnosing power systems is the IEC TC 10 dataset. Using this both traditional and intelligent methods.
data set, a fault diagnosis has been made with different
A. Classification of Transformer Fault Types
classification algorithms. Liu et al. used five characteristic gas
ratios from the IEC TC 10 dataset and 115 data, TPC datasets There are six different types of possible faults in
in 2012 and made fault classification with Decision tree [10]. transformers [3, 4]. These:
In 2015, Malik et al. created a data set by combining the IEC
Commonly used traditional methods: Rogers Ratio Duval %CH4 Partial Discharge (PD)
Method, Doernenburg Ratio Method, IEC Ratio Method and Triangle % C2H4 Low Energy Discharge (D1)
Duval Triangle Method. Information on traditional methods Method % C2H2 High Energy Discharge (D2)
is given in Table I. Thermal Fault(T1) (<300°C)
Thermal Fault (T2) (300°C-
1) Rogers Ratio Method:
700°C)
In order to make fault classification with this method, Thermal Fault (T3) (>700°C)
three ratios of gas densities obtained from DGA analysis are Electrical/ thermal fault (DT)
used to classify transformer conditions in six different
situations [3]. 1) Decision Tree:
2) Doernenburg Ratio Method: It is a method with high diagnostic accuracy, where the
rules and decision process can be obtained. The rule base
In order to make fault classification with this method, four consists of branches and nodes [9]. Nodes represent tags
ratios of gas densities obtained from DGA analysis are used while branches represent classification conditions. The rule
to classify transformer conditions in three different situations base of the decision tree can be obtained by following the
[3]. nodes and branches.
3) IEC Ratio Method: Thanks to the decision tree used in transformer fault
This method makes fault classification in nine different diagnosis, user classification rules and decision mechanisms
situations using three ratios of gas obtained from DGA can be learned. This helps maintenance personnel to diagnose
analysis [4]. the situation of the oil [9]. Data quality is very important in
the Decision tree method, the higher the data quality, the
4) Duval Triangle Method: higher the diagnostic accuracy is obtained.
In order to make fault classification with this method, a 2) K-Nearest Neighbors (KNN):
triangular rule base is formed by using the percentages of KNN is an easy-to-apply and easy-to-understand method
three different gas densities. Transformer condition is that classifies unlabelled data according to the label of the
classified in six different ways [3]. closest data [15].
First, the distance of the unlabeled data to the labeled data
is obtained by the distance measurement method. Next, the
C. Intelligent Methods
number of k neighborhoods is determined and the tag of the
These are computer-aided intelligent methods used to closest neighbors is determined. Finally, the label of the data
increase diagnostic accuracy and overcome the shortcomings with the highest number of nearby parts is taken as the label
of traditional methods. Commonly used intelligent methods: of the data to be classified [15].
Expert systems, Artificial Neural Network, Fuzzy Systems,
Machine Learning Methods.
3) Support Vector Machine (SVM):
In this study, fault classification is made using K-Nearest It is a method that aims to find a suitable hyperplane with
Neighbors and Decision tree methods, which are machine better generalization ability and good performance that will
learning methods, and the results are compared.
maximize the margin between the data and the differential
hyperplane [16, 17].
SVM does not easily fall to the local optimum. It can
easily optimize model structure and parameters [16]. It also
has high diagnostic accuracy. Because of these advantages, it
is one of the most frequently used machine learning methods
in transformer fault diagnosis.
III. APPLICATION FOR TRADITIONAL METHODS
An application is made to diagnose power transformers
using traditional methods from DGA interpretation
techniques in MATLAB GUI. The diagnostic rule base has
been established using the rules in the existing standards for
traditional methods. A user interface has been created where
DGA gas concentrations are entered by the user, gas rates and
percentages are calculated and the type of failure is obtained
and used in diagnosis. Fig. 2. Example of MATLAB GUI
Recall= (2)
Fig. 4. Data pre-processing (Log- normalized and standardizing) ∗ ∗
F1 Score= (3)
C. Machine Learning Methods
In the present study, Support Vector Machine, Decision The confusion matrix is used to show actual values and
Tree and k-Nearest Neighbors algorithms are used for predicted values and to see them comparatively. The
transformer fault classification using IEC TC 10 dataset. diagonals of the matrix show the correctly predicted classes,
First, the dataset is prepared for the entry of preprocessed while the non-diagonal components show incorrect
algorithms. Then, the data set is randomly divided into two prediction results.
as training data set and test data set. While 80% of the data The confusion matrices and classification measures of
set is reserved as training data set, 20% of it is used as test the classification methods trained using the IEC TC 10
data set. However, data sets are arranged to include examples database are given in Table II, Table III and Table IV.
from each class. The classification algorithm used is trained
with the training data set, and then the classification Table II contains the confusion matrix and classification
performance of the algorithm is measured with the test data measures of the Decision Tree method. Decision tree method
set. Finally, the classification performances of the algorithms correctly predicted one of the PD failures in the test data set.
are compared. However, the wrong predicted one sample. At the same time,
this method correctly predicted three thermal failures while
The flow chart for the study is given in the Fig. 5. The predicting three different failures. In total, the number of
classification process is completed by applying these steps. correctly predicted faults in the matrix diagonal is 28, while
Classification algorithms are made using Python the number of faults outside the diagonal is 6.
programming language.
TABLE II. CONFUSION MATRICE AND CLASSIFICATION MEASURES OF
DECISION TREE
D. Classification Result
Decision Tree
A confusion matrix is used to demonstrate the Actual Classification
classification ability of classification algorithms. The Class\ measures
Thermal Fault
Arching Fault
Predicted
false positives, and false negatives [20]. Of these Class
PD
Precision
Thermal Fault
Arching Fault methods. Python programming language is used for
Normal State
Predicted
Class classification with intelligent methods. In the classification
PD
Precision
process, the IEC TC 10 dataset is used. The classification
F1 Score
Recall
results are obtained by randomly dividing the data set so that
the train-test ratio is 80-20%.
PD 2 0 0 0 1 1 1
The obtained classification results show that SVM has a
higher performance than other classification methods. The
Arching 0 12 0 2 1 0,86 0,92
Fault obtained results showed that SVM is the most appropriate
Thermal 0 0 3 3 1 0,50 0,67 fault classification method for the dataset used.
Fault
Normal 0 0 0 12 0,71 1 0,83
State
REFERENCES
TABLE IV. CONFUSION MATRICES AND CLASSIFICATION MEASURES [1] Fu Yang, Jin Xi and Lan Zhida, "A neural network approach to power
OF SVM transformer fault diagnosis," Sixth International Conference on
Electrical Machines and Systems, 2003. ICEMS 2003., Beijing, China,
SVM 2003, pp. 351-354 vol.1.
Actual Classification [2] M. Duval, "Dissolved gas analysis: It can save your transformer," in
Class\ measures IEEE Electrical Insulation Magazine, vol. 5, no. 6, pp. 22-27, Nov.-
Thermal Fault
Arching Fault
Normal State
Precision
F1 Score