0% found this document useful (0 votes)

70 views

Anova Regression Correlation Analysis A Portfolio of Work in Statistical Techniques With SPSS

Uploaded by

bensalah.ons

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views

Anova Regression Correlation Analysis A Portfolio of Work in Statistical Techniques With SPSS

Uploaded by

bensalah.ons

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/358106271

Anova Regression Correlation analysis. A portfolio of work in Statistical

Techniques with SPSS

Article · January 2022

CITATION READS

1 2,155

1 author:

Ilias Kalemis
University of Derby
5 PUBLICATIONS 1 CITATION

SEE PROFILE

All content following this page was uploaded by Ilias Kalemis on 25 January 2022.

The user has requested enhancement of the downloaded file.

Anova Regression Correlation analysis. A portfolio of
work in Statistical Techniques with SPSS.

Ilias Kalemis [1]

Derby University, Kedleston Rd, Derby DE2, GB.

[email protected]
2

Contents
1 Introduction ..................................................................................................... 4
2 Regression and correlation analysis techniques ............................................... 4
2.1 Regression analysis techniques ............................................................... 4
2.3 Correlation analysis techniques ............................................................ 10
2.4 Regression model first execution .......................................................... 11
2.5 Anova .................................................................................................... 12
2.6 Coefficient ............................................................................................ 13
2.7 Regression model second execution ..................................................... 18
2.8 Parametric and non-parametric models ................................................. 23
2.9 Normality test ....................................................................................... 25
2.10 First T-Test ........................................................................................... 26
2.11 Second T-Test ....................................................................................... 28
3 Conclusion and purpose of the research ........................................................ 29
4 References ..................................................................................................... 31
5 Appendices .................................................................................................... 31
6 Datasets.......................................................................................................... 34
3

Abstract. The specific report will focus on statistical technics that may be ap-
plied in any multivariate dataset. Nowadays the data can give us as clear answers
for queries and hypothesis that may observed as it is easier than ever to collect
data and analyze them. As we do know a standard statistical procedure involves
the collection from dataset where the model deployment such as mean, standard
deviation, regression bivariate analysis, anova, chi square test, correlation’ can
either give us a pattern recognition or a range within a decision maker can have
the best results from it. These techniques are helpful in providing insights about
data however, the final decision should remain into human favor where we must
have or give to others our structural opinion based on statistical analysis. The
subject report will analyze two different datasets and try to implement some of
the structural techniques in order to give to the readers results based on the anal-
ysis charts and tables where it may lead him into valuable decisions. The report
will extract the results from a statistical software which is one of the best tools
for these techniques SPSS [2].

Keywords: SPSS, Statistical analysis, multivariate analysis, regression analy-

sis, bivariate analysis, anova, chi square test, correlation, t – tests analysis.
4

1 Introduction

The subject report will distinct the main vulnerabilities that statistical techniques may
have on the implementation in two datasets. In the following sections we will have one
multivariate analysis, regression analysis and five bivariate analysis such as anova, chi
square test, correlation, and 2 t – tests. The first datasets that we will use on the subject
report is Gas Turbine CO and NOx Emission. The dataset contains 36733 instances of
11 sensor measures aggregated over one hour, from a gas turbine located in Turkey for
the purpose of studying flue gas emissions, namely CO and NOx [3]. Second dataset is
the AI4I 2020 Predictive Maintenance Dataset is a synthetic dataset that reflects real
predictive maintenance data encountered in industry [4].

2 Regression and correlation analysis techniques

In this section will start with implementation of regression correlation techniques on

gas turbine Co and NOx emissions dataset. In statistical techniques we use correlation
to denote association between two quantitative variables. We also assume that the as-
sociation is linear, that one variable increase or decreases a fixed amount for a unit
increase or decrease in the other metrics.

2.1 Regression analysis techniques

The other technique that is often used in these circumstances is regression, which in-
volves estimating the best straight line to summarize the association. a regression anal-
ysis model is presented with dependent variable the CO emissions from gas turbines
and independent variables the ambient temperature (AT), pressure (AP) mbar, the hu-
midity (AH) (%), the air filter difference pressure (AFDP) mbar, the gas turbine exhaust
pressure (GTEP) mbar, the turbine inlet temperature (TIT), the turbine after tempera-
ture (TAT) C, the compressor discharge pressure (CDP) mbar and the turbine energy
yield (TEY) MWH (Table 1. Descriptive statistics). The data were part from the UCI
Machine Learning Repository in regards of Gas Turbine CO and NOx Emission Data
Set Data Set. The data concerned the article were written from Heysem, PÄ±nar &
ErdinÃ§ Uzun (2019) and aimed to predict CO and NOx emissions from gas tur-
bines[3]. They were collected in an operating range of the turbine between partial load
(75%) and full load (100%). The initial data set contained 36733 instances and con-
cerned five years, 2011 – 2015. In this study we analyzed the data from year 2015 which
contained 7384 observations.

Mini- Maxi- Std. Devia-

N mum mum Mean tion

Ambient temperature 738 -62,35 99,97 25,581 22,96355

4 2
5

Ambient pressure 738 989,4 1036,6 1014,5 6,8954

4 09
Ambient humidity 738 ,03 96,67 61,880 24,17223
4 0
Air filter difference pres- 738 ,00 5,24 3,2558 1,16020
sure 4
Gas turbine exhaust pres- 738 ,02 40,72 23,534 8,92576
sure 4 0
Turbine inlet temperature 738 1016,0 1100,4 1078,9 19,7624
4 75

Turbine after temperature 738 516,04 550,59 546,64 5,48907

4 25
Compressor discharge 738 ,01 100,00 11,030 5,84373
pressure 4 4
Carbon monoxide 738 ,00 10,00 2,5968 1,74456
4
Nitrogen oxides 738 ,00 10,00 5,3126 2,05560
4
Turbine energy yield 738 100,02 179,50 133,99 16,17921
4 34

Table 1. Descriptive statistics

In the next set of charts graphs, we saw how each independent variable is related to the
dependent variable. Thus we took each variable and tried to scale it, based on the graph
charts it can be easily determined that there is a relationship between the dependent
variable and each of the independent variables or in the case there is a relationship it
does not seem that it has a linear form.
6

Figure 1. Scatterplot between CO and temperature

Figure 2. Scatterplot between CO and pressure

Figure 3. Scatterplot between CO and humidity

Figure 4. Scatterplot between CO and air filter pressure

Figure 5. Scatterplot between CO and gas turbine pressure

Figure 6. Scatterplot between CO and turbine inlet temperature

Figure 7. Scatterplot between CO and turbine after temperature

Figure 8. Scatterplot between CO and compressor discharge pressure

2.3 Correlation analysis techniques

In this section will analyze for the first dataset the correlation within the variables. A
commonly definition of correlation is the following. We can describe the degree which
of two variables are linearly related. This is an important step in bi-variate data analysis.
In the broadest sense correlation is actually any statistical relationship, whether causal
or not, between two random variables in bivariate data. Moreover, the correlation coef-
ficient is a statistical measure of the strength of the relationship between the relative
movements of two variables. The values range between -1.0 and 1.0. A correlation of -
1.0 shows a perfect negative correlation, while a correlation of 1.0 shows a perfect pos-
itive correlation. A correlation of 0.0 shows no linear relationship between the move-
ment of the two variables. From the summary review on the class we saw that the Pear-
son correlation [5] can evaluate only a linear relationship between two continuous var-
iables. A relationship is linear only when a change in one variable is associated with a
proportional change in the other variable. However, the Spearman correlation [6] can
evaluate a monotonic relationship between two variables where can be continuous or
ordinal and it is based on the ranked values for each variable rather than the raw data.
The Spearman’s rho index was used to investigate the relationships among the varia-
bles. This nonparametric test is more suitable when it does not know whether the nor-
mality assumption is met and due to the fact that the linearity assumption is not met
according to figures 1 – 8 that we have seen above [7]. It can be seen that the dependent
variable, carbon monoxide is correlated negatively or positively with the dependent
variables. The strongest correlation is with the turbine inlet temperature (rho = -.553, p
< .01) and the weakest is with ambient temperature (rho = -.063, p < .01). Also, it can
be seen that there is a statistically significant positive correlation of strong intensity
between turbine inlet temperature and the turbine energy yield (rho = .958, p < .01).
Because the correlation is very strong, we have excluded the turbine energy yield vari-
able from the regression analysis in order to avoid a multicollinearity problem.
Ambient pressure (AP)

Compressor discharge
Ambient humidity (AH)

Turbine after tempera-

Turbine inlet tempera-
Ambient temperature

Turbine energy yield

Gas turbine exhaust
Air filter difference

Carbon monoxide
pressure

pressure

pressure
(AT)

ture

Am- 1,000 -,023* -,085** ,159** ,079** ,152** -,139** ,101** -,063** ,127**
bient
temper-
ature
(AT)
Am- -,023* 1,000 ,063** -,108** -,059** -,099** -,107** -,033** ,148** -,016
bient
pressure
(AP)
11

Am- -,085** ,063** 1,000 -,150** -,212** -,228** ,057** -,162** ,130** -,170**
bient hu-
midity
(AH)
Air ,159** -,108** -,150** 1,000 ,568** ,764** -,420** ,593** -,467** ,723**
filter dif-
ference
pressure
Gas ,079** -,059** -,212** ,568** 1,000 ,758** -,439** ,624** -,388** ,771**
turbine
exhaust
pressure

Tur- ,152** -,099** -,228** ,764** ,758** 1,000 -,511** ,773** -,553** ,958**
bine inlet
tempera-
ture

Tur- -,139** -,107** ,057** -,420** -,439** -,511** 1,000 -,457** ,257** -,562**
bine af-
ter tem-
perature

Com- ,101** -,033** -,162** ,593** ,624** ,773** -,457** 1,000 -,422** ,778**
pressor
dis-
charge
pressure
Car- -,063** ,148** ,130** -,467** -,388** -,553** ,257** -,422** 1,000 -,494**
bon
monox-
ide
Tur- ,127** -,016 -,170** ,723** ,771** ,958** -,562** ,778** -,494** 1,000
bine en-
ergy
yield

Table 1. Correlation table

2.4 Regression model first execution

In table 3 it can be seen the adjusted R square of each model where the stepwise method
was used in order to result to the most suitable model and the Durbin Watson index [8]
for the last one. As it can be seen the adjusted coefficient of determination is equal to
0.373 which means that the independent variables explain 37.3% of the variability of
the dependent variable. Also, it can be seen that the Durbin Watson index is equal to
1.495 as acceptable values from Andy’s Field model can be between 1 to 3 [9].

Mode Adjusted R Std. Error of Durbin-Wat-

l R R Square Square the Estimate son
12

1 ,596a ,355 ,355 1,40119

2 ,607b ,369 ,369 1,38600

3 ,610c ,372 ,372 1,38295

d
4 ,611 ,373 ,373 1,38178

5 ,611e ,374 ,373 1,38109 1,495

Table 2. Regression Model Summary

2.5 Anova

Our next statistical indicator of the dataset will be the ANOVA. Where ANOVA is
Analysis of variance and it is a statistical method that separates observed variance data
into different components to use for additional tests. A one-way ANOVA is used for
three or more groups of data, to gain information about the relationship between the
dependent and independent variables. In table 4 it can be seen that the regression model
is statistically significant, F (5, 7378) = 880.469, p =.000.

Sum of Mean
Model Squares df Square F Sig.

1 Regres- 7976,671 1 7976,671 4062,80 ,000b

sion 6

Residual 14493,377 7382 1,963

Total 22470,049 7383

2 Regres- 8291,191 2 4145,595 2158,04 ,000c

sion 7

Residual 14178,858 7381 1,921

Total 22470,049 7383

3 Regres- 8355,363 3 2785,121 1456,22 ,000d

sion 7

Residual 14114,686 7380 1,913

Total 22470,049 7383

4 Regres- 8381,298 4 2095,324 1097,42 ,000e

sion 9
13

Residual 14088,751 7379 1,909

Total 22470,049 7383

5 Regres- 8397,116 5 1679,423 880,469 ,000f

sion

Residual 14072,933 7378 1,907

Total 22470,049 7383

Table 3. Anova

2.6 Coefficient

In table 5 it can be seen that the predictor variables are all statistical significant, Turbine
inlet temperature (b = -.054, p < .01), Ambient temperature (b = .009, p < .01), Turbine
after temperature (b = -.013, p < .01), Ambient pressure (b = .010, p < .01) and com-
pressor discharge pressure (b = .008, p < .01). Furthermore, there is not a multicollin-
earity problem since all VIFs < 10. In addition, in figures 9 and 10 it can be seen that
the normality assumption is not met while the homoscedasticity assumption is partially
met (Figure 11).
14

Figure 9. Histogram

Figure 10. Normality test

Figure 11. Heteroscedasticity test

Stand-
ardized
Unstandardized Coeffi- Collinearity Sta-

Coefficients cients tistics

Std. Tol-
Model B Error Beta t Sig. erance VIF

1 (Con- 59,347 ,890 66,646 ,000

stant)

Turbine -,053 ,001 -,596 - ,000 1,000 1,000

inlet temper- 63,740
ature

2 (Con- 58,577 ,883 66,348 ,000

stant)
16

Turbine -,052 ,001 -,590 - ,000 ,998 1,002

inlet temper- 63,753
ature

Ambient ,009 ,001 ,118 12,796 ,000 ,998 1,002

temperature
(AT)

3 (Con- 70,926 2,307 30,748 ,000

stant)

Turbine -,054 ,001 -,613 - ,000 ,839 1,191

inlet temper- 60,911

ature

Ambient ,009 ,001 ,115 12,470 ,000 ,994 1,006

temperature
(AT)

Turbine -,019 ,003 -,058 -5,792 ,000 ,840 1,190

after temper-
ature

4 (Con- 58,343 4,119 14,164 ,000

stant)

Turbine -,053 ,001 -,605 - ,000 ,800 1,251

inlet temper- 58,700
ature

Ambient ,008 ,001 ,112 11,990 ,000 ,982 1,018

temperature
(AT)

Turbine -,014 ,003 -,045 -4,149 ,000 ,738 1,355

after temper-
ature

Ambient ,009 ,003 ,037 3,686 ,000 ,859 1,164

pressure
(AP)

5 (Con- 57,638 4,124 13,975 ,000

stant)
17

Turbine -,054 ,001 -,606 - ,000 ,799 1,252

inlet temper- 58,793
ature

Ambient ,009 ,001 ,112 12,075 ,000 ,981 1,019

temperature

Turbine -,013 ,003 -,042 -3,891 ,000 ,733 1,365

after temper-
ature

Ambient ,010 ,003 ,038 3,784 ,000 ,858 1,165

pressure
(AP)

Compres- ,008 ,003 ,027 2,880 ,004 ,985 1,015

sor discharge
pressure

Table 4. Coefficients Dependent Variable: Carbon monoxide

Mini- Maxi- Std. Devia-

mum mum Mean tion N

Predicted Value 1,0789 6,5142 2,5968 1,06647 7384

Std. Predicted Value -1,423 3,673 ,000 1,000 7384

Standard Error of Pre- ,017 ,253 ,036 ,017 7384

dicted Value

Adjusted Predicted 1,0780 6,5408 2,5968 1,06675 7384

Value
Residual - 7,77733 ,00000 1,38063 7384
5,79314

Std. Residual -4,195 5,631 ,000 1,000 7384

Stud. Residual -4,201 5,634 ,000 1,001 7384

Deleted Residual - 7,78559 ,00000 1,38299 7384

5,81162
18

Stud. Deleted Residual -4,206 5,646 ,000 1,001 7384

Mahal. Distance ,097 245,988 4,999 12,659 7384

Cook's Distance ,000 ,073 ,000 ,002 7384

Centered Leverage ,000 ,033 ,001 ,002 7384

Value

Table 5. Residuals statistics a. Dependent Variable: Carbon monoxide

2.7 Regression model second execution

In table 7 it can be seen that the cook’s distance and the leverage are below 1. Thus,
based on these criteria there is no need to exclude from the analysis observations. How-
ever, they are residuals greater than the absolute value of 3.29. In addition, from the M
distance we have found the multivariate outliers based on the values of the new proba-
bility variable (Probability variable =1- (CDF.CHISQ (MAH_1,5)) which are less than
.001 (Identifying Multivariate Outliers in SPSS - Statistics Solutions, 2021). Thus, we
have implemented the following filter (Probability.MAH_1 <= .001 & -3.29 < ZRE_1
<3.29) and 99 outliers were excluded from the analysis. The remaining observations
are 7285 and we repeated the analysis.

Mode R Adjusted R Std. Error of Durbin-Wat-

l R Square Square the Estimate son

1 ,610a ,373 ,373 1,37133

2 ,624b ,389 ,389 1,35297

3 ,629c ,395 ,395 1,34635

4 ,629d ,396 ,396 1,34585 1,525

Table 6. Model summary

As it can be seen in table 7 the adjusted coefficient of determination is equal to .396

which means that the independent variables explain 39.6% of the variability of the de-
pendent variable. Also, it can be seen that the Durbin Watson index is equal to 1.525
(acceptable values between 1 – 3, Field, 2005).
19

Sum of Mean
Model Squares df Square F Sig.

1 Regres- 8136,937 1 8136,937 4326,92 ,000b

sion 2

Residual 13695,951 7283 1,881

Total 21832,889 7284

2 Regres- 8502,891 2 4251,446 2322,50 ,000c

sion 8

Residual 13329,997 7282 1,831

Total 21832,889 7284

3 Regres- 8634,831 3 2878,277 1587,86 ,000d

sion 5

Residual 13198,058 7281 1,813

Total 21832,889 7284

4 Regres- 8646,453 4 2161,613 1193,38 ,000e

sion 9

Residual 13186,435 7280 1,811

Total 21832,889 7284

Table 7. Anova

In table 5 it can be seen that the regression model is statistical significant, F(4, 7280) =
1193.389, p =.000.

Stand-
ardized

Unstandardized Coeffi- Collinearity Sta-

Coefficients cients tistics

Std. Tol-

Model B Error Beta t Sig. erance VIF

1 (Con- 61,612 ,897 68,658 ,000

stant)

Turbine -,055 ,001 -,610 - ,000 1,000 1,000

inlet temper- 65,779
ature

2 (Con- 60,886 ,887 68,654 ,000

stant)

Turbine -,054 ,001 -,606 - ,000 ,999 1,001

inlet temper- 66,100
ature

Ambient ,010 ,001 ,130 14,139 ,000 ,999 1,001

temperature
(AT)

3 (Con- 80,957 2,513 32,220 ,000

stant)

Turbine -,058 ,001 -,645 - ,000 ,794 1,260

inlet temper- 63,078
ature

Ambient ,009 ,001 ,118 12,862 ,000 ,979 1,022

temperature
(AT)

Turbine -,030 ,003 -,088 -8,532 ,000 ,785 1,274

after temper-
ature
4
(Con- 72,460 4,190 17,292 ,000
stant)

Turbine -,057 ,001 -,639 - ,000 ,755 1,324

inlet temper- 61,014
ature

Ambient ,009 ,001 ,115 12,338 ,000 ,956 1,045

temperature

(AT)
21

Turbine -,027 ,004 -,080 -7,406 ,000 ,717 1,395

after temper-
ature

Ambient ,006 ,003 ,025 2,533 ,011 ,873 1,145

pressure
(AP)

Table 8. Coefficients a. Dependent Variable: Carbon monoxide

In table 9 it can be seen that the predictor variables are all statistical significant (re-
duced by one in comparison the previous model), Turbine inlet temperature (b = -.057,
p < .01), Ambient temperature (b = .009, p < .01), Turbine after temperature (b = -.027,
p < .01) and ambient pressure (b = .006, p < .05) . Furthermore, there is not a multicol-
linearity problem since all VIFs < 10. In addition, in figure 12 it can be seen that the
normality assumption is not met while the homoscedasticity assumption is partially met
(Figure 13). A potential solution to this is the implementation of a bootstrap regression
analysis. However, this was not possible even though we reduced the number of sam-
ples (100). The following message was printed from the SPSS22.0: “Available memory
was exhausted while compiling output. All the output for this command has been de-
leted”. Table 10 Residuals Statistics.

Figure 12.Normality test

Figure 13. Heteroscedasticity test

Residuals Statistics

Mini- Maxi- Std. Devia-

mum mum Mean tion N

Predicted Value ,9275 6,0134 2,5676 1,05717 7259

Std. Predicted Value -1,551 3,259 ,000 1,000 7259

Standard Error of Pre- ,016 ,079 ,032 ,012 7259

dicted Value

Adjusted Predicted ,9226 6,0287 2,5676 1,05736 7259

Value
Residual - 4,5397 ,00000 1,31030 7259
5,71335 6

Std. Residual -4,359 3,464 ,000 1,000 7259

Stud. Residual -4,366 3,466 ,000 1,000 7259

Deleted Residual - 4,5466 ,00000 1,31175 7259

5,73096 9
23

Stud. Deleted Residual -4,371 3,469 ,000 1,001 7259

Mahal. Distance ,053 25,390 3,999 3,980 7259

Cook's Distance ,000 ,012 ,000 ,001 7259

Centered Leverage ,000 ,003 ,001 ,001 7259

Value

Table 10. Residuals Statistics Dependent Variable: Carbon monoxide

2.8 Parametric and non-parametric models

In this section we present the implementation of the parametric Anova, t – test and non-
parametric tests chi square on a data file retrieved from the website UCI Machine
Learning Repository: AI4I 2020 Predictive Maintenance Dataset Data Set [4]. The
dataset consists of 10000 observations. We have used the following variables: the prod-
uct quality variants (low, medium, high) and machine failures (yes / no) to implement
a chi square test, the product quality variants (low, medium, high) and rotational speed
[rpm] to implement an ANOVA test, machine failures (yes /no) and rotational speed
[rpm] to implement a t – test and tool wear failure (yes / no) and rotational speed [rpm]
to implement a t – test. The variables from the dataset that we will analyze in this section
are product I is consisting of a letter L, M, or H. For low (50% of all products), medium
(30%) and high (20%) as product quality variants and a variant-specific serial number.
The air temperature K generated using a random walk process later normalized to a
standard deviation of 2 K around 300 K. The process temperature K generated using a
random walk process normalized to a standard deviation of 1 K, added to the air tem-
perature plus 10 K. The rotational speed rpm calculated from a power of 2860 W, over-
laid with a normally distributed noise. The torque Nm torque values are normally dis-
tributed around 40 Nm with a Ïƒ = 10 Nm and no negative values. The tool wear min
quality variants H/M/L add 5/3/2 minutes of tool wear to the used tool in the process.
And last one a machine failure label that indicates whether the machine has failed in
this particular datapoint for any of the following failure modes are true.

Machine failure

No Yes Total

Type H N 982a 21b 1003

% 97,9% 2,1% 100,0%

Std. Residual ,4 -2,2

L N 5765a 235b 6000

% 96,1% 3,9% 100,0%

Std. Residual -,4 2,2

M N 2914a 83b 2997

% 97,2% 2,8% 100,0%

Std. Residual ,3 -1,8

Total N 9661 339 10000

% 96,6% 3,4% 100,0%

Table 9. Crosstabulation. Each subscript letter denotes a subset of Machine failure

categories whose column proportions do not differ significantly from each other at the
,05 level.

In table 11 it can be seen that 2.1% of the high variant quality products presented ma-
chine failure compared to 3.9% of the low variant quality products and the 2.8% quality
of the medium variant. This difference is statistical significant according to chi square
test, X2(2) = 13.752, p = .001 (Table 12) (The Fisher test results to the sample conclu-
sion even though the assumptions of the chi square test are met, non-zero cell and less
than 20% of the cells with frequency below 5). However, the intensity of the relation-
ship is very low according to Cramer’s V index (Table 13).

Monte Carlo Sig. (2-sided)

99% Confidence Interval

Asymp. Lower Upper

Value df Sig. (2-sided) Sig. Bound Bound
b
Pearson Chi- 13,75 2 ,001 ,001 ,000 ,002
a
Square 2

Likelihood Ratio 14,53 2 ,001 ,001b ,000 ,001

Fisher's Exact 14,02 ,001b ,000 ,002

Test 1

N of Valid Cases 1000

0
25

Table 12. Chi square test

a. 0 cells (0,0%) have expected count less than 5. The minimum expected count is 34,00.
b. Based on 10000 sampled tables with starting seed 299883525.

Monte Carlo Sig.

99% Confidence Interval

Approx. Lower Upper

Value Sig. Sig. Bound Bound

Nominal by Phi ,037 ,001 ,001c ,000 ,002

Nominal Cramer' ,037 ,001 ,001 c
,000 ,002
sV

N of Valid Cases 1000

Table 13. Cramer’s V

c. Based on 10000 sampled tables with starting seed 299883525.

2.9 Normality test

In table 14 it can be seen that the rotational speed does not follow the normal distribu-
tion in each level of the variant quality product, High, Statistic(1003) = .110, p =.000,
Low, Statistic (6000) = .107, p =.000 and medium, Statistic(2997) = .101, p =.000. In
this case the Kruskal Wallis test was used.

Kolmogorov-Smirnova Shapiro-Wilk

Ty Sta- Sta-
pe1 tistic df Sig. tistic df Sig.

Rotational speed H ,110 1003 ,000 ,863 1003 ,000

[rpm]
L ,107 6000 ,000
26

M ,101 2997 ,000 ,871 2997 ,000

Table 14. Normality test

a. Lilliefors Significance Correction

In table 15 it can be seen that there is not a statistically significant difference among the
three variant quality products in relation to the rotational speed, X2(2) = .248, p = .883.

Rotational
speed [rpm]

Chi-Square ,248

df 2

Asymp. Sig. ,883

Monte Carlo Sig. Sig. ,884c

99% Confidence Interval Lower Bound ,876

Upper Bound ,892

Table 15. Kruskal Wallis test

a. Kruskal Wallis Test

b. Grouping Variable: Type1
c. Based on 10000 sampled tables with starting seed 1335104164.

2.10 First T-Test

In table 16 it can be seen that the rotational speed does not follow the normal distribu-
tion in each level of the machine failure, yes, Statistic (339) = .339, p =.000 and no,
Statistic(9661) = .096, p =.000. In this case the bootstrap method must be used. How-
ever, because it is very difficult to present the bootstrap method (calculation problems)
we present the t – test.

Kolmogorov-Smirnova Shapiro-Wilk
27

Machine Sta- Sta-

failure tistic df Sig. tistic df Sig.

Rotational speed No ,096 9661 ,000

[rpm]
Yes ,339 339 ,000 ,548 339 ,000

Table 16. Normality test

a. Lilliefors Significance Correction

In table 17 it can be seen that homogeneity assumption is not met, F = 295.620, p =

.001. It can be seen that in the case of mechanical failure in comparison to the case of
no mechanical failure the average of rotational speed is higher, t (342.500) = 2.087, p
= .038.

Levene's Test for

Equality of Vari-
ances t-test for Equality of Means

95% Confidence
Interval of the Differ-

ence
Sig. Mean Std.
(2- Differ- Error Dif-
F Sig. t df tailed) ence ference Lower Upper

Ro- Equal 295,620 ,000 4,423 9998 ,000 43,773 9,898 24,372 63,175

ta- vari-
tional ances
speed as-

[rpm] sumed
28

Equal 2,087 342,500 ,038 43,773 20,977 2,514 85,032

vari-
ances
not as-
sumed

Table 10. t – test

2.11 Second T-Test

In table 18 it can be seen that the rotational speed does not follow the normal distribu-
tion in each level of the tool wear failure, yes, Statistic (46) = .156, p =.007 and no,
Statistic(9954) = .104, p =.000. In this case the bootstrap method could be used. How-
ever, because it is very difficult to present the bootstrap method, we present the t – test.

Kolmogorov-Smirnova Shapiro-Wilk

T Sta- Sta-
WF tistic df Sig. tistic df Sig.

Rotational speed N ,104 9954 ,000

[rpm] o

Ye ,156 46 ,007 ,852 46 ,000

Table 11. Normality test

a. Lilliefors Significance Correction

In table 19 it can be seen that the homogeneity assumption is met, F = 1.507, p = .200.
It can be seen that in the case of tool wear mechanical failure in comparison to the case
of no mechanical failure the average of rotational speed is higher, t(9998) = .220, p =
.299.
29

Levene's Test
for Equality of
Variances t-test for Equality of Means

95% Confidence
Interval of the Differ-
ence
Sig. Mean Std.

(2- Differ- Error Dif-

F Sig. t df tailed) ence ference Lower Upper

Ro- Equal 1,507 ,220 - 9998 ,299 - 26,495 - 24,411

ta- vari- 1,039 27,524 79,460
tional ances
speed assumed
[rpm]

Equal - 45,316 ,370 - 30,392 - 33,676

vari- ,906 27,524 88,725

ances
not as-
sumed

Table 19. T - test

3 Conclusion and purpose of the research

The statistical analysis of these two datasets prove that we can have a better understand-
ing of what models prove. For the first model where the dataset can be well used for
predicting turbine energy yield (TEY) using ambient variables as features. We saw that
the initial target which initially aimed to predict CO and NOx emissions from gas tur-
bines had strong relation of the carbon monoxide and the ambient temperature. How-
ever the data as they were collected in an operating range of the turbine between partial
load (75%) and full load (100%) can give us a strong indicator as it was used almost
30

in the pick of the usage as it a=was greater than 75% of its load. The second dataset of
our analysis was about the machine failure where it consists of five independent failure
modes. The tool wear failure (TWF) the tool will be replaced of fail at a randomly
selected tool wear time between 200 and 240 mins. At this point in time, the tool is
replaced 69 times, and fails 51 times we also saw that the heat dissipation causes a
process failure, if the difference between air- and process temperature is below 8.6 K
and the tools rotational speed is below 1380 rpm. This is the case for 115 data points.
power failure (PWF). The product of torque and rotational speed (in rad/s) equals the
power required for the process. If this power is below 3500 W or above 9000 W, the
process fails, which is the case 95 times in our dataset. Finally, the random failures
(RNF) of each process has a chance of 0,1 % to fail regardless of its process parameters.
This is the case for only 5 datapoints, less than could be expected for 10,000 datapoints
in the dataset.
31

4 References
1. Author, Ilias Kalemis. A portfolio of work in Statistical Techniques on SPSS software.
2. SPSS. Software https://www.ibm.com/analytics/spss-statistics-software , last accessed on
2021/5/20.
3. 1st dataset from the paper: Heysem Kaya, PÄ±nar TÃ¼fekci and ErdinÃ§ Uzun. 'Predict-
ing CO and NOx emissions from gas turbines: novel data and a benchmark PEMS', Turk-
ish Journal of Electrical Engineering & Computer Sciences, vol. 27, 2019, pp. 4783-4796.
Heysem Kaya, Department of Information and Computing Sciences, Utrecht University,
3584 CC, Utrecht, The Netherlands Email: h.kaya'@' uu.nl and PÄ±nar TÃ¼fekci, Ã‡orlu
Faculty of Engineering, NamÄ±k Kemal University, TR-59860 Ã‡orlu, TekirdaÄŸ, Tur-
key and Email: ptufekci '@' nku.edu.tr , link: https://archive.ics.uci.edu/ml/da-
tasets/Gas+Turbine+CO+and+NOx+Emission+Data+Set , last accessed on 2021/5/20.
4. 2nd dataset from the paper: Stephan Matzka, 'Explainable Artificial Intelligence for Predic-
tive Maintenance Applications', Third International Conference on Artificial Intelligence
for Industries (AI4I 2020), 2020 (in press). Stephan Matzka, School of Engineering -
Technology and Life, Hochschule fÃ¼r Technik und Wirtschaft Berlin, 12459 Berlin,
Germany, stephan.matzka '@' htw-berlin.de https://archive.ics.uci.edu/ml/da-
tasets/AI4I+2020+Predictive+Maintenance+Dataset , last accessed on 2021/5/20.
5. Pearson correlation https://libguides.library.kent.edu/SPSS/PearsonCorr , last accessed on
2021/5/20.
6. Spearman correlation Spearman C. (1904). "The proof and measurement of association be-
tween two things". American Journal of Psychology. 15 (1): 72–101.
doi:10.2307/1412159. JSTOR 1412159, last accessed on 2021/5/20.
7. Mavuto Mukaka (2012). Statistics corner: A guide to appropriate use of correlation coeffi-
cient in medical research. Malawi medical journal : the journal of Medical Association.
8. Durbin, J.; Watson, G. S. (1950). "Testing for Serial Correlation in Least Squares Regres-
sion, I". Biometrika. 37 (3–4): 409–428. doi:10.1093/biomet/37.3-4.409. JSTOR 2332391.
9. Discovering Statistics Using IBM SPSS Statistics, Field, A. (2005). Discovering statistics
using SPSS (2nd ed.). Sage Publications, Inc

5 Appendices
Regression & Correlation
DATASET ACTIVATE DataSet1.
GRAPH
/SCATTERPLOT(BIVAR)=AT WITH CO
/MISSING=LISTWISE.
GRAPH
/SCATTERPLOT(BIVAR)=AP WITH CO
/MISSING=LISTWISE.
GRAPH
/SCATTERPLOT(BIVAR)=AH WITH CO
/MISSING=LISTWISE.
GRAPH
32

/SCATTERPLOT(BIVAR)=AFDP WITH CO
/MISSING=LISTWISE.
GRAPH
/SCATTERPLOT(BIVAR)=GTEP WITH CO
/MISSING=LISTWISE.

GRAPH
/SCATTERPLOT(BIVAR)=TIT WITH CO
/MISSING=LISTWISE.
GRAPH
/SCATTERPLOT(BIVAR)=TAT WITH CO
/MISSING=LISTWISE.

GRAPH
/SCATTERPLOT(BIVAR)=CDP WITH CO
/MISSING=LISTWISE.
NONPAR CORR
/VARIABLES=AT AP AH AFDP GTEP TIT TAT CDP CO TEY
/PRINT=SPEARMAN TWOTAIL NOSIG
/MISSING=PAIRWISE.
DESCRIPTIVES VARIABLES=AT AP AH AFDP GTEP TIT TAT CDP CO
TEY
/STATISTICS=MEAN STDDEV MIN MAX.
REGRESSION
/MISSING LISTWISE
/STATISTICS COEFF OUTS R ANOVA COLLIN TOL
/CRITERIA=PIN(.05) POUT(.10)
/NOORIGIN
/DEPENDENT CO
/METHOD=STEPWISE AT AP AH AFDP GTEP TIT TAT CDP
/SCATTERPLOT=(*ZPRED ,*ZRESID)
/RESIDUALS DURBIN HISTOGRAM(ZRESID) NORMPROB(ZRESID)
/CASEWISE PLOT(ZRESID) OUTLIERS(3)
/SAVE MAHAL COOK LEVER ZRESID SDBETA SDFIT.

COMPUTE Probability.MAH_1=1- (CDF.CHISQ (MAH_1,5)).

EXECUTE.
USE ALL.
COMPUTE filter_$=(Probability.MAH_1 > .001 & -3.29<
ZRE_1 <3.29).
VARIABLE LABELS filter_$ 'Probability.MAH_1 > .001 & -
3.29< ZRE_1 <3.29 (FILTER)'.
VALUE LABELS filter_$ 0 'Not Selected' 1 'Selected'.
FORMATS filter_$ (f1.0).
33

FILTER BY filter_$.
EXECUTE.
Chi square

CROSSTABS
/TABLES=Type BY Machinefailure
/FORMAT=AVALUE TABLES
/STATISTICS=CHISQ PHI
/CELLS=COUNT ROW SRESID BPROP
/COUNT ROUND CELL
/METHOD=MC CIN(99) SAMPLES(10000).

Anova
EXAMINE VARIABLES=Rotationalspeedrpm BY Type1
/PLOT BOXPLOT STEMLEAF NPPLOT
/COMPARE GROUPS
/STATISTICS DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.

NPAR TESTS
/K-W=Rotationalspeedrpm BY Type1(1 3)
/MISSING ANALYSIS
/METHOD=MC CIN(99) SAMPLES(10000).

T – test (1st)
EXAMINE VARIABLES=Rotationalspeedrpm BY Machinefailure
/PLOT BOXPLOT STEMLEAF NPPLOT
/COMPARE GROUPS
/STATISTICS DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.

BOOTSTRAP
/SAMPLING METHOD=SIMPLE
/VARIABLES TARGET=Rotationalspeedrpm
INPUT=Machinefailure
/CRITERIA CILEVEL=95 CITYPE=BCA NSAMPLES=5000
/MISSING USERMISSING=EXCLUDE.
T-TEST GROUPS=Machinefailure(0 1)
/MISSING=ANALYSIS
/VARIABLES=Rotationalspeedrpm
34

/CRITERIA=CI(.95).

T – test (2nd)

EXAMINE VARIABLES=Rotationalspeedrpm BY TWF

/PLOT BOXPLOT STEMLEAF NPPLOT
/COMPARE GROUPS
/STATISTICS DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.

T-TEST GROUPS=TWF(0 1)
/MISSING=ANALYSIS
/VARIABLES=Rotationalspeedrpm
/CRITERIA=CI(.95).

6 Datasets

1) Gas Turbine CO and NOx Emission Data Set Data Set

WebSource:
https://archive.ics.uci.edu/ml/datasets/Gas+Tu
bine+CO+and+NOx+Emission+Data+Set#

2) AI4I 2020 Predictive Maintenance Dataset Data Set

WebSource:
https://archive.ics.uci.edu/ml/datasets/AI4I+2020+Predictive+Maintenance+Dataset#

View publication stats

Advanced Temperature Measurement and Control, Second Edition
From Everand
Advanced Temperature Measurement and Control, Second Edition
Gregory K. McMillan
No ratings yet
Multiple Regression Analysis
100% (2)
Multiple Regression Analysis
75 pages
Stirring: Theory and Practice
From Everand
Stirring: Theory and Practice
Marko Zlokarnik
No ratings yet
Submitted To: Sir Saeed Meo: MBEW-F17-17
No ratings yet
Submitted To: Sir Saeed Meo: MBEW-F17-17
13 pages
0004 Exploratory Analysis and Feature Selection For The
No ratings yet
0004 Exploratory Analysis and Feature Selection For The
11 pages
AI Lab File - C
No ratings yet
AI Lab File - C
52 pages
JNU
No ratings yet
JNU
6 pages
Injection Molding Processing Data
From Everand
Injection Molding Processing Data
Alberto Naranjo C.
No ratings yet
A Fair Share: Doing the Math on Individual Consumption and Global Warming
From Everand
A Fair Share: Doing the Math on Individual Consumption and Global Warming
Steffen Erik Eikenberry
No ratings yet
AnAnalysisofCombined CyclePowerPlantData - Docx 0
No ratings yet
AnAnalysisofCombined CyclePowerPlantData - Docx 0
32 pages
July 24, 2008 10:37 Spi-B626 9in X 6in FM: Empirical-Statistical Downscaling © World Scientific Publishing Co. Pte. LTD
No ratings yet
July 24, 2008 10:37 Spi-B626 9in X 6in FM: Empirical-Statistical Downscaling © World Scientific Publishing Co. Pte. LTD
6 pages
Bản Sao Của Report
No ratings yet
Bản Sao Của Report
26 pages
PROJECT REPORT Regression Analysis SPECU PDF
No ratings yet
PROJECT REPORT Regression Analysis SPECU PDF
8 pages
Dma 89
No ratings yet
Dma 89
21 pages
A First Course in Dimensional Analysis: Simplifying Complex Phenomena Using Physical Insight
From Everand
A First Course in Dimensional Analysis: Simplifying Complex Phenomena Using Physical Insight
Juan G. Santiago
No ratings yet
doc
No ratings yet
doc
7 pages
XSTK Project PDF
No ratings yet
XSTK Project PDF
26 pages
Regression Analysis
No ratings yet
Regression Analysis
10 pages
Air Quality Prediction Using Artificial Neural Networks
No ratings yet
Air Quality Prediction Using Artificial Neural Networks
5 pages
UNIT 2 Notes
No ratings yet
UNIT 2 Notes
8 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
10 pages
Harness Oil and Gas Big Data with Analytics: Optimize Exploration and Production with Data-Driven Models
From Everand
Harness Oil and Gas Big Data with Analytics: Optimize Exploration and Production with Data-Driven Models
Keith R. Holdaway
No ratings yet
Practical Time Series Analysis in Natural Sciences 1st Edition Victor Privalsky - The ebook is available for online reading or easy download
100% (1)
Practical Time Series Analysis in Natural Sciences 1st Edition Victor Privalsky - The ebook is available for online reading or easy download
70 pages
Air Quality Prediction Through Regression Model
No ratings yet
Air Quality Prediction Through Regression Model
6 pages
Where can buy Practical Time Series Analysis in Natural Sciences 1st Edition Victor Privalsky ebook with cheap price
100% (1)
Where can buy Practical Time Series Analysis in Natural Sciences 1st Edition Victor Privalsky ebook with cheap price
50 pages
Analyzing and Predicting Factors Effecting Environmental Pollution
No ratings yet
Analyzing and Predicting Factors Effecting Environmental Pollution
11 pages
Correlation and Regression
No ratings yet
Correlation and Regression
61 pages
Energy-Efficiency-Prediction
No ratings yet
Energy-Efficiency-Prediction
8 pages
R_corregr
No ratings yet
R_corregr
147 pages
Practical Time Series Analysis in Natural Sciences 1st Edition Victor Privalsky 2024 Scribd Download
100% (1)
Practical Time Series Analysis in Natural Sciences 1st Edition Victor Privalsky 2024 Scribd Download
40 pages
Data Analytics For Performance Monitoring of Gas Turbine Engine
No ratings yet
Data Analytics For Performance Monitoring of Gas Turbine Engine
9 pages
MATH3806 Group Report
No ratings yet
MATH3806 Group Report
13 pages
Handbook of Measurement in Science and Engineering, Volume 2
From Everand
Handbook of Measurement in Science and Engineering, Volume 2
Myer Kutz
No ratings yet
Multi-Objective Optimization in Chemical Engineering: Developments and Applications
From Everand
Multi-Objective Optimization in Chemical Engineering: Developments and Applications
Gade Pandu Rangaiah
No ratings yet
Modeling, Analysis and Optimization of Process and Energy Systems
From Everand
Modeling, Analysis and Optimization of Process and Energy Systems
F. Carl Knopf
No ratings yet
Assignment - 01 - SEC - B - GROUP No. 11
No ratings yet
Assignment - 01 - SEC - B - GROUP No. 11
14 pages
Anomaly Detection and Failure Prediction in Gas Turbines
No ratings yet
Anomaly Detection and Failure Prediction in Gas Turbines
98 pages
Data Analysis
No ratings yet
Data Analysis
8 pages
Automobile Engine Performance Analysis Using Regression Technique
No ratings yet
Automobile Engine Performance Analysis Using Regression Technique
2 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
35 pages
UNITIV.BtechIot
No ratings yet
UNITIV.BtechIot
43 pages
Applied Data Analysis and Modeling for Energy Engineers and Scientists - The latest ebook edition with all chapters is now available
100% (1)
Applied Data Analysis and Modeling for Energy Engineers and Scientists - The latest ebook edition with all chapters is now available
51 pages
Spina 2002
No ratings yet
Spina 2002
10 pages
827b551be7606030c4c1ca693fb54a0ed875
No ratings yet
827b551be7606030c4c1ca693fb54a0ed875
12 pages
EXCEL 9
No ratings yet
EXCEL 9
3 pages
IRJET_DATA_SCIENCE_PROJECT_ON_GDP_ANALYS
No ratings yet
IRJET_DATA_SCIENCE_PROJECT_ON_GDP_ANALYS
10 pages
Integrated Chemical Processes: Synthesis, Operation, Analysis and Control
From Everand
Integrated Chemical Processes: Synthesis, Operation, Analysis and Control
Kai Sundmacher
No ratings yet
Forecasting Report
No ratings yet
Forecasting Report
15 pages
Exemplar report_Carl_Ansell
No ratings yet
Exemplar report_Carl_Ansell
110 pages
AA_Project_Report_PDF_112253_113208_112309
No ratings yet
AA_Project_Report_PDF_112253_113208_112309
13 pages
Regression Linaire Python Tome I
No ratings yet
Regression Linaire Python Tome I
9 pages
Monika Project
No ratings yet
Monika Project
34 pages
OMG355 Multivariate Data Analysis Full Book PDF
No ratings yet
OMG355 Multivariate Data Analysis Full Book PDF
113 pages
Comparison of The Performance of GaussianNB Algorithm, The K Neighbors Classifier Algorithm
No ratings yet
Comparison of The Performance of GaussianNB Algorithm, The K Neighbors Classifier Algorithm
11 pages
Time Series Analysis and Its Applications With R Examples 4th Edition Robert H. Shumway download
100% (1)
Time Series Analysis and Its Applications With R Examples 4th Edition Robert H. Shumway download
56 pages
Assignment 2 - LP1
No ratings yet
Assignment 2 - LP1
7 pages
A Quick Tutorial For Statistical Analysis by The ITACS
No ratings yet
A Quick Tutorial For Statistical Analysis by The ITACS
32 pages
ENVE 6075 Spring 2023 Project 1 - 001224
No ratings yet
ENVE 6075 Spring 2023 Project 1 - 001224
4 pages
Full download Statistical Modeling for Degradation Data 1st Edition Ding-Geng (Din) Chen pdf docx
100% (6)
Full download Statistical Modeling for Degradation Data 1st Edition Ding-Geng (Din) Chen pdf docx
55 pages
Presentation AirQuality Prediction Using Machine Learning
No ratings yet
Presentation AirQuality Prediction Using Machine Learning
16 pages
Raymond - Kassy - 202105 - MSC - EDA Signal Processing
No ratings yet
Raymond - Kassy - 202105 - MSC - EDA Signal Processing
141 pages
Eta Test PDF
No ratings yet
Eta Test PDF
4 pages
Tutorial Sheet 2 1
No ratings yet
Tutorial Sheet 2 1
1 page
4.2 Correlation Regression TQ
No ratings yet
4.2 Correlation Regression TQ
9 pages
CBR-Index Soil Properties
No ratings yet
CBR-Index Soil Properties
5 pages
Financial Risk Measurement and Management: Volatility Forecast: GARCH Estimation
No ratings yet
Financial Risk Measurement and Management: Volatility Forecast: GARCH Estimation
5 pages
Tle
50% (2)
Tle
2 pages
Midterm Exam in Statistics and Probability Grade 11
No ratings yet
Midterm Exam in Statistics and Probability Grade 11
3 pages
Statistical Functions
No ratings yet
Statistical Functions
2 pages
Spatio Temporal Methods in Environmental Epidemiology with R 2nd Edition Gavin Shaddick download pdf
100% (3)
Spatio Temporal Methods in Environmental Epidemiology with R 2nd Edition Gavin Shaddick download pdf
81 pages
G-Test: Statistics Likelihood-Ratio Maximum Likelihood Statistical Significance Chi-Squared Tests
No ratings yet
G-Test: Statistics Likelihood-Ratio Maximum Likelihood Statistical Significance Chi-Squared Tests
5 pages
CFA 1. CFA Only Without Deleting Item, Using AMOS Version 20
No ratings yet
CFA 1. CFA Only Without Deleting Item, Using AMOS Version 20
8 pages
WEEK 5 Modular
No ratings yet
WEEK 5 Modular
10 pages
Sat Comp0142
No ratings yet
Sat Comp0142
4 pages
Cointegration Vector Estimation by Panel DOLS and Long-Run Money Demand
No ratings yet
Cointegration Vector Estimation by Panel DOLS and Long-Run Money Demand
26 pages
Self Check Exercises: Exercise 13.2
No ratings yet
Self Check Exercises: Exercise 13.2
19 pages
Tutorial 4 Sim
No ratings yet
Tutorial 4 Sim
2 pages
Final Statistical Table
No ratings yet
Final Statistical Table
9 pages
Estimating Lifetime Distributions
No ratings yet
Estimating Lifetime Distributions
8 pages
11.hypothesis Test For Means - Biostatistics
No ratings yet
11.hypothesis Test For Means - Biostatistics
2 pages
4 Rejection Region For The Population Mean
No ratings yet
4 Rejection Region For The Population Mean
2 pages
Tutorial Answers
No ratings yet
Tutorial Answers
5 pages
Forecasting
No ratings yet
Forecasting
16 pages
Tutorial Stat 322 PDF
No ratings yet
Tutorial Stat 322 PDF
58 pages
Adf
50% (2)
Adf
15 pages
004-5-MATH 361 Probability & Statistics
No ratings yet
004-5-MATH 361 Probability & Statistics
1 page
Maths Kulaliny CE 15
No ratings yet
Maths Kulaliny CE 15
47 pages
1 Collecting and Interpreting Data Edexcel PDF
No ratings yet
1 Collecting and Interpreting Data Edexcel PDF
3 pages
Module 4 Measures of Variability
No ratings yet
Module 4 Measures of Variability
9 pages
Logistic Regression Tutorial
No ratings yet
Logistic Regression Tutorial
25 pages