ANALISIS DATA EPIDEMIOLOGI
MENORMALKAN DATA DAN ANALISIS REGRESI LOGISTIK
Nama : Meldi Nur Anisa
NIM : N1A118078
Dosen Pengampu: Dr. Ummi Kalsum, S.KM., M.KM.
PROGRAM STUDI ILMU KESEHATAN MASYARAKAT
FAKULTAS KEDOKTERAN DAN ILMU KESEHATAN
UNIVERSITAS JAMBI
2021
Data Usair
Interpretasi
. use "C:\Users\BC5720\Downloads\USAIR.DTA"
. des
Contains data from C:\Users\BC5720\Downloads\USAIR.DTA
obs: 41
vars: 8 28 Apr 2012 10:34
storage display value
variable name type format label variable label
town str9 %9s
so2 int %8.0g
temp float %9.0g
manuf int %8.0g
pop int %8.0g
wind float %9.0g
precip float %9.0g
days int %8.0g
Sorted by:
Variabel independen (temperatur, jumlah pabrik, jumlah populasi, angin, precip, dan hari.
Variabel dependen (populasi udara). Dengan total 8 variabel.
. sum
Variable Obs Mean Std. Dev. Min Max
town 0
so2 41 30.04878 23.47227 8 110
temp 41 55.76341 7.227716 43.5 75.5
manuf 41 463.0976 563.4739 35 3344
pop 41 608.6098 579.113 71 3369
wind 41 9.443902 1.428644 6 12.7
precip 41 36.76902 11.77155 7.05 59.8
days 41 113.9024 26.50642 36 166
. list
town so2 temp manuf pop wind precip days
1. Phoenix 10 70.3 213 582 6 7.05 36
2. Lrock 13 61 91 132 8.2 48.52 100
3. Sfran 12 56.7 453 716 8.7 20.66 67
4. Denver 17 51.9 454 515 9 12.95 86
5. Hartford 56 49.1 412 158 9 43.37 127
6. Wilming 36 54 80 80 9 40.25 114
7. Washing 29 57.3 434 757 9.3 38.89 111
8. Jackson 14 68.4 136 529 8.8 54.47 116
9. Miami 10 75.5 207 335 9 59.8 128
10. Atlanta 24 61.5 368 497 9.1 48.34 115
11. Chicago 110 50.6 3344 3369 10.4 34.44 122
12. Indian 28 52.3 361 746 9.7 38.74 121
13. DesM 17 49 104 201 11.2 30.85 103
14. Wichita 8 56.6 125 277 12.7 30.58 82
15. Louisv 30 55.6 291 593 8.3 43.11 123
16. NewO 9 68.3 204 361 8.4 56.77 113
17. Baltim 47 55 625 905 9.6 41.31 111
18. Detroit 35 49.9 1064 1513 10.1 30.96 129
19. Minn 29 43.5 699 744 10.6 25.94 137
20. Kansas 14 54.5 381 507 10 37 99
21. StLouis 56 55.9 775 622 9.5 35.89 105
22. Omaha 14 51.5 181 347 10.9 30.18 98
23. Alburq 11 56.8 46 244 8.9 7.77 58
24. Albany 46 47.6 44 116 8.8 33.36 135
25. Buffalo 11 47.1 391 463 12.4 36.11 166
26. Cincinn 23 54 462 453 7.1 39.04 132
27. Cleve 65 49.7 1007 751 10.9 34.99 155
28. Colum 26 51.5 266 540 8.6 37.01 134
29. Philad 69 54.6 1692 1950 9.6 39.93 115
30. Pittsb 61 50.4 347 520 9.4 36.22 147
31. Provid 94 50 343 179 10.6 42.75 125
32. Memphis 10 61.6 337 624 9.2 49.1 105
33. Nashville 18 59.4 275 448 7.9 46 119
34. Dallas 9 66.2 641 844 10.9 35.94 78
35. Houston 10 68.9 721 1233 10.8 48.19 103
36. SLC 28 51 137 176 8.7 15.17 89
37. Norfolk 31 59.3 96 308 10.6 44.68 116
38. Richmond 26 57.8 197 299 7.6 42.59 115
39. Seattle 29 51.1 379 531 9.4 38.79 164
40. Charlest 31 55.2 35 71 6.5 40.75 148
41. Milwak 16 45.7 569 717 11.8 29.07 123
Objek penelitian terdiri dari 41 kota.
. ladder so2
Transformation formula chi2(2) P(chi2)
cubic so2^3 43.63 0.000
square so2^2 33.57 0.000
identity so2 16.19 0.000
square root sqrt(so2) 6.53 0.038
log log(so2) 2.82 0.244
1/(square root) 1/sqrt(so2) 6.27 0.043
inverse 1/so2 5.44 0.066
1/square 1/(so2^2) 7.72 0.021
1/cubic 1/(so2^3) 14.99 0.001
. g log_so2= log(so2)
. ladder log_so2
Transformation formula chi2(2) P(chi2)
cubic log_so2^3 7.67 0.022
square log_so2^2 3.84 0.146
identity log_so2 2.82 0.244
square root sqrt(log_so2) 3.90 0.142
log log(log_so2) 4.91 0.086
1/(square root) 1/sqrt(log_so2) 5.52 0.063
inverse 1/log_so2 5.53 0.063
1/square 1/(log_so2^2) 4.75 0.093
1/cubic 1/(log_so2^3) 4.99 0.083
Setelah di analisis kita dapat melihat kenormalannya. Diketahui variabel polusi udara tidak normal. Maka
di lakukan penormalan dengan menglogin kan data data menjadi normal. Sebelumnya p= 0,000 menjadi
p= 0,2444
. ladder temp
Transformation formula chi2(2) P(chi2)
cubic temp^3 12.45 0.002
square temp^2 8.86 0.012
identity temp 5.67 0.059
square root sqrt(temp) 4.29 0.117
log log(temp) 2.87 0.238
1/(square root) 1/sqrt(temp) 1.61 0.446
inverse 1/temp 0.71 0.703
1/square 1/(temp^2) 0.00 0.998
1/cubic 1/(temp^3) 1.16 0.560
P=0,059 (normal)
. ladder manuf
Transformation formula chi2(2) P(chi2)
cubic manuf^3 61.00 0.000
square manuf^2 57.71 0.000
identity manuf 41.74 0.000
square root sqrt(manuf) 19.73 0.000
log log(manuf) 0.66 0.718
1/(square root) 1/sqrt(manuf) 12.70 0.002
inverse 1/manuf 24.90 0.000
1/square 1/(manuf^2) 38.26 0.000
1/cubic 1/(manuf^3) 45.25 0.000
. g log_manuf = log(manuf)
. ladder log_manuf
Transformation formula chi2(2) P(chi2)
cubic log_ma~f^3 10.40 0.006
square log_ma~f^2 3.62 0.164
identity log_ma~f 0.66 0.718
square root sqrt(log_ma~f) 1.93 0.381
log log(log_ma~f) 4.34 0.114
1/(square root) 1/sqrt(log_ma~f) 6.83 0.033
inverse 1/log_ma~f 9.65 0.008
1/square 1/(log_ma~f^2) 15.62 0.000
1/cubic 1/(log_ma~f^3) 21.29 0.000
Variabel jumlah pabrik awalnya tidak normal deng p=0.000, kemudian data di normalkan dengan meng-
log-an data. Kemudian data menjadi p=0,728 (normal).
. ladder pop
Transformation formula chi2(2) P(chi2)
cubic pop^3 59.51 0.000
square pop^2 54.09 0.000
identity pop 35.81 0.000
square root sqrt(pop) 15.82 0.000
log log(pop) 1.13 0.569
1/(square root) 1/sqrt(pop) 11.87 0.003
inverse 1/pop 23.51 0.000
1/square 1/(pop^2) 38.03 0.000
1/cubic 1/(pop^3) 44.70 0.000
. g log_pop = log(pop)
. ladder log_pop
Transformation formula chi2(2) P(chi2)
cubic log_pop^3 7.70 0.021
square log_pop^2 2.66 0.265
identity log_pop 1.13 0.569
square root sqrt(log_pop) 2.24 0.326
log log(log_pop) 4.15 0.126
1/(square root) 1/sqrt(log_pop) 6.06 0.048
inverse 1/log_pop 8.21 0.017
1/square 1/(log_pop^2) 12.90 0.002
1/cubic 1/(log_pop^3) 17.67 0.000
Variabel populasi awalnya tidak normal deng p=0.000, kemudian data di normalkan dengan meng-log-an
data. Kemudian data menjadi p=0,569 (normal).
. ladder wind
Transformation formula chi2(2) P(chi2)
cubic wind^3 7.20 0.027
square wind^2 2.87 0.239
identity wind 0.60 0.742
square root sqrt(wind) 1.65 0.439
log log(wind) 4.31 0.116
1/(square root) 1/sqrt(wind) 7.40 0.025
inverse 1/wind 11.07 0.004
1/square 1/(wind^2) 19.04 0.000
1/cubic 1/(wind^3) 26.56 0.000
P=0,742 (normal)
. ladder precip
Transformation formula chi2(2) P(chi2)
cubic precip^3 11.34 0.003
square precip^2 2.35 0.309
identity precip 5.41 0.067
square root sqrt(precip) 12.98 0.002
log log(precip) 21.58 0.000
1/(square root) 1/sqrt(precip) 29.27 0.000
inverse 1/precip 35.24 0.000
1/square 1/(precip^2) 42.42 0.000
1/cubic 1/(precip^3) 45.80 0.000
P=0,067 (normal)
. ladder days
Transformation formula chi2(2) P(chi2)
cubic days^3 8.02 0.018
square days^2 1.72 0.424
identity days 4.88 0.087
square root sqrt(days) 11.69 0.003
log log(days) 20.75 0.000
1/(square root) 1/sqrt(days) 30.37 0.000
inverse 1/days 39.15 0.000
1/square 1/(days^2) 51.68 0.000
1/cubic 1/(days^3) 57.88 0.000
p=0,087 (normal)
Data Sebaran
. scatter log_so2 temp
Hubungan Antara SO2 dengan temperatur
5
4
log_so2
3
2
40 50 60 70 80
temp
. scatter log_so2 log_manuf
Hubungan antara SO2 dengan jumlah pabrik
5
4
log_so2
3
2
3 4 5 6 7 8
log_manuf
. scatter log_so2 log_pop
Hubungan antara SO2 dengan populasi
5
4
log_so2
3
2
4 5 6 7 8
log_pop
Hubungan antara SO2 dengan angin
5
4
log_so2
3
2
6 8 10 12 14
wind
. scatter log_so2 precip
. scatter log_so2 days
Hubungan antara SO2 dengan precip
5
4
log_so2
3
2
10 20 30 40 50 60
precip
Hubungan antara SO2 dengan hari
5
4
log_so2
3
2
0 50 100 150 200
days
. twoway (lfitci so2 temp) (scatter log_so2 temp)
. twoway (lfitci log_so2 temp) (scatter log_so2 temp)
. twoway (lfitci log_so2 log_manuf) (scatter log_so2 log_manuf)
. twoway (lfitci log_so2 log_pop) (scatter log_so2 log_pop)
. twoway (lfitci log_so2 wind) (scatter log_so2 wind)
. twoway (lfitci log_so2 wind) (scatter log_so2 wind)
. twoway (lfitci log_so2 precip) (scatter log_so2 precip)
. twoway (lfitci log_so2 days) (scatter log_so2 days)
twoway (lfitci so2 temp) (scatter log_so2 temp)
5
4
3
2
1
40 50 60 70 80
temp
95% CI Fitted values
log_so2
twoway (lfitci log_so2 log_manuf) (scatter log_so2 log_manuf)
5
4
3
2
3 4 5 6 7 8
log_manuf
95% CI Fitted values
log_so2
Log_pop
5
4
3
2
4 5 6 7 8
log_pop
95% CI Fitted values
log_so2
Wind
5
4
3
2
6 8 10 12 14
wind
95% CI Fitted values
log_so2
Precip
5
4
3
2
10 20 30 40 50 60
precip
95% CI Fitted values
log_so2
Days
5
4
3
2
1
0 50 100 150 200
days
95% CI Fitted values
log_so2
REGRESI LINIER
INTERPRETASI
. reg log_so2 temp
Source SS df MS Number of obs = 41
F(1, 39) = 16.60
Model 5.89142105 1 5.89142105 Prob > F = 0.0002
Residual 13.8375076 39 .354807887 R-squared = 0.2986
Adj R-squared = 0.2806
Total 19.7289286 40 .493223216 Root MSE = .59566
log_so2 Coef. Std. Err. t P>|t| [95% Conf. Interval]
temp -.0530981 .0130306 -4.07 0.000 -.079455 -.0267412
_cons 6.113935 .7325632 8.35 0.000 4.632186 7.595684
Dari hasil analisis, di temukan bahwa adanya pengaruh temperatur terhadap polusi udara. Variabel
temperatur dapat memprediksi variabel polusi udara sebesar 29,86%. Persamaan garis So2 = 6,114 + ( –
0,053) x temp.
. sum
Variable Obs Mean Std. Dev. Min Max
town 0
so2 41 30.04878 23.47227 8 110
temp 41 55.76341 7.227716 43.5 75.5
manuf 41 463.0976 563.4739 35 3344
pop 41 608.6098 579.113 71 3369
wind 41 9.443902 1.428644 6 12.7
precip 41 36.76902 11.77155 7.05 59.8
days 41 113.9024 26.50642 36 166
log_so2 41 3.153004 .7022985 2.079442 4.70048
log_manuf 41 5.692094 .9634348 3.555348 8.114923
log_pop 41 6.101705 .8044345 4.26268 8.122372
. reg log_so2 log_manuf
Source SS df MS Number of obs = 41
F(1, 39) = 5.04
Model 2.25607411 1 2.25607411 Prob > F = 0.0306
Residual 17.4728545 39 .448021911 R-squared = 0.1144
Adj R-squared = 0.0916
Total 19.7289286 40 .493223216 Root MSE = .66934
log_so2 Coef. Std. Err. t P>|t| [95% Conf. Interval]
log_manuf .2465042 .1098493 2.24 0.031 .024313 .4686954
_cons 1.749878 .6339505 2.76 0.009 .4675924 3.032164
Dari hasil analisis, di temukan bahwa ada pengaruh jumlah pabrik terhadap polusi udara. Variabel jumlah
pabrik dapat memprediksi variabel polusi udara sebesar 11,44%. Persamaan garis So2 = 1,750 + ( 0,247)
x manuf.
. reg log_so2 log_pop
Source SS df MS Number of obs = 41
F(1, 39) = 0.40
Model .199203126 1 .199203126 Prob > F = 0.5319
Residual 19.5297255 39 .500762193 R-squared = 0.0101
Adj R-squared = -0.0153
Total 19.7289286 40 .493223216 Root MSE = .70765
log_so2 Coef. Std. Err. t P>|t| [95% Conf. Interval]
log_pop .0877258 .1390897 0.63 0.532 -.1936097 .3690614
_cons 2.617727 .85585 3.06 0.004 .8866066 4.348847
Dari hasil analisis, di temukan bahwa tidak ada pengaruh jumlah populasi terhadap polusi udara.
Variabel jumlah populasi dapat memprediksi variabel polusi udara sebesar 1,01%. Persamaan garis So2 =
2,612 + ( 0,088) x pop.
. reg log_so2 wind
Source SS df MS Number of obs = 41
F(1, 39) = 0.01
Model .003610137 1 .003610137 Prob > F = 0.9331
Residual 19.7253185 39 .505777398 R-squared = 0.0002
Adj R-squared = -0.0255
Total 19.7289286 40 .493223216 Root MSE = .71118
log_so2 Coef. Std. Err. t P>|t| [95% Conf. Interval]
wind -.0066498 .0787092 -0.08 0.933 -.1658542 .1525546
_cons 3.215804 .7515743 4.28 0.000 1.695601 4.736006
Dari hasil analisis, di temukan bahwa tidak ada pengaruh angin terhadap polusi udara. Variabel
angin dapat memprediksi variabel polusi udara hanya sebesar 0,02%. Persamaan garis So2 = 3,216+ ( -
0,007) x wind.
. reg log_so2 precip
Source SS df MS Number of obs = 41
F(1, 39) = 0.09
Model .04557428 1 .04557428 Prob > F = 0.7654
Residual 19.6833544 39 .504701394 R-squared = 0.0023
Adj R-squared = -0.0233
Total 19.7289286 40 .493223216 Root MSE = .71042
log_so2 Coef. Std. Err. t P>|t| [95% Conf. Interval]
precip .0028675 .0095423 0.30 0.765 -.0164337 .0221686
_cons 3.04757 .3679858 8.28 0.000 2.303249 3.791892
Dari hasil analisis, di temukan bahwa tidak ada pengaruh presipitasi terhadap polusi udara.
Variabel presipitasi dapat memprediksi variabel polusi udara hanya sebesar 0,23%. Persamaan garis So2
= 3,047 + ( 0,003) x precip.
. reg log_so2 days
Source SS df MS Number of obs = 41
F(1, 39) = 11.59
Model 4.51844453 1 4.51844453 Prob > F = 0.0016
Residual 15.2104841 39 .390012413 R-squared = 0.2290
Adj R-squared = 0.2093
Total 19.7289286 40 .493223216 Root MSE = .62451
log_so2 Coef. Std. Err. t P>|t| [95% Conf. Interval]
days .0126798 .0037253 3.40 0.002 .0051448 .0202149
_cons 1.70874 .4353826 3.92 0.000 .8280956 2.589385
Dari hasil analisis, di temukan bahwa ada pengaruh hari terhadap polusi udara. Variabel jumlah
hari dapat memprediksi variabel polusi udara sebesar 22,90%, sisanya diprediksi oleh variabel lain.
Persamaan garis So2 = =1,709 + ( 0,013) x days.
Regresi Linier Multivariat
. reg log_so2 temp log_manuf days
Source SS df MS Number of obs = 41
F(3, 37) = 9.07
Model 8.36229392 3 2.78743131 Prob > F = 0.0001
Residual 11.3666347 37 .307206344 R-squared = 0.4239
Adj R-squared = 0.3771
Total 19.7289286 40 .493223216 Root MSE = .55426
log_so2 Coef. Std. Err. t P>|t| [95% Conf. Interval]
temp -.0382498 .0134931 -2.83 0.007 -.0655894 -.0109103
log_manuf .1700487 .0926046 1.84 0.074 -.017586 .3576834
days .0071896 .0036838 1.95 0.059 -.0002746 .0146538
_cons 3.499097 1.148032 3.05 0.004 1.172963 5.825231
Berdasarkan hasil regresi multivariate. Dapat diketahui bahwa adanya pengaruh antara
temperatur, manuf dan hari terhadap polusi udara dengan p value 0,0001 maka dapat di
simpulkan bahwa terdapat hubungan antara temperatur, manuf dan hari terhadap polusi udara.
[Koefisien x1 – 0,038; 0,17; 0,007 dan konstanta 3,499]
Persamaan garis So2 = [3,499 + (- 0,038) x temp] + [3,499 + 0,170 x manuf] + [3,499 +
(- 0,007) x days].
Artinya temp, manuf, dan hari hanya bisa menjelaskan variasi So2 sebesar 42,39%. F=
9,07 dan P= 0,0001 membuktikan bahwa koefisien β tidak sama dengan 0 atau ada pengaruh
temp, manuf dan hari terhadap So2.