0% found this document useful (0 votes)
38 views11 pages

Meldi Nur Anisa - N1A118078 - AAE - 4 - USAIR

This document analyzes epidemiological data and performs logistic regression analysis after normalizing the data. It contains information on 8 variables for 41 cities, including independent variables (temperature, number of factories, population, wind, precipitation, days) and the dependent variable (air pollution). Some of the variables were found to not be normally distributed, so logarithmic transformations were performed to normalize the data and make it suitable for regression analysis. Specifically, the number of factories and air pollution variables were transformed using logarithms to achieve normal distributions.

Uploaded by

meldi Nur Anisa
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
38 views11 pages

Meldi Nur Anisa - N1A118078 - AAE - 4 - USAIR

This document analyzes epidemiological data and performs logistic regression analysis after normalizing the data. It contains information on 8 variables for 41 cities, including independent variables (temperature, number of factories, population, wind, precipitation, days) and the dependent variable (air pollution). Some of the variables were found to not be normally distributed, so logarithmic transformations were performed to normalize the data and make it suitable for regression analysis. Specifically, the number of factories and air pollution variables were transformed using logarithms to achieve normal distributions.

Uploaded by

meldi Nur Anisa
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

ANALISIS DATA EPIDEMIOLOGI

MENORMALKAN DATA DAN ANALISIS REGRESI LOGISTIK

Nama : Meldi Nur Anisa

NIM : N1A118078

Dosen Pengampu: Dr. Ummi Kalsum, S.KM., M.KM.

PROGRAM STUDI ILMU KESEHATAN MASYARAKAT


FAKULTAS KEDOKTERAN DAN ILMU KESEHATAN
UNIVERSITAS JAMBI
2021
Data Usair

Interpretasi
. use "C:\Users\BC5720\Downloads\USAIR.DTA"

. des

Contains data from C:\Users\BC5720\Downloads\USAIR.DTA


obs: 41
vars: 8 28 Apr 2012 10:34

storage display value


variable name type format label variable label

town str9 %9s


so2 int %8.0g
temp float %9.0g
manuf int %8.0g
pop int %8.0g
wind float %9.0g
precip float %9.0g
days int %8.0g

Sorted by:

Variabel independen (temperatur, jumlah pabrik, jumlah populasi, angin, precip, dan hari.
Variabel dependen (populasi udara). Dengan total 8 variabel.

. sum

Variable Obs Mean Std. Dev. Min Max

town 0
so2 41 30.04878 23.47227 8 110
temp 41 55.76341 7.227716 43.5 75.5
manuf 41 463.0976 563.4739 35 3344
pop 41 608.6098 579.113 71 3369

wind 41 9.443902 1.428644 6 12.7


precip 41 36.76902 11.77155 7.05 59.8
days 41 113.9024 26.50642 36 166

. list

town so2 temp manuf pop wind precip days

1. Phoenix 10 70.3 213 582 6 7.05 36


2. Lrock 13 61 91 132 8.2 48.52 100
3. Sfran 12 56.7 453 716 8.7 20.66 67
4. Denver 17 51.9 454 515 9 12.95 86
5. Hartford 56 49.1 412 158 9 43.37 127

6. Wilming 36 54 80 80 9 40.25 114


7. Washing 29 57.3 434 757 9.3 38.89 111
8. Jackson 14 68.4 136 529 8.8 54.47 116
9. Miami 10 75.5 207 335 9 59.8 128
10. Atlanta 24 61.5 368 497 9.1 48.34 115

11. Chicago 110 50.6 3344 3369 10.4 34.44 122


12. Indian 28 52.3 361 746 9.7 38.74 121
13. DesM 17 49 104 201 11.2 30.85 103
14. Wichita 8 56.6 125 277 12.7 30.58 82
15. Louisv 30 55.6 291 593 8.3 43.11 123

16. NewO 9 68.3 204 361 8.4 56.77 113


17. Baltim 47 55 625 905 9.6 41.31 111
18. Detroit 35 49.9 1064 1513 10.1 30.96 129
19. Minn 29 43.5 699 744 10.6 25.94 137
20. Kansas 14 54.5 381 507 10 37 99

21. StLouis 56 55.9 775 622 9.5 35.89 105


22. Omaha 14 51.5 181 347 10.9 30.18 98
23. Alburq 11 56.8 46 244 8.9 7.77 58
24. Albany 46 47.6 44 116 8.8 33.36 135
25. Buffalo 11 47.1 391 463 12.4 36.11 166

26. Cincinn 23 54 462 453 7.1 39.04 132


27. Cleve 65 49.7 1007 751 10.9 34.99 155
28. Colum 26 51.5 266 540 8.6 37.01 134
29. Philad 69 54.6 1692 1950 9.6 39.93 115
30. Pittsb 61 50.4 347 520 9.4 36.22 147

31. Provid 94 50 343 179 10.6 42.75 125


32. Memphis 10 61.6 337 624 9.2 49.1 105
33. Nashville 18 59.4 275 448 7.9 46 119
34. Dallas 9 66.2 641 844 10.9 35.94 78
35. Houston 10 68.9 721 1233 10.8 48.19 103

36. SLC 28 51 137 176 8.7 15.17 89


37. Norfolk 31 59.3 96 308 10.6 44.68 116
38. Richmond 26 57.8 197 299 7.6 42.59 115
39. Seattle 29 51.1 379 531 9.4 38.79 164
40. Charlest 31 55.2 35 71 6.5 40.75 148

41. Milwak 16 45.7 569 717 11.8 29.07 123

Objek penelitian terdiri dari 41 kota.


. ladder so2

Transformation formula chi2(2) P(chi2)

cubic so2^3 43.63 0.000


square so2^2 33.57 0.000
identity so2 16.19 0.000
square root sqrt(so2) 6.53 0.038
log log(so2) 2.82 0.244
1/(square root) 1/sqrt(so2) 6.27 0.043
inverse 1/so2 5.44 0.066
1/square 1/(so2^2) 7.72 0.021
1/cubic 1/(so2^3) 14.99 0.001

. g log_so2= log(so2)

. ladder log_so2

Transformation formula chi2(2) P(chi2)

cubic log_so2^3 7.67 0.022


square log_so2^2 3.84 0.146
identity log_so2 2.82 0.244
square root sqrt(log_so2) 3.90 0.142
log log(log_so2) 4.91 0.086
1/(square root) 1/sqrt(log_so2) 5.52 0.063
inverse 1/log_so2 5.53 0.063
1/square 1/(log_so2^2) 4.75 0.093
1/cubic 1/(log_so2^3) 4.99 0.083

Setelah di analisis kita dapat melihat kenormalannya. Diketahui variabel polusi udara tidak normal. Maka
di lakukan penormalan dengan menglogin kan data data menjadi normal. Sebelumnya p= 0,000 menjadi
p= 0,2444

. ladder temp

Transformation formula chi2(2) P(chi2)

cubic temp^3 12.45 0.002


square temp^2 8.86 0.012
identity temp 5.67 0.059
square root sqrt(temp) 4.29 0.117
log log(temp) 2.87 0.238
1/(square root) 1/sqrt(temp) 1.61 0.446
inverse 1/temp 0.71 0.703
1/square 1/(temp^2) 0.00 0.998
1/cubic 1/(temp^3) 1.16 0.560

P=0,059 (normal)

. ladder manuf

Transformation formula chi2(2) P(chi2)

cubic manuf^3 61.00 0.000


square manuf^2 57.71 0.000
identity manuf 41.74 0.000
square root sqrt(manuf) 19.73 0.000
log log(manuf) 0.66 0.718
1/(square root) 1/sqrt(manuf) 12.70 0.002
inverse 1/manuf 24.90 0.000
1/square 1/(manuf^2) 38.26 0.000
1/cubic 1/(manuf^3) 45.25 0.000

. g log_manuf = log(manuf)
. ladder log_manuf

Transformation formula chi2(2) P(chi2)

cubic log_ma~f^3 10.40 0.006


square log_ma~f^2 3.62 0.164
identity log_ma~f 0.66 0.718
square root sqrt(log_ma~f) 1.93 0.381
log log(log_ma~f) 4.34 0.114
1/(square root) 1/sqrt(log_ma~f) 6.83 0.033
inverse 1/log_ma~f 9.65 0.008
1/square 1/(log_ma~f^2) 15.62 0.000
1/cubic 1/(log_ma~f^3) 21.29 0.000

Variabel jumlah pabrik awalnya tidak normal deng p=0.000, kemudian data di normalkan dengan meng-
log-an data. Kemudian data menjadi p=0,728 (normal).
. ladder pop

Transformation formula chi2(2) P(chi2)

cubic pop^3 59.51 0.000


square pop^2 54.09 0.000
identity pop 35.81 0.000
square root sqrt(pop) 15.82 0.000
log log(pop) 1.13 0.569
1/(square root) 1/sqrt(pop) 11.87 0.003
inverse 1/pop 23.51 0.000
1/square 1/(pop^2) 38.03 0.000
1/cubic 1/(pop^3) 44.70 0.000

. g log_pop = log(pop)

. ladder log_pop

Transformation formula chi2(2) P(chi2)

cubic log_pop^3 7.70 0.021


square log_pop^2 2.66 0.265
identity log_pop 1.13 0.569
square root sqrt(log_pop) 2.24 0.326
log log(log_pop) 4.15 0.126
1/(square root) 1/sqrt(log_pop) 6.06 0.048
inverse 1/log_pop 8.21 0.017
1/square 1/(log_pop^2) 12.90 0.002
1/cubic 1/(log_pop^3) 17.67 0.000

Variabel populasi awalnya tidak normal deng p=0.000, kemudian data di normalkan dengan meng-log-an
data. Kemudian data menjadi p=0,569 (normal).
. ladder wind

Transformation formula chi2(2) P(chi2)

cubic wind^3 7.20 0.027


square wind^2 2.87 0.239
identity wind 0.60 0.742
square root sqrt(wind) 1.65 0.439
log log(wind) 4.31 0.116
1/(square root) 1/sqrt(wind) 7.40 0.025
inverse 1/wind 11.07 0.004
1/square 1/(wind^2) 19.04 0.000
1/cubic 1/(wind^3) 26.56 0.000

P=0,742 (normal)
. ladder precip

Transformation formula chi2(2) P(chi2)

cubic precip^3 11.34 0.003


square precip^2 2.35 0.309
identity precip 5.41 0.067
square root sqrt(precip) 12.98 0.002
log log(precip) 21.58 0.000
1/(square root) 1/sqrt(precip) 29.27 0.000
inverse 1/precip 35.24 0.000
1/square 1/(precip^2) 42.42 0.000
1/cubic 1/(precip^3) 45.80 0.000

P=0,067 (normal)
. ladder days

Transformation formula chi2(2) P(chi2)

cubic days^3 8.02 0.018


square days^2 1.72 0.424
identity days 4.88 0.087
square root sqrt(days) 11.69 0.003
log log(days) 20.75 0.000
1/(square root) 1/sqrt(days) 30.37 0.000
inverse 1/days 39.15 0.000
1/square 1/(days^2) 51.68 0.000
1/cubic 1/(days^3) 57.88 0.000

p=0,087 (normal)

Data Sebaran

. scatter log_so2 temp

Hubungan Antara SO2 dengan temperatur


5
4
log_so2
3
2

40 50 60 70 80
temp

. scatter log_so2 log_manuf

Hubungan antara SO2 dengan jumlah pabrik


5
4
log_so2
3
2

3 4 5 6 7 8
log_manuf

. scatter log_so2 log_pop

Hubungan antara SO2 dengan populasi


5
4
log_so2
3
2
4 5 6 7 8
log_pop

Hubungan antara SO2 dengan angin


5
4
log_so2
3
2

6 8 10 12 14
wind

. scatter log_so2 precip

. scatter log_so2 days

Hubungan antara SO2 dengan precip


5
4
log_so2
3
2

10 20 30 40 50 60
precip

Hubungan antara SO2 dengan hari


5
4
log_so2
3
2

0 50 100 150 200


days
. twoway (lfitci so2 temp) (scatter log_so2 temp)

. twoway (lfitci log_so2 temp) (scatter log_so2 temp)

. twoway (lfitci log_so2 log_manuf) (scatter log_so2 log_manuf)

. twoway (lfitci log_so2 log_pop) (scatter log_so2 log_pop)

. twoway (lfitci log_so2 wind) (scatter log_so2 wind)

. twoway (lfitci log_so2 wind) (scatter log_so2 wind)

. twoway (lfitci log_so2 precip) (scatter log_so2 precip)

. twoway (lfitci log_so2 days) (scatter log_so2 days)

twoway (lfitci so2 temp) (scatter log_so2 temp)


5
4
3
2
1

40 50 60 70 80
temp

95% CI Fitted values


log_so2

twoway (lfitci log_so2 log_manuf) (scatter log_so2 log_manuf)


5
4
3
2

3 4 5 6 7 8
log_manuf

95% CI Fitted values


log_so2

Log_pop
5
4
3
2

4 5 6 7 8
log_pop

95% CI Fitted values


log_so2
Wind

5
4
3
2
6 8 10 12 14
wind

95% CI Fitted values


log_so2

Precip
5
4
3
2

10 20 30 40 50 60
precip

95% CI Fitted values


log_so2

Days
5
4
3
2
1

0 50 100 150 200


days

95% CI Fitted values


log_so2
REGRESI LINIER

INTERPRETASI

. reg log_so2 temp

Source SS df MS Number of obs = 41


F(1, 39) = 16.60
Model 5.89142105 1 5.89142105 Prob > F = 0.0002
Residual 13.8375076 39 .354807887 R-squared = 0.2986
Adj R-squared = 0.2806
Total 19.7289286 40 .493223216 Root MSE = .59566

log_so2 Coef. Std. Err. t P>|t| [95% Conf. Interval]

temp -.0530981 .0130306 -4.07 0.000 -.079455 -.0267412


_cons 6.113935 .7325632 8.35 0.000 4.632186 7.595684

Dari hasil analisis, di temukan bahwa adanya pengaruh temperatur terhadap polusi udara. Variabel
temperatur dapat memprediksi variabel polusi udara sebesar 29,86%. Persamaan garis So2 = 6,114 + ( –
0,053) x temp.
. sum

Variable Obs Mean Std. Dev. Min Max

town 0
so2 41 30.04878 23.47227 8 110
temp 41 55.76341 7.227716 43.5 75.5
manuf 41 463.0976 563.4739 35 3344
pop 41 608.6098 579.113 71 3369

wind 41 9.443902 1.428644 6 12.7


precip 41 36.76902 11.77155 7.05 59.8
days 41 113.9024 26.50642 36 166
log_so2 41 3.153004 .7022985 2.079442 4.70048
log_manuf 41 5.692094 .9634348 3.555348 8.114923

log_pop 41 6.101705 .8044345 4.26268 8.122372

. reg log_so2 log_manuf

Source SS df MS Number of obs = 41


F(1, 39) = 5.04
Model 2.25607411 1 2.25607411 Prob > F = 0.0306
Residual 17.4728545 39 .448021911 R-squared = 0.1144
Adj R-squared = 0.0916
Total 19.7289286 40 .493223216 Root MSE = .66934

log_so2 Coef. Std. Err. t P>|t| [95% Conf. Interval]

log_manuf .2465042 .1098493 2.24 0.031 .024313 .4686954


_cons 1.749878 .6339505 2.76 0.009 .4675924 3.032164

Dari hasil analisis, di temukan bahwa ada pengaruh jumlah pabrik terhadap polusi udara. Variabel jumlah
pabrik dapat memprediksi variabel polusi udara sebesar 11,44%. Persamaan garis So2 = 1,750 + ( 0,247)
x manuf.

. reg log_so2 log_pop

Source SS df MS Number of obs = 41


F(1, 39) = 0.40
Model .199203126 1 .199203126 Prob > F = 0.5319
Residual 19.5297255 39 .500762193 R-squared = 0.0101
Adj R-squared = -0.0153
Total 19.7289286 40 .493223216 Root MSE = .70765

log_so2 Coef. Std. Err. t P>|t| [95% Conf. Interval]

log_pop .0877258 .1390897 0.63 0.532 -.1936097 .3690614


_cons 2.617727 .85585 3.06 0.004 .8866066 4.348847
Dari hasil analisis, di temukan bahwa tidak ada pengaruh jumlah populasi terhadap polusi udara.
Variabel jumlah populasi dapat memprediksi variabel polusi udara sebesar 1,01%. Persamaan garis So2 =
2,612 + ( 0,088) x pop.

. reg log_so2 wind

Source SS df MS Number of obs = 41


F(1, 39) = 0.01
Model .003610137 1 .003610137 Prob > F = 0.9331
Residual 19.7253185 39 .505777398 R-squared = 0.0002
Adj R-squared = -0.0255
Total 19.7289286 40 .493223216 Root MSE = .71118

log_so2 Coef. Std. Err. t P>|t| [95% Conf. Interval]

wind -.0066498 .0787092 -0.08 0.933 -.1658542 .1525546


_cons 3.215804 .7515743 4.28 0.000 1.695601 4.736006

Dari hasil analisis, di temukan bahwa tidak ada pengaruh angin terhadap polusi udara. Variabel
angin dapat memprediksi variabel polusi udara hanya sebesar 0,02%. Persamaan garis So2 = 3,216+ ( -
0,007) x wind.
. reg log_so2 precip

Source SS df MS Number of obs = 41


F(1, 39) = 0.09
Model .04557428 1 .04557428 Prob > F = 0.7654
Residual 19.6833544 39 .504701394 R-squared = 0.0023
Adj R-squared = -0.0233
Total 19.7289286 40 .493223216 Root MSE = .71042

log_so2 Coef. Std. Err. t P>|t| [95% Conf. Interval]

precip .0028675 .0095423 0.30 0.765 -.0164337 .0221686


_cons 3.04757 .3679858 8.28 0.000 2.303249 3.791892

Dari hasil analisis, di temukan bahwa tidak ada pengaruh presipitasi terhadap polusi udara.
Variabel presipitasi dapat memprediksi variabel polusi udara hanya sebesar 0,23%. Persamaan garis So2
= 3,047 + ( 0,003) x precip.
. reg log_so2 days

Source SS df MS Number of obs = 41


F(1, 39) = 11.59
Model 4.51844453 1 4.51844453 Prob > F = 0.0016
Residual 15.2104841 39 .390012413 R-squared = 0.2290
Adj R-squared = 0.2093
Total 19.7289286 40 .493223216 Root MSE = .62451

log_so2 Coef. Std. Err. t P>|t| [95% Conf. Interval]

days .0126798 .0037253 3.40 0.002 .0051448 .0202149


_cons 1.70874 .4353826 3.92 0.000 .8280956 2.589385

Dari hasil analisis, di temukan bahwa ada pengaruh hari terhadap polusi udara. Variabel jumlah
hari dapat memprediksi variabel polusi udara sebesar 22,90%, sisanya diprediksi oleh variabel lain.
Persamaan garis So2 = =1,709 + ( 0,013) x days.

Regresi Linier Multivariat


. reg log_so2 temp log_manuf days

Source SS df MS Number of obs = 41


F(3, 37) = 9.07
Model 8.36229392 3 2.78743131 Prob > F = 0.0001
Residual 11.3666347 37 .307206344 R-squared = 0.4239
Adj R-squared = 0.3771
Total 19.7289286 40 .493223216 Root MSE = .55426

log_so2 Coef. Std. Err. t P>|t| [95% Conf. Interval]

temp -.0382498 .0134931 -2.83 0.007 -.0655894 -.0109103


log_manuf .1700487 .0926046 1.84 0.074 -.017586 .3576834
days .0071896 .0036838 1.95 0.059 -.0002746 .0146538
_cons 3.499097 1.148032 3.05 0.004 1.172963 5.825231

Berdasarkan hasil regresi multivariate. Dapat diketahui bahwa adanya pengaruh antara
temperatur, manuf dan hari terhadap polusi udara dengan p value 0,0001 maka dapat di
simpulkan bahwa terdapat hubungan antara temperatur, manuf dan hari terhadap polusi udara.
[Koefisien x1 – 0,038; 0,17; 0,007 dan konstanta 3,499]

Persamaan garis So2 = [3,499 + (- 0,038) x temp] + [3,499 + 0,170 x manuf] + [3,499 +
(- 0,007) x days].

Artinya temp, manuf, dan hari hanya bisa menjelaskan variasi So2 sebesar 42,39%. F=
9,07 dan P= 0,0001 membuktikan bahwa koefisien β tidak sama dengan 0 atau ada pengaruh
temp, manuf dan hari terhadap So2.

You might also like