6 Continuous Data Analysis
6 Continuous Data Analysis
Haramaya University
Feb 2025
Session Objectives
BY ADISU B.
One sample t-test
BY ADISU B.
Independent two sample t-test
BY ADISU B.
Example
Research question: To test if there is significant difference in
birth weight of male and female infant→ Independent t-test is
appropriate
BY ADISU B.
Interpretation
BY ADISU B.
Paired t- test
Compare means if each observation in one sample has
one and only one pair in the other sample dependent
to each other.
BY ADISU B.
One way ANOVA (Analysis of Variance)
For two normal distributions the two sample means are
compared by t-test.
BY ADISU B.
One way ANOVA…
The t-test methodology generalizes to the one-way analysis
of variance (ANOVA) for categorical variables with more
than two categories.
BY ADISU B.
One way ANOVA…
Ho : µ1 = µ2 = : : : =µ k ,
BY ADISU B.
One way ANOVA…
BY ADISU B.
STATA CODE: oneway weight place
BY ADISU B.
One way ANOVA…
We reject the null hypothesis (p value < 0.05) and
BY ADISU B.
Correlation
Dependent variable: Y
Constant or intercept:
Parameter represents the expected response when xi =0
Slope
Parameter represents the expected increment in the response per
unit change in xi
Note: Both α and β are population parameters which are usually
unknown and hence estimated from the data by a and b
Assumptions of linear regression
Linearity :- Relationship between independent and dependent variable is
linear
To check this assumptions we draw a scatter plot of residuals and y
values
If the scatter plot follows a linear pattern (i.e. not a curvilinear pattern)
that shows that linearity assumption is met
Linear Regression Assumptions
Normality (Normally Distributed Error Terms): - Error terms follow
the normal distribution. We can use `qnorm' and `pnorm' to check
the normality of the residuals.
stepwise
forward selection
Backward/stepwise/forward selection
Backward selection:
All variables will be entered in the model
Then remove step by step until significantly contributing
variables are left in model
Least contributing variable will be removed first
Then second least contributor will be removed and so on
Forward selection:
Model starts with empty (null model)
Then most significantly contributing variable will enter first
This continuous step by step until only significantly
contributing variables enter in the model
Stepwise selection
Same as forward selection