STATS Introduction Statistical Analysis
STATS Introduction Statistical Analysis
to Statistical Analysis
Pawel Skuza
Statistical Consultant
eResearch@Flinders / Central Library
http://www.flinders.edu.au/staffdev/enrol/courses.php?BCZ
Reproduced from
Health Services
Research Methods
(Shi, 2008, p. 36)
(Random) Sample
Statistics summarize
Inferences from sample
characteristics
to population
Population
Parameters summarize
characteristics
• Disadvantages
– Need a list of whole population
– Can be costly, timely, logistically difficult
• Advantages
– Cheap to implement if strata are convenient groupings
– More precise results than simple random sampling
– Representativeness of stratifying variable
• Disadvantages
– Need information on stratifying variable
– Sampling frame needed for each strata
Reproduced from
Health Services
Research Methods
(Shi, 2008, p. 36)
Reproduced from
Health Services
Research Methods
(Shi, 2008, p. 36)
Reproduced from
Health Services
Research Methods
(Shi, 2008, p. 36)
Trimmed Mean
Median
Mode
di X i X
X X X
2 2
2
s 2
N n 1
d d
2 2
2
s 2
N n 1
X X
2
X
2
s
N n 1
d
2
d
2
s
n 1
N
Gender
Frequency Percent
Valid Female 216 45.6
Male 258 54.4
Total 474 100.0
xx
z
s
P(Z≤-z)
1-P(Z≤z)
-2 -1 0 1 2 z-Score
z
x 130 100
2
15
positive negative
skew skew
• Hypothesis Testing
– Determine how much evidence the data
provides for or against a hypothesised
relationship
• Point estimates
– A single value or statistic is used to estimate the
parameter
• Interval estimate
– Based upon the point estimate
– But also conveys the degree of accuracy of that point
estimate
• That accuracy will be affected by
– Sampling error
– Measurement error
Theorem
(CLT) 2) has a standard deviation (also called "standard
error" or "standard error of the mean") equal to the
population standard deviation, x divided by the
square root of the sample size, N:
• http://wise.cgu.edu/wise-tutorials/tutorial-central-limit-
theorem/ Pawel Skuza 2013
Standard Error of the Mean
• Imagine we took lots of samples
– Each of 100 students
– And calculated the mean each time
• Then we would be able to make a graph (histogram) of
the means – sampling distribution
– The standard deviation of that graph is the standard error of the
mean
X
n
• It does not:
– Imply that the effect is large
– “Prove” the alternate hypothesis (rather, provides “support of”
or “evidence for”)
• Number of groups
• Whether measures are from same subjects
(paired, repeated) or independent samples
Example 2
Table from Pallant, J. (2007). SPSS Survival Manual : A step by step guide
to data analysis using SPSS for Windows (SPSS Version 15) (3rd ed.).
Maidenhead, Berkshire. U.K. ; New York, NY: Open University Press.
Example 3
Flowchart from http://gjyp.nl/marta/Flowchart%20(English).pdf
• Boushey, C., J., Harris, J., Bruemmer, B., & Archer, S.,
L. . (2008). Publishing Nutrition Research: A Review of
Sampling, Sample Size, Statistical Analysis, and Other
Key Elements of Manuscript Preparation, Part 2.
American Dietetic Association. Journal of the American
Dietetic Association, 108(4), 679-688.