Essential Medical Stats - MSC Clin Res
Essential Medical Stats - MSC Clin Res
Priya Ranganathan
Professor, Anaesthesiology
Tata Memorial Hospital
[email protected]
Overview
2
Sample versus population
3
Average height of people aged 35 years
Population =
Sample =
all people in
people in this
the world aged
meeting aged
35 years
35 years
4
Sample versus population
160
cm
165 cm
162
cm 164
cm
5
Take home message 1
• We do studies in samples
• Samples are representative of the population
• Results of sample may differ from population
(“error”)
• Since we do not know true population results, we
accept the possibility of error
6
Types and Distribution of Data
7
• Categorical or qualitative data
– What type ?
• Eye colour: black, brown, blue
• Severity of pain: mild, moderate, severe
8
Categorical or qualitative data
Nominal data Ordinal data
No order between classes Order between classes
Binary / Non-binary
Binary Binary
Only 2 categories More than 2 categories
Continuous Discrete
Objects are measured Objects are counted in
on a continuous scale whole numbers
10
Summarising data
• Categorical data
– Proportions
• Numerical data
11
Normally distributed data
(Gaussian distribution)
Mean + 2 SD = 95%
of observations
•Mean = median
12
Distribution-free (Skewed) data
•Median value
•Range of values
•Inter-quartile ranges
13
Why is type and distribution of data
important?
• Determines
– how you report the results
– the choice of statistical test for analysis
14
Take home message 2
Data
Categorical Numerical
15
Take home message 2…
• For example,
– Pain score of 5/10 can be classified as moderate
– But moderate pain can mean anything from 4 to 7
16
Quiz - 1
17
18
Statistical errors
19
Study question
20
Statistical errors
Dexa = Placebo √
Dexa = Placebo
Dexa is different False positive error
from Placebo Type I error
False negative error
Dexa is different Dexa = Placebo
Type 2 error
from Placebo
Dexa is different √
from Placebo
21
Type 1 error or α error
22
Type 2 error or β error
23
24
“p“ value
25
• α is what we set as the acceptable limit for
finding a result by chance
– Usually set as 5 % or 0.05
• P value
– Probability that the result has occurred by
chance
Quiz – 2(a)
Standard Error
and
Confidence Intervals
32
Sample statistic
differs from the
population
statistic
33
• Standard error
– Difference between sample result and true
population result
• Confidence intervals
– Range of values you could expect to see in
the population
40
• Type 1 error
• Type 2 error (power)
• Expected values in the two groups (margin of
difference)
42
43
Numerical Categorical
Normal Distribution-
free
Chi-square test
Fisher’s exact test
2 groups 2 groups
Unpaired-t Mann-Whitney
48
49
Numerical Categorical
Normal Distribution-
free
53
54
Pearson’s Spearman’s
correlation Kendall’s
coefficient
57
Screening tests
Diagnostic tests
Inter-rater validation
• Continuous data
• Agreement compared using a Bland Altman plot
61
• We do studies in samples
• Samples are representative of the population
• Results of sample may differ from population
(“error”)
• Since we do not know true population results,
we accept the possibility of error
63
Take home message 2
Data
Categorical Numerical
64
Take home message 2…
• For example,
– Pain score of 5/10 can be classified as
moderate
– But moderate pain can mean anything from 4
to 7
65
Take home message 3
• Alpha error or Type 1 error
– False positive result
• P value
– Probability that the result has occurred by
chance
Take home message 4
• Standard error
– Difference between sample result and true
population result
• Confidence intervals
– Range of values you could expect to see in
the population
Take home message 5