0% found this document useful (0 votes)
22 views

Biostatistics Module: Amulya Gupta - Manraj Singh Sra - Archisman Mazumder

The document discusses various measures of central tendency including mean, median and mode. It explains their properties and when each should be used. It also covers standard deviation, normal distribution, central limit theorem and confidence intervals.

Uploaded by

AIIMS Student
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views

Biostatistics Module: Amulya Gupta - Manraj Singh Sra - Archisman Mazumder

The document discusses various measures of central tendency including mean, median and mode. It explains their properties and when each should be used. It also covers standard deviation, normal distribution, central limit theorem and confidence intervals.

Uploaded by

AIIMS Student
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 15

Biostatistics Module

Amulya Gupta | Manraj Singh Sra | Archisman Mazumder


Central Tendencies
• Single value that attempts to describe the whole data.
• Mean, Median and Mode are measures of central tendencies but are
used in different set of conditions.
• A part of descriptive statistics
Mean
• =“Average”
• =
• Disadvantage- It is significantly affected by the outliers.
• 1,1,2,2,3,3,3– Mean would be 2.14…
• If the data is 1,1,2,2,3,3,20 – Mean would be 6. Which is not the true
measure of central tendency.
• It’s meaningless in Likert Scales
Example of Likert scale and mean
Suppose you want to measure whether a test population is sad or happy- you want to
open mental health clinics there based on that data.
So you administer a Likert scale: Very happy (5), Happy (4), Okayish (3), Sad (2), Very
Sad (1)
Test population: 400

150 people say they are very happy, 150 people say they are very sad, 10 say they are
okay, 45 people are happy and 45 people are sad.
Mean= (150x 5 + 150x 1 + 10 x 3 + 45 x 2 + 45 x 4)/400 = 3

This result says my population is okayish, but is that so?


Median
• The middle number; found by ordering all data points and picking out
the one in the middle (or if there are two middle numbers, taking the
mean of those two numbers).
• Relatively immune to outliers. If you have a very spread out data, you
can use Median.
• For example, usually Median years of survival after diagnosis of
cancer.
• However, it does not use all the information available and hence is
not used in most statistical tests.
Mode
• A particular data point that’s occurring at a higher frequency compared to the
rest.
• It’s not affected by extreme values.
• Disadvantages: Sometimes there are more than one mode.
• It is easy to understand only in discrete frequency distributions.
• When small numbers used, it is susceptible to minor variations:
• For example: 1,1,2,2,3,3,3,4,4,5,5 has 3 as mode
• But in 1,1,1,1,2,2,3,3,3,4,4,5,5 has 1 as mode – which doesn’t really
represents a ‘central tendency’
• Rarely used in Medical research.
Distribution of Data
Normal Distribution
The Normal Distribution has:
•mean = median = mode
•symmetry about the centre
•50% of values less than the mean
and 50% greater than the mean

Heights of people
Blood Pressure
Body temperatures
Standard Deviation
• Standard deviation = square root of the variance
• Variance =
Central Limit Theorem
• The central limit theorem states that if you have a population with mean μ and standard
deviation σ and take sufficiently large random samples from the population with
replacement, then the distribution of the sample means will be approximately normally
distributed. This will hold true regardless of whether the source population is normal or
skewed, provided the sample size is sufficiently large (usually n > 30). If the population is
normal, then the theorem holds true even for samples smaller than 30.
Population of 1 million people, Measuring systolic blood pressure

In each study, if you sample 100 people. And you conduct the study infinite number of times.
Frequency

BP
Confidence Intervals
• A Confidence Interval is a range of values we are fairly sure our true
value lies in.
• We measure the heights of 40 randomly chosen men, and get a mean
height of 175cm.
• We also know the standard deviation of men's heights is 20cm.
• The 95% Confidence Interval (will show to calculate it later) is:
175cm ± 6.2cm
• X  ±  Zs/√n
• Where:
Confidence
• X is the mean Z
Interval
• Z is the chosen Z-value 80% 1.282
85% 1.440
• s is the standard deviation
90% 1.645
• n is the number of observations
95% 1.960
99% 2.576
99.5% 2.807
99.9% 3.291

You might also like