Report Group D
Report Group D
VARIATION, SKEWNESS
AND KURTOSIS
GROUP D
MEASURES OF VARIATION, SKEWNESS AND KURTOSIS
VARIABILITY
What is variability?
• is a summary statistic that
represents the amount of
dispersion in a dataset.
• it measures how much your scores
differ from each other.
0, 2, 6, 10, 12 mean = 6
8, 7, 6, 5, 4 mean = 6
6, 6, 6, 6, 6 mean = 6
Using the mean to predict a highly
variable data set is much less useful
because the score is less
representative, but with low
variability, we have high predictability
LOW DISPERSION
easier to make predictions about the
population
HIGH DISPERSION
harder to make predictions about the
population
Example
Range = 54 – 23
= 31
Find the range of given observations:
Range = 93 – 48
=45
Find the range of the given grouped data.
Scores of Grade 7 in Mathematics class.
26-30 14
21-25 10
Find the range of the given grouped data.
Scores of Grade 8 in English class.
21-25 14
16-20 10
LET’S HAVE A
SHORT REVIEW!
SHORT
REVIEW?!?
Quartiles are the values that divide a list of numerical data
into three quarters. There are three quartiles, first,
second and third, denoted by Q1, Q2 and Q3. Here, Q2 is
nothing but the median of the given data.
Ex: 12 13 14 14 15 16 17 18 19 19 20
In Statistics, the range is the smallest of all the measures of dispersion.
It is the difference between the two extreme conclusions of the
distribution. In other words, the range is the difference between
the maximum and the minimum observation of the distribution.
It is defined by
Range = Xmax – Xmin
20-12= 8
Where Xmax is the largest observation and Xmin is the smallest
observation of the variable values.
The interquartile range defines the difference between the
third and the first quartile. Quartiles are the partitioned
values that divide the whole series into 4 equal parts. So,
there are 3 quartiles. First Quartile is denoted by Q1 known
as the lower quartile, the second Quartile is denoted by
Q2 and the third Quartile is denoted by Q3 known as the
upper quartile. Therefore, the interquartile range is equal
to the upper quartile minus lower quartile.
The difference between the upper and lower quartile is
known as the interquartile range. The formula for the
interquartile range is
44 42 50 47 49
44 43 42 45 48
41 40 41 46 48
The semi-interquartile range is defined as the measures
of dispersion. Semi interquartile range also is defined as
half of the interquartile range. It is computed as one half
the difference between the 75th percentile (Q3) and the
25th percentile (Q1). The semi-interquartile range is one-
half of the difference between the first and third
quartiles. The Formula for Semi Interquartile Range is
● Semi Interquartile Range = (Q3– Q1) / 2
Let’s try.
44 42 50 47 49
44 43 42 45 48
41 40 41 46 48
INTERQUARTILE AND SEMI- INTERQUARTILE IN
RESEARCH
THANK YOU?!?
Reference:
https://byjus.com/maths/interquartile-range/
MEAN ABSOLUTE
DEVIATION
Mybel B. Candido
Definition
• is a statistical measure that quantifies the
average distance between each data point in a
dataset and the mean of that dataset.
• is a valuable tool for assessing variability and
dispersion, providing insights into the consistency
of data.
FORMULA
Example:
dataset:
Calculate the Mean:
Calculate Absolute
Deviations:
Sum of Absolute Deviations:
1.2+2.8+0.8+0.2+2.2=7.2
Calculate MAD:
VARIANCE
GROUP D
VARIANCE
In statistics, variance measures variability from
the average or mean. It is calculated by taking
the differences between each number in the
data set and the mean, squaring the differences
to make them positive, and then dividing the
sum of the squares by the number of values in
the data set.
POPULATION VARIANCE
If we have a population data set, the formula is written as
σ2 = ∑ (xi – x̄)2
n
where,
•x̄ is the mean of population data set
•n is the total number of observations
SAMPLE VARIANCE
If the data set is a sample the formula of variance is given by
s2 = ∑ (xi – x̄)2
(n – 1)
● where,
• x̄ is the mean of sample data set
• n is the total number of observations
STANDARD DEVIATION
Standard Deviation
σ 𝒙𝒊 − 𝝁 𝟐
𝝈=
𝑵
Sample Standard Deviation
σ 𝒙𝒊 − 𝒙 𝟐
𝒔=
𝒏−𝟏
Sample Standard Deviation
82, 88, 89, 93, 98
𝒔𝒖𝒎𝒎𝒂𝒕𝒊𝒐𝒏 𝒐𝒇 𝒂𝒍𝒍 𝒔𝒄𝒐𝒓𝒆𝒔
𝒙=
𝑵
𝟖𝟐 + 𝟖𝟖 + 𝟖𝟗 + 𝟗𝟑 + 𝟗𝟖
𝒙=
𝟓
𝟒𝟓𝟎
𝒙=
𝟓
𝒙 = 𝟗𝟎
Sample Standard Deviation
82, 88, 89, 93, 98
Score Deviation Squared Deviation
𝒙𝒊 𝒙𝒊 − 𝒙 𝒙𝒊 − 𝒙 𝟐
98 8 64
93 3 9
89 -1 1
88 -2 4
82 -8 64
Sample Standard Deviation
σ 𝒙𝒊 − 𝒙 𝟐
𝒔=
𝒏− 𝟏
𝟏𝟒𝟐
𝒔=
𝟓−𝟏
𝟏𝟒𝟐
𝒔=
𝟒
Sample Standard Deviation
𝒔 = 𝟑𝟓. 𝟓
𝒔 = 𝟓. 𝟗𝟓𝟖 𝒐𝒓 𝟔
Population Standard Deviation
58, 69, 73, 76, 77, 84, 85, 89, 92,
97
𝝁=
𝒔𝒖𝒎𝒎𝒂𝒕𝒊𝒐𝒏 𝒐𝒇 𝒂𝒍𝒍 𝒔𝒄𝒐𝒓𝒆𝒔
𝑵
58 + 69 + 73 + 76 + 77 + 84 + 85 + 89 + 92 + 97
𝝁=
𝟏𝟎
𝟏𝟔𝟎
𝝁=
𝟓
𝝁= 𝟑𝟐
Population Standard Deviation
σ 𝒙𝒊 − 𝝁 𝟐
𝝈=
𝑵
Population Standard Deviation
Score Deviation Squared Deviation
𝒙𝒊 𝒙𝒊 − 𝝁 𝒙𝒊 − 𝝁 𝟐 𝝁 = 𝟑𝟐
97 65 4,225
92 60 3,600
89 57 3,249
85 53 2,809 σ 𝒙𝒊 − 𝝁 𝟐 = 24,274
84 52 2,704
77 45 2,025
76 44 1,936
73 41 1,681
69 37 1,369
58 26 676
Population Standard Deviation
σ 𝒙𝒊 − 𝝁 𝟐
𝝈=
𝑵
24,274
𝝈=
𝟏𝟎
𝝈= 𝟐, 𝟒𝟐𝟕. 𝟒
𝝈 = 𝟒𝟗. 𝟐𝟔
WHY DO WE USE SKEWNESS IN
REASEARCH?
jessa p. arago
Group d reporter
Understanding Kurtosis
▪ Kurtosis is a descriptive statistic used to help measure how data
disperse between a distribution's center and tails, with larger values
indicating a data distribution may have “heavy” tails that are thickly
concentrated with observations or that are long with extreme
observations.