0% found this document useful (0 votes)
22 views100 pages

Report Group D

Uploaded by

Leni Mae A. Arca
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views100 pages

Report Group D

Uploaded by

Leni Mae A. Arca
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 100

MEASURES OF

VARIATION, SKEWNESS
AND KURTOSIS
GROUP D
MEASURES OF VARIATION, SKEWNESS AND KURTOSIS

TOPICS REPORTERS TOPIC REPORTERS

1. VARIABILITY Mizzie Carvajal 7. STANDARD DEVIATION Julius Gilbert


Mallen
2. RANGE Eyra Jane Velasco
8. COEFFICAIENT OF Ronel Alde
3. INTERQUARTILE RANGE Ericah Joy Lesigues VARIATION
4. SEMI- INTERQUARTILE

5. MEAN ABSOLUTE Mybel Candido


9. SKEWNESS Gina Mae
DEVIATION Gabilan

6. VARIANCE Edson Louis Para 10. KURTOSIS Jessa Arago


Let's Learn

VARIABILITY
What is variability?
• is a summary statistic that
represents the amount of
dispersion in a dataset.
• it measures how much your scores
differ from each other.
0, 2, 6, 10, 12 mean = 6
8, 7, 6, 5, 4 mean = 6
6, 6, 6, 6, 6 mean = 6
Using the mean to predict a highly
variable data set is much less useful
because the score is less
representative, but with low
variability, we have high predictability
LOW DISPERSION
easier to make predictions about the
population

HIGH DISPERSION
harder to make predictions about the
population
Example

You are investigating the amounts of time spent on


phones daily by different groups of people.

Sample A: HIGH SCHOOL STUDENTS


Sample B: COLLEGE STUDENTS
Sample C: ADULT FULL-TIME EMPLOYEES
In statistics, the range of
observations is the value that is
obtained from the difference
between the highest and lowest
values.
Formula for finding the range:

R = Highest observation – Lowest observation


or
R = Highest value – Lowest value
Find the range of given observations:
32, 41, 28, 54, 35, 26, 23, 33, 38, 40

23, 26, 28, 32, 33, 35, 38, 40, 41, 54

Range = 54 – 23
= 31
Find the range of given observations:

48, 50, 50, 51, 53, 90, 90, 91, 92, 93

Range = 93 – 48
=45
Find the range of the given grouped data.
Scores of Grade 7 in Mathematics class.

Scores f Range = Highest boundary -Lowest boundary


of the higher and lower class interval
36-40 3
R = Hb - Lb
31-35 11

26-30 14

21-25 10
Find the range of the given grouped data.
Scores of Grade 8 in English class.

Scores f Range = Highest boundary -Lowest boundary


of the higher and lower class interval
31-35 6
R = Hb - Lb
26-30 18

21-25 14

16-20 10
LET’S HAVE A
SHORT REVIEW!

SHORT
REVIEW?!?
Quartiles are the values that divide a list of numerical data
into three quarters. There are three quartiles, first,
second and third, denoted by Q1, Q2 and Q3. Here, Q2 is
nothing but the median of the given data.

Ex: 12 13 14 14 15 16 17 18 19 19 20
In Statistics, the range is the smallest of all the measures of dispersion.
It is the difference between the two extreme conclusions of the
distribution. In other words, the range is the difference between
the maximum and the minimum observation of the distribution.
It is defined by
Range = Xmax – Xmin
20-12= 8
Where Xmax is the largest observation and Xmin is the smallest
observation of the variable values.
The interquartile range defines the difference between the
third and the first quartile. Quartiles are the partitioned
values that divide the whole series into 4 equal parts. So,
there are 3 quartiles. First Quartile is denoted by Q1 known
as the lower quartile, the second Quartile is denoted by
Q2 and the third Quartile is denoted by Q3 known as the
upper quartile. Therefore, the interquartile range is equal
to the upper quartile minus lower quartile.
The difference between the upper and lower quartile is
known as the interquartile range. The formula for the
interquartile range is

Interquartile range = Upper Quartile – Lower Quartile


IQR = Q3 – Q1
How to Calculate the Interquartile Range?
The procedure to calculate the interquartile range is given as follows:
● Arrange the given set of numbers into increasing or decreasing order.
● Then count the given values. If it is odd, then the center value is median
otherwise obtain the mean value for two center values. This is known as
Q2 value. If there are even number of values, the median will be the average
of the middle two values.
● Median equally cuts the given values into two equal parts. They are
described as Q1 and Q3 parts.
● The median of data values below the median represents Q1.
● The median of data values above the median value represents Q3.
● Finally, we can subtract the median values of Q1 and Q3.
● The resulting value is the interquartile range.
Let’s try.

In the given data set, solve for the


Interquartile Range.
12 13 14 14 15 16 17 18
19 19 20
Let’s try.

The following are the exam scores of fifteen


grade 7 students,solve for the Interquartile
Range.

44 42 50 47 49
44 43 42 45 48
41 40 41 46 48
The semi-interquartile range is defined as the measures
of dispersion. Semi interquartile range also is defined as
half of the interquartile range. It is computed as one half
the difference between the 75th percentile (Q3) and the
25th percentile (Q1). The semi-interquartile range is one-
half of the difference between the first and third
quartiles. The Formula for Semi Interquartile Range is
● Semi Interquartile Range = (Q3– Q1) / 2
Let’s try.

In the given data set, solve for the


Interquartile Range.
12 13 14 14 15 16 17 18
19 19 20
Let’s try.

The following are the exam scores of fifteen


grade 7 students,solve for the Interquartile
Range.

44 42 50 47 49
44 43 42 45 48
41 40 41 46 48
INTERQUARTILE AND SEMI- INTERQUARTILE IN
RESEARCH

•Descriptive Statistics: IQR helps summarize data by


indicating the range within which the central 50% of values lie,
giving a clear picture of variability without being affected by
outliers.
•Comparative Studies: Researchers often use IQR to
compare the spread of different groups or conditions, helping to
identify differences in data distributions.
THANK YOU

THANK YOU?!?
Reference:
https://byjus.com/maths/interquartile-range/
MEAN ABSOLUTE
DEVIATION

Mybel B. Candido
Definition
• is a statistical measure that quantifies the
average distance between each data point in a
dataset and the mean of that dataset.
• is a valuable tool for assessing variability and
dispersion, providing insights into the consistency
of data.
FORMULA
Example:
dataset:
Calculate the Mean:
Calculate Absolute
Deviations:
Sum of Absolute Deviations:

1.2+2.8+0.8+0.2+2.2=7.2
Calculate MAD:
VARIANCE

GROUP D
VARIANCE
In statistics, variance measures variability from
the average or mean. It is calculated by taking
the differences between each number in the
data set and the mean, squaring the differences
to make them positive, and then dividing the
sum of the squares by the number of values in
the data set.
POPULATION VARIANCE
If we have a population data set, the formula is written as

σ2 = ∑ (xi – x̄)2
n

where,
•x̄ is the mean of population data set
•n is the total number of observations
SAMPLE VARIANCE
If the data set is a sample the formula of variance is given by

s2 = ∑ (xi – x̄)2
(n – 1)
● where,
• x̄ is the mean of sample data set
• n is the total number of observations
STANDARD DEVIATION
Standard Deviation

It is a measure of how dispersed the


data is in relation to the mean. It tells
us how far each score lies from the
mean.
Standard Deviation

In normal distribution, a high standard


deviation means the values are generally
far from the mean, while a low standard
deviation indicates that values are
clustered close to the mean.
Population Standard Deviation

σ 𝒙𝒊 − 𝝁 𝟐
𝝈=
𝑵
Sample Standard Deviation

σ 𝒙𝒊 − 𝒙 𝟐
𝒔=
𝒏−𝟏
Sample Standard Deviation
82, 88, 89, 93, 98
𝒔𝒖𝒎𝒎𝒂𝒕𝒊𝒐𝒏 𝒐𝒇 𝒂𝒍𝒍 𝒔𝒄𝒐𝒓𝒆𝒔
𝒙=
𝑵
𝟖𝟐 + 𝟖𝟖 + 𝟖𝟗 + 𝟗𝟑 + 𝟗𝟖
𝒙=
𝟓

𝟒𝟓𝟎
𝒙=
𝟓
𝒙 = 𝟗𝟎
Sample Standard Deviation
82, 88, 89, 93, 98
Score Deviation Squared Deviation
𝒙𝒊 𝒙𝒊 − 𝒙 𝒙𝒊 − 𝒙 𝟐
98 8 64

93 3 9

89 -1 1

88 -2 4

82 -8 64
Sample Standard Deviation
σ 𝒙𝒊 − 𝒙 𝟐
𝒔=
𝒏− 𝟏

𝟏𝟒𝟐
𝒔=
𝟓−𝟏

𝟏𝟒𝟐
𝒔=
𝟒
Sample Standard Deviation

𝒔 = 𝟑𝟓. 𝟓

𝒔 = 𝟓. 𝟗𝟓𝟖 𝒐𝒓 𝟔
Population Standard Deviation
58, 69, 73, 76, 77, 84, 85, 89, 92,
97
𝝁=
𝒔𝒖𝒎𝒎𝒂𝒕𝒊𝒐𝒏 𝒐𝒇 𝒂𝒍𝒍 𝒔𝒄𝒐𝒓𝒆𝒔
𝑵
58 + 69 + 73 + 76 + 77 + 84 + 85 + 89 + 92 + 97
𝝁=
𝟏𝟎
𝟏𝟔𝟎
𝝁=
𝟓
𝝁= 𝟑𝟐
Population Standard Deviation

σ 𝒙𝒊 − 𝝁 𝟐
𝝈=
𝑵
Population Standard Deviation
Score Deviation Squared Deviation
𝒙𝒊 𝒙𝒊 − 𝝁 𝒙𝒊 − 𝝁 𝟐 𝝁 = 𝟑𝟐
97 65 4,225
92 60 3,600
89 57 3,249
85 53 2,809 σ 𝒙𝒊 − 𝝁 𝟐 = 24,274
84 52 2,704
77 45 2,025
76 44 1,936
73 41 1,681
69 37 1,369
58 26 676
Population Standard Deviation
σ 𝒙𝒊 − 𝝁 𝟐
𝝈=
𝑵

24,274
𝝈=
𝟏𝟎

𝝈= 𝟐, 𝟒𝟐𝟕. 𝟒

𝝈 = 𝟒𝟗. 𝟐𝟔
WHY DO WE USE SKEWNESS IN
REASEARCH?

-Skewness is used to detect outliers in a data set. Outliers


are to be removed when a good analysis is required in a
data set. Outliers are also indicators of bad data and may
be worth investigating.
An outlier is a single data point that goes far outside the
average value of a group of statistics. Outliers may be
exceptions that stand outside individual samples of
populations as well.
KURTOSIS

jessa p. arago
Group d reporter
Understanding Kurtosis
▪ Kurtosis is a descriptive statistic used to help measure how data
disperse between a distribution's center and tails, with larger values
indicating a data distribution may have “heavy” tails that are thickly
concentrated with observations or that are long with extreme
observations.

▪ Kurtosis is a measure of the combined weight of a distribution’s tails


relative to the center of the distribution curve (the mean).

▪ Kurtosis is sometimes confused with a measure of the peakedness of


a distribution.

▪ kurtosis measures “tailedness,” not “peakedness.”


FORMULA AND CALCULATIONS:

You might also like