Sa3 - 2
Sa3 - 2
Objectives
At the end of this lesson, you should be able to:
• Gain deeper understanding about the z-score.
• Identify the position of a data value in a data set, using
various measures of position, such as percentiles, deciles,
and quartiles.
• Interpret the analysis reported by box-and-whisker plots
Measures of Position
In addition to the measures of central tendency and
measures of variation, there are measures of position or
location. These measures include standard scores, percentiles,
deciles, and quartiles. They are used to locate the relative
position of a data value in the data set.
Standard Score (z score)
Case A Case B
𝜇1 = 𝜇2 𝜇1 ≠ 𝜇2
𝜎1 = 𝜎2 𝜎1 = 𝜎2
Case C Case D
𝜇1 = 𝜇2 𝜇1 ≠ 𝜇2
𝜎1 ≠ 𝜎2 𝜎1 ≠ 𝜎2
Standard Score
A standard score or z score tells how many standard
deviations a data value is above or below the mean for a
specific distribution of values. If a standard score is zero, then
the data value is the same as the mean.
Standard Score
Example
Charlie got a mark of 85 on a math test which had a
mean of 75 and a standard deviation of 5. Daisy got a mark of
75 on an English test which had a mean of 69 and a standard
deviation of 2. Relative to their respective mean and standard
deviation, who got the better grade?
Solution
Using the z score formula, we calculate the z scores for
each student and find:
Charlie
𝑥−𝑥ҧ 85−75 10
𝑧= = = =2
𝑠 5 5
Daisy
75−69 6
𝑧= = =3
2 2
Solution
The results tell us that Charlie got a test mark 2 standard
deviations higher than the mean of the class, while Daisy got a
mark that is 3 standard deviations higher than the mean in her
class. Therefore, proportionally speaking, Daisy did better
within her class in comparison to Charlie.
Example
Test Scores. A student scored 65 on a calculus test that
had a mean of 50 and a standard deviation of 10; she scored
30 on a history test with a mean of 25 and a standard
deviation of 5. Compare her relative positions on the two
tests.
Solution
Calculus
𝑥−𝑥ҧ 65−50 15
𝑧= = = = 1.5
𝑠 10 10
History
30−25 5
𝑧= = =1
5 5
Since the z score for calculus is larger, her relative position in the
calculus class is higher than her relative position in the history class.
Note: A z-score specifies the precise location of each X value
within a distribution. The sign of the z-score (+ or –) signifies
whether the score is above the mean (positive) or below the
mean (negative). The numerical value of the z-score specifies
the distance from the mean by counting the number of
standard deviations between X and 𝜇. When all data for a
variable are transformed into z-scores, the resulting
distribution will have a mean of 0 and a standard deviation of
1.
Example
Test Scores. Find the z score for each test, and state
which is higher.
Test A x = 38 𝑥ҧ = 40 s=5
Test B x = 94 𝑥ҧ = 100 s = 10
Solution
Test A
𝑥−𝑥ҧ 38−40 −2
𝑧= = = = −0.4
𝑠 5 5
Test B
94−100 −6
𝑧= = = −0.6
10 10
The score for test A is relatively higher than the score for test B.
Percentiles, Deciles, and
Quartiles
To locate a specific point in any distribution, percentiles,
quartiles and deciles are the tools that can be used. These
parameters percentiles, deciles, and quartiles are called
quantiles.
Percentiles
where
𝑙𝑏𝑃𝑘 – lower boundary of the kth percentile class
𝑐𝑢𝑚𝑓𝑏 – cumulative frequency before the kth percentile class
𝑓𝑃𝑘 – frequency of the kth percentile class
𝑐 – class interval
Example
Calculate thirty-seventh, forty-fifth and ninetieth percentile
from the frequency distribution of weights of 120 students, by using
the this table.
Solution (37th Percentile)
𝑘(𝑛) 37(120)
= = 44.4
100 100
44.4−22
𝑃37 = 139.5 + 10 = 147.5 pounds
28
Solution (45th Percentile)
𝑘(𝑛) 45(120)
= = 54
100 100
𝑙𝑏𝑃𝑘 = 149.5
𝑃45
𝑐𝑢𝑚𝑓𝑏 = 50
𝑓𝑃𝑘 = 25
𝑐 = 10
54−50
𝑃45 = 149.5 + 10 = 151.1 pounds
25
Solution (90th Percentile)
𝑘(𝑛) 90(120)
= = 108
100 100
𝑙𝑏𝑃𝑘 = 179.5
𝑐𝑢𝑚𝑓𝑏 = 106
𝑃90 𝑓𝑃𝑘 = 6
𝑐 = 10
108−106
𝑃90 = 179.5 + 10 = 182.83 pounds
6
The results of P37, P45 and P90 indicates that 37% of the
students weigh 147.5 pounds or less. Similarly, 45% of the
students weigh 151.1 pounds or less and 90% weigh 182.83
pounds or less.
Deciles
Deciles divide the distribution into 10 groups. They are
denoted by 𝐷1 , 𝐷2 , 𝐷3 , . . . , 𝐷9 .
Decile (Ungrouped Data)
The formula for finding a data value corresponding to a
kth decile is
𝑘(𝑛+1) 𝑡ℎ
𝐷𝑘 = observation
10
where
n – total number of observations
k = 1, 2, 3, …, 9
Example
The Shell company has a number of outlets in the
Metropolitan Manila area. The numbers of changes at the
EDSA Heritage outlet in the past 20 days are:
65 98 55 62 79 79 59 51 90 72
56 70 62 66 80 94 63 73 71 85
Calculate the 3rd decile, 5th decile, and 8th decile.
Solution
Arrange the data in order from lowest to highest.
51, 55, 56, 59, 62, 62, 63, 65, 66, 70, 71, 72, 73, 79, 79, 80, 85 90, 94, 98.
Then substitute into the formula. For 3rd decile,
3(20+1) 𝑡ℎ
𝐷3 = observation
10
3(21) 𝑡ℎ
𝐷3 = observation
10
𝐷3 = (6.3)𝑡ℎ observation
𝐷3 = 6𝑡ℎ observation + 0.3(7th – 6th)
𝐷3 = 62 + 0.3(63 – 62)
𝐷3 = 62.3
Solution
For 5th decile,
5(20+1) 𝑡ℎ
𝐷5 = observation
10
5(21) 𝑡ℎ
𝐷5 = observation
10
𝐷5 = (10.5)𝑡ℎ observation
𝐷5 = 10𝑡ℎ observation + 0.5(11th – 10th)
𝐷5 = 70 + 0.5(71 – 70)
𝐷5 = 70.5
Solution
For 8th decile,
8(20+1) 𝑡ℎ
𝐷8 = observation
10
8(21) 𝑡ℎ
𝐷8 = observation
10
𝐷8 = (16.8)𝑡ℎ observation
𝐷8 = 16𝑡ℎ observation + 0.8(17th – 16th)
𝐷8 = 80 + 0.8(85 – 80)
𝐷8 = 84
Example
The following data array are the test scores of 12
students in a statistics class:
56 62 65 70 73 77 79 82 85 87 92 99
Find the 1st and 7th decile.
Solution
For 1st decile,
1(12+1) 𝑡ℎ
𝐷1 = observation
10
1(13) 𝑡ℎ
𝐷1 = observation
10
𝐷1 = (1.3)𝑡ℎ observation
𝐷1 = 1𝑠𝑡 observation + 0.3(2nd – 1st)
𝐷1 = 56 + 0.3(62 – 56)
𝐷1 = 57.8
Solution
For 7th decile,
7(12+1) 𝑡ℎ
𝐷7 = observation
10
7(13) 𝑡ℎ
𝐷7 = observation
10
𝐷7 = (9.1)𝑡ℎ observation
𝐷7 = 9𝑡ℎ observation + 0.1(10th – 9th)
𝐷7 = 85 + 0.1(87 – 85)
𝐷7 = 85.2
Decile (Grouped Data)
Deciles can also be calculated for grouped data which is done
with the help of following formula
𝑘(𝑛)
−𝑐𝑢𝑚𝑓𝑏
10
𝐷𝑘 = 𝑙𝑏𝐷𝑘 + 𝑐
𝑓𝐷𝑘
where
𝑙𝑏𝐷𝑘 – lower boundary of the kth decile class
𝑐𝑢𝑚𝑓𝑏 – cumulative frequency before the kth decile class
𝑓𝐷𝑘 – frequency of the kth decile class
𝑐 – class interval
Example
Calculate third, fifth and ninth decile from the frequency
distribution of weights of 120 students, by using the this table.
Solution (3rd Decile)
𝑘(𝑛) 3(120)
= = 36
10 10
𝐷3 𝑙𝑏𝐷𝑘 = 139.5
𝑐𝑢𝑚𝑓𝑏 = 22
𝑓𝐷𝑘 = 28
𝑐 = 10
36−22
𝐷3 = 139.5 + 10 = 144.5 pounds
28
Solution (5th Decile)
𝑘(𝑛) 5(120)
= = 60
10 10
𝑙𝑏𝐷𝑘 = 149.5
𝐷5
𝑐𝑢𝑚𝑓𝑏 = 50
𝑓𝐷𝑘 = 25
𝑐 = 10
60−50
𝐷5 = 149.5 + 10 = 153.5 pounds
25
Solution (9th Decile)
𝑘(𝑛) 9(120)
= = 108
10 10
𝑙𝑏𝐷𝑘 = 179.5
𝑐𝑢𝑚𝑓𝑏 = 106
𝐷9 𝑓𝐷𝑘 = 6
𝑐 = 10
108−106
𝐷9 = 179.5 + 10 = 182.83 pounds
6
The results of D3, D5 and D9 means that 30% of the
students weigh 144.5 pounds or less. Similarly, 50% of the
students weigh 153.5 pounds or less and 90% weigh 182.83
pounds or less.
Quartiles
Quartiles divide the distribution into four groups,
separated by 𝑄1 , 𝑄2 , 𝑄3 .
Quartile (Ungrouped Data)
The formula for finding a data value corresponding to a
kth quartile is
𝑘(𝑛+1) 𝑡ℎ
𝑄𝑘 = observation
4
where
n – total number of observations
k = 1, 2, 3
Example
The Shell company has a number of outlets in the
Metropolitan Manila area. The numbers of changes at the
EDSA Heritage outlet in the past 20 days are:
65 98 55 62 79 79 59 51 90 72
56 70 62 66 80 94 63 73 71 85
Calculate the 1st quartile, 2nd quartile, and 3rd quartile.
Solution
Arrange the data in order from lowest to highest.
51, 55, 56, 59, 62, 62, 63, 65, 66, 70, 71, 72, 73, 79, 79, 80, 85 90, 94, 98.
Then substitute into the formula. For 1st quartile,
1(20+1) 𝑡ℎ
𝑄1 = observation
4
1(21) 𝑡ℎ
𝑄1 = observation
4
𝑄1 = (5.25)𝑡ℎ observation
𝑄1 = 5𝑡ℎ observation + 0.25(6th – 5th)
𝑄1 = 62 + 0.25(62 – 62)
𝑄1 = 62
Solution
For 2nd quartile,
2(20+1) 𝑡ℎ
𝑄2 = observation
4
2(21) 𝑡ℎ
𝑄2 = observation
4
𝑄2 = (10.5)𝑡ℎ observation
𝑄2 = 10𝑡ℎ observation + 0.5(11th – 10th)
𝑄2 = 70 + 0.5(71 – 70)
𝑄2 = 70.5
Solution
For 3rd quartile,
3(20+1) 𝑡ℎ
𝑄3 = observation
4
3(21) 𝑡ℎ
𝑄3 = observation
4
𝑄3 = (15.75)𝑡ℎ observation
𝑄3 = 15𝑡ℎ observation + 0.75(16th – 15th)
𝑄3 = 79 + 0.75(80 – 79)
𝑄3 = 79.75
Example
Calculate first, second and third quartile from the
following data array;
20 28 29 30 36 37 39 42 53 54
55 58 61 67 68 70 74 81 82 93
Solution
For 1st quartile,
1(20+1) 𝑡ℎ
𝑄1 = observation
4
1(21) 𝑡ℎ
𝑄1 = observation
4
𝑄1 = (5.25)𝑡ℎ observation
𝑄1 = 5𝑡ℎ observation + 0.25(6th – 5th)
𝑄1 = 36 + 0.25(37 – 36)
𝑄1 = 36.25
Solution
For 2nd quartile,
2(20+1) 𝑡ℎ
𝑄2 = observation
4
2(21) 𝑡ℎ
𝑄2 = observation
4
𝑄2 = (10.5)𝑡ℎ observation
𝑄2 = 10𝑡ℎ observation + 0.5(11th – 10th)
𝑄2 = 54 + 0.5(55 – 54)
𝑄2 = 54.5
Solution
For 3rd quartile,
3(20+1) 𝑡ℎ
𝑄3 = observation
4
3(21) 𝑡ℎ
𝑄3 = observation
4
𝑄3 = (15.75)𝑡ℎ observation
𝑄3 = 15𝑡ℎ observation + 0.75(16th – 15th)
𝑄3 = 68 + 0.75(70 – 68)
𝑄3 = 69.5
Quartile (Grouped Data)
Quartiles can also be calculated for grouped data which is done
with the help of following formula
𝑘(𝑛)
−𝑐𝑢𝑚𝑓𝑏
4
𝑄𝑘 = 𝑙𝑏𝑄𝑘 + 𝑐
𝑓𝑄𝑘
where
𝑙𝑏𝑄𝑘 – lower boundary of the kth quartile class
𝑐𝑢𝑚𝑓𝑏 – cumulative frequency before the kth quartile class
𝑓𝑄𝑘 – frequency of the kth quartile class
𝑐 – class interval
Example
Calculate first, second and third quartile from the frequency
distribution of weights of 120 students, by using the this table.
Solution (1st Quartile)
𝑘(𝑛) 1(120)
= = 30
4 4
𝑄1 𝑙𝑏𝑄𝑘 = 139.5
𝑐𝑢𝑚𝑓𝑏 = 22
𝑓𝑄𝑘 = 28
𝑐 = 10
30−22
𝑄1 = 139.5 + 10 = 142.36 pounds
28
Solution (2nd Quartile)
𝑘(𝑛) 2(120)
= = 60
4 4
𝑙𝑏𝑄𝑘 = 149.5
𝑄2
𝑐𝑢𝑚𝑓𝑏 = 50
𝑓𝑄𝑘 = 25
𝑐 = 10
60−50
𝑄2 = 149.5 + 10 = 153.5 pounds
25
Solution (3rd Quartile)
𝑘(𝑛) 3(120)
= = 90
4 4
𝑙𝑏𝑄𝑘 = 159.5
𝑄3 𝑐𝑢𝑚𝑓𝑏 = 75
𝑓𝑄𝑘 = 18
𝑐 = 10
90−75
𝑄3 = 159.5 + 10 = 167.83 pounds
18
The results from Q1, Q2 and Q3 indicates that 25% of the
students weigh 142.36 pounds or less. Similarly, 50% of the
students weigh 153.5 pounds or less and 75% weigh 167.83
pounds or less.
Relationships among Percentiles, Deciles, and Quartiles