Basic Statistics Module 5 QA Final - 2
Basic Statistics Module 5 QA Final - 2
OUTCOME-BASED EDUCATION
LEARNING QUARTER I
1
MODULE IN
BASIC STATISTICS
QUARTER I
WEEKS 5-7
Development Teams
2
What I Need to Know
This module is all about the measures of central tendency or more popularly known as
averages. The module is divided into four lessons: Mean, Median, Mode, and Fractiles. The
topics cover both the ungrouped and grouped data
Learning Competencies
The learners…
Objectives:
What I Know
Direction: Read the statements carefully and answer each item correctly. Write the letter of
your
choice on the space provided for.
22, 29, 28, 23, 25, 27, 24, 26, 25, 21, 25
3
_____2. What is the median of the distribution?
A. 22 B. 23 C. 24 D. 25
_____7. Among the measures of central tendency, which measure is sometimes not found
in the distribution?
A. Mode B. Median C. Mean D. Fractile
_____8. Which measure of central tendency has half of the cases lying below it?
A. Mode B. Median C. Mean D. Fractile
4
_____18. Which is the most common measure of central tendency?
A. Mode B. Median C. Mean D. Fractile
_____19. In a dart game, Flor scored a triple 19, a double 18, and a single 20 using three
darts. What is her average per dart?
A. 18.17 B. 19 C. 19. 17 D. 37.67
_____20. The components of Gio’s grade in mathematics are written output, performance
tasks, and quarterly assessment with weights 40%, 40%, and 20%, respectively.
What will be his grade if he has 207 out of 225 for written tasks, 187 out of 200
performance task and 44 out of 50 for the quarterly assessment. Round off the grade
to the nearest integer.
A. 91 B. 92 C. 93 D. 94
Lesson
1 THE MEAN
What’s In
Scores F
94 - 90 2
89 – 85 2
84 - 80 4
79- 75 7
74 - 70 15
69 - 65 9
64 - 60 6
59 - 55 4
54 - 50 1
5
What’s New
What Is It
.
𝑛
∑ 𝑥
𝑋̅ = 𝑖=1 𝑖, where 𝑋̅ = mean, n = number of observations, xi = observation
𝑛
∑𝑥
For brevity and simplicity, we write 𝑋̅ =
𝑛
Example 1. Find the mean of the scores: 39, 44, 35, 46, 42, 40.
Solution:
There are 6 scores or observations or values, so n = 6
∑𝑥 39+44+35+46+42+40 246
𝑋̅ = = = = 41
𝑛 6 6
Example 2. Gio is playing darts and has thrown 5 darts with scores 18, 20, 19, 19, and 18. If
he wants an average of 19, what must be his score in his last throw?
Solution.
Let x = Gio’s score in his last throw
n = 6 (since there are a total of 6 throws)
̅
𝑋 = 19
∑𝑥
𝑋̅ =
𝑛
18+20+19+19+18+𝑥
19 = (multiply the entire equation by 6)
6
19(6) = 94 + x
114 – 94 = x
20 = x
Hence, the score of Gio in his last throw is 20.
6
The weighted mean of observations x1, x2, x3, …, xn which occur w1, w2, w3, …, wn ,
respectively is given by
∑ 𝑤𝑥
𝑋̅ = ∑𝑤
Where 𝑋̅ = mean, x = observation, and w is the corresponding weight of the observation
Example 3. Find the average rating of a Board passer if her ratings are as follows:
Solution:
Note: Percentages must be converted first into decimals.
Let x1 = 83; w1 = 20% = 0.2; x2 = 85; w2 = 40% = 0.4; x3 = 80; w3 = 40% = 0.4
Example 4. Flor received her card and recorded her grades as follows:
Three subjects: 93, two subjects: 95, Three subjects: 94, One subject: 92. Find her
mean grade.
Solution:
Let x1 = 93; w1 = 3; x2 = 95; w2 = 2; x3 = 94; w3 = 3; x4 = 92; w4 = 1
The mean of grouped data can be found in two ways: Midpoint Method and Deviation
Method
This method uses the class mark or class midpoint. To find the mean of grouped data
using class mark, the following formula can be used: The mean for grouped data is given by
∑ 𝑓𝑚
𝑋̅ =
𝑛
where: 𝑋̅ = mean
𝑓 = frequency
𝑚 = class mark/midpoint
𝑛 = total number of observations/ total frequency
7
To illustrate the method, study the following example.
Scores 𝑓
94 - 90 2 Steps in solving the mean using the midpoint method
89 – 85 2
1. Construct the column for class midpoints (𝑚).
84 - 80 4
2. Construct the column for 𝑓𝑚, the product of the
79- 75 7
frequency (𝑓) and class midpoint (𝑚).
74 - 70 15
3. Find the total of the column 𝑓𝑚 (∑ 𝑓𝑚)
69 - 65 9
4. Substitute ∑ 𝑓𝑚 and 𝑛 into the formula
64 - 60 6
5. Evaluate
59 - 55 4
54 - 50 1
Solution:
Follow the steps in solving the mean using the midpoint method. The steps are shown
in the next table.
Scores 𝑓 𝑚 𝑓𝑚
94 - 90 2 92 184
89 – 85 2 87 174
84 - 80 4 82 328 Now,
79- 75 7 77 539 ∑ 𝑓𝑚
𝑋̅ =
74 - 70 15 72 1080 𝑛
69 - 65 9 67 603 3560
=
64 - 60 6 62 372 50
59 - 55 4 57 228 = 71.2
54 - 50 1 52 52
n = 50 ∑ 𝑓𝑚 = 3560
This method uses deviations or codes from the class containing the assumed mean.
∑ 𝑓𝑑
The mean for grouped data is given by 𝑋̅ = 𝑥𝑎 + ( )𝑖
𝑛
where, 𝑋̅ = mean
𝑥𝑎 = assumed mean (class mark of the chosen class interval)
𝑓 = frequency
𝑑 = coded deviation
𝑛 = total number of observations/ total frequency
𝑖 = class size
8
To illustrate the method, study the following example
Frequency Distribution of a
100-Item Math Proficiency Test
Scores 𝑓 𝑑 𝑓𝑑 Now,
94 - 90 2 4 8 ∑ 𝑓𝑑
89 – 85 2 3 6 𝑋̅ = 𝑥𝑎 + ( )𝑖
𝑛
84 - 80 4 2 8 −8
79- 75 7 1 7 = 72 + ( ) 5
50
74 - 70 15 0 0 = 72 − 0.8
69 - 65 9 -1 -9 ̅
𝑋 = 71.2
64 - 60 6 -2 -12
59 - 55 4 -3 -12
54 - 50 1 -4 -4
n = 50 ∑ 𝑓𝑑 = −8
9
Note: The difference of the frequency distributions in Example 6 and Example 7 should be
clear to you. The previous is a descending frequency distribution while the current frequency
distribution is an ascending frequency distribution.
Solution:
Follow the steps in solving the mean using the deviation method. The steps are shown
in the next table. Let us choose a class interval say, 54 – 56, so 𝑥𝑎 = 55 and 𝑖 = 3
Height (in) 𝑓 𝑑 𝑓𝑑
45-47 1 -3 -3
Now,
48-50 2 -2 -4
∑ 𝑓𝑑
51-53 4 -1 -4 𝑋̅ = 𝑥𝑎 + ( )𝑖
54-56 10 0 0 𝑛
58
57-59 12 1 12 = 55 + ( ) 3
48
60-62 8 2 16
= 55 + 3.62
63-65 5 3 15
𝑋̅ = 58.62
66-68 4 4 16
69-71 2 5 10
Total n = 48 ∑ 𝑓𝑑 = 58
Remarks:
1. The mean is the most used measure of central tendency.
2. The mean is the mathematical center of the distribution.
3. It is used with symmetric, unimodal distributions.
4. Data is at least interval in nature.
5. It is sensitive to “outliers” (extremely low or extremely high score) in the distribution.
What’s More
1. A: 34, 45, 43, 29, 30, 36, 39 42, 33, 35, 40, 38, 35, 33, 37
2. B: 78, 75, 79, 81, 70, 78, 76, 72, 74, 78, 73, 76, 71, 81, 80, 73
10
Solve the problems
1. The mean score of the first 5 games of Alaska Aces is 92.6. What must be their score in
the next game if the team wants to increase their mean score to 93?
2. A tricycle driver’s income in the past 6 days were 870, 920, 790, 880, 900, and 910. What
must be his income on the 7th day to have a daily average income of 880?
1. The mean of ungrouped data is the sum of all observations divided by the number of
observations. The mean is denoted by 𝑋̅. The formula is given by
∑𝑛 𝑥
𝑋̅ = 𝑖=1 𝑖, where 𝑋̅ = mean, n = number of observations, xi = observations
𝑛
2. The weighted mean of observations x1, x2, x3, …, xn which occur w1, w2, w3, …, wn,
respectively is given by
∑ 𝑤𝑥
𝑋̅ = ∑𝑤
𝑋̅ = mean, 𝑥 = observation, and 𝑤 is the corresponding weight of the observation
3. The midpoint method uses the class mark or class midpoint. The mean for grouped
∑ 𝑓𝑚
data is given by 𝑋̅ =
𝑛
Where: 𝑋̅ = mean
𝑓 = frequency
𝑚 = class mark/midpoint
𝑛 = total number of observations/ total frequency
4. This deviation method uses deviations or codes from the class containing the assumed
∑ 𝑓𝑑
mean. The mean for grouped data is given by 𝑋̅ = 𝑥𝑎 + ( )𝑖
𝑛
where, 𝑋̅ = mean
𝑥𝑎 = assumed mean (class mark of the chosen class interval)
𝑓 = frequency
𝑑 = deviations/codes
𝑛 = total number of observations/ total frequency
𝑖 = class size
What I Can Do
Take your log of your household’s daily consumption of electricity and water. Take the
mean of these consumptions.
11
Lesson
2 THE MEDIAN
What’s In
1. A: 109, 120, 123, 112, 108, 117, 114, 119, 110, 115, 116, 122
What’s New
Another measure of central tendency is presented here. The median is the
midpoint of the distribution. It is a point where half of the observations lie below it and the
other half of the observations lie above it.
12
What Is It
.
A. The Median (for Ungrouped Data)
Steps:
1. Arrange the observations in an ascending order or descending order
2. Observe 𝑛, the number of observations, it is either odd or even.
3. For the odd case, the median is the middle observation. For the even case, the median is
the average of the two middle observations.
Example 1: Find the median of the distribution 34, 45, 43, 29, 30, 36, 39 42, 33, 35, 40, 38,
35, 33, 37
Solution:
29, 30, 33, 33, 34, 35, 35, 36, 37, 38, 39, 40,42, 43, 45,
Note: If the distribution is arranged in a descending order, the same answer is arrived at.
Example 2. Find the median of the distribution 78, 75, 79, 81, 70, 78, 76, 72, 74, 78, 73, 76,
71, 81, 80, 73
Solution:
Arrange the observations in a descending order.
81, 81, 80, 79, 78, 78, 78, 76, 76, 75, 74, 73, 73, 72, 71, 70
There are two ways of finding the median, using the lower class boundary or using
the upper class boundary.
13
Method 1: Using the lower class boundary
The formula in finding the median of grouped data using the lower class boundary is
given by
𝑛
−<𝑐𝑓𝑏
2
𝑀𝑑 = 𝐿𝐶𝐵 + ( )𝑖
𝑓𝑚
Where: 𝑀𝑑 = median
𝐿𝐶𝐵 = lower class boundary of the median class
𝑛
= locator of the median class
2
𝑛 = total frequency
< 𝑐𝑓𝑏 = cumulative frequency just below the median class
𝑓𝑚 = frequency of the median class
𝑖 = class size
Solution:
Follow the steps in finding the median. Step 1 is shown in the following table.
Scores 𝑓 < 𝑐𝑓 𝑛 50
Step 2: = = 25. The locator is 25.
94 - 90 2 50 2 2
14
Steps 4 and 5 are shown below.
𝑛
−<𝑐𝑓𝑏 25−20
𝑀𝑑 = 𝐿𝐶𝐵 + ( 2 ) 𝑖 = 69.5 + ( )5 = 69.5 + 1.67 = 71.17
𝑓𝑚 15
The formula in finding the median of grouped data using the lower class boundary is
given by
𝑛
− >𝑐𝑓𝑎
2
𝑀𝑑 = 𝑈𝐶𝐵 − ( )𝑖
𝑓𝑚
where 𝑀𝑑 = median
𝑈𝐶𝐵 = upper class boundary of the median class
𝑛
= locator of the median class
2
𝑛 = total frequency
> 𝑐𝑓𝑎 = cumulative frequency just above the median class
𝑓𝑚 = frequency of the median class
𝑖 = class size
To illustrate how to find the median, study the following examples. We are going to use the
same frequency distributions found in the previous 2 examples to compare the results and to
distinguish the differences of the 2 methods.
15
Example 3. Find the median of the frequency distribution.
Frequency Distribution of of a 100-Item Math Proficiency Test
Solution:
Follow the steps in finding the median. Step 1 is shown in the following table.
𝑛 50
Scores 𝑓 > 𝑐𝑓 Step 2: = = 25. The locator is 25.
2 2
94 - 90 2 2 What is meant by 25?
89 – 85 2 4 • It means the median is the 25th
84 - 80 4 8 case/observation.
79- 75 7 15 What is meant by > 𝑐𝑓?
74 - 70 15 30 • For the class 79-75, there are at most
69 - 65 9 39 15 cases found within or above.
64 - 60 6 45 • For the class 74-70, there are at most
59 - 55 4 49 30 cases found within or above it.
54 - 50 1 50 Hence this is the class containing the
𝑛 = 50 25th case.
•
So, the median class is 74-70 (the highlighted class).
Step 3: 𝑈𝐶𝐵 = 74..5, 𝑖 = 5, 𝑓𝑚 = 15, > 𝑐𝑓𝑎 = 15
16
Solution:
Follow the steps in finding the median. Step 1 is shown in the following table.
Remarks:
1. Use the median when the extreme values (outliers) distort the mean. The median is not
affected by outliers
2. Use the median when the problem requires the exact midpoint of the distribution.
What’s More
1. A: 109, 120, 123, 112, 108, 117, 114, 119, 110, 115, 116, 122
Income 𝑓
826-850 3
801-825 6
776-800 11
751-775 12
726-750 9
701-725 4
676-700 3
651-675 1
626-650 1
17
What I Have Learned
What I Can Do
Take your log of your household’s daily consumption of electricity and water. Take the
median of these consumptions.
Lesson
3 THE MODE
What’s In
1. A: 67, 63, 68, 70, 69, 61, 60, 65, 72, 64, 66, 71
18
3. Frequency distribution of the waistlines of Grade 9 students
Waistline 𝑓
15-17 2
18-20 3
21-23 8
24-26 15
27-29 7
30-32 3
33-35 2
What’s New
Another measure of central tendency is presented here. The mode is the
most frequently appearing observation in the distribution. It is used when the scores are in
the nominal level. For ungrouped data, the mode does not require any computation.
Inspection is enough
What Is It
.
A. The Mode (For Ungrouped Data)
Given a distribution, the mode is the observation that appears most often. We use Mo
to denote the mode of a distribution. To illustrate the method in finding the mode, study the
following examples.
Example 2: 56, 53, 54, 56, 54, 57, 60, 58, 51, 52
19
B. The Mode (For Grouped Data)
There are two ways of finding the mode, using the lower class boundary or using the
upper class boundary, just like the median.
𝑓𝑀𝑜 −𝑓1
𝑀𝑜 = 𝐿𝐶𝐵 + ( )𝑖
2𝑓𝑀𝑜 −𝑓1 −𝑓2
where
𝑀𝑜 = mode
𝐿𝐶𝐵 = lower class boundary of the modal class
𝑓𝑀𝑜 = frequency of the modal class
𝑓1 = frequency just “above” the modal class
𝑓2 = frequency just “below” the modal class
i = class size
Remarks:
The modal class is the class interval with the highest frequency.
“Above“ means moving up the distribution
“Below” means moving down the distribution
To illustrate how to find the mode of the distribution using the lower class boundary,
study the following example.
20
𝑓1 = frequency just above the modal class
𝑓2 = frequency just below the modal class
i = class size
To illustrate how to find the mode of the distribution using the upper class boundary,
study the following example.
Example 3: Find the mode (using both methods) of the frequency distribution
21
What’s More
1. A: 109, 120, 123, 112, 108, 117, 114, 109, 110, 115, 116, 122
2. B: 1120, 1109, 1125, 1122, 1124, 1117, 1112, 1118, 1121, 1122
Income 𝑓
826-850 3
801-825 6
776-800 11
751-775 12
726-750 9
701-725 4
676-700 3
651-675 1
626-650 1
1. Given a distribution, the mode is the observation that appears most often.
2. As far as the mode is concerned, distributions can be classified as unimodal, bimodal,
trimodal, polymodal, or non-modal.
3. The formula in finding the mode of a grouped data is given by
22
What I Can Do
Take your log of your household’s daily consumption of electricity and water. Take the
mode of these consumptions.
Lesson
4 FRACTILES
What’s In
What’s New
Fractiles are special point measures in the distribution. The median is also
considered a special point measure since it divides the distribution into two. Other special
point measures are quartiles, deciles, and percentiles. These point measures are also called
quantiles.
23
What Is It
.
The basis in finding the different fractiles is the median. Just like the median, these
have locators, fractile class, lower class boundary (𝐿𝐶𝐵), cumulative frequency just below the
fractile class (< 𝑐𝑓𝑏 ), frequency of the fractile class(𝑓𝑓 ), and class size (i).
If the median divides the distribution into two (mid), quartiles divide the distribution
into 4 (quarter), deciles divide the distribution into 10 (deci), and percentiles divide the
distribution into 100 (centum/centa)
The notations used here are, 𝑄𝑘 for quartiles, 𝐷𝑘 for deciles and 𝑃𝑘 for percentiles
where
𝑘 = 1, 2, 3, 4 for quartiles
𝑘 = 1, 2, 3, …, 10 for deciles, and
𝑘 = 1, 2, 3, … , 100 for quartiles
The greatest difference in the fractiles lie in the locator. The locators are
𝑘𝑛 𝑘𝑛 𝑘𝑛
for quartiles, for deciles, and for percentiles where 𝑛 = total frequency. Once
4 10 100
the locator is identified, the other required quantities are not difficult to find.
Recall that in finding the median, you need to construct the < 𝑐𝑓 column. The same is
true with the fractiles, the < 𝑐𝑓 must be constructed. There is so much similarity in finding the
median and the fractiles.
24
Solution:
Follow the steps in finding the fractile. Step 1 is shown in the following table.
For 𝑫𝟗 :
𝑘𝑛 9(50)
Step 2: = = 45. The locator is 45. What is meant by 45?
10 10
• It means that 𝐷9 is the 45th observation.
What is meant by < 𝑐𝑓?
• For the class 84-80, there are at most 46 cases found within or below it. Hence this is
the class containing the 45th case. So, the decile class is 84-80 (class with green
highlight)
Step 3: 𝐿𝐶𝐵 = 79.5, 𝑖 = 5, 𝑓𝑓 = 4, < 𝑐𝑓𝑏 = 42
Steps 4 and 5 are shown below.
𝑘𝑛
−<𝑐𝑓𝑏 45−42
𝐷9 = 𝐿𝐶𝐵 + ( 10 ) 𝑖 = 79.5 + ( )5 = 79.5 + 3.75 = 83.25
𝑓𝑓 4
For 𝑷𝟔𝟐 :
𝑘𝑛 62(50)
Step 2: = = 31. The locator is 31. What is meant by 31?
100 100
• It means that 𝑃62 is the 31st observation.
What is meant by < 𝑐𝑓?
• For the class 74-70, there are at most 35 cases found within or below it. Hence this is
the class containing the 31st case. So, the decile class is 74-70 (class with blue
highlight)
Step 3: 𝐿𝐶𝐵 = 69.5, 𝑖 = 5, 𝑓𝑓 = 15, < 𝑐𝑓𝑏 = 20
Steps 4 and 5 are shown below.
𝑘𝑛
−<𝑐𝑓𝑏 31−20
100
𝐷9 = 𝐿𝐶𝐵 + ( ) 𝑖 = 69.5 + ( ) 5 = 69.5 + 3.67 = 73.17
𝑓𝑓 15
25
Example 2: Find 𝑄3 , 𝐷4 and 𝑃18 of the frequency distribution.
Solution:
Follow the steps in finding the fractile. Step 1 is shown in the following table.
For 𝑫𝟒 :
𝑘𝑛 2(48)
Step 2: = = 9.6. The locator is 9.6. The 9.6th case is found in the class 54 – 56
10 10
(Green hghlight)
Step 3: 𝐿𝐶𝐵 = 53.5, 𝑖 = 3, 𝑓𝑓 = 10, < 𝑐𝑓𝑏 = 7
Steps 4 and 5 are shown below.
𝑘𝑛
−<𝑐𝑓𝑏 9.6−7
𝐷4 = 𝐿𝐶𝐵 + ( 10 ) 𝑖 = 53.5 + ( )5 = 53.5 + 1.3 = 54.8
𝑓𝑓 10
For 𝑷𝟏𝟖 :
𝑘𝑛 18(48)
Step 2: = = 8.64. The locator is 8.64. The 8.64th case is contained in the class 54
100 100
– 56 (Green highlight)
Step 3: 𝐿𝐶𝐵 = 53.5, 𝑖 = 3, 𝑓𝑓 = 10, < 𝑐𝑓𝑏 = 7
Steps 4 and 5 are shown below.
𝑘𝑛
−<𝑐𝑓𝑏 8.64−7
𝑃18 = 𝐿𝐶𝐵 + (100 ) 𝑖 = 53.5 + ( ) 5 = 53.5 + 0.82 = 54.32
𝑓𝑓 10
26
What’s More
Assessment
Direction: Read the given questions carefully and answer each item correctly. Write the letter
of your choice on the space provided for.
_____1. In a dart game, Flor scored a triple 20, a double 19, and a single 18 using three
darts. What is her average per dart?
A. 38.67 B. 39 C. 19 D. 30
27
_____4. Concerning modes, which best describes a distribution with two peaks?
A. Unimodal B. Bimodal C. Trimodal D. Non-modal
220, 290, 280, 230, 250, 270, 240, 260, 250, 210, 250
_____13. Which measure of central tendency has the greatest number of observations?
A. Mode B. Median C. Mean D. Fractile
_____14. What is the mean of the distribution with a double 23 and a triple 25?
A. 24 B. 24. 2 C. 24.25 D. 24.5
For items 15 - 20, refer to the frequency distribution below.
Scores F
74 - 70 1
69 – 65 4
64 - 60 6
59- 55 9
54 - 50 15
49 - 45 7
44 - 40 4
39 - 35 2
34 - 30 2
28
_____15. What is the mean of the distribution?
A. 52.8 B. 52.83 C. 51.64 D. 52.36
29
Answer Key
30
References
Downie, N.M. & Heath, R.W. (1984). Basic Statistical Methods. Harper and Row Publishers,
Inc.
Hanneman, R.A., Kposowa, A.J., &Riddle, M.D. (2013). Basic Statistics for Social Research.
John Wiley and Sons, Inc.
Walpole, R.E., Myers, R.H., & Myers, S.H. (2000). Probability and Statistics for Engineers
and Scientists. Prentice Hall, Inc.
Webster, A. (1992). Applied Statistics for Business and Economics. Richard D. Irwin, Inc.
Ymas, S.E. Jr., Dayrit, B.C., & Turqueza, A.G. (2007). College Statistics. Ymas Publishing
House.
31
32