UFM QBM101 Mid Term Revision (Aug 2023)
UFM QBM101 Mid Term Revision (Aug 2023)
2 hours Paper
Closed Book
40 MCQ with formulae sheet given
Organization of data: Frequency distribution, relative frequency, Histogram, Ogive, Stem and leaf display,
box plot
Measures of central tendency: Mean, Median and Mode: calculation and interpretation
Population variance and sample variance: Raw data and grouped data
Coefficient of variation
Measurement Scale
Cannot say that Female is better than male because 1 is more than 0
Miss A - Champion 1
Miss B – First runner up 2
Miss c – Second runner up 3
A is more beautiful than B
B is more beautiful than C
A is more beautiful than C
A is twice more beautiful than C compared with B? This is wrong
Interval scale: Has magnitude and equal sized Interval but DO NOT have a true zero
Eg Temperature in Degree Celsius
0 deg C
10 Deg C
20 Deg C
At 0 deg C there is still heat and therefore it is not a true zero, it is an arbitrary zero
Ratio scale: Has magnitude, Equal sized Interval and has True zero
Nominal and Ordinal scale: Qualitative data. Use Non Parametric Test
Interval and Ratio: Quantitative data. Use a Parametric teat
Skewness
Skewness is the study of the effect of either high extreme values or extreme low values. Skewness can be
determined by the following:
If the data set have high extreme value, it is said to be positively skewed or is skewed to the right.
Eg. Data set: 1, 2, 9 is positively skewed because extreme high value, 9
ii In the stem and leaf display/ histogram, if the tail on the right is longer, then it is positively skewed
and if the tail on the left is longer, then it is negatively skewed.
Positively skewed if
- Right whisker is longer than the left whisker
- (Q3 – Q2) > (Q2 – Q1)
Negatively skewed if
- Left whisker is longer than the right whisker
- (Q2 – Q1) > (Q3 – Q2)
11. An observation is a:
A) graph observed for a data set
B) value of a variable for a single element
C) table prepared for a data set
D) sample observed from the population
27. An outlier influences which of the following summary measures the most?
A) mean B) median C) mode D) median and mode
28. Which of the following is the only measure that can be calculated for qualitative data?
A) mean B) range C) mode D) median
29. If a data set is right-skewed with one peak in the histogram, then which of the following is true?
A) the values of the mean, median, and mode are the same
B) the mean is greater than the median, which is greater than the mode
C) the mean and median are equal, but the mode is different
D) the mode is greater than the median, which is greater than the mean
30. The mean income of all MBA degree holders working in Los Angeles is $65,000 per year and the
standard deviation of their incomes is $8,000 per year. According to Chebyshev's theorem, the
percentage of MBA degree holders, working in Los Angeles, with an annual income of $33,000
to $97,000 is at least:
A) 75
B) 93.75
C) 88.89
D) 84.39
Chapter 1
Consider the following six pairs of x and y values. Find the following values.
x y
8 10
11 16
15 20
5 7
20 28
21 21
4. The value of x 2
y is:
Ans: 27,712
5. The value of x 2
is:
Ans: 1,276
6. The value of y 2
is:
Ans: 2,030
7. The value of ( x − 2) 2
y is:
Ans: 21,752
Chapter 2 & 3
Question 1
The weekly food expenditures in US $ of a random sample of 40 students is shown below:
Weekly food expenditures (US $) of a random sample of 40 students
21 30 42 50 62 71 83 93
83 73 62 50 44 32 21 23
32 44 51 64 75 76 66 53
45 34 24 28 35 47 55 66
55 47 36 47 56 57 57 58
x = 2048 x = 117836
2
Also given and
a. Calculate the sample mean, variance and standard deviation.
b. Construct the ordered stem and leaf display to sort the data.
c. Calculate the five-figure summary. Calculate the inner lower fence and the inner upper
fence and hence identify the presence of any outliers. Hence, construct the boxplot.
d. Fill in the values for the three columns in the frequency distribution below.
Weekly Sales(US$) Frequency Relative frequency Cumulative frequency
20 and less than 30
30 and less than 40
40 and less than 50
50 and less than 60
60 and less than 70
70 and less than 80
80 and less than 90
90 and less than 100
Total
e. From the frequency distribution in part (d), calculate the mean and standard deviation.
f. Giving 3 reasons, are the weekly food expenditures positively or negatively skewed?
Question 2
A statistician would like to study the distribution of monthly sales pattern of outlets for product X. He
selected a random sample of 30 outlets and recorded their monthly sales in $ million. The summary output
of the monthly sales of the 30 outlets is shown below. However, four data marked a, b, c and d are missing
in the summary output.
Summary output
Sales ($ Million)
Mean 29.77
Standard Error 2.75
Median 32
Mode 35
Standard Deviation 15.06
Sample Variance a
Kurtosis -0.91
Skewness d
Range b
Minimum 3
Maximum 52
Sum c
Count 30
First Quartile 19
Third Quartile 40
b. Find the values of the three measures of central tendency and interpret their meanings.
c. From the summary statistics, is the value of d positive or negative? Justify your answer.
d. Compute the interval 𝑥̅ ± 1.5𝑠 , where 𝑥̅ and 𝑠 are the sample mean and standard deviation
respectively. According to Chebyshev’s theorem, how many percent of the monthly sales ($
million) are expected to be within this interval?
Question 3
The following table gives the frequency distribution of meal expenses ($) per month by 100 adults
selected from a city.
Question 4
Given the stem and leaf display below for daily spending ($) of a sample group of 35 adults.
Stem Leaf
1 12
2 23478899 9
3 023346 7
4 123345
5 11345
6 0123
7 23
Legend: 1|1 means $11
a. Find the three measures of central tendency, first quartile and third quartile.
c. Construct a frequency distribution table of daily spending ($) using 10-19 as first class, 20-29 as
second class and so on.
d. Find the approximate value of the 60th percentile for the data.
e. Find the percentile rank for the adult who spends $29 per day.
Question 5
The work experiences (in years) of all the 14 employees of a company are
8 21 11 4 14 17 11 8 8 7 2 11 27 6
b. Find the first, second and third quartile. (Answer: 7, 9.5, 14)
c. Find the approximate value of the 70th percentile for the data. (Ans: 11)
d. Find the approximate value of the 30th percentile for the data. (Ans: 7)
e. Find the percentile rank for the employee with 17 years of experience (Ans: 78.57)
f. Find the percentile rank for the employee with 8 years of experience (Ans: 28.57)