STA162 2023 02 Exam Paper
STA162 2023 02 Exam Paper
Semester 2 2023
Module name Statistics
Module Code STA162
NQF level 5
Date and Time 17 November 2023 – 09:00-11:30
Duration 2 Hours & 30 Minutes
Exam type Closed book
Marks 50
General instructions
1. Complete your personal information on the front cover of the STADIO Examination
Answer Booklet.
2. Ensure you number the answer booklets correctly, in the case where more than one
answer booklet is used, i.e., Book 1 of 2, etc.
3. Use only black or blue pen. Do not use a pencil to answer questions. Pencilled answers
will not be marked.
4. Answer all the questions unless otherwise instructed.
5. Read all questions carefully before attempting to answer.
6. Always number the answers and any sub-question answers the same as the question
numbers in the examination paper.
7. Rough work may be done at the back of the examination book only. All rough work
must be labelled as such.
8. By accepting this examination script, you agree to abide by the STADIO Examination
Rules and Regulations.
STA162
©STADIO [Statistics STA162] [Semester 2 - 2023]
Page 1 of 13
Equipment (Closed book)
1. No documents, notes, files, study guides, textbooks, or other materials will be allowed
into the examination.
2. Calculators are permitted.
3. No mobile devices or electronic equipment including smart watches, laptops, iPads,
Kindles, etc. will be allowed on your person or at your desk during the examination.
4. No borrowing or lending of any examination material and/or stationery will be
permitted.
5. A formula sheet is included at the end of the paper.
6. Additional blank paper will be provided for calculations.
Select the correct answer and write down only the sub-question number and next to it the
letter that represents the answer you have selected.
For example: 1. D
Question 1 (1 Mark)
A) Positively skewed
B) Negatively skewed
C) No Skewness
Question 2 (1 Mark)
Identify which sampling method is used if one member of a population is identified and asked
to identify other members of the target population.
A) Quota sampling
B) Snowball sampling
C) Judgement sampling
D) Convenience sampling
Question 3 (1 Mark)
If the correlation coefficient between two variables is -0,24. The relationship between the two
variables can best be described as a:
A Discovery call centre asks its callers to rate the service that they received on a scale of 1 to 5,
where 1 = very poor and 5 = excellent. The results will be used to determine the overall quality
of service that the call centre provides. Identify the category into which this falls.
A) Descriptive statistics
B) Statistical modelling
C) Inferential statistics
Question 5 (1 Mark)
A) Data type
B) Data source
C) Method of data collection
Question 6 (1 Mark)
Identify whether the statement below is true or false. The statement is based on the following
Venn diagram:
A) True
B) False
Choose which of the following is considered secondary data for a company such as Woolworths.
Question 8 (1 Mark)
Identify whether the statement below is true or false. The statement is based on the following
Venn diagram:
A) True
B) False
Question 9 (1 Mark)
The following table represents the preference of shoppers for cell-phone brands per month:
Cell-phone brand Number of shoppers’ preference
Samsung 440
Apple 310
Huawei 150
TOTAL 900
Suppose that a company makes 75 products, and must select any three (3) products to showcase
at a conference.
Identify the correct way to calculate how many different groupings of three (3) products they
could show.
75!
A)
3!(75−3)!
3!
B)
3!(75−3)!
75!
C)
75!(75−3)!
75!
D)
75!(75−3)!
Question 1 (6 Marks)
A retail store is conducting an experiment to determine how many customers enter the store
and purchase any product on a given day. One of the store managers estimates that the
probability of any customer entering the shop and purchasing any product is 0.40. A sample of
nine customers was used in the experiment for the month of November.
1.1 The probability that exactly two customers make a purchase (2)
1.2 The probability that none of the customers makes a purchase (2)
1.3 The probability that at most, one customer makes a purchase (2)
Question 2 (8 Marks)
Answer the following questions based on the given set of data extracted from a Financial
Management class test at STADIO (the test was out of 50 marks and a sample of 10 students
were selected):
Student Mark
A 25
B 40
C 35
D 18
E 22
F 50
G 30
H 15
I 22
J 34
The manager of a restaurant wants to analyse the amount customers spent on average on one
of the new items on their menu. Using a random sample, the sample mean spend was found to
be R220 with a sample standard deviation of R30.
3.1 Estimate the 95% confidence interval of the actual mean spend on the new menu item,
assuming that the sample comprised 36 customers. (4)
3.2 Estimate the 95% confidence interval of the actual mean spend on the new menu item,
assuming that the sample comprised 51 customers. (4)
Question 4 (3 Marks)
The table below shows a sample of the company sizes and the industries of 280 JSE-listed
companies.
Industry Company size Column total
Small Medium Large
Retail 48 15 27 90
Mining 30 95 65 190
Row total 78 110 92 280
Based on the above data, if a company is selected at random, determine the probability of
selecting the following companies. Show all calculations where necessary.
The South African Reserve Bank has increased the repurchase rate (repo rate), which ultimately
increased the prime lending interest rate and the cost of borrowing for clients applying for
credit.
You have been appointed as a bank analyst to find out whether there is a relationship between
the increasing interest rates and the number of new home-loan applications.
Use the historical data below to answer the questions that follow.
Prime Number of home-loan
lending interest rate applications
(%) (no. of clients)
5.5 5 110
7 2 090
7.5 1 367
8 1 427
9 1 597
10.25 1 701
10.5 799
11 850
11.75 920
12 670
12.25 512
14.5 798
1.1 Using simple linear regression analysis, calculate and formulate a simple linear regression
equation to identify the constant and the slope of the linear regression equation. (Your
answer must be in the format y = a + bx, where a represents the y-intecept/constant and
b the slope/gradient of the linear regression line.) (9)
1.2 Explain the relationship between the two variables (Prime lending interest rate and
Number of home-loan applications) based on the simple linear regression line that you
have calculated in Question 1.1. (1)
1.4 Explain the relationship between the two variables (Prime lending interest rate and
Number of home-loan applications) based on the correlation coefficient. (1)
where
𝑥𝑥𝑑𝑑 is the difference between pairs: 𝑥𝑥𝑑𝑑 = 𝑥𝑥1 − 𝑥𝑥2
∑ 𝑥𝑥𝑑𝑑
𝑥𝑥̅𝑑𝑑 is the average of the paired differences: 𝑥𝑥̅𝑑𝑑 =
𝑛𝑛
∑(𝑥𝑥𝑑𝑑 − 𝑥𝑥̅ 𝑑𝑑 )2
𝑠𝑠𝑑𝑑 is the standard deviation of the paired differences: 𝑠𝑠𝑑𝑑 = � 𝑛𝑛−1
𝑛𝑛 ∑ 𝑥𝑥𝑥𝑥 − ∑ 𝑥𝑥 ∑ 𝑦𝑦
𝑟𝑟 =
�[𝑛𝑛 ∑ 𝑥𝑥 2 − (∑ 𝑥𝑥)2 ] × [𝑛𝑛 ∑ 𝑦𝑦 2 − (∑ 𝑦𝑦)2 ]