SAS 1201 PROBABILITY & STATISTICS I
SAS 1201 PROBABILITY & STATISTICS I
Page 1 of 4
d) A student plays a game in which a fair six-sided die is tossed once. If the score is 1, 2 or 4,
she loses 10 sh. She wins x if the score is 3 or 5 and wins 2x if the score is 6.
i. Find the probability distribution or returns for this game [2 Marks]
ii. Find the mean and variance of return when x=10 [4 Marks]
e) In an examination of nine applicants for a clerical post, the marks obtained in accountancy
and statistics papers were as follows:
Applicant 1 2 3 4 5 6 7 8 9
Accounting test score 15 28 28 12 40 60 20 80 28
Statistics test score 40 30 39 30 20 11 30 60 40
Using the rank correlation coefficient with adjustment for ties, investigate whether there
is a relationship between the scores in the two tests. [6 Marks]
a) Briefly outline the significance of correlation analysis, and explain three types of correlation
[4 Marks]
b) Discuss the terms skewness and kurtosis and point out their role in analyzing a frequency
distribution. [3 Marks]
c) Compute the quartile and Pearson’s coefficients of skewness for the following data: 2, 10,
13, 13, 4, 12, 8, 6, 5, 9, 8, 13 [5 Marks]
d) The share price in shillings of a certain company is monitored over a nine month period. The
results are shown in the table below and some summary statistics given:
Time (years) 0 1 2 3 4 5 6 7 8
Price 100 131 183 247 330 454 601 819 1,095
yi xi ei i 0,1,2,...,8
Where; ei are independent normal random variables with mean zero and variance 2 .
i. Plot a scatter diagram of these data and comment on the appropriateness of the
chosen model [2 Marks]
Page 2 of 4
ii. Determine the fitted regression line in which price is modeled as the response
variable and time as the explanatory variable [3 Marks]
iii. Is there an association between the prices and time? [3 Marks]
Marks(more than) 0 10 20 30 40 50 60 70 80
Number of students 150 140 100 80 80 70 30 14 0
i. Organize the above data into a decumulated frequency distribution table with 8
classes [2 marks]
ii. Using the coding method with an assumed mean of 30, estimate the actual mean
and variance for these data [6 marks]
iii. Construct a cumulative frequency curve for the above data and use it to estimate
the interquartile range of the data [5 marks]
iv. Estimate the modal mark [2 marks]
b) In driving to work, Mary has to pass through three roundabouts. The probability that she
will have to stop at any of them is . Draw a tree diagram to represent the data; hence
determine the probability that on any one journey she will have to stop at:
i. None of the roundabouts [2 Marks]
ii. Any two of the roundabouts [3 Marks]
a) A circular board with 8 equal sectors labeled 1-8 is mounted on a spinner. When the board is
spun, the arrow of the spinner is equally likely to land in any of the sectors. The board is
spun once. The following events are defined based on the label on the sector in which the
arrow lands: A- An even number, B- A prime number, C- An odd number, D- A perfect
square, E- A number greater than 6
i. Enumerate the members of sets A, B and D, named according to the above events and
represent them in a well labeled Venn diagram [3 Marks]
ii. What type of events are A and C? Explain. [2 Marks]
Page 3 of 4
iii. Find the probability that the score will be a perfect square, a prime number and more
than 6 [3 Marks]
iv. Find the probability that the score will be both a prime number and an even number
[1 Mark]
v. Find the probability that the score will be A prime number or more than 6 or both
[2 Marks]
b) The data below represents the number of goals per match scored by Team A and Team B in a
number of matches played last season;
Team A: 0, 1, 3, 2, 1, 0, 1
Team B: 1, 0, 2, 3, 5, 1
Find the:
a) Determine the first four moments about the point A=25 of the following distribution; hence
investigate the skewness and the peakedness of the distribution [9 Marks]
b) The number of hours taken by 50 employees at a certain factory to complete a task was
recorded as follows;
41 43 52 62 72 67 65 77 49 50
53 55 66 69 60 72 79 40 41 70
70 71 68 59 84 80 78 39 37 80
49 50 49 55 64 70 61 51 63 66
81 75 30 44 45 44 43 48 58 56
i. Starting with the interval 30-39, construct a frequency distribution table for
the data [3 marks]
ii. Draw a cumulative frequency curve of the data and use it to estimate the
interquartile range [6 marks]
th
iii. How long did the 44 employee take to complete the task? [2 Marks]
Page 4 of 4