0% found this document useful (0 votes)
227 views

Stat 111 - Tutorial Set 2

This document provides a tutorial on statistics concepts and calculations. It includes 15 questions covering topics such as constructing frequency tables and histograms from data sets, calculating measures of central tendency and dispersion, determining percentiles and quartiles, and distinguishing between different types of data and random variables. Students are asked to perform statistical analyses on various data sets involving test scores, physical measurements, purchase amounts, and other values. The goal is to practice common descriptive statistics procedures.

Uploaded by

Damien Afari
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
227 views

Stat 111 - Tutorial Set 2

This document provides a tutorial on statistics concepts and calculations. It includes 15 questions covering topics such as constructing frequency tables and histograms from data sets, calculating measures of central tendency and dispersion, determining percentiles and quartiles, and distinguishing between different types of data and random variables. Students are asked to perform statistical analyses on various data sets involving test scores, physical measurements, purchase amounts, and other values. The goal is to practice common descriptive statistics procedures.

Uploaded by

Damien Afari
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

UNIVERSITY OF GHANAS

DEPARTMENT OF STATISTICS AND ACTUARIAL SCIENCE


STAT 111-TUTORIAL SET 2
1. Given the following grades of 50 students in an examination:
44 13 47 27 55 41 48 54 78 66
58 35 58 36 66 60 53 35 35 47
49 55 32 45 79 45 82 22 18 58
37 36 45 51 79 24 26 33 64 68
45 57 30 72 57 81 33 63 54 42
Construct:
(a) an array for the data;
(b) a stem and leaf table;
(c) a simple frequency table;
(d) a grouped frequency table and hence
i. Determine suitable class boundaries;
ii. Obtain class marks;
(e) a relative percentage frequency table;
(f) a cumulative frequency table;
(g) a relative cumulative frequency table.

2. The following data are the weight of security personnel of the company:
80 100 65 120 105 70 115 10 60 98
15 85 79 101 75 90 77 102 97 118
92 86 108 80 88 100 116 66
Compute
a) the range;
b) the interquartile range;
c) the quartile deviation;
d) the mean absolute deviation;
e) the population variance and hence the population standard deviation.
f) the sample variance and hence the sample standard deviation

STAT 111 Tutorial Set 2 Page 1 of 7


3. The first three moments of a distribution about the value 4 are - 1.5 , 17 , - 30. Find the values of
the mean, standard deviation and the moment measure of skewness.

4. (i) Define an outlier


(ii) Define the 25th, 50th, and 75th percentiles of a data set.
(iii) What is the interquartile range?
(iv) What are the hinges of a box plot?
(v) Using the data in Question 1, obtain the box-and-whisker plot. Describe the distribution.

5. Classify each of the following examples as nominal, ordinal, interval or ratio. Explain each of
your answers.
a) Height of students.
b) Brand of cars used by lecturers.
c) Time spent on solving a question.
d) Number of final year Sociology students.
e) Identity numbers of voters.
f) Temperature recorded on the first Sunday of January.
g) Distance between Accra and Lagos.
h) Evaluation of workers on a scale from 1 to 10 (1 is the lowest)

6. Define
a) Simple random sampling without replacement
b) Simple random sampling with replacement
Give two examples of situations where sampling has to be done without replacement.

7. The table below shows the number of male and female students who visit the University of Ghana
Computing Systems (UGCS) in the first three months in 2005.

Month
Sex January February March
Male 75 88 77
Female 65 92 100

a) Use the information to draw a pie chart.


b) Use the information to draw a bar chart.
.
8. (a) Calculate the arithmetic mean, harmonic mean and the geometric mean from the data
below:
5 38 158 950 1800 4000
(b) Which of these computed measures are appropriate for the data? Please explain your
answer.

STAT 111 Tutorial Set 2 Page 2 of 7


9. Given the following data:
44 13 47 27 55 41 48 54 78 66
58 35 58 36 66 60 53 35 35 47
49 55 32 45 79 45 82 22 18 58
37 36 45 51 79 24 26 33 64 68
45 57 30 72 57 81 33 63 54 42

a) From the raw data, compute the mean, the median and the mode.
b) Obtain a grouped frequency distribution to compute the mean, the median and the mode
and compare the results with those in Question (a) above.
c) From the raw data, compute the first quartile, the fourth decile, seventy-fifth percentile.
In each case explain the significance of the answer.
d) From the grouped frequency distribution, compute the first quartile, the fourth decile,
seventy-fifth percentile and compare the results with those in Question (c) above.

10. Suppose you have spent GHc 1,000 for 20 pearls in one shop, another GHc 1,000 for 38 pearls in
a second shop, and still another GHc 1,000 for 44 pearls in the third shop. Calculate:

a) the arithmetic mean


b) the harmonic mean of the three shops' prices for a pearl.
c) Explain also why the arithmetic mean may be incorrect but the harmonic mean may be
correct in this case.

11. Answer the following questions:


a) What is a random variable?
b) Distinguish between discrete and continuous random variables. Give an example each.
c) Distinguish between probability mass function and Probability distribution function.

12. Hannah properties is a real estate company which specializes in custom-home resale in Accra.
The following is a sample of the size (in hundreds of square feet) and price (in thousand Ghana
Cedis) data for nine custom homes currently listed for sale.

Size 26 27 33 29 29 34 30 40 22
Price 290 305 325 327 356 411 488 554 246

a) Calculate the
i. Arithmetic mean
ii. The standard deviation for each of the variables

b) Explain the importance of the coefficient of variation as a measure of relative dispersion.


Compute the coefficient of variation for the size and price of homes at Hanna properties.
Comment on the results.
c) Compute and interpret the correlation coefficient between the size and price of the homes at
Hanna properties.

STAT 111 Tutorial Set 2 Page 3 of 7


13. The number of passengers carried by a boat during 25 trips is given below:

52 84 61 65 77 64 62 35 82 38 50
51 66 60 95 58 89 78 103 71 75 41

a) Construct a stem and leaf display with one digit leaves.


b) Are there outliers in the given data? Provide an explanation for your answer.
c) Use the above information to obtain a modified Box-and-Whisker plot.

14. The duration of patients' stay in a hospital were organized into a frequency distribution. The mean
duration of stay was 28 days, the mode was 23 days and the median was 25 days. The standard
deviation was 4.2 days.

Find the coefficient of skewness. Is the distribution symmetrical, positively or negatively


skewed? Give reasons to support your answer.

15. The data relating to the length of 60 leaves measured in centimeters in shown:

Length(cm) Frequency
10 - < 15 3
15 - < 20 8
20 - < 25 12
25 - < 30 7
30 - < 35 15
35 - < 40 8
40 - < 45 5
45 - < 50 2
Total 60

a) Draw a histogram for the given data.


b) Draw a frequency polygon superimposed on the histogram.
c) Estimate the mean, median and mode for the data.
d) Estimate the semi-interquartile range (SIQR).
e) Estimate the variance and the standard deviation.
f) Estimate the coefficient of variation.

STAT 111 Tutorial Set 2 Page 4 of 7


16. (a) A research organization selected a sample of 30 visitors to a prestigious shopping
mall. The data about the ages of the selected persons have been organized into the
following Frequency table:

Age (in years) Number of visitors


18 to < 23 2
23 to < 28 7
28 to < 33 12
33 to < 38 6
38 to < 43 3
You are required to calculate the following:
(i) Range
(ii) Sample variance and sample standard deviation.
(iii) Coefficient of variation.

(b) The box plots below summarize the distribution of SAT verbal and math scores
among students at the department of statistics for 2021/2022 academic year.

Describe and compare the distributions of the Math and Verbal scores of the
students based on the range, the five number summary statistics and any other
dispersion measures you consider appropriate.

17. The age and price data for a sample of 11 secondhand Nissan Sunny Cars are
presented in the following table:
Age (years) X Price (in Ghc ‘000’) Y
5 85
4 103
6 70
5 82
5 89
5 98
6 66
6 95
2 169
7 70
7 48

(i) Construct the scatter diagram for the following data:

STAT 111 Tutorial Set 2 Page 5 of 7


(ii) Calculate the Pearson’s product moment coefficient of correlation between
X and Y and comment on the result.
(iii) Calculate the Spearman’s rank coefficient of correlation between X and Y
and comment on the result.

18. A survey asked people how often they exceed speed limits. The data are then
categorized into the following contingency table of counts showing the relationship
between age group and response.

Possible Exceed Limit


Age (in years) Always Not always Total

Under 30 100 100 200


Over 30 40 160 200
Total 140 260 400

Compute the Phi (  ) contingency coefficient between age and possible speed
limit and comment appropriately.

19. Suppose a study of speeding violations and drivers who use cell phones whilst driving
produced the following data:

Cell Phone usage whilst Driving


driving Speeding Not speeding Total

Yes 25 280 305


No 45 405 450
Total 70 685 755
Calculate:
(i) the chi-square statistic between speeding and cell phone usage whilst driving.
(ii) the Cramer’s V contingency coefficient between the two variables.
(iii) the Phi contingency coefficient between the two variables.
(iv) the Tschuprov’s T contingency coefficient between the two variables.
(vi) The Pearson’s contingency coefficient between the two variables.

20. The table below shows the number of athletes who stretch before exercising and how
many had injuries within the last past year:

Stretches Injury last year


Yes No Total
Yes 55 295 350
No 231 219 450
Total 286 514 800

STAT 111 Tutorial Set 2 Page 6 of 7


Calculate:
(i) the chi-square statistic between stretching before exercising and how injury.
(ii) the Cramer’s V contingency coefficient between the two variables.
(iii) the Phi contingency coefficient between the two variables.
(iv) the Tschuprov’s T contingency coefficient between the two variables.
(ii) The Pearson’s contingency coefficient between the two variables.

STAT 111 Tutorial Set 2 Page 7 of 7

You might also like