0% found this document useful (0 votes)
17 views10 pages

CambMATHS10 5.1-5.3 2ED Test 04D

Uploaded by

nick.mavrik6
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views10 pages

CambMATHS10 5.1-5.3 2ED Test 04D

Uploaded by

nick.mavrik6
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

Chapter 4: Single variable and bivariate statistics

Test D (40 marks)


Name: ____________________

Part A – Multiple-choice (10 marks)

1 Which of the following statements is true?

A Both have the minimum value

B Both have the same range

C Both have the same amount of outliers

D Both have the same interquartile range

E Both have the same mean

2 The median of the data displayed in this stem-and-leaf plot is:


Stem Leaf
0 3 4 5
1 0 1 1 2
2 4 5 6
3 0 1 2

A 10 B 24 C 25

D 12 E 11

© Cambridge University Press 2019 1


3 Every year 10 student will vote for a year 10 representative to go on the student
council. This election process is called:

A A population B A sample C A census

D Discrete data E Categorical data

4 A doctor records each patient’s weight. This type of data is called:

A Categorical Ordinal B Categorical Nominal C Skewed

D Numerical Discrete E Numerical Continuous

5 A survey question asking about a person’s highest level of education has the
alternatives: 1. Primary school
2. Secondary school
3. TAFE
4. University
This type of data is called:

A Categorical Ordinal B Categorical Nominal C Skewed

D Numerical Discrete E Numerical Continuous

6 The equation of the trend line shown is:

A 11 y + 2 x = 72 B 11 y + 2 x = 36 C y = –2x + 14

D 2 y + 11 x = 72 E 2 y − 11 x = 36

© Cambridge University Press 2019 2


Questions 7 and 8 refer to the following set of data: 3, 4, 7, 2, 2, 11

7 The mean and median respectively of the above set of data are:

A 2 and 4.8 B 4.8 and 3.5 C 3.5 and 2

D 3.5 and 4.8 E 4.8 and 2

8 The standard deviation of the above set of data is closest to:

A 2.0 B 3.2 C 5.0

D 7.9 E 3.5

Questions 9 and 10 refer to the following information.

This back-to-back stem-and-leaf plot shows the distribution of two data sets where each
contains 12 data values. The means and standard deviations are also given.

Set A Stem Set B Set A:


1 3 3 4 5 0 4 8 9 x = 12.8
0 2 4 1 1 σ = 10.2
2 5 7 8 2 4 5 9 Set B
3 0 3 5 6 x = 23.8
4 2
σ = 12.7

9 Which statement justifies why the mean of set A is smaller than that of set B?

A The distribution of data in set B is symmetric.

B The data in set B are more spread out.

C The data in set B are more closely clustered together.

D There are several values in set B that are larger than those in set A.

E There are several values in set A that are smaller than those in set B.

© Cambridge University Press 2019 3


10 Which statement justifies why the standard deviation of set A is smaller than that of
set B?

A The distribution of data in set B is symmetric.

B The data in set B are more spread out.

C The data in set B are more closely clustered together.

D There are several values in set B that are larger than those in set A.

E There are several values in set B that are smaller than those in set A.

© Cambridge University Press 2019 4


Part B – Short-answer (19 marks)

1 At the end of the hockey season, the Penleigh Penguins star player Ricardo Gomez
wrote down the number of goals he had scored for each game this season. Here is the
raw data:

4, 3, 6, 4, 5, 6, 6, 3, 4, 0, 0, 2, 3, 3

a Classify this type of data.

______________________________________________________________________

b What was his most frequent score? What is this statistic called?

______________________________________________________________________

c Order his scores from lowest to highest

______________________________________________________________________

d What was the upper quartile and the lower quartile? State the IQR

______________________________________________________________________

______________________________________________________________________

e If Gomez played another game how many goals would we expect him to score?

______________________________________________________________________

(½ + ½ + ½ + 1 + ½ = 3 marks)

2 Summarise the following data set by finding:

18, 22, 19, 26, 27, 24, 25, 19, 42

a Minimum value b Maximum Value

__________________ ___________________

c Median d Lower quartile

__________________ ___________________

e Upper quartile f IQR

__________________ ___________________

© Cambridge University Press 2019 5


g Outlier(s) (state why)

______________________________________________________________________

(½ + ½ + ½ + ½ + ½+ ½ + 1 = 4 marks)

3 Consider the following box plot and answer the questions to follow:

a State the range for both plots

______________________________________________________________________

b State the IQR for both plots

______________________________________________________________________

c Explain why the outlier for the top box plot is considered an outlier?

______________________________________________________________________

d What value above 17 would be considered an outlier for the bottom box plot?

______________________________________________________________________

(4 ×1 = 4 marks)

© Cambridge University Press 2019 6


4 Consider the following time series plot below.

Amount in $1000s

Time in years
Jeoff placed an initial investment into a savings account at the end of year 1. He didn’t
make any other contributions and just left it in the high interest savings account for
several years.

a How much was the initial investment?

______________________________________________________________________

b Describe the trend displayed in the graph

______________________________________________________________________

______________________________________________________________________

c What is the practical reason for the shape of this graph?

______________________________________________________________________

d From your answer in part c, explain what type of interest this is.

______________________________________________________________________

(½+ 1 + 1 + ½ = 3 marks)

© Cambridge University Press 2019 7


5 Consider the following graph and comment on the correlation between the two
variables.

Hours spent
working

Age

a Comment on the correlation between the two variables

______________________________________________________________________

______________________________________________________________________

b How can you obtain more accurate results?

______________________________________________________________________

(1 + 1 = 2 marks)

6 Two data sets, called ‘1’ and ‘2’, have means of x1 and x2 respectively. Does x2 > x1
indicate σ 2 > σ 1 ? Explain why or why not.

______________________________________________________________________

______________________________________________________________________

______________________________________________________________________

(2 marks)

7 Data sets A and B each have 15 data values and are very similar except for an outlier in
set A. Which measure of spread will not be affected by this outlier? Why?
i Range ii Interquartile range iii the standard deviation.

______________________________________________________________________

______________________________________________________________________

(1 mark)

© Cambridge University Press 2019 8


Part C – Extended-response (11 marks)

1 Consider the variables x and y and the corresponding bivariate data in the table below.
x 1 2 3 4 5
y 8.6 11 14.8 15.5 18

a Construct a scatter plot for the data on the given axes.

b What can you say about the correlation between x and y, giving values to two
decimal places?

______________________________________________________________________

______________________________________________________________________

c Calculate the standard deviation of y and comment on the result.

______________________________________________________________________

______________________________________________________________________

d Rule a line of best fit by eye and determine an approximate linear equation.

______________________________________________________________________

______________________________________________________________________

e Use your equation from part d to find y when x = 8.

______________________________________________________________________

______________________________________________________________________

(1 + 1 + 2 + 1 + 1 = 6 marks)

© Cambridge University Press 2019 9


2 Consider the data in the following table and use a graphics or CAS calculator or
software to help answer the following questions.

x 1 2 3 4 5 6 7 8
y 2.3 3.4 5 6 7.2 6.7 8 15

a Construct a scatter plot for the data on the calculator.

b Find the equation of the least squares regression line. Explain the mathematical
method used to calculate this type of regression line.

______________________________________________________________________

______________________________________________________________________

c Find the equation of the median–median regression line. Explain the


mathematical method used to calculate this type of regression line.

______________________________________________________________________

______________________________________________________________________

d Sketch the graph of the scatter plot.

e From the least squares regression line and the median–median regression lines
predict:

i the two values of y when x = 15

________________________________________________________________

________________________________________________________________

ii the two values of x when y = 40


________________________________________________________________

________________________________________________________________

f State the data value that makes the two regression lines different. Which type of
regression line is more accurate to use for predicting by extrapolation? Give a
reason for your answer.

______________________________________________________________________

______________________________________________________________________

______________________________________________________________________

(1(b) + 1(c) + 1(e) + 2(f) = 5 marks)

© Cambridge University Press 2019 10

You might also like