Mathsgeeks: WWW - Mathsgeeks.Co - Uk
Mathsgeeks: WWW - Mathsgeeks.Co - Uk
www.mathsgeeks.co.uk
S1-Jan-Edexcel-2007
1. As part of a statistics project, Gill collected data relating to the length of time, to
the nearest minute, spent by shoppers in a supermarket and the amount of money they
spent. Her data for a random sample of 10 shoppers are summarised in the table below,
where t represents time and £m the amount spent over £20.
t(minutes) £m
15 -3
23 17
5 -19
16 4
30 12
6 -9
32 27
23 6
35 20
27 6
Total =212 Total=61
(a) Write down the actual amount spent by the shopper who was in the supermarket for
15 minutes. (1)
Page 1 of 7
2007-Jan-S1-Edexcel
Copyright©2012 Prior Kain Ltd
MathsGeeks
www.mathsgeeks.co.uk
S1-Jan-Edexcel-2007
c) Using
√
√
2. In a factory, machines A, B and C are all producing metal rods of the same length.
Machine A produces 35% of the rods, machine B produces 25% and the rest are produced
by machine C. Of their production of rods, machines A, B and C produce 3%, 6% and 5%
defective rods respectively.
Def=0.03 0.0105
0.3395
Not=0.97
A=0.35
Def=0.06 0.015
Def=0.05
5
0.02
C=0.4
5
Not=0.95 0.38
Page 2 of 7
2007-Jan-S1-Edexcel
Copyright©2012 Prior Kain Ltd
MathsGeeks
www.mathsgeeks.co.uk
S1-Jan-Edexcel-2007
b) (i) From tree =0.0105
(ii) Add all =0.0455
defective=0.0105+0.015+0.02
c)
So it is 0.02/0.0455 =0.439=43.9%
x 1 2 3 4 5 6
P(X=x)
d) Using
( )
e) Using
Page 3 of 7
2007-Jan-S1-Edexcel
Copyright©2012 Prior Kain Ltd
MathsGeeks
www.mathsgeeks.co.uk
S1-Jan-Edexcel-2007
4. Summarised below are the distances, to the nearest mile, travelled to work by a
random sample of 120 commuters.
The mid-point of each class was represented by x and its corresponding frequency by
f giving
∑ and ∑
(c) Estimate the mean and the standard deviation of this distribution. (3)
(e) State whether or not the value of your coefficient is consistent with your description in
part (a). Justify your answer. (2)
(f) State, with a reason, whether you should use the mean or the median to represent the
data in this distribution. (2)
(g) State the circumstance under which it would not matter whether you used the mean
or the median to represent a set of data. (1)
Page 4 of 7
2007-Jan-S1-Edexcel
Copyright©2012 Prior Kain Ltd
MathsGeeks
www.mathsgeeks.co.uk
S1-Jan-Edexcel-2007
b) The median will be the 60th falls in the 20-29 class. The first 29 commuters
60th commuter take us to an average of 19.5 then we have to go
31/43 through the next class therefore
= 26.7 (1.d.p)
c) Using
∑
d)
g) If there is no skew or if the data is normally distributed the median will equal the
mean and it won’t make any difference which is used.
5. A teacher recorded, to the nearest hour, the time spent watching television during a
particular week by each child in a random sample. The times were summarised in a
grouped frequency table and represented by a histogram. One of the classes in the
grouped frequency distribution was 20–29 and its associated frequency was 9. On the
histogram the height of the rectangle representing that class was 3.6 cm and the width
was 2 cm.
(a) Give a reason to support the use of a histogram to represent these data. (1)
(b) Write down the underlying feature associated with each of the bars in a histogram. (1)
(c) Show that on this histogram each child was represented by 0.8 cm2. (3)
a) Histograms can be used for continuous data such as time and for frequency
grouped data as in the question.
b) The area of the histogram bar is proportional to the frequency of that class.
c)
Each child is therefore represented by
Page 5 of 7
2007-Jan-S1-Edexcel
Copyright©2012 Prior Kain Ltd
MathsGeeks
www.mathsgeeks.co.uk
S1-Jan-Edexcel-2007
d) Total number = children.
6. (a) Give two reasons to justify the use of statistical models. (2)
It has been suggested that there are 7 stages involved in creating a statistical model. They
are summarised below, with stages 3, 4 and 7 missing.
Stage 1. The recognition of a real-world problem.
Stage 2. A statistical model is devised.
Stage 3.
Stage 4.
Stage 5. Comparisons are made against the devised model.
Stage 6. Statistical concepts are used to test how well the model describes the real-world
problem.
Stage 7.
(b) Write down the missing stages. (3)
a) Using statistical models of real world problems tend to be simpler, cheaper and
quicker to use. They can be more easily modified to new situations and allow us
to predict further outcomes.
(a) Find the probability that a student selected at random has an IQ less than 91. (4)
The probability that a randomly selected student has an IQ of at least 100 + k is 0.2090.
a)
91 is less than 100 and for normal distribution we need greater than 100. But the
probability of less than 91 is the same as the probability of greater than 109. Which is 1
minus probability of less than 109. Therefore
( )
Page 6 of 7
2007-Jan-S1-Edexcel
Copyright©2012 Prior Kain Ltd
MathsGeeks
www.mathsgeeks.co.uk
S1-Jan-Edexcel-2007
b) At least 100+k
means greater than
or equal to 100+k
Page 7 of 7
2007-Jan-S1-Edexcel
Copyright©2012 Prior Kain Ltd