Chapter 5. NorChapter 5. Normal Probability Distributions - Pdfmal Probability Distributions
Chapter 5. NorChapter 5. Normal Probability Distributions - Pdfmal Probability Distributions
109
P (class less than 50.3 minutes) = 0.5 (50.3 50) = 0.5 0.3 = 0.15
P(class between 50.5 minutes and 50.8 minutes) = 0.5 (50.8 - 50.5) = 0.5 0.3 = 0.15 P(class between 50.5 min and 51.8 min) = 0.5 (51.8 - 50.5) = ( 0.5 1.3) = 0.65
Using the Standard Normal Distribution. In Exercises 5-8, assume that voltages in a circuit vary between 6 volts and 12 volts, and voltages are spread evenly over the range of possibilities, so that there is a uniform distribution. Find the probability of the given range of voltage levels. 5. For a discrete probability distribution, P(x) =1. Since the values on the x axis range from 6 to 12, this is a range of 6.0. To get the closed area within the rectangle to be equal to 1, the height of the rectangle has to be 1/6 = 0.167 and these are placed adjacent to each other to cover all values in the full range of 6 to 12
1 1 (12 10) = 2 = 2 / 6 = 1 / 3 = 0.333 6 6 1 1 6. P (voltage less than 11 volts) = (11 6) = 5 = 5 / 6 = 0.833 6 6 1 1 (10 7) = 3 = 3 / 6 = 1 / 2 = 0.500 7. P (voltage between 7 and 10 volts) = 6 6 1 1 8. P(voltage between 6.5 and 8.0 volts) = (8 6.5) = 1.5 = 1.5 / 6 = 1 / 4 = 0.250 6 6
P(voltage greater than 10 volts) =
110
Using the Standard Normal Distribution. In Exercises 9-28, assume that the readings on scientific thermometers are normally distributed with a mean of 0C and a standard deviation of 1.00C. A thermometer is randomly selected and tested. In each case, draw a sketch, and find the probability of each reading in degrees Celsius. 9. Less than 0.25. The probability distribution of readings is a standard normal distribution because the readings are normally distributed with a mean of 0 and standard deviation of 1. We need to find the area below z= 0.25. From Table A-2, this is 0.4013. So, P(x < 0.25) = 0.4013.
-4
-3.5
-3
-2.5
-2
-1.5
-1
-0.5
0.5
1.5
2.5
3.5
z=-0.25
10. Probability of a thermometer reading less than 2.75C, z= 2.75 Area below z of 2.75= 0.0030, P(x < 2.75) = 0.0030
-4
-3.5
-3
-2.5
-2
-1.5
-1
-0.5
0.5
1.5
2.5
3.5
z=-2.75
111
11. Probability of a thermometer reading less than 0.25C, z= +0.25 Area below z of +0.25= 0.5987, P(x < +0.25) = 0.5987
-4
-3.5
-3
-2.5
-2
-1.5
-1
-0.5
0.5
1.5
2.5
3.5
z=0.25
12. Probability of a thermometer reading less than 2.75C, z= +2.75 Area below z of +2.75= 0.9970, P(x < +2.75) = 0.9970
-4
-3.5
-3
-2.5
-2
-1.5
-1
-0.5
0.5
1.5
2.5
3.5
z=2.75
112
13. Probability of a thermometer reading greater than 2.33C, z= +2.33 Area below z of +2.33= 0.9901, P(x > +2.33) = 1 0.9901 = 0.0099
z=2.33
14. Probability of a thermometer reading greater than 1.96C, z= +1.96 Area below z of +1.96= 0.9750, P(x > +1.96) = 1 0.9750 = 0.0250
z=1.96
113
15. Probability of a thermometer reading greater than 2.33C, z= 2.33 Area below z of 2.33= 0.0099, P(x > 2.33) = 1 0.0099= 0.9901
-4
3.5
-3
-2.5
-2
-1.5
-1
-0.5
0.5
z=-2.33
1. 5
2.5
3.5
16. Probability of a thermometer reading greater than 1.96C, z= 1.96 Area below z of 1.96= 0.0250, P(x > 1.96) = 1 0.0250= 0.9750
z= -1.96
114
17. Probability of a thermometer reading between 0.5C and 1.5C, between z= +0.50 and z= +1.50, Area below z of +1.50= 0.9332 and area below z of +0.50= 0.6915 P(+0.50 < x< +1.50) = 0.9332 0.6915 = 0.2417
-4
-3.5
-3
-2.5
-2
-1.5
-1
-0.5
0.5
1.5
2.5
3.5
z= 0.50
z =1.50
18. Probability of a thermometer reading between 1.5C and 2.5C, between z= +1.50 and z= +2.50, Area below z of +2.50= 0.9938 and area below z of +1.50= 0.9332 P(+1.50 < x < +2.50) = 0.9938 0.9332 = 0.0606
z =1.50
z= 2.50
115
19. Probability of a thermometer reading between 2.00C and 1.0C, z= 2.00 and z= 1.00 Area below z of 1.00 is 0.1587 and area below z of 2.00 is 0.0228 P(2.00 < x < 1.00)= 0.1587 0.0228= 0.1359
-4
-3.5
-3
-2.5
-2
-1.5
-1
-0.5
0.5
1.5
2.5
3.5
z =-2.0
z= -1.0
20. Probability of a thermometer reading between 2.00C and 2.34C, z= +2.00 and z= +2.34 Area below z of +2.34 is 0.9904 and area below z of +2.00 is 0.9772 P(+2.00 < x < +2.34)= 0.9904 0.9772= 0.0132
z =2.0 z= 2.34
116
21. Probability of a thermometer reading between 2.67C and 1.28C, z= 2.67 and z= +1.28 Area below z of +1.28 is 0.8997 and area below z of 2.67 is 0.0038 P(2.67 < x < +1.28)= 0.8997 0.0038= 0.8959
-4
-3.5
-3
-2.5
-2
-1.5
-1
-0.5
0.5
1.5
2.5
3.5
z=-2.67
z= 1.28
22. Probability of a thermometer reading between 1.18C and 2.15C, z= 1.18 and z= +2.15 Area below z of +2.15 is 0.9842 and area below z of 1.18 is 0.1190 P(1.18 < x < +2.15)= 0.9842 0.1190 = 0.8652
-4
-3.5
-3
-2.5
-2
-1.5
-1
-0.5
0.5
1.5
2.5
3.5
z=-1.18
z= 2.15
117
23. Probability of a thermometer reading between 0.52C and 3.75C, z= 0.52 and z= +3.75 Area below z of +3.75 is 0.9999 and area below z of 0.52 is 0.3015 P(0.52 < x < +3.75)= 0.9999 0.3015 = 0.6984
z=-0.52
z= 3.75
24. Probability of a thermometer reading between 3.88C and 1.07C, z= 3.88 and z= +1.07 Area below z of +1.07 is 0.8577 and area below z of 3.88 is 0.0001 P(3.88 < x < +1.07)= 0.8577 0.0001 = 0.8576
Area= 0.8577-0.0001=0.8576
-4 -3.5 -3 -2.5 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2 2.5 3 3.5 4
z=--3.88
z= 1.07
118
25. Probability of a thermometer reading greater than 3.57C, z= +3.57 Area below z of +3.57=0.9999, P(x > +3.57) = 1 0.9999 = 0.0001
-4
-3.5
-3
-2.5
-2
-1.5
-1
-0.5
0.5
1.5
2.5
3.5
z= 3.57
26. Probability of a thermometer reading less than -3.61C, z= 3.61 Area below z of 3.61= 0.0002, P(x < 3.61) = 0.0001
z= -3.61
119
27. Probability of a thermometer reading greater than 0C, z= 0.00 Area below z of 0.00= 0.5000, P(x > 0.00) =1 0.5000= 0.5000
z= 0
28. Probability of a thermometer reading less than 0C, z= 0.00 Area below z of 0.00= 0.5000, P(x < 0.00) = 0.5000
-4
-3.5
-3
-2.5
-2
-1.5
-1
-0.5
0.5
1.5
2.5
3.5
z= 0
120
Basis for Empirical Rule. In Exercises 29-32, find the indicated area under the curve of the standard normal distribution, then convert it to a percentage and fill in the blank. The results form the basis for the empirical rule introduced in Section 2-5. 29. About 68.26% of the area is between z = 1 and z = +1 (or within one standard deviation of the mean). Since the area below z= 1.00 is 0.1587 the area between the mean and z= 1.00 is 0.5000 0.1587 = 0.3413, then the total area between z= 1.00 and z= +1.00 is 2 0.3413= 0.6826, converted to a percentage is 0.6826 100% 68.26% 30. About 95.44% of the area is between z= 2 and z= +2 (or within two standard deviation of the mean). Since the area below z= 2.00 is 0.0228 the area between the mean and z= 2.00 is 0.5000 0.0228 = 0.4772, then the total area between z= 2.00 and z= +2.00 is 2 0.4772= 0.9544, converted to a percentage is 0.9544 100%= 95.44% 31. About 99.74% of the area is between z= 3 and z = +3 (or within three standard deviation of the mean). Since the area below z= -3.00 is 0.0013 the area between the mean and z= 3.00 is 0.5000 0.0013 = 0.4987, then the total area between z= 3.00 and z= +3.00 is 2 0.4987= 0.9974, converted to a percentage is 0.9974 100%= 99.74% 32. About 99.98%of the area is between z= 3.5 and z = +3.5 (or within 3.5 standard deviation of the mean). Since the area below z= 3.50 is 0.0001 the area between the mean and z= -3.50 is 0.5000 0.0001 = 0.4999, then the total area between z= 3.50 and z= +3.50 is 2 0.4999= 0.9998, converted to a percentage is 0.9998 100%= 99.98% Finding Probability. In Exercises 33-36, assume that the readings on the thermometers are normally distributed with a mean of 0C and a standard deviation of 1.00C. Find the indicated probability, where z is the reading in degrees. 33. P (1.96 < z <1.96) = (Area below z= +1.960) (Area below z= 1.960) = 0.9750 0.0250 = 0.9500 34. P (z < 1.645) = Area below z= +1.645 = 0.9500 35. P (z > 2.575) = 1 (Area below z= 2.575) = 1 0.0050 = 0.9950 36. P (1.96< z < 2.33) = (Area below z= +2.33) (Area below z= +1.96) = 0.9901 0.9750= 0.0151 Finding Temperature Values. In Exercises 37-40, assume that the readings on the thermometers are normally distributed with a mean of 0C and a standard deviation of 1.00C. A thermometer is randomly selected and tested. In each case, draw a sketch, and find the temperature reading corresponding to the given information. 37. 0.90 in the body of the table corresponds to a z score of +1.28. So, the 90th percentile is the temperature reading of + (1.28 ) = 0 + (1.28 1.00) = 1.28C.
121
z= 1.28
38. 0.20 in the body of the table corresponds to a z score of 0.84. So, the 20th percentile is the temperature reading of + (0.84 ) = 0 + (0.84 1.00) = 0.84C.
-4
-3.5
-3
-2.5
-2
-1.5
-1
-0.5
0.5
1.5
2.5
3.5
z= -0.84
122
39. 0.05 in the body of the table corresponds to a z score of 1.645. So, the 5th percentile is the temperature reading of + (1.645 ) = 0 + (1.645 1.00) = 1.645C.
-4
-3.5
-3
-2.5
-2
-1.5
-1
-0.5
0.5
1.5
2.5
3.5
z= -1.645
40. 0.03 in the body of the table corresponds to a z score of 1.88. This is the lower cutoff point. 1 0.03= 0.97 in the body of the table corresponds to a z score of +1.88. This is the higher cutoff point. Thus, thermometers with reading lower than 1.88 C or higher than +1.88 C would be rejected and thermometers between 1.88 would not be rejected. In practice, values of 1.88 or +1.88 would probably be rejected in this case since it indicates the lowest and highest 3% would be rejected.
-4
-3.5
-3
-2.5
-2
-1.5
-1
-0.5
0.5
1.5
2.5
3.5
z= -1.88
z= -1.88
123
41. a. b. c. d.
e.
The percentage of data that are between one standard deviation from the mean corresponds to the area between 1.00z and +1.00z scores. This area is 68.26%. The percentage of data that are between 1.96 standard deviations from the mean corresponds to the area between 1.96z and +1.96z scores. This area is 95.00%. The percentage of data that are between three standard deviations from the mean corresponds to the area between 3.00z and +3.00z scores. This area is 99.74%. The percentage of data that are between one standard deviation below the mean and two standard deviations above the mean one corresponds to the area between 1.00z and +2.00z scores. This is 0.9772 0.1587 = 0.8185. This area is 81.85% The percentage of data that are more than two standard deviations away from the mean corresponds to 1 Area between 2.00z and +2.00z scores = 1 0.9544 = 0.0456 or 4.56%.
z=
Referring to Table A-2, z = +1.00 corresponds to an area of 0.8413, so P(IQ < 115) = 0.8413
2.
z=
Referring to Table A-2, z = +2.10 corresponds to an area of 0.9821, so P(IQ > 131.5) = 1 0.9821= 0.0179
124
z=
Referring to Table A-2, z = 0.67 corresponds to an area of 0.2514 and z = +0.67 corresponds to an area of 0.7486, so P(90 < IQ < 110) = 0.7486 0.2514 = 0.4972
Area= 0.2514
125
z=
Referring to Table A-2, z = +0.67 corresponds to an area of 0.7486 and z = +1.33 corresponds to an area of 0.9082, so P(110 < IQ <120) = 0.9082 0.7486= 0.1596
Area= 0.90820.7486=0.1596
Area= 0.7486
5.
We find 0.2 in the body of the table and find the corresponding z score The z score for a cumulative area of 0.20 = 0.84
Area= 0.20
6. We find 0.80 in the body of the table and find the corresponding z score. The z score for a cumulative area of 0.80 = +0.84
126
Area= 0.80
7.
The IQ score separating the top 15% from the others is the same score that separates the bottom (100 15) % from the others100 15 = 85. We find 0.85 in the body of the table and find the corresponding z score. The z score for a cumulative area of 0.85 = 1.04
The IQ score separating the top 15% from the others = 115.6
Area = 0.85
Area = 0.15
8. The IQ score separating the top 55% from the others is the same score that separates the bottom (100 55) % from the others.100 55 = 45. We find 0.45 in the body of the table and find the corresponding z score. The z score for a cumulative area of 0.45 = 0.13
The IQ score separating the top 55% from the others = 98.05
127
Area= 0.45
Area= 0.55
x(IQ) z
z=
b.
From Table A-2, P(Temperature < 100.6) = P (z< +3.87) = 0.9999. P(Temperature> +3.87) = P(z > +3.87) = 1 0.9999 = 0.0001 This corresponds to 0.01%. Yes, this percentage suggests that the cutoff of 100.6C is appropriate. Since we want 5% of the people to exceed the required temperature, we use (100 5)%to find the area to the left of the cutoff line first. This corresponds to an area 0.95. From Table A-2, this corresponds to a z score of +1.645.
Thus, 5% of the people will exceed 99.2C 10. Lengths of Pregnancies, = 268, = 15 a. x = 308. We are to find P(Pregnancy> 308days). We find P(Pregnancy < 308) and subtract it from 1.
z=
From the Table, P(z < 2.67) = P(pregnancy < 308) = 0.9962 P(pregnancy > 308) =1 0.9962 = 0.0038 This result shows that is highly unlikely for a pregnancy to last 308 days or more. Therefore it is more likely that her husband is not responsible for her pregnancy, but there is no proof one way or the other. b. If premature babies are in the lower 4%, we find the cutoff time for the area 0.04.
So, the length that separates premature babies from normal ones is 242 days. 11. Designing Helmets, = 6, = 1 To find the cutoff points for the smallest 2.5% and the largest 2.5%, we find the z scores for the areas 0.025 and (1 0.025) or 0.975. From the table, these are 1.96 and +1.96 respectively.
The minimum and maximum head breadths are 4 inches and 8 inches respectively.
The area for this z score is 0.7389. So the probability that a CD player will have a replacement time less than 8 years is 0.7389 b. We need to find the cutoff point for the upper 2%. So, we find the z score for an area of (10.02) or 0.98. This corresponds to z= + 2.05.
Therefore, the time length of the warranty should be 10 years. Heights of Women. In Exercises 13-16, assume that heights of women are normally distributed with a mean given by = 63.6 in. and a standard deviation given by = 2.5 in. (based on data from the National Health Survey). In each case, draw a graph. 13. Beanstalk Club Height Requirement
= 63.6, = 2.5, z =
This corresponds to a probability of 0.9948. So, 99.48% of the women have height < 70 in. Therefore (100 99.48) or 0.52% of the women meet the requirement of being at least 70in. in height.
Area = 0.9948
Area = 0.0052
) Height(in)
63.6
70 2.56
14. Height Requirement for Women Soldiers We need to find the z scores and areas for 58 in. and 80 in.
129
58 -2.24
63.6 0
80 z 6.56
15. Height Requirement for Rockettes We need to find the z scores and areas for 66.5 in. and 71.5 in.
z=
z=
The areas for these z scores are 0.8770 and 0.9992 respectively. The probability of being between these heights is 0.9992 0.8770 = 0.1222. The probability of meeting this new height is 0.1222. Only 12.22% of women meet this requirement. Yes, it seems that the height of the Rockettes is well above the mean.
Area = 0.1222
Area = 0.8770
) Height(in)
63.6 0
66.5 1.16
71.5 3.16
130
16. Height Requirement for Rockettes To find the cutoffs for the shortest 20% and the tallest 20%, we need to find to find the z scores corresponding to the areas 0.20 and (1 0.20) or 0.80. From the Table, these z values are 0.84 and +0.84. We then use the formula:
x = + ( z ) = 63.6 + (0.84 2.5) = 63.6 2.1 = 61.5 x = + ( z ) = 63.6 + (0.84 2.5) = 63.6 + 2.1 = 65.7
So, the new minimum and maximum allowable heights are 61.5 in. and 65.7 in. respectively.
61.5 -0.84
63.6 0
65.7 0.84
) Height(in)
17. Birth Weights, = 3420, = 495 To find the cutoff weights for the lightest 2% we need to find to find the z score corresponding to the area 0.02. From the Table, the z score is -2.05. We then use the formula: x = + (z ) = 3420 + (2.05 495) = 3420 1014.75 = 2405.25 . Therefore, the weight of 2405g separates the lightest 2% of American babies from the others.
Area = 0.02
2405 -2.05
3420 0
Weight(g) z
131
18. Birth Weights, = 3570, = 500 To find the cutoff weights for the lightest 2% we need to find to find the z scores corresponding to the areas 0.02. From the Table, this z is 2.05. We then use the formula: x = + ( z ) = 3570 + (2.05 500) = 3570 1025 = 2545 . Therefore, the weight of 2545g separates the lightest 2% of Norwegian babies from the others. This result is not very different from the result in Exercise 17. Its a difference of 140g.
Area= 0.02
2545 -2.05
3570 0
Weight(g) z
19. Units of Measurement, = 143, = 29 a. z scores are measured in units of number of standard deviations from the mean, but they do not possess the units of the original variable b. The mean will be 0, the standard deviation will be 1, and the distribution will be normal since the original distribution is normal. z scores have the same shape of distribution as does the original variable distribution; converting to z scores does not result in a normal distribution of z scores if the original distribution was not normally distributed c. After converting to kg., the distribution will be normal since the original distribution is normal, 1 lb= 0.4536 kg
z=
b.
So, P(IQ < 105) = 0.6293. Therefore, P(IQ >105) = 1 0.6293 = 0.3707 We will replace 105 with an interval of 104.5 and 105.5. Because we want the probability of a score greater than 105, we want the area bounded by the interval including the area to the right. We convert 104.5 to a z score
z=
c.
So P(IQ<104.5) =0.6179. Therefore P(IQ >104.5) = 1 0.6179=0.3821. P(IQ >105, adjusted for continuity) = 0.3821 The results from (a) and (b) are nearly the same. There is very little difference.
132
9 10 + 6 + 5 21 = = = 3 3
x = =
7.0 7.0
Sampling Distribution Sample Probability Mean, x 10.0 1/9 8.0 2/9 7.5 2/9 6.0 1/9 5.5 2/9 5.0 1/9
133
b. c. d.
The probability of each sample is 1/9. The distribution of sample means is bi-modal and somewhat flat. Mean of sample statistics=
x = 63 = 7.0
9 9
Yes, the mean of the sampling distribution is equal to the mean of the population of the three values. Yes, these means are always equal, but only if every possible sample is included.
6. Telemarketing Selecting samples with replacement, there will be 42= 16 equally likely samples. Sample Number 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Sum of Sample Means Mean of statistic values Population parameter a. Sample 1, 1 1, 11 1, 9 1, 3 11, 11 11, 1 11, 9 11, 3 9, 9 9, 1 9, 11 9, 3 3, 3 3, 1 3, 11 3, 9 Sample Mean,
x
1 6 5 2 11 6 10 7 9 5 10 6 3 2 7 6 96.0
c. Probability 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16
x =
=
x =
16
6.0
1 + 11 + 9 + 3 24 = = 4 4
6.0
Sampling Distribution Sample Mean, x 11 10 9 7 6 5 3 2 1 b. Probability 1/16 2/16 1/16 2/16 4/16 2/16 1/16 2/16 1/16
The sampling distribution is of the 16 sample means, each of which has a probability of occurring. It has one mode and it is symmetrical.
134
c. d.
x = 96 = 6.0
16 16
Yes, the mean of the sampling distribution is equal to the mean of the population of the four values. Yes, these means are always equal, but only if every possible sample is included.
7. Heights of L.A. Lakers Selecting samples with replacement, there will be 52= 25 equally likely samples. Sample Number 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 Sum of Sample Means Mean of statistic values Population parameter a. Sample 85, 85 85, 79 85, 82 85, 73 85, 78 79, 79 79, 85 79, 82 79, 73 79, 78 82, 82 82, 85 82, 79 82, 73 82, 78 73, 73 73, 85 73, 79 73, 82 73, 78 78, 78 78, 85 78, 79 78, 82 78, 73 Sample Mean, x 85.0 82.0 83.5 79.0 81.5 79.0 82.0 80.5 76.0 78.5 82.0 83.5 80.5 77.5 80.0 73.0 79.0 76.0 77.5 75.5 78.0 81.5 78.5 80.0 75.5 1985 79.4 79.4 Probability 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25
25 85 + 79 + 82 + 73 + 78 397 = = = 5 5
x = x = =
135
Sampling Distribution Sample Mean, x 85.0 83.5 82.0 81.5 80.5 80.0 79.0 78.5 78.0 77.5 76.0 75.5 73.0 b. c. d. Probability 1/25 2/25 3/25 2/25 2/25 2/25 3/25 2/25 1/25 2/25 2/25 2/25 1/25
The probability of each sample occurring is 1/25. The sampling distribution of means consists of the 25 sample means with their corresponding probabilities. It has more than one mode and it is not symmetrical. The means of the sampling distribution is
x 1985 = = 79.4 n 25
Yes, the mean of the sampling distribution is equal to the mean of the population of the five heights listed above. Yes, these means are always equal as long as every possible sample is included.
136
8.
Genetics, p(F)= 3/4= 0.75, q= 0.25 Selecting samples with replacement, there will be 42= 16 equally likely samples. M=Mike(male)=0, A=Anna(female)=1, B=Barbara(female)=1, C=Chris(female)=1 Sample Number a. Sample 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Sum of Sample Means Mean of statistic values Population parameter Sampling Distribution Sample Mean, x 0.0 0.5 1.0 Probability 1/16 6/16 9/16 M,M= 0, 0 M,A= 0, 1 M,B= 0, 1 M,C= 0, 1 A,A= 1, 1 A,M= 1, 0 A,B= 1, 1 A,C= 1, 1 B,B= 1, 1 B,M= 1, 0 B,A= 1, 1 B,C= 1, 1 C,C= 1, 1 C,M= 1, 0 C,A= 1, 1 C,B= 1, 1 Proportion of Females (Sample Mean) 0.0 0.5 0.5 0.5 1.0 0.5 1.0 1.0 1.0 0.5 1.0 1.0 1.0 0.5 1.0 1.0 12.0 0.75 0.75
Probability 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16 1/16
16 16 0 +1+1+1 3 = = = 4 4
x = x = 12 = =
b. The probability of each proportion is 1/16. The sampling distribution of proportions consists of the 16 sample proportions with their corresponding probabilities of 1/16. The distribution has one mode and is clearly not symmetrical. c. d. The mean of the sampling distribution is
x 12 = = 0.75 n 16
The mean of the sampling distribution is equal to the population proportion of females. Yes, the mean of the sampling distribution of proportions always equals the population proportion as long as every possible sample is included.
137
9.
Quality Control Selecting samples with replacement, there will be 52= 25 equally likely samples. D1= 1, D2= 1, A1= 0, A2=0, A3=0 Sample Number 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 Sum of Sample Means Mean of statistic values Population parameter a. Sample D1, D1= 1, 1 D1, D2= 1, 1 D1, A1= 1, 0 D1, A2= 1, 0 D1, A3= 1, 0 D2, D2= 1, 1 D2, D1= 1, 1 D2, A1= 1, 0 D2, A2= 1, 0 D2, A3= 1, 0 A1, A1= 0, 0 A1, A2= 0, 0 A1, A3= 0, 0 A1, D1= 0, 1 A1, D2= 0, 1 A2, A2= 0, 0 A2, A3= 0, 0 A2, D1= 0, 1 A2, D2= 0, 1 A2, A1= 0, 0 A3, A3= 0, 0 A3, D1= 0, 1 A3, D2= 0, 1 A3, A1= 0, 0 A3, A2= 0, 0 Sample Mean Probability 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25 1/25
x
1.0 1.0 0.5 0.5 0.5 1.0 1.0 0.5 0.5 0.5 0.0 0.0 0.0 0.5 0.5 0.0 0.0 0.5 0.5 0.0 0.0 0.5 0.5 0.0 0.0 10.0 0.40
25 25 1+1+ 0 + 0 + 0 2 = = = 5 5
x = x = 10 = =
0.40
Sampling Distribution Sample Probability Mean, x 0.0 9/25 0.5 12/25 1.0 4/25 b. The sampling distribution consists of the 25 proportions and their corresponding probabilities of 1/25 each. The sampling distribution has one mode, but it is not symmetrical.
138
c. d.
x 10 = = 0.40 n 25
Yes, the mean of the sampling distribution is equal to the population proportion of defects. Yes, the mean of the sampling distribution of proportions always equals the population proportion as long as every possible sample is included.
10. Women Senators a. From a random sample, these results were obtained: D, R, D, D, D. b. The proportion of democrats is 4/5= 0.80. c. The proportion from part b is a statistic because it is the proportion in a particular sample. d. No, the sample proportion (4/5 = 0.8) does not equal the population proportion (10/13 = 0.77) No random sample of size 5 can equal the population proportion because the proportions in the samples must be multiples of 0.2. The possibilities are: 0.0, 0.2, 0.4, 0.6, 0.8, 1.0. The population proportion (0.77) is not equal to any of these. e. If all possible samples of size 5 are listed, then the mean of all the sample proportions will be equal to population proportion. 11. Mean Absolute Deviation From Table 5-2, x= 1, 2, 5, = 2.67 Population Mean Absolute Deviation, see this formula in Section 2-5.
xx
n
Sample Number 1 2 3 4 5 6 7 8 9
a. Sample 1, 1 1, 2 1, 5 2, 1 2, 2 2, 5 5, 1 5, 2 5, 5
Sample Mean x 1.0 1.5 3.0 1.5 2.0 3.5 3.0 3.5 5.0
d =
( x1 x 2 ) 2
0.0 0.5 2.0 0.5 0.0 1.5 2.0 1.5 0.0
MAD = d =
d
n
8 = 0.89 9
Since MAD = 0.89 1.56 (the population absolute mean deviation) the mean absolute deviation is not a good estimate of the population mean absolute deviation.
139
12. Median as an Estimator Sample Number 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 Sample 1,1,1 1,1,2 1,1,5 1,2,1 1,5,1 1,2,5 1,5,2 1,2,2 1,5,5 2,2,2 2,2,1 2,2,5 2,1,2 2,5,2 2,1,5 2,5,1 2,1,1 2,5,5 5,5,5 5,5,1 5,5,2 5,1,5 5,2,5 5,1,2 5,2,1 5,1,1 5,2,2 Mean ( x ) 1.00 1.33 2.33 1.33 2.33 2.67 2.67 1.67 3.67 2.00 1.67 3.00 1.67 3.00 2.67 2.67 1.33 4.00 5.00 3.67 4.00 3.67 4.00 2.67 2.67 2.33 3.00 Median 1 1 1 1 1 2 2 2 5 2 2 2 2 2 2 2 1 5 5 5 5 5 5 2 2 1 2 Probability 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27 1/27
xx =
x = 72 = 2.7
n 27
x Mdn =
Mdn = 68 = 2.5
n 27
In this case, the mean of the sample means and the mean of the sample medians both are not equal to the population mean. Only the mean of the sample means is equal to the population mean. The mean of the medians is negatively biased. We conclude that the mean of the sample mean a better estimate of the population mean than the mean of the medians.
140
z=
b.
x n
167 172 5 = 0.17. From Table A-2, P(z < 0.17) = 0.4325. 29 29
There is a 0.4325 probability that an individual man will weigh less than 167 lb. P( x < 167)
z=
167 172 5 5 = = = 1.03 . From Table A-2, P(z < 1.03) = 0.1515. 29 29 4.833 6 36
167 lb.
There is a 0.1515 probability that a group of 36 men will have a mean weight less than 2. a. P(x > 180)
z=
180 172 8 = = +0.28. From Table A-2, P(z < +0.28) = 0.6103. 29 29
Therefore, P(z > +0.28) = 1 0.6103 = 0.3897. There is a 0.3897 probability that an individual man will weigh more than 180 lb. b. P( x > 180)
z=
x n
180 172 8 8 = = = +2.76 . From Table A-2, P(z < +2.76) = 0.9971. 29 29 2 .9 10 100
Therefore, P(z > 0.28) = 1 0.9971 = 0.0029. There is a 0.0029 probability that a group of 100 men will have a mean weight more than 180 lb. 3. a. P(170 < x < 175)
z=
b.
From Table A-2, P(z < 0.07) = 0.4721 and P (z < +0.10) = 0.5398. The difference is 0.5398 0.4721 = 0.0677. There is a 0.0677 probability that an individual man will weigh between 170 lb and 175 lb P(170 < x < 175)
z=
From Table A-2, P(z < 2.48) = 0.0066 and P(z < 0.24) = 0.4052.
141
b.
The difference is 0.4052 0.0066 = 0.3986. There is a 0.3986 probability that an individual man will weigh between 100 lb and 165 lb P(100 < x < 165)
z=
x n
b.
From Table A-2, P(z < 2.07) = 0.0192 Therefore P(z > 2.07) = 1 0.0192 = 0.9808. There is a 0.9808 probability that a group of 25 men will weigh more than 160 lb. The central limit theorem can be used in part (a) because the original distribution is a normal distribution and we assume the sampling distribution would be normal even though the sample size is less than 30. P(160 < x < 180)
6. a.
b.
From Table A-2, P (z < 0.83) = 0.2033 and P(z < +0.55) = 0.7088. The difference is 0.7088 0.2033 = 0.5055. There is a 0.5055 probability that a group of 4 men will have a mean weight between 160 lb and 180 lb. The central limit theorem can be used in part (a) because the original distribution is a normal distribution we assume the sampling distribution would be normal even though the sample size is less than 30.
z=
From Table A-2, P(z < 0.10) = 0.4602 and P(z < +2.34) = 0.9904. The difference is 0.9904 0.4602 = 0.5302. There is a 0.5302 probability that an individual woman will weigh between 140 lb and 211 lb. b. P(140 < x < 211)
142
c.
z=
There is a 0.5793 probability that an individual man will have a head breadth less than b.
6 .2 6 0 .2 0 .2 z= = = = = + 2 .0 . 1 1 0 .1 10 100 n
From Table A-2, P(z < +2.0) = 0.9772. There is a 0.9772 probability that a group of 100 men will have a mean head breadth less than 6.2 in. The results from (b) above are for a group of men. Since the helmets are to be used by one man alone at a time, the results of (a) are more appropriate for the production manager to use.
c.
z=
x n
b.
From Table A-2, P (z < +2.26) = 0.9881. Therefore, P (z > 2.26) = 1 0.9881 = 0.0119. The probability that the mean of the 2 men is greater than 16 in. is 0.0119. No, most riders will be able to fit since the probability of both riders having a mean hip breadth of greater than 16in. is very low.(0.0119). Yes, this design appears to be acceptable.
z=
x n
From Table A-2, P(z < +2.42) = 0.9922. Therefore, P(z > 2.42) = 1 0.9922 = 0.0078. The probability of getting 100 numbers with a mean greater than 0.57 is 0.0078. It would be unusual to generate 100 such numbers and get a mean of greater than 0.57. This is because the probability of this occurring is very low (0.0078). 11. Blood Pressure, = 114.8, = 13.1 a.
z=
From Table A-2, P(z < +1.92) = 0.9726.Therefore, P(z > +1.92) = 1 0.9726 = 0.0274. There is a 0.0274 probability that an individual woman will have a systolic blood pressure greater than 140.
143
b.
z=
x n
c. d.
From Table A-2, P(z < +3.85)= 0.9999. Therefore, P(z > 3.85) = 1 0.9999= 0.0001. There is a 0.0001 probability that a group of 4 women will have a mean systolic blood pressure greater than 140. The central limit theorem can be used in part (b) because the original distribution is a normal distribution, even though the sample size is less than 30. No. Although the mean result for the 4 women is less than 140, the individual values could be above or below 140 due to sampling variability.
z=
x n
0.882 0.941 = 1.19. From Table A-2, P(z < 1.19) = 0.1170. 0.313 40
b.
There is a 0.1170 probability of randomly selecting 40 cigarettes with a mean of 0.882 g or less. Based on the results, the amount of nicotine seems to be lower. This is because it is very unlikely to select a group of 40 cigarettes with a mean nicotine level of less than 0.882 if the mean and standard deviation have not changed. Therefore, it is likely that these values have changed as the company claims.
13. Elevator Design, = 172, = 29, n= 16, P = 0.975 We first find the z score for the area P= 0.975 from the body of table A-2.This corresponds to z = +1.96. We then use the formula:
29 29 x = + z = 172 + 1.96 * = 172 + 1.96 4.0 = 172 + (1.96 7.25) = . 16 n 172 + 14.21 = 186.21 To get the total value for 16 men, 186.21 16 = 2979.4. This is the maximum total allowable weight if we
want a 0.975 probability of this weight not being exceeded with 16 men. 14. Seating Design, = 14.4, = 1, n = 18, P= 0.975 a. We first find the z score for the area P= 0.975 from the body of table A-2. This +1.96. We then use the formula:
corresponds to z =
1 1 x = + z = 14.4 + 1.96 = 14.4 + 1.96 4.243 = 14.4 + 1.96 0.236 = . n 18 14.4 + 0.46 = 14.86 To get the total value for 18men, = 14.86 18 = 267.48 in. This is the minimum length of the bench if we
b. want a 0.975 probability that it will fit the combined hips of 18 men. Using the result in (a) would be wrong because we actually want to build a bench for 18 male college football player are most probably bigger in size than normal men.
15. Correcting for a Finite Population, = 143, = 29, N=120, n = 8 a. If we do not want to exceed this limit, we need to find the probability of the 8 of them having a total weight less than 1300 lb. A total capacity of 1300 lb for the 8 women means 1300/8 = 162.5 lb per woman on average.
N n 29 120 8 29 112 N 1 120 1 2.828 119 n 8 19.5 19.5 19.5 = = = 1.96 10.25 0.941 10.25 0.970 9.94
From Table A-2, P(z < +1.96) = 0.975.
z=
162.5 143
19.5
144
b.
The probability of their total weight not exceeding 1300lb = 0.9750. We first find the z score for the area P= 0.9900 from the body of Table A-2. This corresponds to z = +2.33. We then use the formula:
N n 29 120 8 29 = 143 + 2.33 = 143 + 2.34 0.941 = N 1 120 1 2.828 n 8 143 + 2.34 10.25 * 0.970 = 143 + 23.18 = 166.18 To get the total value for 8 women, = 166.18 8 = 1329 lb. This is the maximum allowable weight of x = + z
passengers in the elevator if we want a 0.99 probability that the elevator will not be overloaded. 16. Population Parameters, 2, 3, 6, 8, 11, 18 a.
x 2 + 3 + 6 + 8 + 11 + 18 48 = = = 8 .0 6 6 N
x x (x )2
2
2 -6 36
3 -5 25
6 -2 4
8 0 0
11 3 9
18 10 100
x= 48 (x )= 0 (x )2= 174
=
b.
( x ) 174 = = 5.385 N 6
Sample Number 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Samples (without replacement) 2, 3 2, 6 2, 8 2, 11 2, 18 3, 6 3, 8 3, 11 3, 18 6, 8 6,11 6, 18 8, 11 8, 18 11, 18 = Sample mean, x 2.5 4.0 5.0 6.5 10.0 4.5 5.5 7.0 10.5 7.0 8.5 12.0 9.5 13.0 14.5 120.00
x x
-5.5 -4.0 -3.0 -1.5 2.0 -3.5 -2.5 -1.0 2.5 -1.0 0.5 4.0 1.5 5.0 6.5 0.0
(x
x )2
30.25 16.00 9.00 2.25 4.00 12.25 6.25 1.00 6.25 1.00 0.25 16.00 2.25 25.00 42.25 174.00
c. d.
x =
x = 120 = 8.0
nx 15
Mean and standard deviation, See part (c). for the mean of 8.0 Standard deviation of sample means,
x =
e.
By comparing the result in part (a) with the result in part (c), we see that x
= =8
145
= np = 14 0.5 = 7 x
= npq = 14 0.5 0.5 = 3.5 = 1.871 8.5 7 1.5 = = 0.80 1.871 1.871 x 9.5 7 2.5 z= = = = 1.34 1.871 1.871 z= =
146
z = 0.80 corresponds to a probability of 0.7881 z = 1.34 corresponds to a probability of 0.9099 P(9) from Normal Approximation= 0.9099 0.7881 = 0.1218 (very good approximation to 0.122) 10. a. b. n = 12, p = 0.8, From Table A-1, P (7) = 0.053
np = 12 0 .8 = 9 .6 , nq = 12 0 .2 = 2 .4
nq = 2.4 which is <5. Therefore, the normal approximation is not justified 11. a. b. n = 15, p = 0.9, From Table A-1, P (14) = 0.343, P(15)= 0.206 P (at least 14) = P (14 or more)= P(14) +P(15) = 0.343 + 0.206= 0.549 Normal approximation
np = 15 0 .9 = 13 .5 , nq = 15 0 .1 = 1.5
nq = 1.5 which is < 5. Therefore the normal approximation is not justified 12. a. b. n = 13, p = 0.4, From Table A-1, P(2)=0.0452, P(1)= 0.011, P(0)= 0.001 P (fewer than 3) = P(2) + P(1) + P(0)= 0.045 + 0.011 + 0.001= 0.057 Normal approximation np= 13 0.4= 5.2, nq= 13 0.6= 7.8 (both 5, normal approximation is justified)
= np = 13 0.4 = 5.2
= npq = 13 0.4 0.6 = 3.12 = 1.766 z= x 2.5 5.2 2.7 = = = 1.53 1.766 1.766
P(x < 3), finding Pc(x < 2.5) P(z < 1.53) = 0.0630 (relatively close approximation to 0.057) 13. Probability of More Than 55 Girls, np= 50, nq= 50 (both 5, normal distribution justified)
n = 100 , p = 0 .5 npq =
= np = 100 0 .5 = 50 = z = 100 0 . 5 0 . 5 = 25 = 5 x 55 .5 50 5 .5 = = = + 1 .1 5 5
x= 55, finding Pc(x > 55.5) P (girls > 55) = P (z > +1.1) = 1 0.8643 = 0.1357 No, since P(girls > 55) is greater than 0.05, it is not unusual to get more than 55 girls out of 100 births. 14. Probability of at Least 65 Girls, np= 50, nq= 50 (both 5, normal distribution justified)
n = 100 , p = 0 .5 = np = 100 0 .5 = 50
= z=
npq = 100 0 . 5 0 . 5 =
25 = 5
x 64 .5 50 14 . 5 = = = + 2 .9 5 5
P(x < 65), finding Pc(x < 64.5) P (girls 65) = P (z > +2.9) = 1 0.9981 = 0.0019 Yes, since P(girls 65) is less than 0.05, it is unusual that there would be 65 or more girls out of 100 births.
Chapter 5: Normal Probability Distributions 15. Probability of at Least Passing, np= 50, nq= 50 (both 5, normal distribution justified)
147
= z=
25 = 5
P(x 60), finding Pc(x > 59.5) P (score 60) = P (z > +1.9) = 1 0.9713 = 0.0287 No, since P(score 60) is less than 0.05, it is unusual to get a score of at least 60 by guessing 16. Multiple-Choice Test, np= 5, nq= 20 (both 5, normal distribution justified)
x 59 .5 50 9 .5 = = = + 1.9 5 5
n = 25, p = 0.2 (one out of 5 options is correct) = np = 25 0.2 = 5 = npq = 25 0.2 0.8 = 4 = 2 z= x 2.5 5 2.5 = = = 1.25 2 2 z=
P(3 < x < 10), finding Pc(2.5 < x < 10.5) P (z< 1.25) = 0.1056, P (z < +2.75) = 0.9970 P (-1.25 < z < +2.75) = 0.9970 0.1056 = 0.8914 17. Mendels Hybridization Experiment, np= 145, nq= 435 (both 5, normal distribution
n = 580 , p = 0.25
justified)
= np = 580 0.25 = 145 = npq = 580 0.25 0.75 = 108.75 = 10.43 z= x 151.5 145 6 .5 = = = +0.62 10.43 10.43
P(x 152), finding Pc(x > 151.5) P (z at least 0.62) = P(z > +0.62) = 1 0.7324 = 0.2676 No, there is no evidence that the Mendelian rate of 25% is wrong because it is not unusual to get 152 yellow pods out of 580 seedlings, p= 0.2676 18. Cholesterol-Reducing Drug, np= 16.4, nq= 846.6 (both 5, normal distribution justified)
= z=
npq = 863 0 .019 0 .981 = 16 .09 = 4 .01 x- 18 .5 16 .397 2 .103 = = = + 0 .52 4 .01 4 .01
P(x 19), finding Pc(x > 18.5) P(z at least 0.52) = P (z > +0.52) = 1 0.6985 = 0.3015 It is not unusual to have 19 people with flu symptoms (P= 0.3015). Therefore, the flu symptoms are probably not due to taking the drug.
148
19. Probability of at Least 50 Color-Blind Men, np= 54, nq= 546 (both 5, normal distribution justified)
n = 600, p = 0.09,
= np = 600 0.09 = 54 = npq = 600 0.09 0.91 = 49.14 = 7.01 x - 49.5 54 4.5 z= = = = 0.64 7.01 7.01
P(x 50), finding Pc(x > 49.5) P(z at least 0.64) = P(z > 0.64) = 1 0.2611 = 0.7389 It is quite likely to have 50 color blind men among this group of 600 men (P= 0.7389). However, the researchers cannot be very confident since there is still quite some chance of not getting up to 50 men. 20. Cell Phones and Brain Cancer, np= 143, nq= 419,952 (both 5, normal distribution justified)
= z=
P(x 135), finding Pc(x < 135.5) P (z < 0.61) = 0.2709 It is not unusual to have 135 or fewer cases of brain cancer in the population (P= 0.2709). Therefore, the media reports that cell phones cause brain cancer are not supported by the evidence. 21. Identifying Gender Discrimination, np= 31, nq= 31 (both 5, normal distribution justified)
n = 62 , p = 0 .5 npq =
P(x 21), finding Pc(x < 21.5) P (z < 2.41) = 0.0080 It is unusual to have 21 female employees out of 62 new employees being hired assuming no gender discrimination. (P= 0.0080) These results support the charge of gender discrimination taking place. 22. Blood Group, np= 180, nq= 220 (both 5, normal distribution justified)
n = 400 , p = 0.45
= np = 400 0.45 = 180 = npq = 400 0.45 0.55 = 99 = 9.95 z= x 176.5 180 3.5 = = = 0.35 9.95 9.95
P(x 177), finding P(x > 176.5) P(z < 0.35)= 0.3632, P(z > 0.35) = 1 0.3632= 0.6368 It is not unusual to have at least 177 Group O donors in this group of 400 people. The pool may be sufficient, however this pool may not be sufficient because the probability is not high (P = 0.6368).
Chapter 5: Normal Probability Distributions 23. Acceptance Sampling, np= 5, nq= 45 (both 5, normal distribution justified)
149
n = 50 , p = 0 .1 npq =
P(x 2), finding P(x > 1.5) P(z < 1.65)= 0.0495, P(z > 1.65) = 1 0.0495= 0.9505 Yes, this plan would detect defects at the 10% level about 95% of the time. 24. Car Crashes, np= 170, nq= 330 (both 5, normal distribution justified)
n = 500, p = 0.34
= np = 500 0.34 = 170 = npq = 500 0.34 0.66 = 112.2 = 10.59 x 199.5 170 29.5 z= = = = +2.79 10.59 10.59
P(x 200), finding P(x > 199.5) P(z < +2.79)= 0.9974, P(z > +2.79) = 1 0.9974 = 0.0026 The probability of having 40 %( 200) of 500 men having accidents is very low (p< 0.05) when the true probability is 0.34. Therefore, the claim that the accident rate in New York City is higher than 34% is supported by the evidence in this result. 25. Cloning Survey, np= 506, nq= 506 (both 5, normal distribution justified)
n = 1012 , p = 0.5
= np = 1012 0.5 = 506 = npq = 1012 0.5 0.5 = 253 = 15.91 z= x 900.5 506 394.5 = = = +24.80 15.91 15.91
P(x 900), finding P(x > 900.5) P (z < +24.80) 0.9999, P(z > +24.80) = 1 0.9999 = 0.0001 The probability of having 89% (900) of 1012 people in a sample assuming a general probability of 0.5 is very low. Yes, this evidence supports the claim that a majority of people are opposed to cloning
150
4. The data are normally distributed since the data plot dots are very close to a straight line that follows the normal quantile plot that is expected if the data are normally distributed. Determining Normality. In Exercises 5-8, refer to the indicated data set and determine whether the requirement of a normal distribution is satisfied. Assume that this requirement is loose in the sense that the population distribution need not be exactly normal, but it must be a distribution that is basically symmetric with only one mode. 5. BMI, Data Set 1 in Appendix B
12
10
Frequency
BMIMales
The histogram above shows a distribution with one mode, relatively symmetrical, and bell-shaped. It can be said to approximate a normal distribution.
151
6.
20
15
Frequency
10
HeadCircMales
The histogram above shows a distribution with one mode, relatively symmetrical, and bell-shaped, except for two values in the lower part of the range. While this distribution is not perfectly symmetrical it could be considered to be approximately normal.
152
7.
Water Conductivity
12.5
10.0
Frequency
7.5
5.0
2.5
WaterConductivity
The histogram above shows a distribution with one mode. However, the distribution is not symmetrical and bell-shaped so it would not be considered to be approximately normal.
153
8.
10
Frequency
PoplarTreeHgt
The histogram above shows a distribution with one mode. However, the distribution is not symmetrical or bellshaped so it would not be considered to be approximately normal.
154
Generating Normal Quantile Plots. In Exercises 9-12, use the data from the indicated exercise in this section. Use a TI-83/84Plus calculator or software (such as SPSS, SAS, STATDISK, Minitab. or Excel) capable of generating normal quantile plots (or normal probability plots). Generate the graph, then determine whether the data appear to come from a normally distributed population. NOTE: The following Normal Quantile Plots, except those in Exercises 15 and 16 were generated by SPSS. When using the SPSS option for standardized or z scores, both axes are put into z score units, not just the Yaxis. 9. From Exercise 5
Normal Q-Q Plot of BMIMales
-2
-4 -4 -2 0 2 4
The BMI data from Exercise 5 seems to come from a normal distribution. Most of the points are very close to the straight line.
155
-2
-4 -4 -2 0 2
The head circumference data from Exercise 6 seems to come from a normal distribution. Most of the points, except for two of them, are very close to the line.
156
-2
-4 -4 -2 0 2 4
The data on the conductivity variable are not normally distributed. The points depart quite a bit from the straight line.
157
-2
-4 -4 -2 0 2 4
This tree height data distribution is not normal. The points are not close to the line. Also, there are some obvious outliers seen in the plot. 13. Comparing Data Sets
Normal Q-Q Plot of HgtWomen
Normal Q-Q Plot of CholestWomen
4
4
-2
-2
-4
-4 -3 -2 -1 0 1 2 3 4
The distribution for height appears to be normal, but the distribution for cholesterol does not appear to be normal. This could be because cholesterol levels depend on diet and many other human behaviors in different ways that do not yield normally distributed results while height is a more natural variable less influenced by human behaviors.
158
-2
-2
-4
-4 -4 -2 0 2 4
Systolic blood pressure does not appear to have a distribution that approximates a normal distribution, but the distribution of elbow breadth could approximate a normal distribution. This could be because systolic blood pressure levels depend on diet and other human behaviors that do not yield normally distributed results while elbow breadth is a more natural variable less influenced by human behaviors. Constructing Normal Quantile Plots. In Exercises 15 and 16, use the given data values and identify the corresponding z scores that are used for a normal quantile plot, then construct the normal quantile plot and determine whether the data appear to be from a population with a normal distribution. 15. Heights of L.A. Lakers Sorting the data by order gives us 73, 78, 79, 82, 85 n = 5, 1/2n, 3/2n, 5/2n, 7/2n, 9/2n = 0.1, 0.3, 0.5, 0.7, 0.9 Corresponding z scores, using Table A-2 for these areas are: 1.28, 0.52, 0.00, +0.52, and +1.28 We now pair the sorted heights with their corresponding z scores: (73, 1.28) (78, 0.52) (79, 0) (82, +0.52) (85, +1.28) We plot these (x,y) coordinates to get the normal quantile plot.
159
1.5
0.5
-0.5
-1
-1.5
-2
70
74
76
82
84
86
This distribution looks like it approximates a normal distribution. 16. Monitoring Lead in Air Sorting the data by order gives us 0.42, 0.48, 0.73, 1.10, 1.10, 5.40 n= 6, 1/2n, 3/2n, 5/2n, 7/2n, 9/2n, 11/2n = 0.083, 0.167, 0.417, 0.583, 0.750, 0.917 Corresponding z scores by using Table A-2 for these areas are: 1.38, 0.67, 0.21, +0.21, +0.67 and +1.39 We now pair the sorted heights with their corresponding z scores: (0.42, -1.38) (0.48,-0.67) (0.73, 0.21) (1.10, 0.21) (1.10, 0.67) (5.40, 1.39) We plot these (x,y) coordinates to get the normal quantile plot.
160
1.5
0.5
-0.5
-1
-1.5
-2
The distribution of the data clearly is not normal. 17. Using Standard Scores No, the transformation to z scores involves subtracting a constant and dividing by a constant, so the plot of the (x,z,) points will always be a straight line, regardless of the nature of the distribution.
161
-1
-1
-2 -2 -1 0 1 2 3
-2 -2 -1 0 1 2
The above distribution on the left is clearly not normal. However, the distribution on the right, after the log (x + 1) transformation is much closer to being a normal distribution. This illustrates that at times a transformation can provide a distribution much closer to a normal distribution than the original distribution has.
Review Exercises
1. High Cholesterol Levels, = 178.1, = 40.7 a. P(x > 260)
z=
b.
P(x > 260)= P(z > + 2.01), Using Table A-2, P(z < +2.01)= 0.9778 P(z > +2.01)= 1 P(z +2.01)= 1 0.9778= 0.0222 P(170 < x < 200)
z=
z=
c.
P(z < +0.54)= 0.7054, P(z < -0.20)= 0.4207 P(170 < x < 200)= P(-0.20 < z < +0.54)= 0.7054 0.4207= 0.2847 P(170 < x < 200), with n= 9
170 178.1 8.1 8 .1 = = = 0.60 40.7 40.7 13.57 3 9 n . x 200 178.1 21.9 21.9 z= = = = = +1.61 40.7 40.7 13.57 3 n 9 z= =
From Table A-2, P (z < 0.60)= 0.2743 and P(z < +1.61)= 0.9463. The difference is 0.9463 0.2743= 0.6720. There is a 0.6720 probability that a group of 9 men will have a mean cholesterol level between 170 mg/dL and 200 mg/dL The top 3% is equivalent to bottom 97%. From Table A-2, the area 0.97 corresponds to a z score of +1.88
d.
162
z=
b.
P( z < 2.46) = 0.0069. Therefore, 0.69% of babies are in at risk category. If the Chicago hospital has 900 births, we expect 0.69 % of the 900 to be at risk 6.21 babies would be at risk. Lowest 2%. From Table A-2, the area 0.02 corresponds to a z score of 2.05
c.
z=
x- n
d.
From Table A-2, P(z < 2.26) = 0.9881.Therefore, P(z >2.26) = 1 0.9881 = 0.0119. The probability that 16 newborn babies will have mean weight greater than 3700 is 0.0119. P(3300 < x < 3700) with n= 49
3300 3420 120 120 = = = 1.70 495 495 70.71 7 n 49 280 280 x - 3700 3420 z= = = = = +3.96 495 495 70.71 7 49 n z= =
From Table A-2, P(z < 1.70) = 0.0446, and, P(z < +3.96) = 0.9999. P(3300 < x < 3700)= P(z < +3.96) P(z < 1.70)= 0.9999 0.0446 = 0.9553. There is a 0.9553 probability that a group of 49 babies will have a mean birth weight between 3300 g and 3700 g. 3. Blue Genes, since np= 25 and nq= 75, both > 5, use of normal approximation to a binomial distribution, with continuity correction, is justified P(x 19), find Pc(x < 19.5)
x-
n = 100, p = 0.25
= np = 100 0.25 = 25 = npq = 100 0.25 0.75 = 18.75 = 4.33 x 19.5 25 5.5 = = = 1.27 z= 4.33 4.33
From Table A-2, the area below a z score of 1.27 is 0.1020. Since P= 0.1020 > 0.05, it would not be considered to be unusual to have 19 or fewer offspring with blue eyes out of 100 births. 4. Marine Corps Height Requirements for Men, = 69, = 2.8 a. P(64 < x < 78)
z=
64 69 5 = = 1.79 2 .8 2 .8
z=
78 69 9 = = +3.21 2 .8 2 .8
From Table A-2, the area below a z score of 1.79 is 0.0367 and for a z score of +3.21 is 0.9993. P(64 < x < 78)= P(z < +3.21) P(z < 1.79)= 0.9993 0.0367= 0.9626 Therefore 96% of men meet this requirement so not many men (only about 3.7%) are denied entry into the Marines because of their height.
163
b.
The shortest 2% corresponds to an area of 0.02 which corresponds to a z score of 2.05. The tallest 2% corresponds to an area of 0.98 which corresponds to a z score of +2.05
c.
The new minimum and maximum heights would be 63.3 in. and 74.7 in. P( x > 68) with n= 64
z=
x n
68 69 1 1 = = = 2.86 2 .8 2 .8 0.35 8 64
The area below a z score of 2.86 is 0.0021. P(z > 2.86) = 1 0.0021= 0.9979 The probability of randomly drawing a sample of 64 with a mean height greater than 68 in. is 0.9979. 5. Sampling Distributions a. With a sample size of 100, which is considered a large sample size, we would expect the distribution of sample means to be normally distributed regardless of the shape of distribution from which the samples are drawn. The basis for making this claim is the Central Limit Theorem. b. The standard deviation of the sample means is referred to as the standard error of the mean. If = 512 and samples are of size, n= 100, it is found as:
x =
c.
512 100
512 = 51.2 10
With a sample size of 1200, which is considered a very large sample size, we would expect the distribution of sample proportions from x/n to be normally distributed even though the original distribution is a binomial distribution. The basis for making this claim is the Central Limit Theorem.
6. Gender Discrimination, n= 20, p= 0.30, q= 0.70 np= 6, nq= 14 (since both 5, a normal distribution approximation is justified)
= np = 20 0.30 = 6
P( x 2), Pc ( x < 2.5)
From Table A-2, P(z < 1.71)= 0.0436 The probability of selecting two or fewer women by chance is 0.0436. Since P= 0.0436 < 0.05, this outcome would be considered unusual. Either this is an unusual event or something else may have happened such as discrimination.
164
7. Testing for Normality, From Data Set 6 in Appendix B, Bear Neck Size From the graphs below, the distribution is approximately normal. The histogram, with a normal distribution superimposed on it, has one mode and is roughly bell-shaped and the normal quantile plot has most of the points on the straight line.
10
Frequency
-2
-4 -4 -2 0 2 4
BearNeckSize
8. Testing for Normality, From Data Set 12 in Appendix B, Pre-Exercise with No Stress From the graphs below, the distribution is approximately normal. The histogram, with a normal distribution superimposed on it, has one mode is roughly bell-shaped and the normal quantile plot has most of the points on the straight line.
Frequency
-1
-2 -2 -1 0 1 2
PreExrcsSystBP
165
x=
b. c. d.
x = 67 + 66 + 59 + 62 + 63 + 66 + 66 + 55 = 504 = 63.0
n 8 8
Since there are a even number of scores, the median is the middle point between the two middle, Median, ~ x = (63+66)/2 = 64.5 The mode is the number that occurs the most frequent = 66 (occurs 3 times) Standard deviation
x = 504
s =
2
= 31876
=
n x ( x )
2
n(n 1)
s = s 2 = 17.714 = 4.21
e. f. g. h. i.
z=
6 of the 8 numbers are greater than 59, 6/8= 0.75 or 75% Assuming a normal distribution, the area below a z score of 0.95, P(z < 0.95)= 0.1711 P (z > 0.95) = 1 0.1711= 0.8289. This corresponds to 82.89% This data set is ratio level of measurement since there are equal intervals of measurement and there is a natural staring point at zero. The exact un-rounded distances are continuous data that can be any value on the continuum.
2. Left-Handedness, p= 0.10 a. This is a binomial distribution with p= 0.1. Probability of 3 out of a sample of 3 being left handed P(L1) = 0.1, P(L2)= 0.1, P(L3)= 0.1 P(all three are L)= P(L1) P(L2) P(L3)= 0.13= 0.001 b. P(at least 1 person left-handed)= 1 P(no lefthanders)= 1 P( N1) P(N2) P(N3) = 1 (0.9 0.9 0.9)= 1 0.729= 0.271 c. The sample size of 3 is too small, np= 0.3 < 5, np 5 is not satisfied. d. In a group of 50 people, the mean number of left handed people would be = np = 50 0.1 = 5.0 e. f. Standard deviation, P(x > 8)
z=
Area below a z score of 1.42 is 0.9207.Therefore, P(x > 8) = 1 0.9207= 0.0793. Since P= 0.0793 > 0.05, it would not be considered an unusual result to get 8 lefthanders out of 50 subjects.