Lab 8 - Sampling Techniques 1
Lab 8 - Sampling Techniques 1
The absolute value of test statistic is 4.5644 which is greater than the critical
value of 1.6449. Hence, at 0.05 significance level, we accept the claim that
mean lifetime of a light bulb is above 10,000 hours.
Alterative Comparison P- Value
• The p-value is the level of marginal significance within a statistical hypothesis
test representing the probability of the occurrence of a given event. The p-
value is used as an alternative to rejection points to provide the smallest level
of significance at which the null hypothesis would be rejected. A smaller p-
value means that there is stronger evidence in favour of the alternative
hypothesis.
P- Value Cont….
• The p-value approach to hypothesis testing uses the calculated probability to
determine whether there is evidence to reject the null hypothesis. The null
hypothesis, also known as the conjecture, is the initial claim about a
population of statistics. The alternative hypothesis states whether the
population parameter differs from the value of the population parameter
stated in the conjecture. In practice, the p-value, or critical value, is stated in
advance to determine how the required value to reject the null hypothesis.
P –Value Comparison
• Instead of using the critical value, we apply the pnorm function to
compute the lower tail p-value of the test statistic. As it turns out to be
less than the .05 significance level, we reject the null hypothesis.
Problem
• Suppose the food label on a cookie bag states that there is at most 2
grams of saturated fat in a single cookie. In a sample of 35 cookies, it
is found that the mean amount of saturated fat per cookie is 2.1 grams.
Assume that the population standard deviation is 0.25 grams. At .05
significance level, can we reject the claim on food label?
R - Code
• The null hypothesis is that μ =2. We begin with computing the test
statistic.
• Alternate hypothesis that mean is less than or equal to 2
Critical Value Comparison
Interpretation
• The test statistic 2.3664 is greater than the critical value of 1.6449.
Hence, at .05 significance level, we accept the claim that there is at
most 2 grams of saturated fat in a cookie.
Problem
• Suppose the mean weight of King Penguins found in an Antarctic
colony last year was 15.4 kg. In a sample of 35 penguins same time this
year in the same colony, the mean penguin weight is 14.6 kg. Assume
the population standard deviation is 2.5 kg. At .05 significance level,
can we reject the null hypothesis that the mean penguin weight does
not differ from last year?
R – Code
• The null hypothesis is that μ = 15.4.
Comparison using p- value
The p – value turns out to be greater than the .05 significance level, we accept the
null hypothesis that μ = 15.4.
Test of Population Proportion
• The null hypothesis of the population proportion can be expressed as
follows:
𝟎
week(x)
Thus, we are 95% confident that the percent of eighth-graders who performed at or above the basic
level in mathematics in 2011 is between 0:14% and 5:86% higher than in 2009.
Problem 3
• The use of helmet among recreational alpine skiers and snowboarders are generally low. A
study from Norway wanted to examine if helmet use reduces the risk of head injury. In
the study, they compared the helmet use among skiers and snowboarders that was injured
with a control group. The control group consisted of skiers and snowboarders that was
uninjured. 96 of 578 people with head injuries used a helmet and 656 of 2992 people in
the uninjured group used a helmet. Is helmet use lower among skiers and snowboarders
who had head injuries?
•
Let p1 be the proportion of helmet use among injured skiers and snowboarders.
Let p2 be the proportion of helmet use among uninjured skiers and snowboarders
H0 : p1 = p2 against H1 : p1 < p2
The p-value= 0.0021 < 0.01 so we have strong evidence that helmet use is lower among skiers
and snowboarders who had head injuries compared to uninjured skiers and snowboarders.
Problem 4
• A survey is taken two times over the course of two weeks. The pollsters wish to see if there
is a difference in the results as there has been a new advertising campaign run. Here is the
data
Week1 Week2
Favorable 45 56
Unfavorable 35 47
H0: P1 = P2
H1: P1 =P2 (two- sided)
R - Code
we observe that the p-value is 0.9172 so we accept the null hypothesis that P1 =P2.
Two mean Test
The following data shows the heights of individuals of two different countries with the population
variance of 5 and 8.5 respectively. Is there any significant difference between the average heights
of two groups.
A: 175 168 168 190 156 181 182 175 174 179
B: 185 169 173 173 188 186 175 174 179 180
R – Code
P- value comparison
Since it turns out to be greater than the .05 significance level, we do not reject the null
hypothesis
Practice Problems on Large Samples
• In the sample of 1000 people in Maharashtra,540 are rice eaters and the rest are
wheat eaters. Can we assume that both rice and wheat are equally popular in this
state at 1% level of significance
• A particular brand of tires claims that its deluxe tire averages at least 50,000 miles
before it needs to be replaced. From past studies of this tire, the standard deviation
is known to be 8000. A survey of owners of that tire design is conducted. From the
28 tires surveyed, the average lifespan was 46,500 miles with a standard deviation
of 9800 miles. Do the data support the claim at the 5% level?
Practice problems cont….
• In the large city A,20 per cent of Random sample of 900 School children
had defective eye –sight. In the large city B,15 percent of random sample of
1600 school children had the same defective. Is this Difference between the
two Proportions Significant? Obtain 95% confidence limits of the difference
in the population proportions.
• A cigarette manufacturing firm claims its brand A of the cigarettes outsells
its brand B by 8%.if its found that 42 out sample of 200 smoker prefer
brand A and 18 out of another random sample of 100 smokers prefers
brand B, test whether the 8% difference is a valid cliam.
Practice problems cont….
• The average number of sick days an employee takes per year is believed to be about 10.
Members of a personnel department do not believe this figure. They randomly survey 8
employees. The number of sick days they took for the past year are as follows: 12; 4; 15; 3;
11; 8; 6; 8. Let X = the number of sick days they took for the past year. Should the
personnel team believe that the average number is about 10?
• In 1955, Life Magazine reported that the 25 year-old mother of three worked [on average]
an 80 hour week. Recently, many groups have been studying whether or not the women's
movement has, in fact, resulted in an increase in the average work week for women
(combining employment and at-home work). Suppose a study was done to determine if
the average work week has increased. 81 women were surveyed with the following results.
The sample average was 83; the sample standard deviation was 10. Does it appear that the
average work week has increased for women at the 5% level?
Practice problems cont….
• A sample of 100 tyres is taken from a lot. The mean life of tyres is
found to be 39, 350 kilo meters with a standard deviation of 3, 260.
Could the sample come from a population with mean life of 40, 000
kilometers?
• The mean life time of a sample of 400 fluorescent light bulbs
produced by a company is found to be 1, 570 hours with a standard
deviation of 150 hours. Test the hypothesis that the mean life time of
bulbs is 1600 hours against the alternative hypothesis that it is greater
than 1, 600 hours at 1% and 5% level of significance