Q4 STATISTICS AND PROBABILITY Final
Q4 STATISTICS AND PROBABILITY Final
BRIEF INTRODUCTION:
As we enter the fourth and final quarter of the school year, I would like to congratulate you in
advance. I hope your enthusiasm and passion for learning is still there despite our new mode of
learning amidst this COVID- 19 outbreak. In our Statistic and Probability class, we will focus on
Hypothesis Testing. In our topic, your prior knowledge in your previous lesson will still be used for
you to be able to cope with the given tasks/ activities. The tasks that you will need to accomplish
include problem solving sheets, essays, and brochure which will further enhance your mind. I hope
you will enjoy and learn a lot from our lessons!
REFERENCES:
A. Printed:
Bataller, Ramil T., (2016), Math Activated: Engage Yourself and our world:
Statistics and Probability (Senior High School), Antonio Arnaiz cor. Chino Roces
Avenues, Makati City, Philippines, Don Bosco Press.
Canlapan, Raymond B., (2016), DIWA Senior High School Series: Statistics and
Probability, 4/F SEDCCO 1 Bldg.DIWA Learning Systems, Inc.
Albay, E. M., (2019), DIWA Senior High School Series (Second Edition):
Statistics and Probability, 4/F SEDCCO 1 Bldg.DIWA Learning Systems, Inc.
B. Online: https://www.youtube.com/watch?v=VK-rnA3-41c
https://www.youtube.com/watch?v=-FtlH4svqx4
Opening Prayer
Dear God, today we will be having our lesson. Help us remember all
that we have studied and learned. Give us a calm mind so we can be
able to think clearly and strengthen us to the rest. We promise to do our
best for the greater Glory of your Son Jesus Christ. Amen.
1
TOPIC: UNDERSTANDING HYPOTHESIS TESTING
_________
INTRODUCTION:
One of our ultimate goals of every nation is to produce professions who will contribute to
scientific knowledge through research. In research investigation, hypothesis testing is a vital
procedure. It is making a decision whether to accept or reject a statement or the assumption about
some parameter in any research problem. From the result of the correct decision making, conclusions
are drawn in which facts are generated, and thus can contribution to a particular body of knowledge in
the fields of education, business, medicine, commerce, economics, and many others.
What I know What I Want to Find How can I Learn What I have Learned
Out More
2
CONTENT DISCUSSION:
In this lesson, we will study the terminologies related to testing of hypothesis, how to calculate
the probabilities of committing a type I and type II error, hypothesis testing is about a population
mean when the variance is unknown, and hypothesis testing concerning proportions. Hypothesis
testing, the focal point of this unit, brings to light the role of research in discovering new knowledge
and breakthroughs in different fields of discipline.
***Notice that the null hypothesis is expressed through the use of the “equal” symbol while the
alternative hypothesis ix expressed by “not total” symbol because the claim/conjecture does not
specify any direction.
3
TYPE OF TEST
- A statistical test may either be directional (one-tailed) or non-directional (two-tailed). We can
determine whether a test is directional or non-directional by looking at how the alternative
hypothesis is expressed.
DIRECTIONAL TEST
o A test of any statistical hypothesis where the alternative hypothesis is expressed, using
less than (<) or greater than (>) is called directional test since the critical or rejection
region lies entirely in one tail of the sampling distribution.
NONDIRECTIONAL TEST
o A test of any statistical hypothesis where the alternative hypothesis is written with not
equal (≠) is called a nondirectional test or two-tailed test since there is no assertion
made on the direction of the difference. The rejection region is split into two equal
parts, one in each tail of the sampling distribution.
TYPES OF ERROR
In decision making, we sometimes make a wrong decision. Likewise, when we test a hypothesis,
there is a possibility that we shall also commit an error of accepting or rejecting the hypothesis. There
are two types of errors: TYPE I error and the TYPE II error
TYPE I error- occurs when we reject the null hypothesis when it is true, it is also called
alpha error (α error ¿
TYPE II error- occurs when we accept the null hypothesis when it is false. It is also called
the beta error ( β error ¿
4
LEVEL OF SIGNIFICANCE
The probability of committing Type I error is called the level of significance. It is denoted by the
Greek letterα . Thus, the value of α tells us the probability of making an error in rejecting the null
hypothesis when it is actually true. The choice for the value of the significance level is determined by
the researcher. This depends on the risk or degree of confidence the researcher is willing to take in
committing Type I error. The commonly used levels of significance are 0.05 and 0.01. The level of
significance should be set before testing the hypothesis.
Example:
A 0.01 level of significance means the researcher is willing to take 1% error in making a
decision. It also implies that he is 99% confident that he will make the right decision. Likewise a 0.05
level of confidence means that the researcher is willing to take 5% error in making a decision. It also
implies that he is 95% confident that he will make a right decision.
Example 1:
Claim: The average weekly allowance of college students is less than P 1500.
Ho: The average weekly allowance of college students is P 1500.
Ha: The average weekly allowance of college students is less than P 1500.
This is directional test or one-tailed test. More specifically, this is a left-tailed test because the less
than symbol was used in expressing the alternative hypothesis. Thus the critical region or the
rejection region lies entirely in the left tail of the sampling distribution.
Example 2:
Claim: The average weekly allowance of college students is greater than P 1500.
Ho: The average weekly allowance of college students is P 1500.
5
Ha: The average weekly allowance of college students is greater than P 1500.
This is also a directional test or one-tailed test. More specifically, this is a right-tailed test because
the “greater than” symbol was used in expressing the alternative hypothesis. Thus the critical region
or the rejection region lies entirely at the right tail of the sampling distribution.
DIRECTIONAL TEST
o A test of any statistical hypothesis where the alternative hypothesis is expressed, using
less than (<) or greater than (>) is called directional test since the critical or rejection
region lies entirely in one tail of the sampling distribution.
NONDIRECTIONAL TEST
o A test of any statistical hypothesis where the alternative hypothesis is written with not
equal (≠) is called a nondirectional test or two-tailed test since there is no assertion
made on the direction of the difference. The rejection region is split into two equal
parts, one in each tail of the sampling distribution.
Closing Prayer
Dear God, thank you for the lesson today, it’s great to find out new things about your world.
It’s fun trying new skills and learning to read and write. Help us remember all we have learnt
today. We look forward to tomorrow and all the wonderful things we will be taught. Amen.
6
WEEK 2: COMPARING THE SAMPLE MEAN AND THE POPULATION MEAN
REFERENCES:
A. Printed:
Bataller, Ramil T., (2016), Math Activated: Engage Yourself and our world:
Statistics and Probability (Senior High School), Antonio Arnaiz cor. Chino Roces
Avenues, Makati City, Philippines, Don Bosco Press.
Canlapan, Raymond B., (2016), DIWA Senior High School Series: Statistics and
Probability, 4/F SEDCCO 1 Bldg.DIWA Learning Systems, Inc.
Albay, E. M., (2019), DIWA Senior High School Series (Second Edition):
Statistics and Probability, 4/F SEDCCO 1 Bldg.DIWA Learning Systems, Inc.
B. Online: https://www.youtube.com/watch?v=mpkizZxqvM0
Opening Prayer
Lord God, bless our family today, keep us free from this COVID 19 virus. Stay with us always
Lord.
What I know What I Want to Find How can I Learn What I have Learned
Out More
7
PROCESS QUESTIONS/ FOCUS QUESTIONS:
Below are the key guide questions that you should remember as you perform all the activities in this
lesson. You should be able to answer them at the end of the week.
1. How do you compare mean of the sample means and the mean of the population in a large
sample size?
2. How do you find the population mean from the sample mean?
3. What test can be used to test the difference between two sample means when the population
variance is known?
CONTENT DISCUSSION:
Population mean is the mean of sample values collected. Population mean is the mean of all the
values in the population. If the sample is random and sample size is large then the sample mean
would be a good estimate of the population mean.
INTRODUCTION:
In many instances when we are testing the hypotheses, we want to find out if the mean
difference is statistically significant or not. The z-test can be used for this purpose. Generally, the z-
test is used when the following conditions are satisfied.
1. The population standard deviation is known or given.
2. The population standard deviation is unknown but the sample size is sufficiently large,(greater
than or equal to 30). In this case, we use the sample standard deviation (s) to replace the
population standard deviation.
The z-test of One-Sample Mean
- The z test of one sample mean is used when we want to test if the sample mean x differs
significantly from a population mean or the hypothesized mean µ. To illustrate, suppose that
the mean of the population is known to be µ0. If we take a random sample size n from this
population and obtain a sample x which is somewhat different from µ 0 , so there is a reason to
believe that there is significant difference between µ0 and x .
- The z-test of one sample mean is used to compare sample mean and population mean. We use
the z-score formula for the sample means because we are dealing with sample means. Thus ,
by the Central Limit Theorem we have
x−μ
z=
σx
σ
Where σ x =
√n
The formula can be written as
(x−μ) √ n
z=
σ
Where
x=mean of the sample
µ= mean of the population
n=sample size
σ = standard deviation of the population
8
Afterwards, we compare the test value of z against the critical value z by drawing the rejection region
in the sampling distribution. The table shows the critical values of z for the given type of test and
level of significance.
EXAMPLE:
A new drug on the market is claimed by its manufacturer to reduce overweight women by 4.70 kg
per month with standard deviation of 0.91kg. Thirty - women chosen at random have reported losing
an average of 4.55 kg within a month. Does this data support the claim of manufacturer at 0.05 level
of significance?
Solution:
Step 1: Ho: The average weight loss per month using a new drug is 4.55 kg ( μ=4.55 ¿
Ha: The average weight loss per month using a new drug is not 4.55 kg μ ≠ 4.55 ¿
Step 2: Type of test
Two-tailed or non-directional test
α =0.05
Critical value is z= ± 1.96
Step 3: Compute the test value, using the formula or test statistic
Given:
x=4.55
μ=4.70
n=30
σ =0.91
Substitute the given values in the formula.
(x−μ) √ n
z=
σ
( 4.55−4.70) √ 30
z=
0.91
TOPIC: COMPARING THE SAMPLE MEAN AND THE POPULAION MEAN IN A SMALL
SAMPLE SIZE
In using the z-test of one sample mean, we have made an assumption that the population
standard deviation is known or given. However, if σ is not known, we can still use the z-test by
replacing σ by s(sample standard deviation) provided that n≥ 30. What if σ is not known and n< 30?
We shall use the t-test of one sample mean for this situation. In chapter 4, we have already discussed
the t-distribution and how to get the critical values in the t-table.
When sample size is less than 30 and the population standard population is unknown, we use the
following formula to compare the sample mean and the population mean and the population mean.
x−μ ( x−μ ) √ n
z= ∨t=
s s
√n
9
Example:
The director of secretarial school believes that its graduates can type more than 75 words per minute.
A random sample of 12 graduates has been found to have an average of 77.2 words per minute with
standard deviation of 7.9 words per minute in a typing test. Using the 0.05 level of significance, test
the claim of the director.
Step 1: Formulate the null and alternative hypotheses
Ho: The average number of words that graduates can type is 75 words per minute
( μ=75¿
Ha: The average number of words that graduates can type is greater than 75 words
per minute(μ>75)
Step 2: Type of test
One -tailed or directional test
α =0.05
df=12-1=11 is t=1.796
Step 3: Compute the test value, using the formula or test statistic
Given:
x=77.2
μ=75
n=12
s=7.9
( x−μ ) √ n
t=
s
(77.2−75) √ 12
z=
7.9
1. How do you compare mean of the sample means and the mean of the population in a large sample
size?
The sample mean is mainly used to estimate the population mean when population mean is
not known as they have the same expected value. Sample mean implies the mean of the sample
derived from the whole population randomly. Population mean is nothing but the average of the
entire group.
2. How do you find the population mean from the sample mean?
With a sample mean you can get an estimate of the population mean. According to the central
limit theorem, the sample means of random samples of size n approximately follow a normal
distribution.
3. What test can be used to test the difference between two sample means when the population
variance is known?
Z- Test can be used to test the difference between two sample means when the population
variance is known?
10
When sample size is less than 30 and the population standard population is unknown, we use the
following formula to compare the sample mean and the population mean and the population mean.
x−μ ( x−μ ) √ n
z= ∨t=
s s
√n
Closing Prayer
We thank you for setting us at tasks which demand our best efforts, and for leading us to
accomplishments which satisfy and delight us.
11
WEEK 3: TESTING A MEAN WHEN POPULATION VARIOUS IS KNOWN AND UNKNOWN
REFERENCES:
A. Printed:
Bataller, Ramil T., (2016), Math Activated: Engage Yourself and our world:
Statistics and Probability (Senior High School), Antonio Arnaiz cor. Chino Roces
Avenues, Makati City, Philippines, Don Bosco Press.
Canlapan, Raymond B., (2016), DIWA Senior High School Series: Statistics and
Probability, 4/F SEDCCO 1 Bldg.DIWA Learning Systems, Inc.
Albay, E. M., (2019), DIWA Senior High School Series (Second Edition):
Statistics and Probability, 4/F SEDCCO 1 Bldg.DIWA Learning Systems, Inc.
B. Online: https://www.youtube.com/watch?v=BdeuCflLPQI
Opening Prayer
Lord, you shaped my life from nothing! You created me and saved me, and now I give You all
my trust and praise today. Continue to shape my life today according to your will! Amen.
INTRODUCTION:
Hypothesis testing is a procedure used by statisticians to determine whether or not to reject a
statement about a population.
Suppose that we are taking samples from a population that is normally distributed with respect
to some random variable X. What can we say about the sampling distribution of the sample mean?
What I know What I Want to Find How can I Learn What I have Learned
Out More
EXERCISE 1.
Directions: State the null and the alternative hypothesis of the given problem.
1. The Head of the P. E. Department of a certain high school claims that the mean height of
Grade 7 students is 163 cm. The mean height of 50 randomly selected grade 7 students is
161cm. Using 0.01 significance level, can it be concluded that the mean height of Grade 7
students is different from 163 cm as claimed by the Head of P.E department?
A.
B.
CONTENT DISCUSSION:
In this “Hypothesis Test for a Population Mean,” we will look at the four steps of hypothesis test
as they relate to a claim about a population mean. In this lesson, the value of the population means is
not the focus of inference. Instead, it develops tools for determining the relationship between known
and unknown population means.
Afterwards, we compare the test value of z against the critical value z by drawing the rejection region
in the sampling distribution. The table shows the critical values of z for the given type of test and
level of significance.
Table A:
TYPE OF TEST LEVEL OF SIGNIFICANCE
α=0.01 α=0.05
ONE-TAILED ± 2.33 ± 1.65
TWO-TAILED ± 2.58 ± 1.96
EXAMPLE:
A new drug on the market is claimed by its manufacturer to reduce overweight women by 4.70 kg
per month with standard deviation of 0.91kg. Thirty - women chosen at random have reported losing
an average of 4.55 kg within a month. Does this data support the claim of manufacturer at 0.05 level
of significance?
Solution:
Step 1: Ho: The average weight loss per month using a new drug is 4.55 kg ( μ=4.55 ¿
Ha: The average weight loss per month using a new drug is not 4.55 kg μ ≠ 4.55 ¿
Step 2: Type of test
Two-tailed or non-directional test
α =0.05
Critical value is z= ± 1.96
Step 3: Compute the test value, using the formula or test statistic
Given:
x=4.55
μ=4.70
n=30
σ =0.91
Substitute the given values in the formula.
(x−μ) √ n
z=
σ
( 4.55−4.70) √ 30
z=
0.91
14
TOPIC: COMPARING THE SAMPLE MEAN AND THE POPULATION MEAN IN A
SMALL SAMPLE SIZE
In using the z-test of one sample mean, we have made an assumption that the population
standard deviation is known or given. However, if σ is not known, we can still use the z-test by
replacing σ by s(sample standard deviation) provided that n≥ 30. What if σ is not known and n< 30?
We shall use the t-test of one sample mean for this situation. In chapter 4, we have already discussed
the t-distribution and how to get the critical values in the t-table.
When sample size is less than 30 and the population standard population is unknown, we use the
following formula to compare the sample mean and the population mean and the population mean.
x−μ ( x−μ ) √ n
z= ∨t=
s s
√n
Example:
The director of secretarial school believes that its graduates can type more than 75 words per minute.
A random sample of 12 graduates has been found to have an average of 77.2 words per minute with
standard deviation of 7.9 words per minute in a typing test. Using the 0.05 level of significance, test
the claim of the director.
Step 1: Formulate the null and alternative hypotheses
Ho: The average number of words that graduates can type is 75 words per minute
( μ=75¿
Ha: The average number of words that graduates can type is greater than 75 words
per minute( μ>75)
Step 2: Type of test
One -tailed or directional test
α =0.05
df=12-1=11 is t=1.796
Step 3: Compute the test value, using the formula or test statistic
Given:
x=77.2
μ=75
n=12
s=7.9
( x−μ ) √ n
t=
s
(77.2−75) √ 12
z=
7.9
2. When a hypothesis test is to be conducted regarding a population variance the test statistic will be?
15
When a hypothesis test is to be conducted regarding a population variance, the test statistic
will be: X2 value from the chi- square distribution. If we are interested in performing a one- tailed,
upper- tail hypothesis test about a population variance where the level of significance.
Table A:
TYPE OF TEST LEVEL OF SIGNIFICANCE
α=0.01 α=0.05
ONE-TAILED ± 2.33 ± 1.65
TWO-TAILED ± 2.58 ± 1.96
Closing Prayer
Father of Justice, you gifted us with your creation so that we can participate with it through
the works of our hands, heart, feet and intellect. Guide us as you did with your son Jesus
Christ, in our vocation may we build community and not hinder it. Amen.
16
WEEK 4: TEST POPULATION MEAN
REFERENCES:
A. Printed:
Bataller, Ramil T., (2016), Math Activated: Engage Yourself and our world:
Statistics and Probability (Senior High School), Antonio Arnaiz cor. Chino Roces
Avenues, Makati City, Philippines, Don Bosco Press.
Canlapan, Raymond B., (2016), DIWA Senior High School Series: Statistics and
Probability, 4/F SEDCCO 1 Bldg.DIWA Learning Systems, Inc.
Albay, E. M., (2019), DIWA Senior High School Series (Second Edition):
Statistics and Probability, 4/F SEDCCO 1 Bldg.DIWA Learning Systems, Inc.
B. Online: https://www.youtube.com/watch?v=Hag0GW9B6EY
Opening Prayer
TALK with God, no breath is lost. WALK with God, no strength is lost. WAIT for God, no time
is lost. TRUST in God, you will never be lost. Amen.
INTRODUCTION:
In this section, we discuss statistical tests concerning means: the z- test. The following is the
general formula used for the computation of the test statistic.
Statistic −Parameter
Test Statistic=
Standard Error
17
When doing a hypothesis testing for a mean, the observed value refers to the sample mean
while the expected value is the value of the population mean when the null hypothesis is assumed to
be true.
What I know What I Want to Find How can I Learn What I have Learned
Out More
CONTENT DISCUSSION:
This lesson encompasses the most essential learning competencies for Test Population
Mean- Computes for the test statistic value (population mean), draws conclusion about the
population mean based on the test-statistic value and the rejection region, solves problems
involving test of hypothesis on the population mean. It is designed to develop and enhance your
knowledge, skills and abilities in solving mathematical problem.
Example:
1. A researcher reports that the average salary of company managers is more than P 63 000. A sample
of 35 company managers has a mean salary of P 65 700. At α =0.01 , test the claim that the company
managers earn more than P 63 000 a month. The standard deviation of the population is P 5,250.
Solution:
18
Given:
x=P 65 700
μ=P 63 000
σ =P 5250
n=35
α = 0.01
Step 3: The critical z value is 2.326
One-tailed test
(x−μ) √ n
z=
σ
(65 700−63000) √ 35
¿
5250
z= 3.043
Step 5: Conclusion:
Since we reject the null hypothesis, we can conclude that there is enough evidence
to support the claim that the monthly salary of company managers is more than P
63 000.
Example:
1. A sociologist believes that it cost more than P 90 000 to raise a child from birth to age one. A
random sample of 49 families, each with a child is selected to see if this figure is correct. The
average expenses for these families reveal a mean of P 92, 000 with standard deviation of P 4500.
Based on these sample data, can it be concluded that the sociologist is correct in his claim? Use 0.05
level of significance
Solution:
19
α = 0.05
Critical value z= ± 1.65
One tailed test
Step 3: The critical z value is 2.326
One-tailed test
(x−μ) √ n
z=
σ
Reject the null hypothesis since it falls within the rejection region, therefore accept
the alternative hypothesis.
Step 5: Conclusion:
There is a significant difference between the sample mean and the population
mean. Thus, the sociologist is correct in claiming that the cost of raising a child
from birth to age one is more than P 90 000.
Below are the key guide questions that you should remember as you perform all the activities in this
lesson. You should be able to answer them at the end of the week.
Statistic −Parameter
Test Statistic=
Standard Error
Closing Prayer
20
Lord, give us the courage to shout out from the mountaintops against those who violate your
goodness by exploiting the work of others. Let us work to be signs of hope for those who seek
decency and fairness. Amen.
MODULE 5: HYPOTHESIS TESTING CONCERNING PROPORTIONS
The learner demonstrates understanding of key concepts of tests of
Content Standard hypotheses on the population mean and proportion
The learner is able to perform appropriate tests of hypotheses involving
Performance Standard the population mean and population proportion to make inferences in
real-life problems in different disciplines.
Formulates the appropriate null and alternative hypotheses on
Most Essential Learning the population proportion
Competencies (MELCs) Identifies the appropriate form of the test-statistic when the
Central Limit Theorem is to be used.
21st Century Learning
Critical Thinking
Skills
REFERENCES:
A. Printed:
Bataller, Ramil T., (2016), Math Activated: Engage Yourself and our world:
Statistics and Probability (Senior High School), Antonio Arnaiz cor. Chino Roces
Avenues, Makati City, Philippines, Don Bosco Press.
Canlapan, Raymond B., (2016), DIWA Senior High School Series: Statistics and
Probability, 4/F SEDCCO 1 Bldg.DIWA Learning Systems, Inc.
Albay, E. M., (2019), DIWA Senior High School Series (Second Edition):
Statistics and Probability, 4/F SEDCCO 1 Bldg.DIWA Learning Systems, Inc.
B. Online: https://courses.lumenlearning.com/wmopen-concepts-statistics/chapter/
hypothesis-test-for-a-population-proportion-3-of-3/
Opening Prayer
I pray because I believe in the power of prayer. I pray because I have no doubt that God has
answered my prayers. I pray because God woke me up and allowed me to see another day. No
matter what others think or say, I will pray. Amen.
INTRODUCTION:
Now that some general concepts of hypothesis testing have been introduced, we are ready to
turn our attention to the development of procedures using sample information to decide between the
null and the alternative hypotheses.
21
Below is the Learning Targets/ Specific Objective:
1. Formulates the appropriate null and alternative hypotheses on the population proportion
2. Identifies the appropriate form of the test-statistic when the Central Limit Theorem is to be
used.
What I know What I Want to Find How can I Learn What I have Learned
Out More
CONTENT DISCUSSION:
This lesson was made in order for us to learn how to determine if a proportion from a sample
differs significantly from a proportion from a population. We will show you in this lesson how to
compare sample proportion and population proportion using a test.
In our previous lesson, you have compared sample mean and population. There are some
instances wherein we want to compare are proportions. In this lesson we shall learn how to determine
if a proportion from a sample differs significantly from a proportion from a population.
To compare sample proportion and population proportion, we use the z-test for one-sample
mean proportion. The test statistic for this test is
p− p o
z=
√ p0 (1−p 0)
n
Where
p= sample proportion
p0= population proportion
n=sample size
x
p=
n
x= number of successes
Table A:
TYPE OF TEST LEVEL OF SIGNIFICANCE
α=0.01 α=0.05
ONE-TAILED ± 2.33 ± 1.65
TWO-TAILED ± 2.58 ± 1.96
Example:
22
It has been claimed that less than 60% of all purchases of a certain kind of computer program will call
the manufacturer’s hotline with a month purchase. If 55 out of 100 software purchasers selected at
random call the hotline within a month of purchase, test the claim at 0.05 level of significance.
Solution:
Step 1: Formulate the null and alternative hypotheses
Ho: The proportion of purchasers that will call the manufacturer’s hotline within one
month of purchase is 60% or 0.60 ( po= 0.60)
Ha: The proportion of purchasers that will call the manufacturer’s hotline within one
month of purchase is less than 60% or 0.60 ( po < 0.60)
To compare sample proportion and population proportion, we use the z-test for one-sample mean
proportion. The test statistic for this test is
p− p o
z=
√ p0 (1−p 0)
n
Where:
p= sample proportion
p0= population proportion
n=sample size
x
p=
n
x= number of successes
Example:
It has been claimed that less than 60% of all purchases of a certain kind of computer program will call
the manufacturer’s hotline with a month purchase. If 55 out of 100 software purchasers selected at
random call the hotline within a month of purchase, test the claim at 0.05 level of significance.
Solution:
Step 1: Formulate the null and alternative hypotheses
Ho: The proportion of purchasers that will call the manufacturer’s hotline within one
month of purchase is 60% or 0.60 ( po= 0.60)
Ha: The proportion of purchasers that will call the manufacturer’s hotline within one
month of purchase is less than 60% or 0.60 ( po < 0.60)
Step 2: Type of test:
The test is one tailed (left-tailed)
Critical value: with the use of Table A value of z at 0.05 level, one-tailed test is z=-
1.65
Rejection region:
23
Step 3: Compute the test value
Given:
po=0.60
n=100
55
p= =0.55
100
p− p o
z=
√ p0 (1−p 0)
n
0.55−0.60
z=
√
0.60(1−0.60)
z=-1.02
100
Step 4: Decision:
Accept the null hypothesis because the computed value or the test value falls outside
the rejection region.
Step 5: Conclusion:
There is no sufficient evidence to conclude that the proportion of purchasers that will
call the manufacturer’s hotline within one month of purchase is less than 60%. Thus
the claim is false or incorrect.
Example:
A recent survey done by the Philippine Housing Authority found that 35% of the population
owns their homes. In a random sample of 240 heads of households, 78 responded that they owned
their homes. At the 0.01 level of significance, that that indicate a difference from the national
proportion?
Solution:
Ho: The proportion of the population who owns their homes is 0.35. (p0=0.35)
Ha: The proportion of the population who owns their homes is not 0.35. (p0 ≠ 0.35)
p− p o
z=
√ p0 (1−p 0)
n
0.325−0.35
z=
√ 0.35(1−0.35)
240
z= -0.812
Step 4: Decision
Do not reject the null hypothesis, since the test value falls outside the critical region,
as shown in the figure above.
Step 5: Conclusion:
Since we fail to reject the null hypothesis, we can conclude that there is not enough
evidence to reject the claim that 35% of the Filipinos owned their homes
Let’s Summarize
In this section, we looked at the four steps of a hypothesis test as they relate to a claim about a
population proportion.
25
Step 2: Collect the data.
Since the hypothesis test is based on probability, random selection or assignment is essential in data
production. Additionally, we need to check whether the sample proportion can be np ≥ 10 and n(1
− p) ≥ 10.
Determine the test statistic which is the z-score for the sample proportion.
Use the test statistic, together with the alternative hypothesis to determine the P-value. You can
use a standard normal table (or Z-table) or technology (such as the simulations on the second
page of this topic) to find the P-value.
If the alternative hypothesis is greater than, the P-value is the area to the right of the test
statistic. If the alternative hypothesis is less than, the P-value is the area to the left of the test
statistic. If the alternative hypothesis is not equal to, the P-value is equal to double the tail area
beyond the test statistic.
A small P-value says the data is unlikely to occur if the null is true. If the P-value is less than or
equal to the significance level, we reject the null hypothesis and accept the alternative
hypothesis instead.
If the P-value is greater than the significance level, we say we “fail to reject” the null
hypothesis. We never say that we “accept” the null hypothesis. We just say that we don’t have
enough evidence to reject it. This is equivalent to saying we don’t have enough evidence to
support the alternative hypothesis.
We write the conclusion in the context of the research question. Our conclusion is usually a
statement about the alternative hypothesis (we accept Ha or fail to accept Ha) and should
include the P-value.
REVISED KNOWLEDGE: Actual answer to the process questions/ focus questions
Below are the key guide questions that you should remember as you perform all the activities in this
lesson. You should be able to answer them at the end of the week.
To test a hypothesis for a proportion, you need to follow the four steps of a hypothesis test as
they relate to a claim about a population proportion (determine the hypotheses, collect the data, assess
the evidence, and give the conclusion.
2. What are the four steps of a hypothesis test as they relate to a claim about a population proportion?
The four steps of a hypothesis test as they relate to a claim about a population proportion
1. Collect the data, assess the evidence
2. Give the conclusion
3. Assess the evidence,
4. Give the conclusion.
26
p− p o
z=
√ p0 (1−p 0)
n
Where
p= sample proportion
p0= population proportion
n=sample size
x
p=
n
x= number of successes
Closing Prayer
Thank you, Lord for this day, protect and guide us always Lord. Make us worthy of you’re
your blessings Lord. Amen.
REFERENCES:
A. Printed:
Bataller, Ramil T., (2016), Math Activated: Engage Yourself and our world:
Statistics and Probability (Senior High School), Antonio Arnaiz cor. Chino Roces
Avenues, Makati City, Philippines, Don Bosco Press.
Canlapan, Raymond B., (2016), DIWA Senior High School Series: Statistics and
Probability, 4/F SEDCCO 1 Bldg.DIWA Learning Systems, Inc.
Albay, E. M., (2019), DIWA Senior High School Series (Second Edition):
Statistics and Probability, 4/F SEDCCO 1 Bldg.DIWA Learning Systems, Inc.
B. Online: https://www.youtube.com/watch?v=qC9_mohleao
Opening Prayer
Father, Lord of new beginnings, thank you for letting me wake up and breathe in the
common air, I offer you all my works and my plans, all what my hands will make and
27
do. Protect me, Lord, from all danger, as well as my loved ones who will also pursue
their own thing today. Keep me Lord, under your care. Be with me today, Lord
TOPIC: UNDERSTANDING CORRELATION ANALYSIS
Why do most students who are good in Mathematics also perform well in physics? Why does blood
pressure go with age? Why do students with high IQ have good academics performances? These
questions have something to do with relationships between two variables.
Analyze and Explore
So far, we have analyzed data involving only a single variable---for instance, the grades of students,
the weights of grocery products and the length of rods. These data are called UNIVARIATE DATA
because they involve a single variable only. In this lesson you shall analyze data involving two
variables. Data that involve two variables are called BIVARIATE DATA.
The analysis of bivariate data involves describing the relationship between two variables. The process
or procedure of describing the relationship between two variables is called CORRELATION
ANALYSIS.
What I know What I Want to Find How can I Learn What I have Learned
Out More
CONTENT DISCUSSION:
In this section we will first discuss correlation analysis, which is used to quantify the association
between two continuous variables (e.g. between an independent and a dependent variable or between
two independent variables). in correlation analysis, we estimate a sample correlation coefficient, more
specifically the Pearson Product Moment correlation coefficient.
Example:
A company with six branches provides free coffee to its employees. A manager is interested to find
out if there is a relationship between the number of cups of coffee provided and the number of
employees in the offices. The table below shows that data needed. Determine if there is a relationship
between the number of employees and the number of cups of coffee.
Notice that the points on the scatter plot do not lie on the one line. However, the points closely follow
a straight line. This line is called TREND LINE.
The relationship between two variables is described in terms of strength and direction.
In terms of direction, the relationship between two variables may be positive, negative or zero.
29
The next illutration show the different types of relationship described in terms of direction and
strength.
30
REVISED KNOWLEDGE: Actual answer to the process questions/ focus questions
31
1. What is correlation analysis?
Correlation analysis is statistical method that is used to discover if there is a relationship
between two variables/ datasets, and how strong that relationship maybe.
A correlation of r= 0.9 suggests a strong, positive association between two variables, whereas
correlation of r=-0.2 suggest a weak, negative association. A correlation close to zero suggests no
linear association between two continuous variables.
Closing Prayer
Thank You Lord for the gift of life to serve and love you Lord. Keep us away from harm.
Amen.
REFERENCES:
A. Printed:
Bataller, Ramil T., (2016), Math Activated: Engage Yourself and our world:
Statistics and Probability (Senior High School), Antonio Arnaiz cor. Chino Roces
Avenues, Makati City, Philippines, Don Bosco Press.
Canlapan, Raymond B., (2016), DIWA Senior High School Series: Statistics and
Probability, 4/F SEDCCO 1 Bldg.DIWA Learning Systems, Inc.
Albay, E. M., (2019), DIWA Senior High School Series (Second Edition):
Statistics and Probability, 4/F SEDCCO 1 Bldg.DIWA Learning Systems, Inc.
B. Online: https://www.youtube.com/watch?v=atLZNGsTN6k
https://www.youtube.com/watch?v=BXXtkYOqAfM
Opening Prayer
God of grace, we open our hearts, minds and souls to worship you. Thank you that
today we dwell in your kingdom and live in your presence. Thank you that as we
gather together, we join with all Christians across the world to glorify your holy name.
Amen.
INTRODUCTION:
The Pearson correlation coefficient is used to measure the strength of a linear association
between two variables, where the value r =1 means a perfect positive correlation and the value r = -1
means a perfect negative correlation.
What I know What I Want to Find How can I Learn What I have Learned
Out More
33
Skills I expect to use:
CONTENT DISCUSSION:
To describe the relationship between two variables, we can compute the correlation coefficient
(r ). The correlation coefficient is a number between -1 and 1 that describes both the strength and the
direction of correlation. In symbol we write -1 ≤ r ≤ 1.
Value of r Interpretation
r=1 Perfect positive correlation
r=0 No correlation/ zero correlation
r=-1 Perfect negative correlation
Correlation Scale
Value of r Interpretation
±1.00 Perfect correlation; perfect relationship
±0.81 to ±0.99 Very high correlation
±0.61 to ±0.80 High correlation
±0.41 to ±0.60 Moderately high correlation
±0.21 to ±0.40 Low correlation
±0.01 to ±0.20 Negligible correlation
0.00 No correlation, no relationship
Notice that the correlation scales gives only strength of correlation. The direction of correlation is
determined by the sign of the correlation coefficient?
Value or r Interpretation
-0.45 Moderately high negative correlation
0.66 High positive correlation
0.83 Very high positive correlation
-0.35 low negative correlation
0.58 Moderately high positive correlation
34
To compute the correlation coefficient, we use the Pearson Product-Moment Correlation (PPMC)
coefficient. The following formula gives the Pearson Product-Moment Correlation (PPMC)
coefficient.
r=
∑ (x−x )( y− y )
√ ∑ ( x−x)2 ∑ ( y − y)2
Where X=value of the variable X
Y= value of the variable Y
X = mean of the variable X
Y = mean of the variable Y
The following examples illustrate the computation of the Pearson Product- Moment Correlation
(PPMC) coefficient.
Example:
A store manager wishes to find out whether there is a relationship between the age of the
employees and the number of sick days they incur each year. The data for the sample are shown.
Calculate the correlation coefficient ( r ) and describe the relationship in terms of strength and
direction.
Employee A B C D E F
Age (X) 18 26 39 48 53 58
Days (Y) 16 12 9 5 6 2
Step 1:
Employee X Y
A 18 16
B 26 12
C 39 9
D 48 5
E 53 6
F 58 2
∑ X=242 ∑ Y =50
X=
∑ X = 242 =40.33
n 6
Y=
∑ y = 50 =8.33
n 6
Step 2:
Subtract X from each value of X. Label this as −X . Subtract Y from each value of Y. Label this as
Y −Y
Employee X Y X −X Y −Y
A 18 16 -22.33 7.67
B 26 12 -14.33 3.67
C 39 9 -1.33 0.67
D 48 5 7.67 -3.33
E 53 6 12.67 -2.33
F 58 2 17.67 -6.33
∑ X=242 ∑ Y =50
35
Step 3
Employee X Y X −X Y −Y (X −X )
2
(Y −Y )
2
Step 4
Employee X Y X −X Y −Y ( X −X )2 (Y −Y )2 ( X −X )( Y −Y
)
A 18 16 -22.33 7.67 498.63 58.83 -171.27
B 26 12 -14.33 3.67 205.35 13.47 -52.59
C 39 9 -1.33 0.67 1.77 0.45 -0.09
D 48 5 7.67 -3.33 58.83 11.09 -25.54
E 53 6 12.67 -2.33 160.53 5.43 -29.52
F 58 2 17.67 -6.33 312.23 40.07 -111.85
∑ X=242 ∑ Y =50 ∑(X −X )2= ∑
2
∑ (X −X )(
1236.84 (Y −Y ) =129.34 Y −Y ) = -390.56
Step 5:
r=
∑ (x−x )( y− y )
√ ∑ (x−x)2 ∑ ( y − y)2
−390.56
r=
√ ∑ 1236.84∗129.34
r =−0.98
36
Step 6:
Using the correlation scale, we interpret the obtained value r = -0.98 as very high negative
correlation. This implies that there is a very high negative correlation between the age of employees
and the number of sick days. This means that older employees tend to have more number of sick
days.
Value of r Interpretation
r=1 Perfect positive correlation
r=0 No correlation/ zero correlation
r=-1 Perfect negative correlation
To compute the correlation coefficient, we use the Pearson Product-Moment Correlation (PPMC)
coefficient. The following formula gives the Pearson Product-Moment Correlation (PPMC)
coefficient.
r=
∑ (x−x )( y− y )
√ ∑ ( x−x)2 ∑ ( y − y)2
Where X=value of the variable X
Y= value of the variable Y
X = mean of the variable X
Y = mean of the variable Y
Closing Prayer
Without a dream, we reach nothing. Without love, we feel nothing, And, without God, we are
nothing. Amen.
37
38