0% found this document useful (0 votes)

36 views

Lecture III

Cours de d'analyse de données

Uploaded by

djimenedivin

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views

Lecture III

Cours de d'analyse de données

Uploaded by

djimenedivin

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 52

Lecture III

Fundamentals of Hypothesis
Testing: One-Sample Tests
Prof. Dr. Innocent NM.
Tel: 677540384/653754070
Email: [email protected]

1
2
In this Chapter:

• The concept of hypothesis

• Null and Alternative Hypotheses
• Steps in hypothesis testing
• Test Statistic
• P-Value
• Significance Level
• Power and Sample Size
3
The concept of hypothesis

4
Null Hypothesis: H0

The null hypothesis (denoted by H0) is a statement that the value of a

population parameter (such as proportion, mean, or standard
deviation) is equal to some claimed value
We test the null hypothesis directly. Either reject H0 or fail to reject H0 .

Alternative Hypothesis: H1
The alternative hypothesis (denoted by H1 or Ha or HA) is the
statement that the parameter has a value that somehow differs from the
null hypothesis. It is a statement of what we believe is true if our
sample data cause us to reject the null hypothesis

The symbolic form of the alternative hypothesis must use one of these
5
symbols: ≠, <, >
Critical Region, Critical Value, Test Statistic
The critical region (or rejection region) is the set of all values of the
test statistic that cause us to reject the null hypothesis. For example, see
the red-shaded region in the previous figure

Significance Level
The significance level (denoted by α) is the probability that the test
statistic will fall in the critical region when the null hypothesis is
actually true.

Critical Value
 A critical value is any value that separates the critical region (where we reject the
null hypothesis) from the values of the test statistic that do not lead to rejection of
the null hypothesis.

 The critical values depend on the nature of the null hypothesis, the sampling
distribution that applies, and the significance level α. 6
Two-tailed, Right-tailed, Left-tailed Tests

The tails in a distribution are the extreme regions bounded by critical

values.

7
Right-tailed Test

8
Left-tailed Test

9
Hypothesis Testing
The hypothesis test is used to evaluate the results
from a research study in which
1. A sample is selected from the population.
2. The treatment is administered to the sample.
3. After treatment, the individuals in the
sample are measured.

10
Hypothesis Testing (cont.)
• The purpose of the hypothesis test is to decide
between two explanations:
1. The difference between the sample and
the population can be explained by sampling
error (there does not appear to be a
treatment effect)
2. The difference between the sample and
the population is too large to be
explained by sampling error (there does
appear to be a treatment effect).

11
Steps in hypothesis testing

Testing a hypothesis about the mean of a population:

Step 1:
Data: determine variable, sample size (n), sample mean(Xbar) ,
population standard deviation or sample standard deviation if is
unknown

Step 2.
Assumptions :We have two cases:
 Case1: Population is normally or approximately normally
distributed with known or unknown variance (sample size n may
be small or large),
 Case 2: Population is not normal with known or unknown variance
(n is large i.e. n≥30).
12
… Steps in hypothesis testing

Step 3
For a given level of significance, α.
State the null hypothesis, H0
 always states that the treatment has no effect (no
change, no difference). According to the null
hypothesis, the population mean after treatment is the
same is it was before treatment.
 The α level establishes a criterion, or "cut-off", for
making a decision about the null hypothesis.

13
Step 4

Locate the critical region. The critical region

consists of outcomes that are very unlikely to
occur if the null hypothesis is true. That is, the
critical region is defined by sample means that are
almost impossible to obtain if the treatment has no
effect. The phrase “almost impossible” means
that these samples have a probability (p) that is
less than the alpha level.

14
Step 5

Compute the test statistic. The test statistic

(in this chapter a z-score) forms a ratio
comparing the obtained difference between the
sample mean and the hypothesized population
mean versus the amount of difference we
would expect without any treatment effect (the
standard error).

16
P-Values in Hypothesis Testing

p-value <α, reject H0 17

Step 6
 A large value for the test statistic shows that the obtained mean
difference is more than would be expected if there is no
treatment effect.

 If it is large enough to be in the critical region, we conclude

that the difference is significant or that the treatment has a
significant effect. In this case we reject the null hypothesis.
If the mean difference is relatively small, then the test statistic
will have a low value.

 In this case, we conclude that the evidence from the sample is

not sufficient, and the decision is fail to reject the null
hypothesis.
18
Step 7
Decision Criterion
Traditional method:
 Reject H0 if the test statistic falls within the critical region.
 Fail to reject H0 if the test statistic does not fall within the critical region

An Alternative Decision Rule using the p - value

 If the p-value is less than or equal to α ,we reject the null hypothesis (p ≤ α)
 If the p-value is greater than α ,we do not reject the null hypothesis (p > α)

Confidence Intervals:
Because a confidence interval estimate of a population parameter contains the likely
values of that parameter, reject a claim that the population parameter has a value
that is not included in the confidence interval

19
…P-Value
The P-value (or p-value or probability value) is the probability of
getting a value of the test statistic that is at least as extreme as the one
representing the sample data, assuming that the null hypothesis is true.
The null hypothesis is rejected if the P-value is very small, such as 0.05
or less

20
…P-Value

21
…P-Value

22
…P-Value

Example:
1. There, we had Ho: μ = 300 vs Ha: μ > 300 and Z = 1.18 What is the p-value for
this result?
2. Repeat (1) if μ ≠ 300 and Z = 1.18

23
…P-value
• The P-value answer the question: What is the
probability of the observed test statistic or one more
extreme when H0 is true?

For Ha: μ > μ0  P = Pr(Z > zstat) = right-tail beyond zstat

For Ha: μ < μ0  P = Pr(Z < zstat) = left tail beyond zstat
For Ha: μ ¹ μ0  P = 2 × one-tailed P-value

24
Interpretation
Conventions*
P > 0.10  non-significant evidence against H0
0.05 < P  0.10  marginally significant evidence
0.01 < P  0.05  significant evidence against H0
P  0.01  highly significant evidence against H0

Examples
P =.27  non-significant evidence against H0
P =.01  highly significant evidence against H0
* It is unwise to draw firm borders for “significance”
25
Summary

26
The case where sigma is unknown

27
Illustrative Example: “Body Weight”

• The problem: In the 1970s, 20–29 year old men in the U.S.
had a mean μ body weight of 170 pounds. Standard deviation
σ was 40 pounds. We test whether mean body weight in the
population now differs.

28
Inference on the Mean of a Population,
Variance Known

Assumptions

29
Inference on the Mean of a Population,
Variance Known
Hypothesis Testing on the Mean

We wish to test:

The test statistic is:

30
Inference on the Mean of a Population,
Variance Known
Hypothesis Testing on the Mean

Reject H0 if the observed value of the test statistic

z0 is either:
or
Fail to reject H0 if

31
Inference on the Mean of a Population,
Variance Known
Hypothesis Testing on the Mean

32
Inference on the Mean of a Population,
Variance Known
Hypothesis Testing on the Mean

33
Note about Forming Your Own Claims (Hypotheses)

If you are conducting a study and want to use a hypothesis test to

support your claim, the claim must be worded so that it becomes the
alternative hypothesis

Test Statistic
a value used in making a decision about the null hypothesis, and is
found by converting the sample statistic to a score with the assumption
that the null hypothesis is true

34
… Test Statistic

35
Example:
A survey of n = 880 randomly selected adult drivers showed that 56%
(or p = 0.56) of those respondents admitted to running red lights. Find
the value of the test statistic for the claim that the majority of all adult
drivers admit to running

Solution

H0: p = 0.5

H1: p > 0.5.

36
z statistic
• For the illustrative example, μ0 = 170
• We know σ = 40
• Take an SRS of n = 64. Therefore
 40
SE x   5
n 64
• If we found a sample mean of 173, then
x   0 173  170
zstat    0.60
SE x 5

37
P(Z>3.56) = 1 – 0.9999 = 0.0001.
38
39
Homework

1. Researchers are interested in the mean age of a certain population.

A random sample of 10 individuals drawn from the population of
interest has a mean of 27.

 Assuming that the population is approximately normally distributed

with variance 20,can we conclude that the mean is different from 30
years ? (α=0.05) .
 If the p - value is 0.0340 how can we use it in making a decision?

40
2. An inventor has developed a new, energy-efficient lawn mower engine. He claims
that the engine will run continuously for more than 5 hours (300 minutes) on a single
gallon of regular gasoline. (The leading brand lawnmower engine runs for 300
minutes on 1 gallon of gasoline.).
From his stock of engines, the inventor selects a simple random sample of 50
engines for testing. The engines run for an average of 305 minutes. The true standard
deviation σ is known and is equal to 30 minutes, and the run times of the engines are
normally distributed. Test hypothesis that the mean run time is more than 300
minutes. Use a 0.05 level of significance.

3. Example: The Brinell scale is a measure of how hard a material is. An engineer
hypothesizes that the mean Brinell score of all subcritically annealed ductile iron
pieces is not equal to 170. The engineer measured the Brinell score of 25 pieces of
this type of iron and calculated the sample mean to be 174.52 and the sample standard
deviation to be 10.31. Perform a hypothesis test that the true average Brinell score is
not equal to 170, as well as the corresponding confidence interval. Set alpha = 0.01

41
Errors in Hypothesis Tests
• Just because the sample mean (following treatment) is
different from the original population mean does not
necessarily indicate that the treatment has caused a change.

• You should recall that there usually is some discrepancy

between a sample mean and the population mean simply as a
result of sampling error.

42
Errors in Hypothesis Tests (cont.)

• Because the hypothesis test relies on sample data, and

because sample data are not completely reliable, there
is always the risk that misleading data will cause the
hypothesis test to reach a wrong conclusion.
• Two types of error are possible.

43
Type I Errors

• A Type I error occurs when the sample data appear to show a treatment effect
when, in fact, there is none.

• In this case the researcher will reject the null hypothesis and falsely conclude that
the treatment has an effect.

• Type I errors are caused by unusual, unrepresentative samples. Just by chance the
researcher selects an extreme sample with the result that the sample falls in the
critical region even though the treatment has no effect.

• The hypothesis test is structured so that Type I errors are very unlikely;
specifically, the probability of a Type I error is equal to the alpha level.

44
Type II Errors
• A Type II error occurs when the sample does not appear to
have been affected by the treatment when, in fact, the
treatment does have an effect.

• In this case, the researcher will fail to reject the null hypothesis
and falsely conclude that the treatment does not have an effect.

• Type II errors are commonly the result of a very small

treatment effect. Although the treatment does have an effect,
it is not large enough to show up in the research study.

45
Measuring Effect Size
• A hypothesis test evaluates the statistical
significance of the results from a research study.
• That is, the test determines whether or not it is
likely that the obtained sample mean occurred
without any contribution from a treatment effect.
• The hypothesis test is influenced not only by the
size of the treatment effect but also by the size
of the sample.
• Thus, even a very small effect can be significant
if it is observed in a very large sample.
46
Measuring Effect Size
• Because a significant effect does not necessarily
mean a large effect, it is recommended that the
hypothesis test be accompanied by a measure
of the effect size.
• We use Cohen=s d as a standardized measure
of effect size.
• Much like a z-score, Cohen=s d measures the
size of the mean difference in terms of the
standard deviation.

47
Power
• β ≡ probability of a Type II error
β = Pr(retain H0 | H0 false)
(the “|” is read as “given”)

• 1 – β  “Power” ≡ probability of avoiding a

Type II error
1– β = Pr(reject H0 | H0 false)
Power of a z test
 | 0   a | n 
1      z1   


2  
where
• Φ(z) represent the cumulative probability
of Standard Normal Z
• μ0 represent the population mean under
the null hypothesis
• μa represents the population mean under
the alternative hypothesis
Calculating Power: Example
A study of n = 16 retains H0: μ = 170 at α = 0.05
(two-sided); σ is 40. What was the power of test’s
conditions to identify a population mean of 190?

 |    | n 
1      z1   0 a 
  
 
2

 | 170  190 | 16 
   1.96  
 40 
 
  0.04 
 0.5160 50
Sample Size Requirements
Sample size for one-sample z test:

n
2

 z1   z1 
2

2


2
where
1 – β ≡ desired power
α ≡ desired significance level (two-sided)
σ ≡ population standard deviation
Δ = μ0 – μa ≡ the difference worth detecting
51
Example: Sample Size
Requirement
How large a sample is needed for a one-sample z
test with 90% power and α = 0.05 (two-tailed)
when σ = 40? Let H0: μ = 170 and Ha: μ = 190
(thus, Δ = μ0 − μa = 170 – 190 = −20)

n
2

 z1   z1 
2

2


40 (1.28  1.96)
2 2
 41.99
2
 20 2

Round up to 42 to ensure adequate power.

Green Finance & Stock Performance
No ratings yet
Green Finance & Stock Performance
28 pages
Statistics For Dummies
From Everand
Statistics For Dummies
Deborah J. Rumsey
4/5 (27)
Analyzing and Interpreting Quantitative Data
No ratings yet
Analyzing and Interpreting Quantitative Data
19 pages
Hypothesis Testing - New
No ratings yet
Hypothesis Testing - New
43 pages
Session 11 Handouts
No ratings yet
Session 11 Handouts
37 pages
Hypothesis Test
No ratings yet
Hypothesis Test
57 pages
Testing of Hypothesis Stastistics
No ratings yet
Testing of Hypothesis Stastistics
93 pages
Statistical Estimation
No ratings yet
Statistical Estimation
37 pages
Buss. Stat - Chapter 3 Hypothesis Testing 2
No ratings yet
Buss. Stat - Chapter 3 Hypothesis Testing 2
26 pages
Buss. Stat - Chapter 3 Hypothesis Testing 2
No ratings yet
Buss. Stat - Chapter 3 Hypothesis Testing 2
26 pages
CH 9 (Notes)
100% (2)
CH 9 (Notes)
23 pages
Huypothesis Testing Final Notes 2020 - 2021
No ratings yet
Huypothesis Testing Final Notes 2020 - 2021
33 pages
Module-7A
No ratings yet
Module-7A
9 pages
Lecture 3 of Computational Statistics
No ratings yet
Lecture 3 of Computational Statistics
32 pages
Statbis Inferential Statistics
No ratings yet
Statbis Inferential Statistics
27 pages
Statbis Inferential Statistics
No ratings yet
Statbis Inferential Statistics
27 pages
T test
No ratings yet
T test
29 pages
Testing of Hypothesis Hypothesis
No ratings yet
Testing of Hypothesis Hypothesis
32 pages
Hypothesis Testing
100% (1)
Hypothesis Testing
60 pages
Stat new summary
No ratings yet
Stat new summary
4 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
45 pages
Testing of Hypothesis
No ratings yet
Testing of Hypothesis
34 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
38 pages
Buss. Stat - Chapter 3 Hypothesis Testing
No ratings yet
Buss. Stat - Chapter 3 Hypothesis Testing
10 pages
Buss. Stat - Chapter 3 Hypothesis Testing
No ratings yet
Buss. Stat - Chapter 3 Hypothesis Testing
10 pages
Hypothesis Testing For One Population Parameter - Samples
100% (1)
Hypothesis Testing For One Population Parameter - Samples
68 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
69 pages
Testing of Hypothesis
No ratings yet
Testing of Hypothesis
34 pages
Hypothesis-power-analysis
No ratings yet
Hypothesis-power-analysis
37 pages
Lecture 7 Introduction to testing of hypothesis
No ratings yet
Lecture 7 Introduction to testing of hypothesis
16 pages
Basic Stat
No ratings yet
Basic Stat
35 pages
Statistical Test of Hypotheses
No ratings yet
Statistical Test of Hypotheses
36 pages
Unit 4-2 Testing of Hypothesis
No ratings yet
Unit 4-2 Testing of Hypothesis
34 pages
Hypothesis
No ratings yet
Hypothesis
11 pages
6 RM - Basics of Testing of Hypothesis
No ratings yet
6 RM - Basics of Testing of Hypothesis
33 pages
Lecture 09
No ratings yet
Lecture 09
48 pages
2 Haypothesis Test
No ratings yet
2 Haypothesis Test
75 pages
Sonek Inferential Analysis
No ratings yet
Sonek Inferential Analysis
25 pages
10 InferentialStatistics
No ratings yet
10 InferentialStatistics
124 pages
AP ECON 2500 Session 8
No ratings yet
AP ECON 2500 Session 8
32 pages
Unit 3 (Hypothesis Testing)
No ratings yet
Unit 3 (Hypothesis Testing)
40 pages
Hypothesis Testing I
No ratings yet
Hypothesis Testing I
26 pages
Test of Significance
No ratings yet
Test of Significance
45 pages
Hypothesis Testing: Null & Alternative Hypotheses Level of Significance
No ratings yet
Hypothesis Testing: Null & Alternative Hypotheses Level of Significance
31 pages
Lecture Notes ON Parametric & Non-Parametric Tests FOR Social Scientists/ Participants OF Research Metodology Workshop Bbau, Lucknow
No ratings yet
Lecture Notes ON Parametric & Non-Parametric Tests FOR Social Scientists/ Participants OF Research Metodology Workshop Bbau, Lucknow
19 pages
ES 209 Lecture Notes Week 14
No ratings yet
ES 209 Lecture Notes Week 14
22 pages
Testing Hypothesis One Sample
No ratings yet
Testing Hypothesis One Sample
7 pages
ASHRM_P9
No ratings yet
ASHRM_P9
51 pages
Hypothesis Testing (1)
No ratings yet
Hypothesis Testing (1)
7 pages
Testing of Hypothesis
67% (3)
Testing of Hypothesis
37 pages
de 6 Dca 404 C
No ratings yet
de 6 Dca 404 C
31 pages
Biostateunit 3
No ratings yet
Biostateunit 3
102 pages
Hypothesis Testing For The Mean
No ratings yet
Hypothesis Testing For The Mean
10 pages
Introduction To Hypotheses Testing
No ratings yet
Introduction To Hypotheses Testing
31 pages
What Is Hypothesis Testing
100% (1)
What Is Hypothesis Testing
32 pages
Developing Null and Alternative Hypotheses Type I and Type II Errors Population Mean: Known Population Mean: Unknown Population Proportion
No ratings yet
Developing Null and Alternative Hypotheses Type I and Type II Errors Population Mean: Known Population Mean: Unknown Population Proportion
48 pages
Module 7 - MAMW100 Hypothesis Testing New
No ratings yet
Module 7 - MAMW100 Hypothesis Testing New
6 pages
Tests of Significance
No ratings yet
Tests of Significance
13 pages
Testing of hypothesis Lec 1
No ratings yet
Testing of hypothesis Lec 1
20 pages
PSNM - Ch. 3
No ratings yet
PSNM - Ch. 3
32 pages
7 Step
No ratings yet
7 Step
5 pages
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet
Statistical Hypothesis
No ratings yet
Statistical Hypothesis
70 pages
Stats-Proj Group 2
0% (1)
Stats-Proj Group 2
53 pages
Z-Test of Group 10 (042024)
No ratings yet
Z-Test of Group 10 (042024)
34 pages
FBR & IT Applications: Compiled and Presented by DR - Deepak Joshi For Academic Use Only
No ratings yet
FBR & IT Applications: Compiled and Presented by DR - Deepak Joshi For Academic Use Only
77 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
105 pages
1-Z-Test For Mean PDF
0% (1)
1-Z-Test For Mean PDF
20 pages
Core - 11 - Statistics-and-Probability - q4 - CLAS1 - Hypothesis - Testing - v1.2 - JOSEPH AURELLO
No ratings yet
Core - 11 - Statistics-and-Probability - q4 - CLAS1 - Hypothesis - Testing - v1.2 - JOSEPH AURELLO
12 pages
Chapter 4 5 Usefulness of AIS For Effective OP
No ratings yet
Chapter 4 5 Usefulness of AIS For Effective OP
22 pages
Sta - Chap 11
No ratings yet
Sta - Chap 11
5 pages
Introduction To Longitudinal Analysis Using SPSS - 2012
No ratings yet
Introduction To Longitudinal Analysis Using SPSS - 2012
66 pages
Q4 CLAS2 Statistics-And-Probability The Parameter
100% (1)
Q4 CLAS2 Statistics-And-Probability The Parameter
19 pages
Hypothesis Testing
100% (3)
Hypothesis Testing
23 pages
"The Importance of Electricity Consumption in Economic Growth: The Example of African Nations" by Slim Mahfoudh and Mohamed Ben Amar
No ratings yet
"The Importance of Electricity Consumption in Economic Growth: The Example of African Nations" by Slim Mahfoudh and Mohamed Ben Amar
13 pages
Lipa City National High School
No ratings yet
Lipa City National High School
30 pages
PDF (Ebook PDF) Social Statistics For A Diverse Society 7th Edition Download
100% (9)
PDF (Ebook PDF) Social Statistics For A Diverse Society 7th Edition Download
51 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
59 pages
Matching Hypothesis Coursework
100% (2)
Matching Hypothesis Coursework
8 pages
Instant download Essentials of Modern Business Statistics with Microsoft Excel 8th Edition David R. Anderson pdf all chapter
100% (2)
Instant download Essentials of Modern Business Statistics with Microsoft Excel 8th Edition David R. Anderson pdf all chapter
41 pages
A Study On Customer Satisfaction Towards Coco-Cola Company, Chennai
100% (1)
A Study On Customer Satisfaction Towards Coco-Cola Company, Chennai
5 pages
Statistics and Probablity
No ratings yet
Statistics and Probablity
20 pages
2020-2021 EDA 101 Lectures (Sampling Distribution and Point Estimates - Test of Hypothesis For Single Population)
No ratings yet
2020-2021 EDA 101 Lectures (Sampling Distribution and Point Estimates - Test of Hypothesis For Single Population)
55 pages
Carver - The Case Against Statistical Significance Testing
No ratings yet
Carver - The Case Against Statistical Significance Testing
18 pages
Data Analysis for the Social Sciences Integrating Theory and Practice 1st Edition Douglas Bors - The ebook with rich content is ready for you to download
100% (1)
Data Analysis for the Social Sciences Integrating Theory and Practice 1st Edition Douglas Bors - The ebook with rich content is ready for you to download
78 pages
Biostatistics and Orthodontics
50% (2)
Biostatistics and Orthodontics
72 pages
Hypothesis Test - 1
No ratings yet
Hypothesis Test - 1
27 pages
AICcmodavg
No ratings yet
AICcmodavg
22 pages
Sampling PDF
No ratings yet
Sampling PDF
117 pages
Chapter 13: Introduction To Analysis of Variance
No ratings yet
Chapter 13: Introduction To Analysis of Variance
26 pages

Lecture III

Uploaded by

Lecture III

Uploaded by

Lecture III

• The concept of hypothesis

The null hypothesis (denoted by H0) is a statement that the value of a

The tails in a distribution are the extreme regions bounded by critical

Testing a hypothesis about the mean of a population:

Locate the critical region. The critical region

Compute the test statistic. The test statistic

p-value <α, reject H0 17

 If it is large enough to be in the critical region, we conclude

 In this case, we conclude that the evidence from the sample is

An Alternative Decision Rule using the p - value

For Ha: μ > μ0  P = Pr(Z > zstat) = right-tail beyond zstat

The test statistic is:

Reject H0 if the observed value of the test statistic

If you are conducting a study and want to use a hypothesis test to

H1: p > 0.5.

1. Researchers are interested in the mean age of a certain population.

 Assuming that the population is approximately normally distributed

• You should recall that there usually is some discrepancy

• Because the hypothesis test relies on sample data, and

• Type II errors are commonly the result of a very small

• 1 – β  “Power” ≡ probability of avoiding a

Round up to 42 to ensure adequate power.

You might also like