0% found this document useful (0 votes)

13 views18 pages

W8 Hypothesis Testing

Uploaded by

Thu Phương

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views18 pages

W8 Hypothesis Testing

Uploaded by

Thu Phương

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

BIG DATA

Week 8 – Hypothesis testing

July 2024
Inferential statistics-
Hypothesis testing
2
TODAY’S OBJECTIVES

We will be dealing with:

1. Null/Alternative hypotheses
2. Test statistics
3. P-values
4. Chi-squared test

3
HYPOTHESIS TESTING

Our objective is to choose between these two opposite statements about the population,
where these statements are known as hypotheses. By convention these are denoted by
𝐻0 & 𝐻1 .

Null hypothesis, 𝐻0 Alternative hypothesis, 𝐻1

A modified process does not produce a A modified process does produce a
higher yield than the standard process. higher yield than the standard process.
There is no association between smoking There is an association between smoking
and lung cancer incidence and lung cancer incidence
A person is not gifted with Extra Sensory A person is gifted with Extra Sensory
Perception. Perception.
The average level of lead in the blood in a The average level of lead in the blood
particular environment of people in a particular environment of
is as high as 3.0. people is lower than 3.0.

There is no difference in the mean of the There is a difference in the mean of the
exam score between Class A and Class B exam score between Class A and Class B
4
HYPOTHESIS TESTING : EXAMPLE

Example : A coin is tossed 100 times in order to decide whether or not it is a fair coin.
𝐻0 : Prob of getting a head = prob of getting a tail = ½.
𝐻1 : Prob of getting a head ≠ prob of getting a tail.

We observe 98 heads and 2 tails. Common sense tells us that this is (very!) strong
evidence that the coin is biased toward heads.

More formally, we reject the null hypothesis that the coin is fair as the chance of obtaining 98
heads with a fair coin is extremely low - the observation, 98 heads, is inconsistent with the coin
being fair since we would have expected (approximately) 50 heads and the observation
significantly departed from this expectation.

Choosing between competing hypotheses requires us to conduct a statistical test,

which use sample data.

5
TEST STATISTIC

In ‘classical testing’, we always assume the null hypothesis is true by performing the test conditional on
𝐻0 being true. That is, 𝐻0 is our ‘working hypothesis' which we hold to be true until we obtain significant
evidence against it.

A test statistic is the formal mechanism used to evaluate the support

given to 𝐻0 by sample data.

Note:
• Different test statistics are used for testing different forms of hypotheses.
• Observed (or measured) test statistic is a value calculated from the sample data.

6
SIGNIFICANCE LEVELS

Definition: Significance level 𝛼 % is defined by the probability when rejecting 𝐻0 when it is true (i.e. a false
positive). It is sometimes referred to as type I error.

We control for the probability of committing a Type I error by setting the value for 𝛼.

Interpretation: If we perform a test at the 5% significance level, say, then we are basing our decision on a
procedure which gives us a 5% probability of making a Type I error when 𝐻0 is true.

It is common to use significance level 1% , 5% and 10%.

7
If a test is performed at the 10% significance level, what
does it imply?
A. There is a 10% probability of rejecting a true null hypothesis
B. There is a 10% probability of accepting a false null hypothesis
What does the significance level measure?

A. Type II error
B. Beta level
C. Type I error
D. Alpha level
P-VALUES

One way to make decision in hypothesis testing is based on p-value

Definition: p-value is the probability of obtaining test result at least as extreme as observed test statistic
under the assumption that the null hypothesis is true.

10
PROCEDURE OF HYPOTHESIS TESTING (P-VALUE APPROACH)

Define the Calculate observed

Set significance
hypotheses 𝐻0 vs test statistic and p-
level
𝐻1 . value.

Decide whether or
Draw conclusions
not to reject 𝐻0

11
CHI-SQUARED 𝝌𝟐
TEST

This Photo by Unknown Author is licensed under CC BY-NC

CHI-SQUARED 𝜒 2 TEST

A chi-squared 𝜒 2 test is a statistical hypothesis test used in the analysis of contingency tables when the
sample sizes are large.

Commonly used for Categorical data

This type of test tests the null hypothesis that two factors (or attributes) are not associated, against the
alternative hypothesis that they are associated. That is,

• 𝐻0 : There is no association between Factor 1 and Factor 2. (or Factor 1 and Factor 2 are independent)
• 𝐻1 : There is an association between Factor 1 and Factor 2. (or Factor 1 and Factor 2 are dependent)
CHI-SQUARED 𝜒 2 TEST: EXAMPLE
Example: The following cross-tabulation shows data on the 3,593 people who applied to graduate study at
the University X. Dung classified the applicants according to their sex, and whether or not they were
admitted to the university.
Admitted
Sex No Yes Total
Male 1,180 686 1,866
Female 1,259 468 1,727
Total 2,439 1,154 3,593

Perform a hypothesis testing using 𝜒 2 test at 5% level of significance.

• 𝐻0 : There is no association between decision of admission and gender.

• 𝐻1 : There is an association between decision of admission and gender.
CHI-SQUARED 𝜒 2 TEST: JASP EXAMPLE

INPUT DATA
CHI-SQUARED 𝜒 2 TEST: JASP EXAMPLE

Observed test statistic

p-value < 5%

Decision: Reject 𝐇𝟎 , there is sufficient evidence to suggest there is an association between decision of admission and gender.
CHI-SQUARED 𝜒 2 TEST: EXAMPLE
Example: Linh, however, decides to take another look at the statistics. She adds one more piece of data, the
department to which each person applied, and creates cross-tabulations separately for each department
(which are labelled A, B and C).
Admitted
Department Sex No Yes Total
A Male 207 353 560
Female 8 17 25
B Male 484 258 724
Female 635 346 968
C Male 489 75 564
Female 616 105 734
Total Total 2,439 1,154 3,593

Perform a hypothesis testing using 𝜒 2 test at 5% level of significance.

• 𝐻0 : There is no association between decision of admission and gender in department A/B/C.

• 𝐻1 : There is an association between decision of admission and gender in department A/B/C. .
CHI-SQUARED 𝜒 2 TEST: JASP EXAMPLE

Observed test statistic

p-value > 5%
Decision
p-value > 5%
???
p-value > 5%

Download Data Analysis Techniques for Physical Scientists 1st Edition Claude A. Pruneau ebook All Chapters PDF
100% (1)
Download Data Analysis Techniques for Physical Scientists 1st Edition Claude A. Pruneau ebook All Chapters PDF
55 pages
Level of Significance
No ratings yet
Level of Significance
4 pages
Trend Analysis PDF
No ratings yet
Trend Analysis PDF
15 pages
Instant Access to Experimental Design with Applications in Management Engineering and the Sciences Paul D. Berger ebook Full Chapters
100% (1)
Instant Access to Experimental Design with Applications in Management Engineering and the Sciences Paul D. Berger ebook Full Chapters
55 pages
T Test Chi-Square Test
No ratings yet
T Test Chi-Square Test
26 pages
A Proof of The Black and Scholes Formula: Claudio Pacati May 30, 2012
100% (1)
A Proof of The Black and Scholes Formula: Claudio Pacati May 30, 2012
3 pages
Inferential Hypothesis Testing
100% (1)
Inferential Hypothesis Testing
108 pages
Chapter 2 Hypothesis Testing
100% (7)
Chapter 2 Hypothesis Testing
43 pages
4.2 Hypothesis Testing
No ratings yet
4.2 Hypothesis Testing
49 pages
AEE 302 Note - Unit 3
No ratings yet
AEE 302 Note - Unit 3
4 pages
3.0-Discrete Probability Distributions
No ratings yet
3.0-Discrete Probability Distributions
131 pages
Confidence, Prediction, and Tolerance Intervals: Engineering Experimental Design Valerie L. Young
No ratings yet
Confidence, Prediction, and Tolerance Intervals: Engineering Experimental Design Valerie L. Young
18 pages
MA2201 - Evaluation Criteria For Students
No ratings yet
MA2201 - Evaluation Criteria For Students
2 pages
OERprobability 2020
No ratings yet
OERprobability 2020
247 pages
Statistics Cheat Sheet
No ratings yet
Statistics Cheat Sheet
5 pages
Hypothesis Test
83% (6)
Hypothesis Test
15 pages
Statistics - Hypothesis Testing - Britannica
No ratings yet
Statistics - Hypothesis Testing - Britannica
8 pages
(9)HT_mean
No ratings yet
(9)HT_mean
46 pages
Inferential Statistics FWACP_035611
No ratings yet
Inferential Statistics FWACP_035611
54 pages
Tugas Rutin 2 PEMODELAN
No ratings yet
Tugas Rutin 2 PEMODELAN
3 pages
Chapter 3
No ratings yet
Chapter 3
72 pages
W4 - Data Collection - Sampling
No ratings yet
W4 - Data Collection - Sampling
55 pages
AL3451 - QUESTION BANK
100% (1)
AL3451 - QUESTION BANK
12 pages
Hypothesis Testing and Chi - Square
No ratings yet
Hypothesis Testing and Chi - Square
35 pages
Week 2 - Research Topic, Research Question, and Research Objectives
No ratings yet
Week 2 - Research Topic, Research Question, and Research Objectives
43 pages
Test of Significance
No ratings yet
Test of Significance
20 pages
Test of Significance
No ratings yet
Test of Significance
45 pages
Week 3 Module in Stat 4th Quarter
No ratings yet
Week 3 Module in Stat 4th Quarter
7 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
11 pages
Global Supply Chains and Transportation
No ratings yet
Global Supply Chains and Transportation
50 pages
CFA 2
No ratings yet
CFA 2
3 pages
Edu Exam P Sample Sol
No ratings yet
Edu Exam P Sample Sol
133 pages
J. K.Shah Classes Regression Analysis
No ratings yet
J. K.Shah Classes Regression Analysis
21 pages
HydroGOF Indices de Eficiencia en R
No ratings yet
HydroGOF Indices de Eficiencia en R
76 pages
Unit 4 Testing of hypothesis
No ratings yet
Unit 4 Testing of hypothesis
60 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
86 pages
23.-Scaling-Techniques
No ratings yet
23.-Scaling-Techniques
30 pages
Chi Square Lab Report
100% (2)
Chi Square Lab Report
5 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
20 pages
8. Hypothesis Testing (1)
No ratings yet
8. Hypothesis Testing (1)
62 pages
Computational Data Science - Unit 4
No ratings yet
Computational Data Science - Unit 4
18 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
10 pages
AP STAISTICS – FINAL EXAM REVISION - Grade 12
No ratings yet
AP STAISTICS – FINAL EXAM REVISION - Grade 12
28 pages
DVM (1)
No ratings yet
DVM (1)
32 pages
Ch-4 Testing of Hypothesis
No ratings yet
Ch-4 Testing of Hypothesis
32 pages
Quantitative Methods For Management: Session - 10
No ratings yet
Quantitative Methods For Management: Session - 10
95 pages
L7-Hypothesis Testing
No ratings yet
L7-Hypothesis Testing
44 pages
CH 1
No ratings yet
CH 1
7 pages
Point Estimation and Interval Estimation: Learning Objectives
No ratings yet
Point Estimation and Interval Estimation: Learning Objectives
58 pages
Test of Hypothesis For 2020
100% (1)
Test of Hypothesis For 2020
62 pages
1. Introduction to Hypotheses testing
No ratings yet
1. Introduction to Hypotheses testing
7 pages
Model Terbenar
No ratings yet
Model Terbenar
16 pages
Hypothesis_Testing (updated)
No ratings yet
Hypothesis_Testing (updated)
13 pages
Stat - Hypothesis Testing
No ratings yet
Stat - Hypothesis Testing
34 pages
BRM UNIT 4
No ratings yet
BRM UNIT 4
20 pages
What is Hypothesis Testing in Statistics Types a…
No ratings yet
What is Hypothesis Testing in Statistics Types a…
2 pages
Chapter 4 Probability - MPH
No ratings yet
Chapter 4 Probability - MPH
100 pages
6 - Stat - Discrete Probability Distributions 2024
No ratings yet
6 - Stat - Discrete Probability Distributions 2024
31 pages
DMDA Unit-5 notes (2) (1)
No ratings yet
DMDA Unit-5 notes (2) (1)
35 pages
7.Hypothesis testing and Sample size determination
No ratings yet
7.Hypothesis testing and Sample size determination
60 pages
Testing of Hypothesis_Note
No ratings yet
Testing of Hypothesis_Note
6 pages
Hypothesis Testing Notes
No ratings yet
Hypothesis Testing Notes
7 pages
Accuracy Is The Closeness of A Measured Value To The True - For Example, The Measured Density of Water Has Become More Accurate With Improved Experimental Design, Technique, and Equipment
No ratings yet
Accuracy Is The Closeness of A Measured Value To The True - For Example, The Measured Density of Water Has Become More Accurate With Improved Experimental Design, Technique, and Equipment
15 pages
Unit 3 (Hypothesis Testing)
No ratings yet
Unit 3 (Hypothesis Testing)
40 pages
Hypothesis Testing For One Population Parameter - Samples
100% (1)
Hypothesis Testing For One Population Parameter - Samples
68 pages
HYPOTHESIS TESTING AND ESTIMATION
No ratings yet
HYPOTHESIS TESTING AND ESTIMATION
7 pages
Statistics-and-Probability 4Q SLM1
No ratings yet
Statistics-and-Probability 4Q SLM1
11 pages
Unit 4 Statistical Testing and Modeling in r
No ratings yet
Unit 4 Statistical Testing and Modeling in r
25 pages
Chapter II
No ratings yet
Chapter II
19 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
10 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
8 pages
6 Inferential Statistics VI - May 12 2014
No ratings yet
6 Inferential Statistics VI - May 12 2014
40 pages
Topic 3. HYPOTHESIS AND ITS LOGIC PROCESS (2015-17)
No ratings yet
Topic 3. HYPOTHESIS AND ITS LOGIC PROCESS (2015-17)
24 pages
Chapter No. 08 Fundamental Sampling Distributions and Data Descriptions - 02 (Presentation)
No ratings yet
Chapter No. 08 Fundamental Sampling Distributions and Data Descriptions - 02 (Presentation)
91 pages
Quantitative Methods For Management: Session - 10
No ratings yet
Quantitative Methods For Management: Session - 10
95 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
21 pages
Testing in Statistics
No ratings yet
Testing in Statistics
22 pages
Business Statistics by S P Gupta 1
No ratings yet
Business Statistics by S P Gupta 1
18 pages
Stat Activity 2
No ratings yet
Stat Activity 2
4 pages
de 6 Dca 404 C
No ratings yet
de 6 Dca 404 C
31 pages
Week 1 To 3 Lectures Q A
No ratings yet
Week 1 To 3 Lectures Q A
16 pages
Raja Daniyal (0000242740) 8614 - Assignment 1
No ratings yet
Raja Daniyal (0000242740) 8614 - Assignment 1
30 pages
Lesson 7: Standard Scores (Z)
No ratings yet
Lesson 7: Standard Scores (Z)
14 pages
Mathematics Applications and Interpretation Paper 3 TZ2 HL
No ratings yet
Mathematics Applications and Interpretation Paper 3 TZ2 HL
9 pages
Chapter Five Hypothesis Testing
No ratings yet
Chapter Five Hypothesis Testing
50 pages
Hypothesis
No ratings yet
Hypothesis
11 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
54 pages
04 Hypothesis Testing IITB PDF
No ratings yet
04 Hypothesis Testing IITB PDF
33 pages
Statistical Estimation
No ratings yet
Statistical Estimation
37 pages
Statistical Test of Hypotheses
No ratings yet
Statistical Test of Hypotheses
36 pages
The Poisson Distribution
No ratings yet
The Poisson Distribution
13 pages
Statistics For Dummies
From Everand
Statistics For Dummies
Deborah J. Rumsey
4/5 (27)
Statistics Essentials For Dummies
From Everand
Statistics Essentials For Dummies
Deborah J. Rumsey
3.5/5 (25)
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet

W8 Hypothesis Testing

Uploaded by

W8 Hypothesis Testing

Uploaded by

BIG DATA

Week 8 – Hypothesis testing

We will be dealing with:

Null hypothesis, 𝐻0 Alternative hypothesis, 𝐻1

Choosing between competing hypotheses requires us to conduct a statistical test,

A test statistic is the formal mechanism used to evaluate the support

It is common to use significance level 1% , 5% and 10%.

One way to make decision in hypothesis testing is based on p-value

Define the Calculate observed

This Photo by Unknown Author is licensed under CC BY-NC

Commonly used for Categorical data

Perform a hypothesis testing using 𝜒 2 test at 5% level of significance.

• 𝐻0 : There is no association between decision of admission and gender.

Observed test statistic

Perform a hypothesis testing using 𝜒 2 test at 5% level of significance.

• 𝐻0 : There is no association between decision of admission and gender in department A/B/C.

Observed test statistic

You might also like