Lecture 3
Lecture 3
Hypothesis:
• In statistics, a hypothesis is a conclusion about population parameter which may or may
not be true.
Null Hypothesis, (H0): This is a statement of no effect, no difference, or no association. It
represents the status quo or the default assumption. (It is the hypothesis to be tested or a
statement you want to reject/accept.)
Example: The average age of a university student is 22.
H0 = μ=22 years (Null hypothesis always contains the equality!)
Alternative Hypothesis, (HA or H1): This is a statement that contradicts the null hypothesis and
represents the effect, difference, or association you want to test. (usually your own opinion)
Example: The average age of university students is not equal to 22.
H1 = μ ≠ 22 (Alternative hypothesis never contains the equality!)
For example: According to Glassdoor, Average Salary of Data Scientist = Rs 113,000.
So, H0= μ= Rs 113,000
H1 = Salary≶$113,000
Hypothesis Testing
• Hypothesis testing is a fundamental concept in statistics used to make inferences about a
population based on a sample of data.
• It involves comparing a null hypothesis (H0) and an alternative hypothesis (Ha) to
determine whether there is enough evidence to reject the null hypothesis in favor of the
alternative hypothesis. Or
• Fail to reject the null hypothesis.
• When testing, there are two decisions that can be made: to accept the null hypothesis or
to reject the null hypothesis.
Statistical Errors:
Type I and Type II Errors:
• When we make a conclusion from a statistical test there are two types of errors that we
could make.
• They are called type 1 and type 2 errors.
• Type I error: Rejecting the null hypothesis H0 when it is true is defined as a type I error.
• Type II error: Failing to reject the null hypothesis when it is false is defined as a type II
error.
• The probability of committing Type I error (False positive) is equal to the significance level
(α).
• The probability of committing Type II error (False negative) is equal to the beta (β).
Hypothesis Test for mean when Sigma is Known:
• When we know about the population variance, we can use the z-scores for testing our
hypothesis.
• After performing z statistics, we will get a numerical value.
• Based on this value, we will decide whether to accept the null hypothesis or reject it.
• So, we need to define a critical value which will act as a threshold for the test value.
• Critical Value is the cut off value between Acceptance Zone and Rejection Zone.
• We compare our test score to the critical value.
Critical Value for Left Tailed Test:
• If the test is left tailed, the critical region, with an area equal to α will be on the left side of
the mean.
• In the left-tailed test, our rejection region consists of values that are smaller than our critical
value.
• So, If the test value is less than our critical value, we will reject the null hypothesis