0% found this document useful (0 votes)
19 views

Hypothesis Testing

Hypothesis testing is a statistical method used to make inferences about population data by testing assumptions through null and alternative hypotheses. It involves various tests, including Z-tests and T-tests, to analyze differences between means and relationships between variables, as well as chi-square tests for categorical data. The p-value plays a crucial role in determining the statistical significance of the results, guiding whether to reject the null hypothesis.

Uploaded by

ishmeetkaur1815
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
19 views

Hypothesis Testing

Hypothesis testing is a statistical method used to make inferences about population data by testing assumptions through null and alternative hypotheses. It involves various tests, including Z-tests and T-tests, to analyze differences between means and relationships between variables, as well as chi-square tests for categorical data. The p-value plays a crucial role in determining the statistical significance of the results, guiding whether to reject the null hypothesis.

Uploaded by

ishmeetkaur1815
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 10

Hypothesis Testing

Hypothesis Testing

Hypothesis testing is a tool for making statistical inferences about the population data. It
is an analysis tool that tests assumptions and determines how likely something is within
a given standard of accuracy. Hypothesis testing provides a way to verify whether the
results of an experiment are valid.

A null hypothesis and an alternative hypothesis are set up before performing the
hypothesis testing. This helps to arrive at a conclusion regarding the sample obtained
from the population. In this article, we will learn more about hypothesis testing, its
types, steps to perform the testing, and associated examples.

What is Hypothesis Testing in Statistics?

Hypothesis testing uses sample data from the population to draw useful conclusions
regarding the population probability distribution. It tests an assumption made about the
data using different types of hypothesis testing methodologies. The hypothesis testing
Hypothesis Testing
Hypothesis testing is a formal procedure for using statistics to investigate our ideas about the
world. Scientists most commonly use it to test specific predictions derived from theories,
referred to as hypotheses.

It's crucial to restate your initial research hypothesis as a Null and Alternate hypothesis so that
you can test it mathematically.

A Null Hypothesis expresses the opposite of what a researcher or experimenter predicts or


anticipates. It essentially defines the statement that no exact or actual relationship exists
between the variables. It is denoted by H0.

An Alternate Hypothesis makes a statement that suggests or advises an investigator or


researcher about a possible result or outcome, commonly denoted by H1.

Suppose you want to test if there is a relationship between gender and weight. Based on your
knowledge, you hypothesize that males are heavier than females on average.

In this case, the Null hypothesis will be that males are not heavier than females, and the
Alternate hypothesis will be that males are heavier than females.
What Is p-Value?

The p-value is known as the probability value. P-values are used in hypothesis
testing to help determine whether the null hypothesis should be rejected. The
lower the p-value, the more likely the null hypothesis will be rejected.

•If the p-value is less than 0.05, the result is statistically significant. In this case,
you reject the null hypothesis favoring the alternative hypothesis.

•If the p-value is greater than 0.05, then the result is not statistically significant
and hence doesn't reject the null hypothesis.
Definition of Z-Test
Z-test is the statistical test used to analyze whether two population means are different or not when the
variances are known, and the sample size is large.
The z-test is based on the normal distribution.
The assumptions for Z-test are:
•All observations are independent.
•The size of the sample should be more than 30.
•The Z distribution is normal when the mean is 0, and the variance is 1.
The test statistic is defined by:
Example

Let's say that the mean score of students in a class is greater than 70 with a standard deviation of 10. If a sample of 50
students was selected with a mean score of 80, calculate the Z-value to check if there is enough evidence to support
this claim at a 0.05 significance level.

Solution:

Here, the sample size is 50 and we know the standard deviation. This is a case of a right-tailed one-sample z test.

The Null hypothesis is the mean score is 70

The Alternative hypothesis is mean score is greater than 70

From the z-table, the critical value at alpha = 0.05 is 1.645

Xbar = 80

μ = 70

n = 50

σ = 10

Substituting the values in the formula, you will get the Z value to be equal to 7.09.

Since 7.09 > 1.645 thus, the null hypothesis is rejected and there is enough to support that the mean of the class is
greater than 70.
Definition Of T-Test
A T-test is a parametric test applied to identify how the average of two data sets
differs when variance is not given.

When the sample size is small, and the population standard deviation is unknown, the
T-test is used in conjunction with the t-distribution. The degree of freedom
significantly impacts the shape of a t-distribution. The number of independent
observations in a given set of observations is the degree of freedom.

Hypothesis Testing t Test


The t test is another method of hypothesis testing that is used for a small sample
size (n < 30). It is also used to compare the sample mean and population mean.
However, the population standard deviation is not known. Instead, the sample
standard deviation is known. The mean of two samples can also be compared
using the t test.
•One sample: t = ¯¯¯x−μs√nx¯−μsn.
•Two samples: t = (¯¯¯¯¯x1−¯¯¯¯¯x2)−(μ1−μ2)√s21n1+s22n2(x1¯−x2¯)−
(μ1−μ2)s12n1+s22n2.
There are the following assumptions taken for the T-Test:
•All the data points are independent.
•The sample size is very small.
•The sample size should be taken and recorded accurately.
The chi-square test is a hypothesis testing method used to analyze categorical data,
determining if observed frequencies significantly differ from expected frequencies, and
to assess relationships between variables. It's commonly used to test for independence
between two variables or to assess how well a distribution fits observed data.

1. What is a Chi-Square Test?


•Purpose:
The chi-square test is a statistical test used to analyze categorical (or nominal) data to determine if there's a
statistically significant relationship between two variables or if observed data fits a hypothesized distribution.
•How it works:
It compares observed frequencies (the actual counts in each category) with expected frequencies (the counts
you would expect if there were no relationship or if the hypothesized distribution were true).
•Key Concepts:
•Categorical Data: Data that can be sorted into categories (e.g., gender, yes/no, colors).
•Observed Frequencies: The actual counts or frequencies you observe in your data.
•Expected Frequencies: The frequencies you would expect to see if there were no relationship or if the
hypothesized distribution were true.
•Chi-Square Statistic: A value that summarizes the difference between observed and expected
frequencies, calculated using the formula: (Observed - Expected)² / Expected.
•Degrees of Freedom: A value that reflects the number of independent pieces of information used to
calculate the chi-square statistic.
•P-value: The probability of observing a chi-square statistic as extreme as, or more extreme than, the one
calculated, assuming the null hypothesis is true.
When to Use a Chi-Square Test
•Categorical Data: Use when you have categorical data (nominal or ordinal) and want to
analyze relationships or fit to a distribution.
•Large Sample Sizes: The chi-square test is generally appropriate for large sample
sizes.
•Independence: Use the test of independence to determine if two variables are related.
•Goodness-of-Fit: Use the goodness-of-fit test to determine if observed data fits a
hypothesized distribution.

You might also like