Statisticaltests
Statisticaltests
net/publication/353828687
CITATIONS READS
0 7,275
2 authors:
All content following this page was uploaded by Huma Parveen on 11 August 2021.
Pre-requisites
Objectives
Keywords t-test; z-test; Anova; chi square test
1. Introduction
Mathematical science dealing with the collection, analysis, interpretation, and
presentation of numerical data in order to draw appropriate conclusionsis called
statistics (Trajkovski 2016).It is an independent branch and its use is highly prevalent
in all the fields of knowledge. Methods such as parametric and non-parametric tests
are used in statistics. Statistics is used both in scientific and non-scientific way to
make appropriate decisions and conclusions based on the data.
Statistics which are based on the normal distribution of the data are called parametric
statistics. T, z, and f are examples of parametric statisticaltests. Those statistical
tests, which are not based on normal distribution of the data, are called non-
parametric statistics or distribution free tests.Chi-square test,Spearman’s rank
correlation coefficient, Mann-Whitney-U test, Kruskal-Wallis analysis of variance etc
are some of the examples of non parametric tests. Furthermore, parametric tests are
more powerful statistical tests than non-parametric tests. It is always recommended
to use parametric statistical tests than non-parametric test. However, non-parametric
statistical tests are only used when the assumptions of the parametric tests are not
met or fulfilled.
Trajkovski, (2016) in his research paper mentioned that wrong statistical tests are
used by the researchers. He reported that most of the researchers use parametric
test which does not fulfill the assumptions of parametric tests and vice versa. Now
days, the availability of the different types of statistical software’s like SPSS, AMOS,
MATLAB, R software, etc. makes performing of the statistical test easy but selection
of the appropriate statistical test is still a problem. A systematic step-by-step
approach is the best way to decide how to analyze the data. Hence it is
recommended to follow the below mentioned steps before analyzing the data.
3. T-test
A statistical test that offers an opportunity to compare between two group means is
called t-test. For example, if we want to compare between male and femaleon health
issues or if we want to compare between rural and urban people on the same, t-test
is the appropriate statistical test to measure the difference between these
groups.Other statistical tests include an z-test, chi-square test and analysis of
variance. T-test is used when we have smaller group of data while as z-test is used
when we greater group of data (>30).It verifies, if the difference between two means
is larger than would be expected by chance. Common types of t-tests which are
frequently used are dependent sample t-test and independent sample t-test.
Independent sample t-test is used when we have to make a comparison between two
sample meanswho’s means are not dependent on each other. Unlike dependent
sample t-test (where the participants are meaningfully related with each other), two
separate groups of participants participate in the study. One of the commonly used t-
test is the independent sample t-test, where each groups are completely independent
of each other. Comparision between male and female, Urban & rural areas is the
simple examples of independent sample t-test. If we want to make a comparision
between male and females on mental health, an independent sample t-test is the
appropriate statistics. In this test, the sample of men should not be related to the
smaple of females, and the should not be any kind of overlape between the two
groups, i.e the groups should be independent of each other.
4. Z-test
Like t-test, z-test is a statistical test that offers an opportunity between the two groups
but unlike t-test, where the variance is unknown and sample size is small, in z-test,
there is known variance and sample sizelarger.For example, if we want to compare
between male and female on mental health, z-test is the appropriate statistical test to
measure the difference between thegroups. The condition is that, the data should be
greater than thirty and normally distributed, and the standard deviation should be
known. Besides that while conducting a z-test, the null and alternative hypotheses,
alpha and z-score should be stated.
While as the t-test and z-tests are used when we have to compare the differences
between two groups. The problem arises when we have to compare more than two
groups. In such a situation when we to compare among three or more than three
groups, Analysis of variance (ANOVA) is an appropriate statistics. It is nothing, but
an extension of t-test and z-test. It is better to say the technique as analysis of
means rather than variance as inference about the means are made by analyzing the
variances. This test is used to test general rather than specific differences among
means. While performing analysis of variance, we get two variances, between group
variance and within group variance. Difference between the means is called between
group variance and difference within the means is called within group variance.
If the difference between the two group variances (i.e. between and within groups) is
significant i.e., between group have large variance as compared to within group, we
reject the null hypothesis and conclude that our experimental manipulation had a real
effect. If the difference between the variance is not significant, we accept the null
hypothesis and conclude that experimental manipulation didn’t have real effect.
Unlike t-test, where we get the t value, In analysis of variance, we get the Fratio. The
F ratio is computed by dividing the between groups variance estimate by the within
group variance.
There are different types of ANOVAs such as (one way ANOVA randomized, one
way ANOVA repeated, two way ANOVA randomized, two way ANOVA repeated and
factorial ANOVA)
A one way ANOVA is used whenever we have one independent variable with three
or more than three levels. If separate groups of subjects participate in each
condition/level, in a between subject design, a randomizedANOVA is the appropriate
test. However if the same subjects have participated in each condition, in a within
subject design, a repeated measure ANOVA is the appropriate test. While as a two
way ANOVA is used whenever two independent variables are manipulated and all
the combinations of levels of each of the two variables are used.
When we have one continuous dependent and two or more categorical independent
variables, the factorial anova is the appropriate statistics. Besides mean difference,
factorial anova provides the main as well as the interaction effect.
For example, I want to know the difference between the boys and girls on academic
performance in school. Further, I also want to know the difference between the
children’s coming from rural and urban areas on academic performance. In this
example, the academic performance is the dependent variable, and gender of the
children and area of location (rural/urban) are two independent variables. This is
known as 2 (gender)× 2 (location) factorial analyses.
Z-test is used to compare the mean of two groups, with large sample size whether
population standard deviation is known or not. While as t-test is used to compare
between the two groups when the population standard deviation is not known. An F-
test is an extension of t-test and z-test and is used to compare mean score of more
than two groups and population variance of any sample size.
7. Chi-Square Test
Test which is used to determine the difference between expected frequencies and
observed frequencies in one or more than one categories is called chi-square test. It
is one of the most commonly used non-parametric tests used by social science
researchers.The Chi square test is a statistical testwhich measures the association
between twocategorical variables (Ugona & Walker 1995).
It is a computationally simple statistical test which is used to examine independence
across two categorical variables.
a. Goodness of fit
A statistical model which is used to describes how well it fits the observations. It is
used to compare the observed values with the expected value. In this test, the data is
first divided into intervals and then points that fall in the intervals are compared with
the expected points in each interval.
b. Measure of Independence
One of the most useful statistics for testing the hypothesis, when the data are
nominal is the chi-square test of independence. It is used to determine the
significance of association between the two variables (categorical variables) when
the sample size is large. This test is used when we have to compare two nominal or
categorical variables and we want to know the difference in terms of proportion
between the variables. For example, chi-square test of independence is used to
determine whether gender (Male, Female) is related to voting preference (Congress,
BJP, Independent etc.).
8. Summary
In this paper, we have described the basics of a parametric tests (t-test, z-test and
ANOVA) and non parametric tests (chi square test). We hope that information
provided has clarified the difference between parametric and non parametric
statistics. In addition, we hope that information provided has clarified the difference
between independent sample t-test and dependent sample t-test; one way ANOVA,
two ways ANOVA, factorial ANOVA; and chi-square test. The paper finally describes
the assumptions of all the above mentioned statistical tests.
References
1. Hulsizer, M.R., & Woolf, M. L. (2009). A Guide to Teaching Statistics:
Innovations and Best Practices, United Kindom:A John Wiley & Sons, Ltd.
2. Kothari, C. R. (2007). Quantitative techniques. New Delhi, UBS Publishers