Hypothesis Test Errors

The document discusses hypothesis testing, including Type I and Type II errors, and methods to calculate them using R programming. It also explains the Bonferroni correction for multiple tests, and provides an overview of ANOVA and its types, including one-way and two-way ANOVA, as well as the Kruskal-Wallis test for non-parametric data. The document includes examples of statistical tests and their interpretations in the context of agricultural data analysis.

Uploaded by

Chaya Anu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views16 pages

Hypothesis Test Errors

Uploaded by

Chaya Anu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Hypothesis Test Errors

• Hypothesis testing is performed with the objective of obtaining a p-value in
order to quantify evidence against the null statement H0.
• This is rejected in order to quantify evidence against the null statement H0.
• This is rejected in favor of the alternative, HA, if the p-value is itself less
than a predefined significance level α, which is conventionally 0.05 or 0.01.
• As touched upon, this approach is justifiably criticized since the choice of α
is essentially arbitrary;
• A decision to reject or retain H0 can change depending solely upon the α
value
Type I Error
• A Type I error, also known as a false positive or alpha error, occurs when a
null hypothesis is rejected when it is actually true.
• In statistical hypothesis testing, the null hypothesis (H0) often represents a
baseline assumption, such as no effect or no difference between groups.
• A Type I error is the probability of incorrectly rejecting H0 when it is true,
and this probability is represented by the significance level (alpha).
• In R programming, you can calculate the Type I error rate (also known as
the alpha level or false positive rate) by simulating data and comparing the
proportion of false positives to the total number of tests conducted.
# Set the parameters alpha <- 0.05
sample_size <- 30
num_simulations <- 10000
# Set the seed for reproducibility
set.seed(123)
# Initialize the counter for false positives
false_positives <- 0
# Perform the simulations
for (i in 1:num_simulations)
{
# Generate two samples from the same normal
# distribution (null hypothesis is true)
sample1 <- rnorm(sample_size, mean = 0, sd = 1)
sample2 <- rnorm(sample_size, mean = 0, sd = 1)
# Conduct a t-test test_result <- t.test(sample1, sample2)
# Check if the p-value is less than the alpha level
if (test_result$p.value < alpha) { false_positives <- false_positives + 1 } }
# Calculate the Type I error rate
type1_error_rate <- false_positives / num_simulations
# Print the Type I error rate cat("Type I Error Rate:", type1_error_rate)
output
# Print the Type I error
rate > cat("Type I Error Rate:",
type1_error_rate)
Type I Error Rate: 0.0481
• In this example, we run 10,000 simulations
where we draw two samples from the same
normal distribution,
and conduct a t-test for each pair of samples.

• We count the number of times we reject the null

hypothesis when it is true (false positives) and
divide it by the total number of simulations to
Bonferroni Correction
• When several hypothesis tests are conducted, you can curb the
multiple testing problem with respect to committing a Type I error by
using the Bonferroni correction.
• The Bonferroni correction suggests that when performing a total of N
independent hypothesis tests, each under a significance level of α,
you should instead use αB = α/N for any interpretation of statistical
significance.
• The Bonferroni and other corrective measures were developed in an
attempt to formalize remedies to making a Type I error in multiple
tests.
Type II error
• Type II error, also known as a false negative, occurs when you fail to
reject the null hypothesis when it’s actually false.
• In hypothesis testing, this error is denoted as β (beta).
• To calculate Type II error in R, you need to know the effect size
(difference between the null and alternative hypotheses), sample size,
standard deviation, and the desired significance level (alpha).
effect_size <- 0.5
# The difference between null and alternative
hypotheses
sample_size <- 100
# The number of observations in each group
sd <- 15
# The standard deviation
alpha <- 0.05
# The significance level
# Calculate Type II Error
pwr_result <- pwr.t.test( n = sample_size, d =
effect_size / sd, sig.level = alpha, type =
"two.sample", alternative = "two.sided" )
type_II_error <- 1 - pwr_result$power
# Print Type II Error
print(type_II_error)
In this example, we are using the pwr package to calculate the power
of the test,
output
and then subtracting it from 1 to obtain the Type II error (β).
Remember to adapt the parameters according to your specific problem.
Output:
# Print Type II Error
> print(type_II_error)
[1] 0.9436737
Analysis of Variance
• ANOVA is a statistical Test for estimating how a quantitative dependent
variable changes according to the levels of one or more
categorical independent variables
• ANOVA tests whether there is a difference in means of the groups at
each level of the independent variable.
• The null hypothesis (H0) of the ANOVA is no difference in means, and
the alternative hypothesis (Ha) is that the means are different from one
another.
Types of ANOVA test
• One-way ANOVA: One-way When there is a single categorical independent
variable (also known as a factor) and a single continuous dependent
variable, an ANOVA is employed.
• It seeks to ascertain whether there are any notable variations in the
dependent variable’s means across the levels of the independent variable.

• Two-way ANOVA: When there are two categorical independent variables

(factors) and one continuous dependent variable, two-way ANOVA is used
as an extension of one-way ANOVA.
• You can evaluate both the direct impacts of each independent variable and
how they interact with one another on the dependent variable.
The Dataset (crop.data)

• Here the ‘density’, ‘block’, and ‘fertilizer’ listed as categorical variables with the
number of observations at each level (i.e. 48 observations at density 1 and 48
observations at density 2).
• ‘Yield’ should be a quantitative variable with a numeric summary (minimum,
median, mean, maximum).
Performing ANOVA
One.way<-aov(yield ~ fertilizer,data=crop.data
Summary(one.way)
The model summary first lists the independent variables being tested in the model (in this case we have only
one, ‘fertilizer’) and the model residuals (‘Residual’).
All of the variation that is not explained by the independent variables is called residual variance.
• The rest of the values in the output table describe the independent variable and the residuals:
• The Df column displays the degrees of freedom for the independent variable (the number of
levels in the variable minus 1), and the degrees of freedom for the residuals (the total number
of observations minus one and minus the number of levels in the independent variables).
• The Sum Sq column displays the sum of squares (also known as the total variation between
the group means and the overall mean).
• The Mean Sq column is the mean of the sum of squares, calculated by dividing the sum of
squares by the degrees of freedom for each parameter.
• The F value column is the test statistic from the F test. This is the mean square of each
independent variable divided by the mean square of the residuals. The larger the F value, the
more likely it is that the variation caused by the independent variable is real and not due to
chance.
• The Pr(>F) column is the p value of the F statistic. This shows how likely it is that the F value
calculated from the test would have occurred if the null hypothesis of no difference among
group means were true.
• The p value of the fertilizer variable is low (p < 0.001), so it appears that the type of fertilizer
used has a real impact on the final crop yield.
Two-way ANOVA
• In the two-way ANOVA example, we are modeling crop yield as a function of
type of fertilizer and planting density.
• First we use aov() to run the model, then we use summary() to print the
summary of the model.
• two.way<-aov(yield ~ fertilizer+density,data=crop.data
• Summary(two.way)
• Adding planting density to the model seems to have made the model better:
• It reduced the residual variance (the residual sum of squares went from 35.89 to 30.765), and both
planting density and fertilizer are statistically significant (p-values < 0.001).
Kruskal-wallis test
• A Kruskal-Wallis test is used to determine whether or not there is a statistically significant
difference between the medians of three or more independent groups.
• This test is the nonparametric equivalent of the one-way ANOVA and is typically used when the
normality assumption is violated.
• The Kruskal-Wallis test does not assume normality in the data and is much less sensitive to
outliers than the one-way ANOVA.
• Here are a couple examples of when you might conduct a Kruskal-Wallis test:
• Example 1: Comparing Study Techinques
• You randomly split up a class of 90 students into three groups of 30. Each group uses a different
studying technique for one month to prepare for an exam.
• At the end of the month, all of the students take the same exam. You want to know whether or
not the studying technique has an impact on exam scores.
• From previous studies you know that the distributions of exam scores for these three studying
techniques are not normally distributed so you conduct a Kruskal-Wallis test to determine if there
is a statistically significant difference between the median scores of the three groups.

Research Paper G-5
100% (1)
Research Paper G-5
16 pages
R Programming Unit 3
No ratings yet
R Programming Unit 3
48 pages
R programming unit 1
No ratings yet
R programming unit 1
83 pages
Hypothesis Testing by Example Hands On Approach Using R
No ratings yet
Hypothesis Testing by Example Hands On Approach Using R
39 pages
Guide For Statistical Analysis For IA - Simple Ver
No ratings yet
Guide For Statistical Analysis For IA - Simple Ver
20 pages
Lawless 2010
No ratings yet
Lawless 2010
31 pages
International Student Biotechnology Congress
0% (1)
International Student Biotechnology Congress
276 pages
MAT097 Chapter 7 Random Variables (With Solution)
100% (1)
MAT097 Chapter 7 Random Variables (With Solution)
32 pages
ANOVA-Week-1
No ratings yet
ANOVA-Week-1
79 pages
R Programming Lab test
No ratings yet
R Programming Lab test
1 page
Lesson on ANOVA
No ratings yet
Lesson on ANOVA
55 pages
R Programming Language Unit01
No ratings yet
R Programming Language Unit01
133 pages
Stat 362 UNIT 4
No ratings yet
Stat 362 UNIT 4
30 pages
AS lecture 10 (Anova test)
No ratings yet
AS lecture 10 (Anova test)
29 pages
Unit 1.2
No ratings yet
Unit 1.2
52 pages
Chapter-1 FINAL
No ratings yet
Chapter-1 FINAL
28 pages
ML m5_3
No ratings yet
ML m5_3
51 pages
SPSS PPT Final
No ratings yet
SPSS PPT Final
19 pages
ANOVA
No ratings yet
ANOVA
34 pages
One-way ANOVA | When and How to Use It (With Examples)
No ratings yet
One-way ANOVA | When and How to Use It (With Examples)
16 pages
ANOVA in R
No ratings yet
ANOVA in R
11 pages
QSCI 381 Lecture 8
No ratings yet
QSCI 381 Lecture 8
35 pages
Journal Financial Management Agency Conflict
No ratings yet
Journal Financial Management Agency Conflict
9 pages
Unit 1.1
No ratings yet
Unit 1.1
85 pages
Statistical_Hypothesis_Testing
No ratings yet
Statistical_Hypothesis_Testing
20 pages
Problem Set 2 Quantitative Methods UNIGE
No ratings yet
Problem Set 2 Quantitative Methods UNIGE
10 pages
R 2nd IA
No ratings yet
R 2nd IA
7 pages
Array
No ratings yet
Array
7 pages
Parametric & Non Parametric Tests
No ratings yet
Parametric & Non Parametric Tests
18 pages
The Effect of ESG-Corporate, Company Size, and Size of Board Director On Financial Performance With Audit Quality As A Moderating Variable in Public Companies in Indonesia
No ratings yet
The Effect of ESG-Corporate, Company Size, and Size of Board Director On Financial Performance With Audit Quality As A Moderating Variable in Public Companies in Indonesia
8 pages
OSEP Letter To Copenhaver 10/19/07
No ratings yet
OSEP Letter To Copenhaver 10/19/07
3 pages
Afraz Et Al 2013 Working Paper
No ratings yet
Afraz Et Al 2013 Working Paper
64 pages
r programming 2nd unit
No ratings yet
r programming 2nd unit
43 pages
Bedawi Et Al 2023 Early Video Assisted Thoracoscopic Surgery (Vats) or Intrapleural Enzyme Therapy (Iet) in Pleural
No ratings yet
Bedawi Et Al 2023 Early Video Assisted Thoracoscopic Surgery (Vats) or Intrapleural Enzyme Therapy (Iet) in Pleural
46 pages
Estimation Theory
No ratings yet
Estimation Theory
4 pages
ANOVA
No ratings yet
ANOVA
5 pages
unit -1 notes R programming
No ratings yet
unit -1 notes R programming
52 pages
Unit 1
No ratings yet
Unit 1
9 pages
unit4_R
No ratings yet
unit4_R
21 pages
Statistical Treatment
No ratings yet
Statistical Treatment
13 pages
Chapter7 ANOVA
No ratings yet
Chapter7 ANOVA
20 pages
Contribution of School Disciplinary Committee To The Management of Students Discipline in Public Secondary Schools in
No ratings yet
Contribution of School Disciplinary Committee To The Management of Students Discipline in Public Secondary Schools in
4 pages
linearregression
No ratings yet
linearregression
18 pages
One-Way-Analysis-of-Variance
No ratings yet
One-Way-Analysis-of-Variance
34 pages
hypothesis
No ratings yet
hypothesis
16 pages
Perfil Sensorial-2 - Informe - Web
No ratings yet
Perfil Sensorial-2 - Informe - Web
16 pages
Topic 2. Distributions, Hypothesis Testing, and Sample Size Determination
No ratings yet
Topic 2. Distributions, Hypothesis Testing, and Sample Size Determination
15 pages
BBADM 221 Unit 10 - With Notes
No ratings yet
BBADM 221 Unit 10 - With Notes
51 pages
Multiple Comparisons Testing
No ratings yet
Multiple Comparisons Testing
7 pages
Research statistics lesson 5
No ratings yet
Research statistics lesson 5
11 pages
Aqt 1
No ratings yet
Aqt 1
33 pages
Anova
No ratings yet
Anova
34 pages
Statistics Qestions PDF
No ratings yet
Statistics Qestions PDF
66 pages
Assignment No. 2 Subject: Educational Statistics (8614) (Units 1-4) Subject
No ratings yet
Assignment No. 2 Subject: Educational Statistics (8614) (Units 1-4) Subject
7 pages
ANOVA
No ratings yet
ANOVA
19 pages
Halimatus Islamiah Analisis Jurnal Internasional K3
No ratings yet
Halimatus Islamiah Analisis Jurnal Internasional K3
3 pages
A Comprehensive Approach to Misspecification Testing in Linear Regression Models
No ratings yet
A Comprehensive Approach to Misspecification Testing in Linear Regression Models
6 pages
Transformando La Movilidad Urbana en Mexico2
No ratings yet
Transformando La Movilidad Urbana en Mexico2
4 pages
Robert Wuthnow-Meaning and Moral Order - Explorations in Cultural Analysis-University of California Press (1989)
No ratings yet
Robert Wuthnow-Meaning and Moral Order - Explorations in Cultural Analysis-University of California Press (1989)
440 pages
ANOVA in R in RIMSR
No ratings yet
ANOVA in R in RIMSR
12 pages
MSc-IMTAtlantique-Data Science-03
No ratings yet
MSc-IMTAtlantique-Data Science-03
2 pages
ANOVA
No ratings yet
ANOVA
12 pages
R Session Bootstrapping Randomisation 2024
No ratings yet
R Session Bootstrapping Randomisation 2024
4 pages
Ed Aaaaaaa
No ratings yet
Ed Aaaaaaa
7 pages
DA Unit II - II
No ratings yet
DA Unit II - II
47 pages
Basic
No ratings yet
Basic
4 pages
Unit IV. Inferential Statistics: A. T-Test B. Analysis of Variance (ANOVA) C. Chi-Square
No ratings yet
Unit IV. Inferential Statistics: A. T-Test B. Analysis of Variance (ANOVA) C. Chi-Square
26 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
8 pages
R Programming Lab
No ratings yet
R Programming Lab
14 pages
ANOVA in R
No ratings yet
ANOVA in R
7 pages
ANOVA-Reader
No ratings yet
ANOVA-Reader
7 pages
00 Lab Notes
No ratings yet
00 Lab Notes
8 pages
Inferenatial Assign, Of Iqra Sajid
No ratings yet
Inferenatial Assign, Of Iqra Sajid
8 pages
Analysis of Variance
No ratings yet
Analysis of Variance
4 pages
R-Assignement 2
No ratings yet
R-Assignement 2
6 pages
Introduction to Statistical Hypothesis Testing in R
No ratings yet
Introduction to Statistical Hypothesis Testing in R
8 pages
Post-Observation-Conference-Tool May 15
75% (4)
Post-Observation-Conference-Tool May 15
2 pages
AD3411 - 6 To11
No ratings yet
AD3411 - 6 To11
15 pages
Uji Reabilitas Instrumen Penelitian
No ratings yet
Uji Reabilitas Instrumen Penelitian
2 pages
Chapter 5 Hypothesis Testing
No ratings yet
Chapter 5 Hypothesis Testing
27 pages
Jurnal 12
No ratings yet
Jurnal 12
22 pages
Module 3 Hypothesis Testing Using R
No ratings yet
Module 3 Hypothesis Testing Using R
7 pages
Fundamentals of Mathematical Statistics: Pavol Oršanský
No ratings yet
Fundamentals of Mathematical Statistics: Pavol Oršanský
85 pages
Difference Between Oop and SP
No ratings yet
Difference Between Oop and SP
2 pages
18MEO113T - DOE - Unit 5 - AY2023 - 24 ODD
No ratings yet
18MEO113T - DOE - Unit 5 - AY2023 - 24 ODD
76 pages
Internals Answers
No ratings yet
Internals Answers
53 pages
Data Preparation & Analysis
No ratings yet
Data Preparation & Analysis
27 pages
Results-Based Performance Management System (RPMS) : Portfolio
No ratings yet
Results-Based Performance Management System (RPMS) : Portfolio
93 pages
Cincinnati Tenants' Union: Legal Representation and Outcomes in Hamilton County Eviction Court
No ratings yet
Cincinnati Tenants' Union: Legal Representation and Outcomes in Hamilton County Eviction Court
22 pages
ST102 Notes
0% (1)
ST102 Notes
21 pages
Documentary Film Worksheet
100% (1)
Documentary Film Worksheet
3 pages
Tingkat Signifikansi Untuk Uji Satu Arah 0.05 0.025 0.01 0.005 0.0005 Tingkat Signifikansi Untuk Uji Dua Arah
No ratings yet
Tingkat Signifikansi Untuk Uji Satu Arah 0.05 0.025 0.01 0.005 0.0005 Tingkat Signifikansi Untuk Uji Dua Arah
4 pages
Action Research Proposal: Work Plan, Timelines and Cost Estimates
0% (1)
Action Research Proposal: Work Plan, Timelines and Cost Estimates
10 pages
Unit 5 Mba 1ST
No ratings yet
Unit 5 Mba 1ST
197 pages
Just Learn Stats
No ratings yet
Just Learn Stats
9 pages
Modelling in R
No ratings yet
Modelling in R
47 pages
Class Notes
No ratings yet
Class Notes
147 pages
Clinical Research: - Sai Krishna
No ratings yet
Clinical Research: - Sai Krishna
27 pages
251543651626NTA NET - Paper-I
No ratings yet
251543651626NTA NET - Paper-I
56 pages
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet

Hypothesis Test Errors

Uploaded by

Hypothesis Test Errors

Uploaded by

Hypothesis Test Errors