0% found this document useful (0 votes)

6 views

Lab-10-Forest-Regression

Uploaded by

poopstack1984

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Lab-10-Forest-Regression

Uploaded by

poopstack1984

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Week 10: ENV 445 and ENV 645: Tree Regression Lab

Correlation and Simple Linear Regression

# clear the workspace

rm(list=ls())

In this homework, you will be doing the examples from class, to sure you understand them. And you will be
doing a regression analysis of a dataset that we’ll collect on Burnaby Mountain.
# example starts on slide 9 from class
# enter the data
x=c(1,2,2,3)
y=c(1,2,3,6)

1. Use the correlation function in R, ‘cor.test()’ to determine the sample’s correlation coefficient r.
2. Is ‘cor.test(x,y)’ equal to ‘cor.test(y,x)’?
3. Assuming that y is dependent on the value of x, what percent of the variance of y can be explained by
x?
4. Calculate the covariance of x, y.
5. How is the covariance related to the correlation coefficent between 2 variables?
6. Create a correlation and covariace matrix for x and y. (Slide 13).

Another example of correlation: Relating time spent reading to time spent watching tv.

# Read in the data (this is an example from your text)

read = c(1,2,2,2,3,4,4,5,5,6)
tv = c(90,95,85,80,75,70,75,60,65,50)

Suppose x is the predictor variable, and y is the response variable. You are interested in testing if the amount
of reading a person does is related to the amount of tv they watch.
7. What is the parameter of interest?
8. What are HO and HA ?
9. Using the R function ‘cor.test’ calculate the correlation coefficient between read and tv. Is it significant
at α = 0.05?
10. On slide 24, I’ve pasted the output from R of a regression that includes the t-stat (‘t value’) which is the
‘Estimate’/‘Std.Error’. Calculate the test-statistic for the regression model (lm(y ∼ x)) this hypothesis
test using the formula from class (bottom of slide 24). Note that I used the negative of the ‘Estimate’
as it’s easier to use the left side of the distribution.
11. Why did we multiply by 2 to get the same p-value as R has in the regression outputs?
12. What is the degrees of freedom you used?
13. Is this test statistic in the rejection region? What do you conclude from this statistical test?

1
Simple Linear Regression

Generate some random data (junk food eaten vs immune cell marker data)

Let’s assume x is the number of units of junk food eaten, and y is some marker of immune cell response that
indicates cancer. Here, we are interested if there is a signficant relationship between units of junk food eaten
and a marker for cancer as the news suggests. (Keep in mind this isn’t a real study – we used R to randomly
generate the data!).
a=1 # intercept
b=1.2 # slope
x=0:20
set.seed(0); y=a+b*x + rnorm(length(x),0,4)
head(data.frame(x,y))

## x y
## 1 0 6.0518171
## 2 1 0.8950666
## 3 2 8.7191971
## 4 3 9.6897173
## 5 4 7.4585657
## 6 5 0.8401998
# Let's plot the data (x,y) we generated
par(mfrow=c(1,1), oma=c(4,4,1,1), mar=c(0,0,0,0))
plot(x,y, col='orangered', pch=16, axes=F)
#lines(x, a+b*x, col='blue')
axis(1)
axis(2, las=2)
box()
mtext(side=1, 'x', line=3)
mtext(side=2, 'y', line=3)

15
y

0
0 5 10 15 20

2
14. Fit a linear regression using the function ‘lm()’ and summary(lm()) where x is the covariate, and y is
the response variable. Plot the data and add the predicted least squares line to the plot of the data.
15. What are the estimated paramters β̂0 (intercept) and β̂1 (slope) of the model?
16. Calculate the fitted value, ŷ when x=12 using β̂0 and β̂1 from the last question.
# fit regression model
model=lm(y~x)
# add the regression line to the plot
plot(x,y, col='orangered', pch=16, axes=F)
axis(1)
axis(2, las=2)
box()
abline(model)
25
20
15
y

10
5
0
0 5 10 15 20

x
17. Calculate the ‘residual’ of point (x=12, y=10.8).
18. Plot the residuals of this linear model? Do you see any heteroskedasticity?
# what are the residuals (use R function)
resids.model = residuals(model)
# what are the residuals (write your own code)
resids.manual = y-(coef(model)[1] + coef(model)[2]*x )
# check
head(data.frame(resids.model,resids.manual))

## resids.model resids.manual
## 1 2.833055 2.833055
## 2 -3.296870 -3.296870
## 3 3.554086 3.554086
## 4 3.551432 3.551432
## 5 0.347106 0.347106
## 6 -7.244434 -7.244434

3
# are the residuals homoscedastic?
plot(resids.model)
abline(h=0, lty='dotted')
10
resids.model

5
0
−5

5 10 15 20

Index
19. You can also use R to extract the confidence intervals for the two parameters of interest β0 and β1 . Use
the build-in function ‘confint’ to calculate the confidence intervals of the intercept and slope coefficients.
20. Use ‘summary(model)’ to extract the standard errors for these confidence intervals, and use the correct
tstat and verify that R’s function ‘confint’ is correct.
## 2.5 % 97.5 %
## (Intercept) -0.1562490 6.593773
## x 0.6844764 1.261873
plot(x,y, col='orangered', pch=16, axes=F)
# add the regression line to the plot
abline(model)
# upper 95% CI
lines(x, model$coef[1]+ model$coef[2]*x + qt(.025, df=19) * ci.model$se.fit, col='blue', lty='dotted')
# lower 95% CI
lines(x, model$coef[1]+ model$coef[2]*x - qt(.025, df=19) * ci.model$se.fit, col='blue', lty='dotted')
axis(1)
axis(2, las=2)
box()
25
20
15
y

10
5
0
0 5 10 15 20

4
Class dataset from Burnaby Mountain Forest

21. What are the assumptions of a regression analysis?

22. What are any assumptions that are specific to the class dataset?
23. State the population parameter of interest, define in context.
24. State Hypotheses HO , HA
25. Calculate any relevant test statistics
26. Determine the p-values.
27. Interpret the regression statistics.
• Is the intercept different from zero?
• Is there a positive or negative relationship between the explanatory variable and the response
variable?
28. Do you accept or reject your null hypotheses?
29. What can you conclude something related to the ‘truth’ of the population.

Due Next Thursday March 28th at 2:30 pm

knitr::knit_exit()

Econometric Exam 2 Flashcards - Quizlet
No ratings yet
Econometric Exam 2 Flashcards - Quizlet
18 pages
R Lab 4
No ratings yet
R Lab 4
7 pages
Graded Homework 1 Solutions
No ratings yet
Graded Homework 1 Solutions
19 pages
Regression An Ova
No ratings yet
Regression An Ova
24 pages
Proycto Final Karla Tamayo Bioestadistica - Ingles.
No ratings yet
Proycto Final Karla Tamayo Bioestadistica - Ingles.
5 pages
MIT 302 - Statistical Computing II - Tutorial 03
No ratings yet
MIT 302 - Statistical Computing II - Tutorial 03
16 pages
Lab-5-1-Regression and Multiple Regression
100% (2)
Lab-5-1-Regression and Multiple Regression
8 pages
RegrCorr PDF
No ratings yet
RegrCorr PDF
20 pages
Weatherwax Weisberg Solutions
No ratings yet
Weatherwax Weisberg Solutions
162 pages
BES - Lecture 10 - Simple Linear Regression
No ratings yet
BES - Lecture 10 - Simple Linear Regression
15 pages
Lab box cox and multiple linear reg-1
No ratings yet
Lab box cox and multiple linear reg-1
4 pages
R Lab 1
No ratings yet
R Lab 1
5 pages
Exercice V
No ratings yet
Exercice V
5 pages
Unit 2 - Class 3-Al-830
No ratings yet
Unit 2 - Class 3-Al-830
22 pages
Evans Analytics2e PPT 08
No ratings yet
Evans Analytics2e PPT 08
65 pages
1 Introduction To R and Rstudio: 2024-2025 Calculus Iii
No ratings yet
1 Introduction To R and Rstudio: 2024-2025 Calculus Iii
3 pages
Stat 362 UNIT 1
No ratings yet
Stat 362 UNIT 1
53 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
DAV-EXP
No ratings yet
DAV-EXP
11 pages
DMV Unit 3 PPT_RSK_250419_125620 jfhuehiwhu
No ratings yet
DMV Unit 3 PPT_RSK_250419_125620 jfhuehiwhu
89 pages
R Lab 3
No ratings yet
R Lab 3
7 pages
H-311 Linear Regression Analysis With R
100% (1)
H-311 Linear Regression Analysis With R
71 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
Chapter 8 B - Trendlines and Regression Analysis
No ratings yet
Chapter 8 B - Trendlines and Regression Analysis
73 pages
2-Linear Regression and Correlation in R Commander-CV
No ratings yet
2-Linear Regression and Correlation in R Commander-CV
2 pages
3010 Lab Model diagnostic-1
No ratings yet
3010 Lab Model diagnostic-1
4 pages
9 Regression (Statistics IEM 2-2)
No ratings yet
9 Regression (Statistics IEM 2-2)
32 pages
practice_questions_on_symmetry_corr_reg_on_vectors
No ratings yet
practice_questions_on_symmetry_corr_reg_on_vectors
3 pages
Linear Regression
100% (2)
Linear Regression
228 pages
Using R For Linear Regression
No ratings yet
Using R For Linear Regression
9 pages
Chapter 17
No ratings yet
Chapter 17
31 pages
Introduction To Linear Regression and Correlation Analysis: Objectives
100% (1)
Introduction To Linear Regression and Correlation Analysis: Objectives
33 pages
Linear Regression
No ratings yet
Linear Regression
17 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
27 pages
Lab 6 - Linear Regression and Multiple Linear Regression
No ratings yet
Lab 6 - Linear Regression and Multiple Linear Regression
12 pages
Regression
No ratings yet
Regression
66 pages
Lab 5 LR
No ratings yet
Lab 5 LR
9 pages
Simple Regression
100% (1)
Simple Regression
50 pages
Unit 2 Regression Analysis
No ratings yet
Unit 2 Regression Analysis
16 pages
U02Lecture06 Regression
No ratings yet
U02Lecture06 Regression
25 pages
MakeUpCat
No ratings yet
MakeUpCat
6 pages
Machine Learning-Lecture 1(Student)
No ratings yet
Machine Learning-Lecture 1(Student)
14 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
6th Lecture Note 108335647 230518 203102
No ratings yet
6th Lecture Note 108335647 230518 203102
35 pages
CS ELEC 4 Finals Module
No ratings yet
CS ELEC 4 Finals Module
57 pages
Lecture-3---Linear-Regression-imran-20022025-092939am
No ratings yet
Lecture-3---Linear-Regression-imran-20022025-092939am
46 pages
s
No ratings yet
s
20 pages
R Unit 4th and 5th
No ratings yet
R Unit 4th and 5th
17 pages
Simple Linear Regression 1. Review of Least Squares Procedure 2. Inference For Least Squares Lines
No ratings yet
Simple Linear Regression 1. Review of Least Squares Procedure 2. Inference For Least Squares Lines
51 pages
How To Use "Qqplot": X: Independent Variable, Y: Dependent Variable
No ratings yet
How To Use "Qqplot": X: Independent Variable, Y: Dependent Variable
6 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
51 pages
Series 1
No ratings yet
Series 1
2 pages
Lecture8 4
No ratings yet
Lecture8 4
29 pages
Stats101A - Chapter 1
No ratings yet
Stats101A - Chapter 1
25 pages
Example Sheet 4 1a. Data - Read - Table ("Salary - TXT", Sep "", Header FALSE)
No ratings yet
Example Sheet 4 1a. Data - Read - Table ("Salary - TXT", Sep "", Header FALSE)
4 pages
Matlab Prject
No ratings yet
Matlab Prject
10 pages
Week 2
No ratings yet
Week 2
66 pages
47.exp2.dav
No ratings yet
47.exp2.dav
15 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
Pre-Calculus Essentials
From Everand
Pre-Calculus Essentials
Ernest Woodward
No ratings yet
MTN 315
No ratings yet
MTN 315
20 pages
Reliance Power, Tata Power and Suzlon Energy
100% (1)
Reliance Power, Tata Power and Suzlon Energy
103 pages
Practical Research 2, Q1, WEEK 3
100% (1)
Practical Research 2, Q1, WEEK 3
5 pages
PR2 Q1W2L2 - PPT - Research Variables
100% (3)
PR2 Q1W2L2 - PPT - Research Variables
18 pages
GRADE: 12 Semester: First Semester Subject Title: Practical Research PREREQUISITE: Statistics and Probability Common Subject Description
No ratings yet
GRADE: 12 Semester: First Semester Subject Title: Practical Research PREREQUISITE: Statistics and Probability Common Subject Description
27 pages
Manual CANOCO 5
No ratings yet
Manual CANOCO 5
25 pages
JCN 9 381 PDF
No ratings yet
JCN 9 381 PDF
1 page
Methodology of Research: Danilo C. Dela Cruz JR., LPT
No ratings yet
Methodology of Research: Danilo C. Dela Cruz JR., LPT
41 pages
agronomy-12-00411-v2
No ratings yet
agronomy-12-00411-v2
28 pages
Multiple Choice Quiz Stat
100% (4)
Multiple Choice Quiz Stat
2 pages
Data Analysis Using Regression and Multilevel Hierarchical Models 1st Edition Andrew Gelman - Download the entire ebook instantly and explore every detail
100% (2)
Data Analysis Using Regression and Multilevel Hierarchical Models 1st Edition Andrew Gelman - Download the entire ebook instantly and explore every detail
47 pages
Prediktori Trazenja Psiholoske Pomoci
No ratings yet
Prediktori Trazenja Psiholoske Pomoci
7 pages
Paper Pengolahan Data
No ratings yet
Paper Pengolahan Data
9 pages
Rma Midterm Reviewer
No ratings yet
Rma Midterm Reviewer
11 pages
Anova and Manova
No ratings yet
Anova and Manova
30 pages
Multilevel Modeling Using R 1st Edition Edition W. Holmes Finch download pdf
100% (4)
Multilevel Modeling Using R 1st Edition Edition W. Holmes Finch download pdf
71 pages
AJSSMS20229(1)18-24
No ratings yet
AJSSMS20229(1)18-24
7 pages
Week 1
No ratings yet
Week 1
47 pages
RM TRL 1
No ratings yet
RM TRL 1
78 pages
Book of مؤتمر الجيوتكنيك 3-2023
No ratings yet
Book of مؤتمر الجيوتكنيك 3-2023
498 pages
3-Polynomial Regression Using Python
No ratings yet
3-Polynomial Regression Using Python
14 pages
Political Conservatism, Need For Cognitive Closure, and Intergroup Hostility
No ratings yet
Political Conservatism, Need For Cognitive Closure, and Intergroup Hostility
21 pages
TQM - TRG - F-09 - Discriminant Analysis - Rev01 - 20180602 PDF
No ratings yet
TQM - TRG - F-09 - Discriminant Analysis - Rev01 - 20180602 PDF
22 pages
Forecastingcognitiveflexibilitythroughacademicresilienceindesignthinking-enhancedbiologylessons
No ratings yet
Forecastingcognitiveflexibilitythroughacademicresilienceindesignthinking-enhancedbiologylessons
8 pages
Evans Analytics2e PPT 10 Data Mining
No ratings yet
Evans Analytics2e PPT 10 Data Mining
69 pages
Research 2
No ratings yet
Research 2
47 pages
BUS336 A3 Spring 2024
No ratings yet
BUS336 A3 Spring 2024
4 pages
Investment Model Theory (Rusbult)
No ratings yet
Investment Model Theory (Rusbult)
34 pages
Laekemariam Haile PDF
No ratings yet
Laekemariam Haile PDF
26 pages

Lab-10-Forest-Regression

Uploaded by

Lab-10-Forest-Regression

Uploaded by

Week 10: ENV 445 and ENV 645: Tree Regression Lab

Correlation and Simple Linear Regression

# clear the workspace

# Read in the data (this is an example from your text)

21. What are the assumptions of a regression analysis?

Due Next Thursday March 28th at 2:30 pm

You might also like