0% found this document useful (0 votes)
17 views14 pages

STAT501 Online FinalExam Fall2024

The document is a final exam for a statistics course, containing multiple questions related to statistical analysis, including ANOVA, regression, hypothesis testing, and confidence intervals. It emphasizes the importance of academic honesty and outlines the exam format, allowing for a cheat sheet and calculator. Each question requires detailed statistical reasoning and calculations based on provided data sets.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views14 pages

STAT501 Online FinalExam Fall2024

The document is a final exam for a statistics course, containing multiple questions related to statistical analysis, including ANOVA, regression, hypothesis testing, and confidence intervals. It emphasizes the importance of academic honesty and outlines the exam format, allowing for a cheat sheet and calculator. Each question requires detailed statistical reasoning and calculations based on provided data sets.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

STAT 501online Final Exam Fall 2024

NAME:

Question 1 …………………
…………..
Question 2……………… …………

Question 3………………………………………
Question 4……………………………..

Question 5……………………………………..

Show all your work.

Good luck!! Thanks for your hard work this semester.

Anything you want to tell me:

Academic Honesty : Please don’t collaborate together or search on the Internet.


The exam is a closed book and closed notes exam but you are allowed 2 pages (two-sided) of a cheat sheet and a
calculator for the exam.
Question 1
Researchers studying salmon in the Pacific Northwest collected a data set with the diameter of growth rings for first year
freshwater growth (hundredths of an inch) and type of salmon (1= Alaskan females, 2= Alaskan males, 3= Canadian
females, 4= Canadian males) along with some other variables for a random sample of salmon. The researchers want to
know if there are differences in the average diameter of these growth rings for the four different types of salmon. They
decide to run an ANOVA to address that research question. Some preliminary analysis and partial ANOVA R output is
shown below.

Df Sum Sq Mean Sq F value Pr(>F)


Type ?? 38591 12864 ?? <2.2e-16 Type Mean SD n
Residuals ?? 28338 1 96.58 18.34 26
2 100.33 13.49 24
3 139.54 17.01 26
4 135.21 19.24 24

a.(Circle one) This ANOVA is balanced unbalanced.

b. What does the preliminary analysis reveal to you?

c. In order for this ANOVA to be valid, you need to check the following assumptions (state and check them):
d. Determine the missing values for the degrees of freedom and the test statistic.

Df Sum Sq Mean Sq F value Pr(>F)


Type ?? 38591 12864 ?? <2.2e-16 Type Mean SD n
Residuals ?? 28338 ?? 1 96.58 18.34 26
2 100.33 13.49 24
3 139.54 17.01 26
4 135.21 19.24 24

Type df = F value=

Residuals df= MSE =

e. The researchers want to know if there are differences in the average diameter of these growth rings for the four different
types of salmon. Conduct the test:

Null hypothesis: Alternative Hypothesis :


Test Statistic:
Distribution of Test Stat:
p-value (show how to calculate):

Conclusion (apply to the problem):

f. What is your best estimate of the common population variance?

g. Compute a 95% confidence interval for μ1 - μ2


1= Alaskan females, 2= Alaskan males (you don’t need to finish calculations , but put into the formula all numbers )

g. Are multiple comparisons appropriate? Explain. If yes, summarize the results.


Question 2
Some coastal studies examine the impact of
erosion and major storm events on the location of
coastal shorelines. An article from 2018 in the
Journal of Coastal Research uses a simple linear
regression model to examine the relationship
between year and deviation from the shoreline
location in 1850 for a particular location at
Fenwick Island, Delaware in meters.

a. Describe the relationship between Year and


Deviation from the 1850 shoreline
location based on the scatterplot.

Use the R output provided to address all the questions that follow.
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 326.03421 51.24873 6.362 2.63e-06 ***
Year -0.24088 0.02609 -9.233 7.69e-09 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 2.488 on 21 degrees of freedom


Multiple R-squared: 0.8023, Adjusted R-squared: 0.7929
F-statistic: 85.24 on 1 and 21 DF, p-value: 7.694e-09

b.What is the equation of the regression line for predicting deviation from 1850 based on year?

c. What is the value of the sample correlation coefficient? Interpret the correlation coefficient.

(i) r=
(ii) Interpretation of r :

d. What is the value of R-squared ?Interpret the R-squared value.

(i)R-squared =
(ii)Interpretation of R-squared:

(iii) How well you think this model fits the data?
e State and check the assumptions .

f. How well does the regression appear to fit? Explain, using 2 statistics in your justification.
Test 1:

(i)Null hypothesis: Alternative Hypothesis:

(ii)Test Statistic (value)

(iii)Distribution of Test Stat (name, parameters)

(iv )p-value (state how to calculate and then use the output):

(v)Conclusion:
Test 2:

(vi)Null hypothesis: Alternative Hypothesis:

(vii)Test Statistic (value)

(viii) Distribution of Test Stat (name, parameters)

(ix) p-value (state how to calculate and then use the output)

(x)Conclusion:

g. Obtain a 95% confidence interval for the population slope. (write the formula and put numbers , you don’t need to
finish calculations.

h. Compute a 95% Prediction Interval for an individual response when year is 2023. Assume the mean value of deviation
is -150.(write the formula and put numbers , you don’t need to finish calculations.
Question 3
Dartmouth Distribution Warehouse makes deliveries of a large number of products to its customers. To keep its
customers happy and satisfied, the company's policy is to deliver on time at least 87% of all the orders it receives
from its customers. The quality control inspector at the company quite often takes samples of orders delivered and
checks to see whether this policy is maintained. A recent sample of 100 orders taken by this inspector showed that
83 of them were delivered on time.
Using a 1% significance level, can you conclude that the company's policy is maintained?

a) State the null and alternative hypothesis for this test.

b) State and calculate the value of the test statistic.

c) Use the critical value approach to test your hypothesis using Į = 0.01.

Rejection region:

Decision : Reject Ho Do not reject Ho (circle your answer)

Apply to the problem:

d) Use the p- YDOXHDSSURDFKWRWHVW\RXUK\SRWKHVLVXVLQJĮ 1.

p-value :

Decision : Reject Ho Do not reject Ho (circle your answer)

e) Calculate an appropriate confidence interval (two sided or one sided) .


Explain how it can be used to test the hypotheses.

CI interval calculations :

How to use to test the hypotheses:


f) What kind of error could you make: Type I error or Type II error?

g) Describe this error and consequences of making this error (in one sentence or two ,don’t give me definitions,
just apply to the problem )

h) Describe the power of the test . How would you calculate ? You don’t need to give the exact number, just show
the steps and roughly evaluate.

i)How could you increase the power of the test?

j) Just circle the correct answers

If a hypothesis is not rejected at the 0.05 level of significance, it:


a) will not be rejected at the 0.01 level True False We cannot tell
b) Will not be rejected at the 0.1 level True False We can not tell
Question 4
Investigators wanted to explore the relationship between SAT performance (an achievement test) and
first year college GPA. They recorded SAT Verbal and Math scores for 219 students entering college,
and waited for their first years in college to end to study their GPA. In the meantime, perhaps you can
answer a related question. Do students tend to have higher scores on the Math component of the SAT
relative to the Verbal component? Both tests are scored from 0-800.
a. What parameter should be used to address the researcher’s question? Pd P1  P 2
Explain your choice in one sentence.

b. State the hypotheses you would test (be sure to define your order of subtraction) to address the question.

c. Assume all conditions necessary for your procedure check out. The following output was generatedby
someone who was not sure which procedure to run. Use the appropriate output to address the questions
that follow.
t.test(GPA$SATM,GPA$SATV)
data: GPA$SATM and GPA$SATV
t = 3.8505, df = 431.46, p-value = 0.0001357
alternative hypothesis: true difference in means is not equal to 0
sample estimates:
mean of SATM mean of SATV
634.2922 605.0685

t.test(GPA$SATdiff) Note: SATdiff=SATM-SATV


One Sample t-test
data: GPA$SATdiff
t = 5.582, df = 218, p-value = 7.026e-08 alternative
hypothesis: true mean is not equal to 0sample
estimates:
mean of SATdiff
29.22374
Clearly state the numeric value of the test statistic for your procedure . Then interpret the test statistic.

Null hypothesis: Alternative Hypothesis:

Test Statistic:

Distribution of Test Stat (name, parameters)

p-value (how to calculate and then use the output):

Interpretation of p-value:

Conclusion (apply to the problem):

g. What type of error might you have made? Type I Type II No Error
Question 5
Researchers using the wolf skulls data set are interested in differences between sexes and regions in terms of some
other measured variables (zygomatic width, crown length of a molar, interorbital width, palatal length, postpalatal
length), all recorded in mm. For each research question below, state whether a hypothesis test or confidence interval,
or some other analysis (regression, ANOVA, chi-square) is appropriate. If you choose hypothesis test or CI,
you need to specify the parameter (s) and what kind of test you are going to use ( for instance :
if this is a hypothesis test, how would look Ho and Ha.If you are using Chi-square test , what Ho , Ha will look like
or if you use t or z test, if you compare two means , what kind of t-test you will use .........

a. Is there a difference between sexes in terms of zygomatic width?

b. Does crown length of a molar increase as palatal length increases?

c. How much difference is there between the regions in terms of interorbital width?

d. Considering the 2 sexes and 2 regions as really four groups of wolves, what are the differences in palatal length across
the groups?

e. How much longer is palatal length than postpalatal length for the wolves?
Table IV Standard Normal Distribution Table C19

Table IV Standard Normal Distribution Table

The entries in this table give the


cumulative area under the standard
normal curve to the left of z with the
values of z equal to 0 or negative. z 0 z

z .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
-3.4 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0002
-3.3 .0005 .0005 .0005 .0004 .0004 .0004 .0004 .0004 .0004 .0003
-3.2 .0007 .0007 .0006 .0006 .0006 .0006 .0006 .0005 .0005 .0005
-3.1 .0010 .0009 .0009 .0009 .0008 .0008 .0008 .0008 .0007 .0007
-3.0 .0013 .0013 .0013 .0012 .0012 .0011 .0011 .0011 .0010 .0010

-2.9 .0019 .0018 .0018 .0017 .0016 .0016 .0015 .0015 .0014 .0014
-2.8 .0026 .0025 .0024 .0023 .0023 .0022 .0021 .0021 .0020 .0019
-2.7 .0035 .0034 .0033 .0032 .0031 .0030 .0029 .0028 .0027 .0026
-2.6 .0047 .0045 .0044 .0043 .0041 .0040 .0039 .0038 .0037 .0036
-2.5 .0062 .0060 .0059 .0057 .0055 .0054 .0052 .0051 .0049 .0048

-2.4 .0082 .0080 .0078 .0075 .0073 .0071 .0069 .0068 .0066 .0064
-2.3 .0107 .0104 .0102 .0099 .0096 .0094 .0091 .0089 .0087 .0084
-2.2 .0139 .0136 .0132 .0129 .0125 .0122 .0119 .0116 .0113 .0110
-2.1 .0179 .0174 .0170 .0166 .0162 .0158 .0154 .0150 .0146 .0143
-2.0 .0228 .0222 .0217 .0212 .0207 .0202 .0197 .0192 .0188 .0183

-1.9 .0287 .0281 .0274 .0268 .0262 .0256 .0250 .0244 .0239 .0233
-1.8 .0359 .0351 .0344 .0336 .0329 .0322 .0314 .0307 .0301 .0294
-1.7 .0446 .0436 .0427 .0418 .0409 .0401 .0392 .0384 .0375 .0367
-1.6 .0548 .0537 .0526 .0516 .0505 .0495 .0485 .0475 .0465 .0455
-1.5 .0668 .0655 .0643 .0630 .0618 .0606 .0594 .0582 .0571 .0559

-1.4 .0808 .0793 .0778 .0764 .0749 .0735 .0721 .0708 .0694 .0681
-1.3 .0968 .0951 .0934 .0918 .0901 .0885 .0869 .0853 .0838 .0823
-1.2 .1151 .1131 .1112 .1093 .1075 .1056 .1038 .1020 .1003 .0985
-1.1 .1357 .1335 .1314 .1292 .1271 .1251 .1230 .1210 .1190 .1170
-1.0 .1587 .1562 .1539 .1515 .1492 .1469 .1446 .1423 .1401 .1379

-0.9 .1841 .1814 .1788 .1762 .1736 .1711 .1685 .1660 .1635 .1611
-0.8 .2119 .2090 .2061 .2033 .2005 .1977 .1949 .1922 .1894 .1867
-0.7 .2420 .2389 .2358 .2327 .2296 .2266 .2236 .2206 .2177 .2148
-0.6 .2743 .2709 .2676 .2643 .2611 .2578 .2546 .2514 .2483 .2451
-0.5 .3085 .3050 .3015 .2981 .2946 .2912 .2877 .2843 .2810 .2776

-0.4 .3446 .3409 .3372 .3336 .3300 .3264 .3228 .3192 .3156 .3121
-0.3 .3821 .3783 .3745 .3707 .3669 .3632 .3594 .3557 .3520 .3483
-0.2 .4207 .4168 .4129 .4090 .4052 .4013 .3974 .3936 .3897 .3859
-0.1 .4602 .4562 .4522 .4483 .4443 .4404 .4364 .4325 .4286 .4247
0.0 .5000 .4960 .4920 .4880 .4840 .4801 .4761 .4721 .4681 .4641
C20 Appendix C Statistical Tables

Table IV Standard Normal Distribution Table (continued)

The entries in this table give the


cumulative area under the standard
normal curve to the left of z with the
values of z equal to 0 or positive. 0 z z

z .00 .01 .02 .03 .04 .05 .06 .07 .08 .09

0.0 .5000 .5040 .5080 .5120 .5160 .5199 .5239 .5279 .5319 .5359

0.1 .5398 .5438 .5478 .5517 .5557 .5596 .5636 .5675 .5714 .5753
0.2 .5793 .5832 .5871 .5910 .5948 .5987 .6026 .6064 .6103 .6141
0.3 .6179 .6217 .6255 .6293 .6331 .6368 .6406 .6443 .6480 .6517
0.4 .6554 .6591 .6628 .6664 .6700 .6736 .6772 .6808 .6844 .6879
0.5 .6915 .6950 .6985 .7019 .7054 .7088 .7123 .7157 .7190 .7224

0.6 .7257 .7291 .7324 .7357 .7389 .7422 .7454 .7486 .7517 .7549
0.7 .7580 .7611 .7642 .7673 .7704 .7734 .7764 .7794 .7823 .7852
0.8 .7881 .7910 .7939 .7967 .7995 .8023 .8051 .8078 .8106 .8133
0.9 .8159 .8186 .8212 .8238 .8264 .8289 .8315 .8340 .8365 .8389
1.0 .8413 .8438 .8461 .8485 .8508 .8531 .8554 .8577 .8599 .8621

1.1 .8643 .8665 .8686 .8708 .8729 .8749 .8770 .8790 .8810 .8830
1.2 .8849 .8869 .8888 .8907 .8925 .8944 .8962 .8980 .8997 .9015
1.3 .9032 .9049 .9066 .9082 .9099 .9115 .9131 .9147 .9162 .9177
1.4 .9192 .9207 .9222 .9236 .9251 .9265 .9279 .9292 .9306 .9319
1.5 .9332 .9345 .9357 .9370 .9382 .9394 .9406 .9418 .9429 .9441

1.6 .9452 .9463 .9474 .9484 .9495 .9505 .9515 .9525 .9535 .9545
1.7 .9554 .9564 .9573 .9582 .9591 .9599 .9608 .9616 .9625 .9633
1.8 .9641 .9649 .9656 .9664 .9671 .9678 .9686 .9693 .9699 .9706
1.9 .9713 .9719 .9726 .9732 .9738 .9744 .9750 .9756 .9761 .9767
2.0 .9772 .9778 .9783 .9788 .9793 .9798 .9803 .9808 .9812 .9817

2.1 .9821 .9826 .9830 .9834 .9838 .9842 .9846 .9850 .9854 .9857
2.2 .9861 .9864 .9868 .9871 .9875 .9878 .9881 .9884 .9887 .9890
2.3 .9893 .9896 .9898 .9901 .9904 .9906 .9909 .9911 .9913 .9916
2.4 .9918 .9920 .9922 .9925 .9927 .9929 .9931 .9932 .9934 .9936
2.5 .9938 .9940 .9941 .9943 .9945 .9946 .9948 .9949 .9951 .9952

2.6 .9953 .9955 .9956 .9957 .9959 .9960 .9961 .9962 .9963 .9964
2.7 .9965 .9966 .9967 .9968 .9969 .9970 .9971 .9972 .9973 .9974
2.8 .9974 .9975 .9976 .9977 .9977 .9978 .9979 .9979 .9980 .9981
2.9 .9981 .9982 .9982 .9983 .9984 .9984 .9985 .9985 .9986 .9986
3.0 .9987 .9987 .9987 .9988 .9988 .9989 .9989 .9989 .9990 .9990

3.1 .9990 .9991 .9991 .9991 .9992 .9992 .9992 .9992 .9993 .9993
3.2 .9993 .9993 .9994 .9994 .9994 .9994 .9994 .9995 .9995 .9995
3.3 .9995 .9995 .9995 .9996 .9996 .9996 .9996 .9996 .9996 .9997
3.4 .9997 .9997 .9997 .9997 .9997 .9997 .9997 .9997 .9997 .9998
Table V The t Distribution Table C21

Table V The t Distribution Table

The entries in this table give the critical values


of t for the specified number of degrees
of freedom and areas in the right tail.
0 t

Area in the Right Tail Under the t Distribution Curve

df .10 .05 .025 .01 .005 .001

1 3.078 6.314 12.706 31.821 63.657 318.309


2 1.886 2.920 4.303 6.965 9.925 22.327
3 1.638 2.353 3.182 4.541 5.841 10.215
4 1.533 2.132 2.776 3.747 4.604 7.173
5 1.476 2.015 2.571 3.365 4.032 5.893
6 1.440 1.943 2.447 3.143 3.707 5.208
7 1.415 1.895 2.365 2.998 3.499 4.785
8 1.397 1.860 2.306 2.896 3.355 4.501
9 1.383 1.833 2.262 2.821 3.250 4.297
10 1.372 1.812 2.228 2.764 3.169 4.144
11 1.363 1.796 2.201 2.718 3.106 4.025
12 1.356 1.782 2.179 2.681 3.055 3.930
13 1.350 1.771 2.160 2.650 3.012 3.852
14 1.345 1.761 2.145 2.624 2.977 3.787
15 1.341 1.753 2.131 2.602 2.947 3.733
16 1.337 1.746 2.120 2.583 2.921 3.686
17 1.333 1.740 2.110 2.567 2.898 3.646
18 1.330 1.734 2.101 2.552 2.878 3.610
19 1.328 1.729 2.093 2.539 2.861 3.579
20 1.325 1.725 2.086 2.528 2.845 3.552
21 1.323 1.721 2.080 2.518 2.831 3.527
22 1.321 1.717 2.074 2.508 2.819 3.505
23 1.319 1.714 2.069 2.500 2.807 3.485
24 1.318 1.711 2.064 2.492 2.797 3.467
25 1.316 1.708 2.060 2.485 2.787 3.450
26 1.315 1.706 2.056 2.479 2.779 3.435
27 1.314 1.703 2.052 2.473 2.771 3.421
28 1.313 1.701 2.048 2.467 2.763 3.408
29 1.311 1.699 2.045 2.462 2.756 3.396
30 1.310 1.697 2.042 2.457 2.750 3.385
31 1.309 1.696 2.040 2.453 2.744 3.375
32 1.309 1.694 2.037 2.449 2.738 3.365
33 1.308 1.692 2.035 2.445 2.733 3.356
34 1.307 1.691 2.032 2.441 2.728 3.348
35 1.306 1.690 2.030 2.438 2.724 3.340
C22 Appendix C Statistical Tables

Table V The t Distribution Table (continued)


Area in the Right Tail Under the t Distribution Curve

df .10 .05 .025 .01 .005 .001

36 1.306 1.688 2.028 2.434 2.719 3.333


37 1.305 1.687 2.026 2.431 2.715 3.326
38 1.304 1.686 2.024 2.429 2.712 3.319
39 1.304 1.685 2.023 2.426 2.708 3.313
40 1.303 1.684 2.021 2.423 2.704 3.307
41 1.303 1.683 2.020 2.421 2.701 3.301
42 1.302 1.682 2.018 2.418 2.698 3.296
43 1.302 1.681 2.017 2.416 2.695 3.291
44 1.301 1.680 2.015 2.414 2.692 3.286
45 1.301 1.679 2.014 2.412 2.690 3.281
46 1.300 1.679 2.013 2.410 2.687 3.277
47 1.300 1.678 2.012 2.408 2.685 3.273
48 1.299 1.677 2.011 2.407 2.682 3.269
49 1.299 1.677 2.010 2.405 2.680 3.265
50 1.299 1.676 2.009 2.403 2.678 3.261
51 1.298 1.675 2.008 2.402 2.676 3.258
52 1.298 1.675 2.007 2.400 2.674 3.255
53 1.298 1.674 2.006 2.399 2.672 3.251
54 1.297 1.674 2.005 2.397 2.670 3.248
55 1.297 1.673 2.004 2.396 2.668 3.245
56 1.297 1.673 2.003 2.395 2.667 3.242
57 1.297 1.672 2.002 2.394 2.665 3.239
58 1.296 1.672 2.002 2.392 2.663 3.237
59 1.296 1.671 2.001 2.391 2.662 3.234
60 1.296 1.671 2.000 2.390 2.660 3.232
61 1.296 1.670 2.000 2.389 2.659 3.229
62 1.295 1.670 1.999 2.388 2.657 3.227
63 1.295 1.669 1.998 2.387 2.656 3.225
64 1.295 1.669 1.998 2.386 2.655 3.223
65 1.295 1.669 1.997 2.385 2.654 3.220
66 1.295 1.668 1.997 2.384 2.652 3.218
67 1.294 1.668 1.996 2.383 2.651 3.216
68 1.294 1.668 1.995 2.382 2.650 3.214
69 1.294 1.667 1.995 2.382 2.649 3.213
70 1.294 1.667 1.994 2.381 2.648 3.211
71 1.294 1.667 1.994 2.380 2.647 3.209
72 1.293 1.666 1.993 2.379 2.646 3.207
73 1.293 1.666 1.993 2.379 2.645 3.206
74 1.293 1.666 1.993 2.378 2.644 3.204
75 1.293 1.665 1.992 2.377 2.643 3.202

00 1.282 1.645 1.960 2.326 2.576 3.090

You might also like