0% found this document useful (0 votes)

110 views

Problem Set 6

1) The document contains a problem set with multiple choice and analytical questions regarding regression analysis and hypothesis testing. 2) It includes regression output from 3 models examining the relationship between costs and output. The output is used to answer questions about sample sizes, R-squared values, and effects of variables in each model. 3) Questions also address hypothesis testing, model selection, issues like multicollinearity, and pooling time series data across periods.

Uploaded by

Sila Kapsata

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

110 views

Problem Set 6

Uploaded by

Sila Kapsata

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Problem Set 6

Multiple Choice Questions

1. The critical value in the F-distribution depends on the degrees of freedom in the
numerator and denominator. How do you find the degrees of freedom in the nu-
merator?
(a) It is the number of observations minus the number of coefficients estimated
(N − K)
(b) It is the number of hypotheses being tested simultaneously (J)
(c) It is the number of coefficients being estimated (K)
(d) It is the number of observations minus the number of hypotheses tested (N −J)
2. The critical value in the F-distribution depends on the degrees of freedom in the
numerator and denominator. How do you find the degrees of freedom in the de-
nominator?
(a) It is the number of observations minus the number of coefficients estimated
(N − K)
(b) It is the number of hypotheses being tested simultaneously (J)
(c) It is the number of coefficients being estimated (K)
(d) It is the number of observations minus the number of hypotheses tested (N −
J)
3. When performing an F-test, if the null hypothesis is H0 : β1 = β2 = 0. What is
the alternative hypothesis?
(a) β1 6= 0 and β2 6= 0
(b) β1 6= 0 or β2 6= 0
(c) (β1 6= 0 and β2 = 0) or (β1 = 0 and β2 6= 0)
(d) β1 = β2 6= 0
4. How does omitting a relevant variable from a regression model affect the estimated
coefficient of other variables in the model?
(a) they are biased downward and have smaller standard errors
(b) they are biased upward and have larger standard errors
(c) they are biased and the bias can be negative or positive
(d) they are unbiased but have larger standard errors
5. How does including an irrelevant variable in a regression model affect the estimated
coefficient of other variables in the model?
(a) they are biased downward and have smaller standard errors
(b) they are biased upward and have larger standard errors
(c) they are biased and the bias can be negative or positive

1
(d) they are unbiased but have larger standard errors
6. Which of the following measures is NOT used to evaluate model specification?
(a) The adjusted R2
(b) Akaike Information Criterion
(c) Bayesian Information Criterion
(d) Jarque-Bera test
7. When are the R2 and adjusted R2 equal?
(a) When the model is correctly specified
(b) When K = 1
(c) When the error terms are normally distributed
(d) When an unrestricted model is estimated
8. When highly collinear variables are included in an econometric model coefficient
estimates are
(a) biased downward and have smaller standard errors
(b) biased upward and have larger standard errors
(c) biased and the bias can be negative or positive
(d) unbiased but have larger standard errors
9. When a set of variables with perfect collinearity is included in an econometric
model coefficient estimates are
(a) undefined
(b) unbiased
(c) biased upward
(d) biased, but the direction is unclear
10. If your regression results show a high R2 , adj R2 , and a significant F-test, but low
t-values for the coefficients, what is the most likely cause?
(a) omitted relevant variables
(b) irrelevant variables have been included
(c) multicolinearity
(d) heteroskedasticity

Analytical Questions

11. Past EXAM Question

The following output is taken from OLS regressions of three different models which
try to establish the effect of output (measured in Kilograms) on total costs (mea-
sured in £’s). The first model regresses the level of costs, (costs), on the level of
output, (output). The second model regresses the natural log of costs, (log cost),
on the natural log of output, (log output) and the third model regresses the level

2
of costs on the level of output, the square of output, (output sq) and the cube of
output (output cub). Some of the regression output has been hidden.
Model 1
reg costs output
Source | SS df MS Number of obs =
-------------+------------------------------ F( 1, 58) = 662.73
Model | 733.336303 1 733.336303 Prob > F = 0.0000
Residual | 97.3749935 58 1.10653402 R-squared = 0.8828
-------------+------------------------------ Adj R-squared = 0.8814
Total | 830.711297 59 9.33383479 Root MSE = 1.0519
------------------------------------------------------------------------------
costs | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
output | .5000000 .0250000 0.000
_cons | .6501553 .1677777 3.88 0.000 .3167323 .9835782
------------------------------------------------------------------------------

Model 2
reg log_cost log_output

Source | SS df MS Number of obs =

-------------+------------------------------ F( 1, 58) = 185.50
Model | 1 Prob > F = 0.0000
Residual | 10.0000000 58 .113636360 R-squared =
-------------+------------------------------ Adj R-squared = 0.9155
Total | 100.000000 59 1.69491530 Root MSE = 1.3019
------------------------------------------------------------------------------
log_cost | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
log_output | .6000000 .0272426 22.39 0.000 .5556884 .6639662
_cons | -2.447097 .1569509 -15.59 0.000 -2.759004 -2.13519
------------------------------------------------------------------------------

Model 3
reg costs output output_sq output_cub

Source | SS df MS Number of obs =

-------------+------------------------------ F( , ) =
Model | 855.000000 3 285.00000 Prob > F = 0.0000
Residual | 95.0000000 56 1.69642860 R-squared = 0.9000
-------------+------------------------------ Adj R-squared = 0.8806
Total | 950.000000 59 16.1016950 Root MSE = 4.0126
------------------------------------------------------------------------------
costs | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
output | .8000000 .2000000 4.00 0.000
output_sq | -0.003000 0.003000 -1.00 0.290 -9.07e-06 2.74e-06
output_cub | 0.000001 0.000001 0.95 0.347 -1.36e-09 3.83e-09
_cons | .4343922 .2503542 1.74 0.086 -.0632955 .9320799
------------------------------------------------------------------------------

(a) Find the sample size in model 1

(b) Calculate the R2 value in model 2
(c) Interpret the effect of the estimated effect of output on costs in each model
(d) Test the hypothesis that the variable output has some explanatory power in
model 1 (use the 5% significance level for your test and the nearest critical
value in the Table for the relevant degrees of freedom)
(e) Calculate the F test of goodness of fit of the model as a whole in model 3
(f) Explain, briefly, how the adjusted R2 helps with model selection. Use this to
help you choose whether you prefer models 1, 2 or 3
(g) The regression output in Model 3 suggests the presence of what issue that
arises in many multiple regression models? Give reasons for your answer.
(h) Why might we worry if the OLS residuals are not normally distributed?

3
12. You have time series data for the period 1935-2000. You are given an estimate
of the effects of income (measured in £billion) and interest rates, (measured in
percentage points) on aggregate consumption expenditure (measured in £billion).
ˆ =
Cons 10.00 + 0.90Income − 6.00IntRate T SS = 70 ĒSS = 10
(1.00) + (0.45) + (2.00)

You then split the data into two periods, and run 2 separate regressions

For the period 1935-1970:

ˆ =
Cons 6.00 + 0.95Income − 2.00IntRate T SS = 30 ĒSS = 10
(1.00) + (0.40) + (1.00)

For the period 1971-2000:

ˆ =
Cons 14.00 + 0.85Income − 10.00IntRate T SS = 20 ĒSS = 10
(1.00) + (0.50) + (4.00)

Test the hypothesis that the data could be pooled across both time periods and
estimated as a single equation.
13. Consider the model Y = β0 + β1 X + u
(a) What is the formula for the Ordinary Least Squares estimate of β1 ?
(b) Under what conditions will Ordinary Least Squares produce an unbiased and
efficient estimate of β1 ?
(c) Prove that the Ordinary Least Squares estimate of β1 is unbiased.
14. A researcher is interested how the proportion of household budget spent on trans-
portation (W T RAN S) depends on total household expenditure (measured in logs
- LOGEXP ), the age of the household head (AGE) and the number of children
in the household (N U M KIDS). The researcher produces the following table of
estimates:
WTRANS
Log expenditure 0.0414
(0.0071)
Age of HH head -0.0001
(0.0004)
No. of children -0.0130
(0.0055)
Constant -0.0315
(0.0322)
R2 0.0247
N 1,519
Standard errors reported in parentheses

(a) What was the theoretical model the researcher took to the data?
(b) Write down the estimated model
(c) Interpret the estimates
(d) Are there any variables you would exclude from the model? Why, or why
not?

4
(e) Predict the proportion of a budget that will be spent on transportation for a
one-child household when total expenditure and age are set at their sample
means (98.7 and 36 respectively)

Practical Questions

15. When estimating wage equations we expect that young, experienced workers will
have relatively low wages and that with additional experience their wages will rise,
but then begin to decline after middle age, as the worker nears retirement. This
lifecycle pattern of wages can be captured by introducing experience and the square
of experience to explain the level of wages.
Consider the theoretical model

W age = β0 + β1 Exper + β2 Exper2 + β3 Educ + u (1)

(a) What is the marginal effect of experience on wages?

(b) What signs do you expect for each of the coefficients β1 and β2 and why?
(c) After how many years of experience do wages start to decline?
(d) Open the dataset cpseduc.dta (we used this dataset previously in Problem
Set 4)
i. Estimate a simple regression model of wages on years of experience
ii. Estimate a second model where you also include years of education
iii. Estimate the full theoretical model in (1) and interpret the estimates.
Are the estimates consistent with your expectations?
iv. Export your the estimates from all three models in one single table
• To export the table requires the outreg2 command. (Recall if nec-
essary you can download the command using < ssc install outreg2 .
• After estimating each model you need to save the estimates in STATA’s
internal memory. Then tell STATA to put the estimates together in
one table with the outreg command. The syntax will look something
like:
reg . . . . . .
estimates store model1
reg . . . . . .
est sto model2
reg . . . . . .
est sto model3
outreg2 [model1 model2 model3] using datapath\Table1, replace
word
v. Compare the coefficient on experience between the simple regression
model and the second model. What happens? Why? What does this
tell you about the correlation between experience and education?

5
16. The file cocaine.dta available on Moodle contains 56 observations on variables
related to sales of cocaine powder in northeastern California over the period 1984-
1991. The data are a subset of those used in the study

Caulkins, J.P. and R. Padman (1993) “Quantity Discounts and Quality Premia
for Illicit Drugs” Journal of the American Statistical Association, 88, 748-757

The variables are:

• PRICE = price per gram in dollars for a cocaine sale
• QUANT= number of grams of cocaine in a given sale
• QUAL = quality of the cocaine expressed as a percentage of purity
• TREND = a time variable with 1984=1 up to 1991=8
Consider the regression model

P RICE = β0 + β1 QU AN T + β2 QU AL + β3 T REN D + u

(a) What signs would you expect for the coefficients β1 , β2 and β3 . Explain
(b) Estimate the model in STATA and interpret the coefficient estimates. Do the
signs of the coefficients conform to your expectations?
(c) What proportion of the variation in cocaine prices is explained jointly by
variation in quantity, quality and time?
17. Use the data cpseduc.dta to estimate the following wage equation:

ln(W age) = β0 + β1 Educ + β2 Exper + β3 Hrswk + u

(a) Interpret the regression output.

(b) Test the hypothesis that an extra year of education increases the wage rate
by 10%
(c) Re-estimate the model with the additional variables EDU C ∗ EXP ER and
EDU C 2 and EXP ER2 . Interpret the regression output
(d) Estimate the marginal effects ∂ ∂EDU
ln(W age)
C for a woman with 16 years of educa-
tion and 2 years of experience and for a woman with 12 years of education
and 2 years of experience. What can you say about the marginal effect of
education for women as education increases?
(e) Estimate the marginal effects ∂ ∂EDU
ln(W age)
C for a man with 16 years of education
and 2 years of experience and for a man with 12 years of education and 2
years of experience. What can you say about the marginal effect of education
for men as education increases?

Experiment 1: Errors, Uncertainties and Measurements Laboratory Report
No ratings yet
Experiment 1: Errors, Uncertainties and Measurements Laboratory Report
8 pages
MTS 241 (First)
No ratings yet
MTS 241 (First)
16 pages
Generalised Linear Models and Bayesian Statistics
No ratings yet
Generalised Linear Models and Bayesian Statistics
35 pages
4.determinants Assignment Solutions
No ratings yet
4.determinants Assignment Solutions
13 pages
QM-II Midterm OCT 2014 Solution
No ratings yet
QM-II Midterm OCT 2014 Solution
19 pages
Business Statistics: Level 3
100% (1)
Business Statistics: Level 3
26 pages
Assignment 3
No ratings yet
Assignment 3
6 pages
Elementary Statistics Triola 11th Edition Test Bank all chapter instant download
100% (25)
Elementary Statistics Triola 11th Edition Test Bank all chapter instant download
31 pages
Quiz Solutions
No ratings yet
Quiz Solutions
6 pages
ECON1203-2292 Final Exam S212 PDF
No ratings yet
ECON1203-2292 Final Exam S212 PDF
13 pages
Analytics Quiz and Case Study
No ratings yet
Analytics Quiz and Case Study
12 pages
Quantitative Methods II Mid-Term Examination: Instructions
100% (1)
Quantitative Methods II Mid-Term Examination: Instructions
17 pages
The Most Important Probability Distribution in Statistics
No ratings yet
The Most Important Probability Distribution in Statistics
57 pages
Testing Hypothesis
No ratings yet
Testing Hypothesis
42 pages
Chapter 2 - Organizing Data
No ratings yet
Chapter 2 - Organizing Data
33 pages
2 Simple Regression Model Estimation and Properties
100% (1)
2 Simple Regression Model Estimation and Properties
48 pages
Solution To Exam 1
No ratings yet
Solution To Exam 1
8 pages
Matrices Exercise Solution PDF
No ratings yet
Matrices Exercise Solution PDF
3 pages
Bahan Univariate Linear Regression
No ratings yet
Bahan Univariate Linear Regression
64 pages
Method of Moment
No ratings yet
Method of Moment
53 pages
Combined Quiz Solutions PDF
No ratings yet
Combined Quiz Solutions PDF
61 pages
Lec 8 (MTH100) Matrices and Determines
No ratings yet
Lec 8 (MTH100) Matrices and Determines
9 pages
Stat Cheatsheet (Ver.2)
No ratings yet
Stat Cheatsheet (Ver.2)
2 pages
Exponent Rules Practice PDF
No ratings yet
Exponent Rules Practice PDF
2 pages
Solutions 2 Matrices
No ratings yet
Solutions 2 Matrices
10 pages
Doug Bates Mixed Models
No ratings yet
Doug Bates Mixed Models
75 pages
Correlation
No ratings yet
Correlation
11 pages
Handout 9 PDF
No ratings yet
Handout 9 PDF
79 pages
Lecture 9 Introduction To Difference Equations
No ratings yet
Lecture 9 Introduction To Difference Equations
55 pages
Solutions Chapter 5
No ratings yet
Solutions Chapter 5
21 pages
Lecture Notes in Statistics: GLIM 82: Proceedings of The International Conference On Generalised Linear Models
No ratings yet
Lecture Notes in Statistics: GLIM 82: Proceedings of The International Conference On Generalised Linear Models
194 pages
CE 459 Statistics: Assistant Prof. Muhammet Vefa AKPINAR
No ratings yet
CE 459 Statistics: Assistant Prof. Muhammet Vefa AKPINAR
211 pages
Regression Formula
No ratings yet
Regression Formula
2 pages
SSLC MATHS Question Paper by KSTA (Eng Med)
No ratings yet
SSLC MATHS Question Paper by KSTA (Eng Med)
13 pages
Wooldridge 7e Ch05 SM
No ratings yet
Wooldridge 7e Ch05 SM
5 pages
Extra Question
No ratings yet
Extra Question
5 pages
Statistical Machine Learning W4400 Lecture Slides PDF
No ratings yet
Statistical Machine Learning W4400 Lecture Slides PDF
520 pages
Regression Analysis
No ratings yet
Regression Analysis
25 pages
Statistics Traning Exam Answer
No ratings yet
Statistics Traning Exam Answer
9 pages
Chap 2 - Probability Theory - PPT
No ratings yet
Chap 2 - Probability Theory - PPT
27 pages
Bigdata Assess1 PDF
No ratings yet
Bigdata Assess1 PDF
12 pages
Bivariate Exponential Distribution
No ratings yet
Bivariate Exponential Distribution
11 pages
Chapter 13 Partial Derivatives
No ratings yet
Chapter 13 Partial Derivatives
174 pages
What Are Non Parametric Methods!
No ratings yet
What Are Non Parametric Methods!
19 pages
Im ch01
No ratings yet
Im ch01
11 pages
BRM Practice Questions PGP20
0% (1)
BRM Practice Questions PGP20
47 pages
Updated Key Answers by Kuya Bohol Accuracy: 97%
100% (1)
Updated Key Answers by Kuya Bohol Accuracy: 97%
113 pages
Em 18 Equilibrium of A Particle
No ratings yet
Em 18 Equilibrium of A Particle
2 pages
Rohatgi Expl
No ratings yet
Rohatgi Expl
192 pages
SPSS2 Workshop Handout 20200917
No ratings yet
SPSS2 Workshop Handout 20200917
17 pages
Chapter9 - Serial Correlation
No ratings yet
Chapter9 - Serial Correlation
37 pages
Math207 HW3
No ratings yet
Math207 HW3
2 pages
Iitjee Maths
No ratings yet
Iitjee Maths
78 pages
Ap Calculus Ab Syllabus 3
No ratings yet
Ap Calculus Ab Syllabus 3
6 pages
Chemistry Exam Review Package
No ratings yet
Chemistry Exam Review Package
30 pages
Stats Formula
No ratings yet
Stats Formula
2 pages
Simple Linear Regression and Correlation
No ratings yet
Simple Linear Regression and Correlation
50 pages
Chapter 6. Comparison of Several Multivariate Means
100% (1)
Chapter 6. Comparison of Several Multivariate Means
9 pages
4.2 Tests of Structural Changes: X y X y
No ratings yet
4.2 Tests of Structural Changes: X y X y
8 pages
Econometrics - Sheet 2A - MR - 2024
No ratings yet
Econometrics - Sheet 2A - MR - 2024
3 pages
Chapter 2: Properties of The Regression Coe Cients and Hypothesis Testing
No ratings yet
Chapter 2: Properties of The Regression Coe Cients and Hypothesis Testing
5 pages
MA Class 4 (Feb. 1)
No ratings yet
MA Class 4 (Feb. 1)
12 pages
Lecture
No ratings yet
Lecture
15 pages
HYPOTHESIS Research Methodology
No ratings yet
HYPOTHESIS Research Methodology
20 pages
Instant download From Galileo to Gell Mann The Wonder that Inspired the Greatest Scientists of All Time In Their Own Words 1st Edition Marco Bersanelli pdf all chapter
100% (5)
Instant download From Galileo to Gell Mann The Wonder that Inspired the Greatest Scientists of All Time In Their Own Words 1st Edition Marco Bersanelli pdf all chapter
61 pages
Unit 1 - Scientific Method: Worksheet: Your Name Here
No ratings yet
Unit 1 - Scientific Method: Worksheet: Your Name Here
5 pages
Outline and Explain TWO Ethical Issues That Sociologists Using Primary Quantitative Methods Would - Studocu
No ratings yet
Outline and Explain TWO Ethical Issues That Sociologists Using Primary Quantitative Methods Would - Studocu
1 page
Research II Take Home
No ratings yet
Research II Take Home
7 pages
3.1. Statistics in Python - Scipy Lecture Notes
No ratings yet
3.1. Statistics in Python - Scipy Lecture Notes
20 pages
BRM PPT, 2021
No ratings yet
BRM PPT, 2021
22 pages
QTA02 Qualitative Traditions of Inquiry
No ratings yet
QTA02 Qualitative Traditions of Inquiry
8 pages
Lesson 16 - Designing The Training Curriculum
No ratings yet
Lesson 16 - Designing The Training Curriculum
80 pages
11angelica Quinlog - Written Report
No ratings yet
11angelica Quinlog - Written Report
15 pages
Cheat Sheet
No ratings yet
Cheat Sheet
2 pages
Week 7 Chapter 4, 5 1
No ratings yet
Week 7 Chapter 4, 5 1
5 pages
Chi Square
No ratings yet
Chi Square
12 pages
Week 1 Scientific Values and Scientific Method
No ratings yet
Week 1 Scientific Values and Scientific Method
27 pages
Descriptive Statistics and Normality Tests For Statistical Data
No ratings yet
Descriptive Statistics and Normality Tests For Statistical Data
13 pages
Study Guide 8 - The Review of Related Literature and Studies
No ratings yet
Study Guide 8 - The Review of Related Literature and Studies
7 pages
"A" Level Sociology: A Resource-Based Learning Approach
No ratings yet
"A" Level Sociology: A Resource-Based Learning Approach
21 pages
Operating Costing With Veena World Tour.
No ratings yet
Operating Costing With Veena World Tour.
3 pages
Textbook 3
No ratings yet
Textbook 3
2 pages
Task-Technology Fit Theory: A Survey and Synopsis of The Literature
No ratings yet
Task-Technology Fit Theory: A Survey and Synopsis of The Literature
20 pages
CHAPTER 4 With ANOVA
No ratings yet
CHAPTER 4 With ANOVA
11 pages
Complete Download Case Control Studies Ruth H Keogh PDF All Chapters
100% (21)
Complete Download Case Control Studies Ruth H Keogh PDF All Chapters
60 pages
Practical Research 1
No ratings yet
Practical Research 1
11 pages
Steps in Hypothesis Testing Using The Z
No ratings yet
Steps in Hypothesis Testing Using The Z
1 page
Analisis Data Kuantitatif - Pengenalan
No ratings yet
Analisis Data Kuantitatif - Pengenalan
22 pages
MODULE 3, Research Questions and Research Types
No ratings yet
MODULE 3, Research Questions and Research Types
18 pages
Indonesia Safety - 1 PDF
No ratings yet
Indonesia Safety - 1 PDF
72 pages

Problem Set 6

Uploaded by

Problem Set 6

Uploaded by

Problem Set 6

Multiple Choice Questions

11. Past EXAM Question

Source | SS df MS Number of obs =

Source | SS df MS Number of obs =

(a) Find the sample size in model 1

For the period 1935-1970:

For the period 1971-2000:

W age = β0 + β1 Exper + β2 Exper2 + β3 Educ + u (1)

(a) What is the marginal effect of experience on wages?

The variables are:

ln(W age) = β0 + β1 Educ + β2 Exper + β3 Hrswk + u

(a) Interpret the regression output.

You might also like