0% found this document useful (0 votes)

80 views6 pages

5103A1

The document contains answers to questions about regression analysis. Key points: 1) OLS provides an unbiased estimate of β1 under assumptions of linearity, random sampling, no perfect collinearity, and zero conditional mean. These assumptions are often unrealistic. 2) Violations of the assumptions may cause β1 to be overestimated if independent variables are positively correlated with each other. 3) Summary statistics and regressions are reported for variables like AFQT scores, wages, education and experience using US census data.

Uploaded by

Maesha Armeen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views6 pages

5103A1

Uploaded by

Maesha Armeen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

ECA5103 ASSIGNMENT 1-SEC1

Maesha Armeen

A0210255W

29.09.2019

Answer 1

(a) Under what assumptions will the OLS estimate of _1 provide an unbiased estimate of _1?
Are these assumptions realistic?

a) OLS estimate of ai will provide an unbiased and consistent estimate of Bi under MLR 1-4, which are as
follows

MLR1: Linear in parameters the model should be linear in parameters meaning that increase of unit size
and price per square foot of the house must hold a linear relationship. The researcher can use specific
alternations like log, exponential etc. while running the specified regression, but the linearity in
parameters in the original regression must hold.

MLR2: Random Sampling the housing data must be a random sample from the population. This allows
us to make unbiased interpretations or else a flawed conclusion would be drawn based on a specific
portion of the population.

MLR3: No perfect Collinearity all the values of the independent variables cannot be the same which
means each x cannot be a linear function of another or constant.

MLR4: Zero conditional Mean the explanatory variables cannot contain any information about the
mean of the error terms, i.e. they must be endogenous.

There are further two assumptions ML5: variance of explanatory variable must be constant and ML6
normality, ei is independent of lnsi under all which OLS is BLUE (best linear unbiased estimator).
However, the coefficient is unbiased if it satisfies just MLR1-4.

These assumptions are not always realistic, however is it important to make these assumptions to infer
the causal effect. In real life, these assumptions are unlikely to hold since we can expect unit size to have
a proportional increasing on pricing, researchers have no details about the randomness of the sample
data, explanatory variables are likely to hold some information about residuals or other parameters.

(b) If these assumptions are violated, will the OLS estimate of _1 over- or under-estimate the
impact of size on price? Explain your answer.

Violations of these assumptions may make the results worthless or the effect can be usually trivial. In
this case, dropping a variable from the regression leads to violation of omitted variable bias because of
zero conditional mean. Thus, alphai is likely to be overestimated since number of bedrooms and unit
size hold a positive relationship.
Answer 2

(a) Rename variables R0000300 R0000500 R0618300 to birth month birth day, and afqt.

. rename R0000300 birth_month

. rename R0000500 birth_day

. rename R0618300 afqt

(b) Use recode 1) to convert invalid values into missing and 2) to recode sex, which is currently

defined as = 1 for male, = 2 for female, into = 0 for male and = 1 for female, and rename

the variable female.

i) After checking all the variables with tab for negitive value,found out that birth_month and birth_day
has no negitive values thus,recoding them is not required.

. recode afqt (-4/-3=.)

(afqt: 808 changes made)

. recode ind04(-5/-3=.)

(ind04: 6052 changes made)

. recode wage04(-5/-4=.)

(wage04: 10002 changes made)

. recode incwg04 (-5/-1=.)

(incwg04: 5417 changes made)

. recode edu(-5=.)

(edu: 5025 changes made)

. recode age04(-5=.)

(age04: 5025 changes made)

ii)

. recode sex(1=0)

(sex: 6403 changes made)

. recode sex(2=1)

(sex: 6283 changes made)

. rename sex female

in Table 1 of the provided template.

. summarize afqt female wage04 edu age04

Variable Obs Mean Std. Dev. Min Max

afqt 11,878 40.95193 28.75716 1 99

female 12,686 .4952704 .4999973 0 1
wage04 2,684 3522610 3965777 -3 3.01e+07
edu 7,661 13.23026 2.518906 0 20
age04 7,661 43.15703 2.256123 27 48

Table 1:

Variable Obs Mean Min Max

afqt 11,878 40.95193 1 99

female 12,686 0.49527 0 1
wage04 2,684 3522610 -3 3.01E+07
edu 7,661 13.23026 0 20
age04 7,661 43.15703 27 48

(d) Create a new variable birthq that contains information on a person's birth quarter. For
example, if a person was born in May, then his birthq will have the value of 2.

. gen birthq=birth_month

. recode birthq(1/3=1)

(birthq: 2019 changes made)

. recode birthq(4/6=2)

(birthq: 3038 changes made)

. recode birthq(7/9=3)

(birthq: 3562 changes made)

. recode birthq(10/12=4)

(birthq: 2941 changes made)

. tab birthq

birthq Freq. Percent Cum.

1 3,145 24.79 24.79

2 3,038 23.95 48.74
3 3,562 28.08 76.82
4 2,941 23.18 100.00

Total 12,686 100.00

(e) Test whether people born in the last quarter are less educated than people born in other

quarters.

. mean(edu) if birthq ==4

Mean estimation Number of obs = 1,804

Mean Std. Err. [95% Conf. Interval]

edu 13.15521 .05824 13.04099 13.26944

. mean (edu) if birthq==3|birthq==2|birthq==1

Mean estimation Number of obs = 5,857

Mean Std. Err. [95% Conf. Interval]

edu 13.25337 .0330904 13.1885 13.31824

From the above tables if we compare the mean values of the forth quarter and the first three quarters
we can notice that the mean of the fourth quarter(i.e. 13.55) is less than the cumulative mean of the
first three quarters(13.23).As a result, the people born in the last quarter are less educated.

(f) Plot the histogram of wage04 and log(wage04). (You can use the command histogram and
you need to create a new variable lw04 = log(wage04)).

. gen lw04=log(wage04)

(10,037 missing values generated)

. hist wage04

(bin=34, start=-3, width=884705.97)

. hist lw04

(bin=34, start=7.6797137, width=.28057818)

(g) Explain why researchers prefer to use log wage rather than wage in the regression.

Researchers prefer to use log wage instead of wage as they care about percentage changes in wages
rather than absolute changes. Moreover, using log enables them to normalize the data.

(h) Generate potential years of experience using exp = age - edu - 5.

. gen exp=age04-edu-5

(5,025 missing values generated)

(i) Regress log wage on education, female, a quadratic function of potential years of experience.
Report the regression results in column 1 of Table 2.

. gen exp2=(exp)^2

(5,025 missing values generated)

. reg lw04 edu female exp2

Source SS df MS Number of obs = 2,649

F(3, 2645) = 27.82
Model 429.167305 3 143.055768 Prob > F = 0.0000
Residual 13600.3953 2,645 5.14192638 R-squared = 0.0306
Adj R-squared = 0.0295
Total 14029.5626 2,648 5.29817318 Root MSE = 2.2676

lw04 Coef. Std. Err. t P>|t| [95% Conf. Interval]

edu .0849158 .0293516 2.89 0.004 .0273613 .1424702

female -.8019071 .0895686 -8.95 0.000 -.9775387 -.6262755
exp2 .0003259 .0003809 0.86 0.392 -.000421 .0010729
_cons 12.78836 .5632276 22.71 0.000 11.68395 13.89277
Table 2
lw04 (1) (2) (3)

edu 0.0849158 (j)

female -0.8019071 Regress
exp2 0.0003259 log wage
_cons 12.78836 on
Observations 2,649
R-squared 0.0306

(j) Regress log wage on ducation, female, a quadratic function of potential years of experience, and
AFQT scores. Report the regression results in column 2 of Table 2.

. reg lw04 edu female exp2 afqt

Source SS df MS Number of obs = 2,541

F(4, 2536) = 21.77
Model 446.949788 4 111.737447 Prob > F = 0.0000
Residual 13016.7549 2,536 5.13278979 R-squared = 0.0332
Adj R-squared = 0.0317
Total 13463.7047 2,540 5.30067114 Root MSE = 2.2656

lw04 Coef. Std. Err. t P>|t| [95% Conf. Interval]

edu .1160641 .0342978 3.38 0.001 .0488095 .1833187

female -.8276587 .0915236 -9.04 0.000 -1.007127 -.6481901
exp2 .0004232 .000393 1.08 0.282 -.0003474 .0011938
afqt -.0030298 .0020755 -1.46 0.144 -.0070995 .00104
_cons 12.45203 .6019054 20.69 0.000 11.27175 13.63231

Table 2
lw04 (1) (2) (3)

edu 0.1160641
female -0.8276587
exp2 0.0004232
afqt -0.0030298
_cons 12.45203
Observations 2,541
R-squared 0.0332

(k) Comment on the difference in the coefficients on education between these two columns.

The coefficient of edu increases from 0.0849 to 0.116 after we add variable afqt in the regression. This
means that in the first regression the coefficient was underestimated because of omitted variable bias,
thus adding another variable makes it more unbiased.

ECO311 Stata
100% (1)
ECO311 Stata
111 pages
Communication and Mathematical Skills: Predictors of Students’ Entrepreneurial Management Performance
100% (1)
Communication and Mathematical Skills: Predictors of Students’ Entrepreneurial Management Performance
13 pages
Dummy Variable Regression Models
No ratings yet
Dummy Variable Regression Models
48 pages
Your Mathematics Standards Companion Grades K 2 What They Mean and How to Teach Them 1st Edition Linda M. Gojak pdf download
No ratings yet
Your Mathematics Standards Companion Grades K 2 What They Mean and How to Teach Them 1st Edition Linda M. Gojak pdf download
52 pages
Notes On Matric Algebra
No ratings yet
Notes On Matric Algebra
11 pages
Analyse Econometrique Avec Stata 12 2
No ratings yet
Analyse Econometrique Avec Stata 12 2
414 pages
maths-Exponents & power
No ratings yet
maths-Exponents & power
11 pages
2019 Mtap Grade 6 Session 3
No ratings yet
2019 Mtap Grade 6 Session 3
28 pages
CH 5 - Multicollearity
No ratings yet
CH 5 - Multicollearity
27 pages
chiarella1992
No ratings yet
chiarella1992
23 pages
Chapter -5- Panel Data Analysis
No ratings yet
Chapter -5- Panel Data Analysis
53 pages
hjgh
No ratings yet
hjgh
48 pages
L9.1_2023
No ratings yet
L9.1_2023
47 pages
Topic 6 - FE, RE and tests
No ratings yet
Topic 6 - FE, RE and tests
46 pages
Go Har Logical Mathematics 07
75% (4)
Go Har Logical Mathematics 07
217 pages
Finite Difference Buckling Analysis of Non Uniform Columns
No ratings yet
Finite Difference Buckling Analysis of Non Uniform Columns
8 pages
ch4 Dummy
No ratings yet
ch4 Dummy
54 pages
17-Econometrics-Linear Regression
No ratings yet
17-Econometrics-Linear Regression
18 pages
Excel Basic Formulas
No ratings yet
Excel Basic Formulas
5 pages
Linear Regression Using R
No ratings yet
Linear Regression Using R
24 pages
Lab Exercises Answer
No ratings yet
Lab Exercises Answer
13 pages
Research Method: Lecture 7 (Ch14) Pooled Cross Sections and Simple Panel Data Methods
No ratings yet
Research Method: Lecture 7 (Ch14) Pooled Cross Sections and Simple Panel Data Methods
47 pages
CS5242 Neural Networks and Deep Learning: Quiz 1
No ratings yet
CS5242 Neural Networks and Deep Learning: Quiz 1
2 pages
GMU Econ535-Applied Econometrics Problem Set3 (PS3) Solutions Spring 2024
No ratings yet
GMU Econ535-Applied Econometrics Problem Set3 (PS3) Solutions Spring 2024
15 pages
Assignment No 4
No ratings yet
Assignment No 4
16 pages
1ST Quarter / Midterm Examinations
No ratings yet
1ST Quarter / Midterm Examinations
8 pages
F1 Math
No ratings yet
F1 Math
31 pages
s41598-022-20135-5
No ratings yet
s41598-022-20135-5
14 pages
Partial Differentiation NFE
No ratings yet
Partial Differentiation NFE
16 pages
SQP 9
No ratings yet
SQP 9
18 pages
EC212: Introduction To Econometrics Multiple Regression: Estimation (Wooldridge, Ch. 3)
No ratings yet
EC212: Introduction To Econometrics Multiple Regression: Estimation (Wooldridge, Ch. 3)
76 pages
Eco 311 Course - Outline
No ratings yet
Eco 311 Course - Outline
2 pages
Ees 404
No ratings yet
Ees 404
10 pages
Econ 251 S2018 PS6 Solutions
No ratings yet
Econ 251 S2018 PS6 Solutions
16 pages
Econ 251 PS5 Solutions
No ratings yet
Econ 251 PS5 Solutions
16 pages
Section 1: Gre数学机经六套题课程咨询：Yunmengze1020
No ratings yet
Section 1: Gre数学机经六套题课程咨询：Yunmengze1020
12 pages
AE6207_Solution 1_2024
No ratings yet
AE6207_Solution 1_2024
8 pages
Dummy Variable Regression Models (Lec 8-9) : 1 Nguyen Thu Hang, BMNV, FTU CS2
No ratings yet
Dummy Variable Regression Models (Lec 8-9) : 1 Nguyen Thu Hang, BMNV, FTU CS2
48 pages
3.1 Infinite Sequences
No ratings yet
3.1 Infinite Sequences
14 pages
Rec 5 4
No ratings yet
Rec 5 4
8 pages
Assignment 1 With Answers PDF
No ratings yet
Assignment 1 With Answers PDF
8 pages
Homework 2 Questions
No ratings yet
Homework 2 Questions
7 pages
Mathematician 2 .
No ratings yet
Mathematician 2 .
14 pages
Stata_Session_10_2
No ratings yet
Stata_Session_10_2
10 pages
Econ 251 PS4 Solutions
No ratings yet
Econ 251 PS4 Solutions
11 pages
FinalExam Fall2020 Updated GB213
No ratings yet
FinalExam Fall2020 Updated GB213
11 pages
Assignement 1 .Hridita. BUS 525
No ratings yet
Assignement 1 .Hridita. BUS 525
10 pages
Nonwhite - .0729731 .4437979 0.16 0.869 - .7988879 .9448342
No ratings yet
Nonwhite - .0729731 .4437979 0.16 0.869 - .7988879 .9448342
11 pages
Existence of Continuous Functions
100% (1)
Existence of Continuous Functions
6 pages
283 (1)
No ratings yet
283 (1)
7 pages
Programming Fundamentals – Exam Preparation Guide
No ratings yet
Programming Fundamentals – Exam Preparation Guide
5 pages
All As 525 v2
No ratings yet
All As 525 v2
10 pages
Ecotrix Assignment
No ratings yet
Ecotrix Assignment
5 pages
Results
No ratings yet
Results
7 pages
HMW_3 CAUSAL
No ratings yet
HMW_3 CAUSAL
4 pages
MA3608 QB unit I
No ratings yet
MA3608 QB unit I
4 pages
Results 1
No ratings yet
Results 1
4 pages
Lab8 Hetero GLS and WLS
No ratings yet
Lab8 Hetero GLS and WLS
5 pages
BFA TCP Exam Brief 21-22 Final 080322
No ratings yet
BFA TCP Exam Brief 21-22 Final 080322
5 pages
Gen Some - Highsch 0: Worksheet - 2 Name-Sarah Nuzhat Khan ID-20175008 Creating Dummy Variables
No ratings yet
Gen Some - Highsch 0: Worksheet - 2 Name-Sarah Nuzhat Khan ID-20175008 Creating Dummy Variables
6 pages
Chapter 1: The Real Number System: 1.2 Ordered Field Axioms
0% (1)
Chapter 1: The Real Number System: 1.2 Ordered Field Axioms
13 pages
CH 3 Trigonometric Functions
No ratings yet
CH 3 Trigonometric Functions
3 pages
Centeno - Alexander PSET2 LBYMET2 Final
No ratings yet
Centeno - Alexander PSET2 LBYMET2 Final
11 pages
ps5 Fall+2015
No ratings yet
ps5 Fall+2015
9 pages
UWI Math Fair 2025 Math Olympics Competition Guidelines - Updated 15-01-25
No ratings yet
UWI Math Fair 2025 Math Olympics Competition Guidelines - Updated 15-01-25
4 pages
Lnq = Β + Β Lnli + Β Lnki + Ɛ
No ratings yet
Lnq = Β + Β Lnli + Β Lnki + Ɛ
12 pages
PDF
No ratings yet
PDF
9 pages
Sample Quiz LM
No ratings yet
Sample Quiz LM
5 pages
Problem Set 8-Answers PDF
No ratings yet
Problem Set 8-Answers PDF
5 pages
Linear
No ratings yet
Linear
2 pages
Ch 1_Real nos(Revised)_Test-1 (set -3)
No ratings yet
Ch 1_Real nos(Revised)_Test-1 (set -3)
2 pages
Department of Economics Problem Set
No ratings yet
Department of Economics Problem Set
5 pages
RM2017 Midterm Questions
No ratings yet
RM2017 Midterm Questions
9 pages
Tutorials2016s1 Week7 Answers-3
No ratings yet
Tutorials2016s1 Week7 Answers-3
5 pages
Empirical Exercises 6
No ratings yet
Empirical Exercises 6
7 pages
Topic 1 class exercises
No ratings yet
Topic 1 class exercises
5 pages
ECMT1020 - Week 06 Workshop
No ratings yet
ECMT1020 - Week 06 Workshop
4 pages
Ap Physics Unit 1 Homework
No ratings yet
Ap Physics Unit 1 Homework
2 pages
Assignment 3( QM)
No ratings yet
Assignment 3( QM)
3 pages
HWsheet 1-4 Linear Algebra
No ratings yet
HWsheet 1-4 Linear Algebra
16 pages
Assignment ACCOUNTING
No ratings yet
Assignment ACCOUNTING
4 pages
Ps 3
No ratings yet
Ps 3
13 pages
Assignment -Group 3
No ratings yet
Assignment -Group 3
2 pages
SOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian Two Wave Panel Data Analysis
No ratings yet
SOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian Two Wave Panel Data Analysis
12 pages
(A) Regress Log of Wages On A Constant and The Female Dummy. Paste Output Here
No ratings yet
(A) Regress Log of Wages On A Constant and The Female Dummy. Paste Output Here
5 pages
Aaa
No ratings yet
Aaa
2 pages
Math Home Assignment - 1
No ratings yet
Math Home Assignment - 1
4 pages
MAT 221 Lecture Plan
No ratings yet
MAT 221 Lecture Plan
2 pages
Midterm Fall2011
No ratings yet
Midterm Fall2011
13 pages
Comparing Fraction Strategies
No ratings yet
Comparing Fraction Strategies
2 pages
β school+u (for women) : Estimation of log (wage) =β +
No ratings yet
β school+u (for women) : Estimation of log (wage) =β +
3 pages
Econometrics for Finanace Test 2
No ratings yet
Econometrics for Finanace Test 2
1 page
Tutorial 4 Combinational Logic Circuit Design and K-Map: C C C C D C D C D C CD
No ratings yet
Tutorial 4 Combinational Logic Circuit Design and K-Map: C C C C D C D C D C CD
2 pages
Panel Data Problem Set 2
No ratings yet
Panel Data Problem Set 2
6 pages
Painless Pre-Algebra
From Everand
Painless Pre-Algebra
Barron's Educational Series
3/5 (2)
Foundations of Image Science
From Everand
Foundations of Image Science
Harrison H. Barrett
No ratings yet
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
From Everand
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
Stuart A. Klugman
4/5 (1)

5103A1

Uploaded by

5103A1

Uploaded by

ECA5103 ASSIGNMENT 1-SEC1

. rename R0000300 birth_month

. rename R0000500 birth_day

. rename R0618300 afqt

the variable female.

. recode afqt (-4/-3=.)

(afqt: 808 changes made)

(ind04: 6052 changes made)

(wage04: 10002 changes made)

. recode incwg04 (-5/-1=.)

(incwg04: 5417 changes made)

(edu: 5025 changes made)

(age04: 5025 changes made)

(sex: 6403 changes made)

(sex: 6283 changes made)

. rename sex female

in Table 1 of the provided template.

. summarize afqt female wage04 edu age04

Variable Obs Mean Std. Dev. Min Max

afqt 11,878 40.95193 28.75716 1 99

Variable Obs Mean Min Max

afqt 11,878 40.95193 1 99

(birthq: 2019 changes made)

(birthq: 3038 changes made)

(birthq: 3562 changes made)

(birthq: 2941 changes made)

birthq Freq. Percent Cum.

1 3,145 24.79 24.79

Total 12,686 100.00

. mean(edu) if birthq ==4

Mean estimation Number of obs = 1,804

Mean Std. Err. [95% Conf. Interval]

edu 13.15521 .05824 13.04099 13.26944

. mean (edu) if birthq==3|birthq==2|birthq==1

Mean estimation Number of obs = 5,857

Mean Std. Err. [95% Conf. Interval]

edu 13.25337 .0330904 13.1885 13.31824

(10,037 missing values generated)

(bin=34, start=-3, width=884705.97)

(bin=34, start=7.6797137, width=.28057818)

(h) Generate potential years of experience using exp = age - edu - 5.

(5,025 missing values generated)

(5,025 missing values generated)

. reg lw04 edu female exp2

Source SS df MS Number of obs = 2,649

lw04 Coef. Std. Err. t P>|t| [95% Conf. Interval]

edu .0849158 .0293516 2.89 0.004 .0273613 .1424702

edu 0.0849158 (j)

. reg lw04 edu female exp2 afqt

Source SS df MS Number of obs = 2,541

lw04 Coef. Std. Err. t P>|t| [95% Conf. Interval]

edu .1160641 .0342978 3.38 0.001 .0488095 .1833187

You might also like