Student - Dummy Variable Issue
Student - Dummy Variable Issue
c) Which type of problem may arise if we do not follow dummy variable trap
properly?
d) Consider a simple model with one continuous variable (x) and one dummy
(d). What is the difference between two models if d = 0 or 1 for comparing
the alternatives?
e) Among different types of measurement scales and data set, which one/s are
best fitted for dummy variable?
1
a. D=1 for male, 0 for female
b. D=1 for female, 2 for male
c. D=1 for female, -1 for male.
Requirement: a) Interpret the preceding regression model for each dummy
assignment. Is one method preferable to another? Justify your answer.
b) What is the impact of dummy on intercept & slope coefficient in regression?
4. Question for Discussion No. 9.16
To study the rate of growth of population in Belize over the period 1970-1992,
Mukherjee et al. estimated the following models:
a) In Model I, what is the rate of growth of Belize’s population over the sample
period?
b) Are the population growth rates statistically different pre- and post-1978? How
do you know?
c) What are the cautionary issues that analysts should consider whilst
using dummy variable?
5. Questions for discussion
In his study on the labor hours spent by the FDIC (Federal Deposit Insurance
Corporation) on 91 bank examinations, R. J. Miller estimated the following function*:
ln 𝑌̂̂ = 2.41 + 0.3674 ln X1 + 0.2217 ln X2 + 0.0803 ln X3
(0.0477) (0.0628) (0.0287)
−0.1755D1 + 0.2799D2 + 0.5634D3 − 0.2572D4
(0.2905) (0.1044) (0.1657) (0.0787)
R2 = 0.766
where Y = FDIC examiner labor hours
X1 = total assets of bank
2
X2 = total number of offices in bank
X3 = ratio of classified loans to total loans for bank
D1 = 1 if management rating was “good”
D2 = 1 if management rating was “fair”
D3 = 1 if management rating was “satisfactory”
D4 = 1 if examination was conducted jointly with the state
The figures in parentheses are the estimated standard errors.
a. Based on functional form, which type of regression it is? Interpret the results of the
model.
b. Is there any problem in interpreting the dummy variables in this
model since Y is in the log form?. How would you interpret the
dummy coefficients? Please consider the suggestion made by
Halvorsen and Palmquist to provide your response.