Business Analytics Question Paper Solution
Business Analytics Question Paper Solution
Q1(A):What do you mean by the multiple linear regression model? Write a stochastic
equation of a multiple regression model where the dependent variable is 'debt
payments' and the independent variables are (a) income, and (b) unemployment.
Where:
---
The table provides the output for the linear regression model where "Debt Payments" is
the dependent variable, and "Income" and "Unemployment" are the independent
variables.
- **Intercept**: The value of debt payments when income and unemployment are both
zero is approximately 198.9956, but it is not statistically significant (p = 0.215).
- **Income**: The coefficient for income is 10.5122, which means for every unit
increase in income, debt payments increase by approximately 10.51 units. This variable
is highly significant (p < 0.001).
- **Adjusted R-squared (0.7312)**: After adjusting for the number of predictors, 73.12%
of the variability in debt payments is explained by the model.
- **F-statistic (35 on 2 and 23 DF, p-value: 1.054e-07)**: The model is statistically
significant overall, meaning the independent variables jointly influence the dependent
variable.
---
Q1(C): *If income is Rs. 80,000 and the unemployment rate is 7.5%, how much will be
the debt payments?*
---
---
Q2(B): How will you test for these assumptions using graphical methods?
1. Linearity: Check residual plots. The plot of residuals vs. predicted values should
show no pattern.
3. **Homoscedasticity**: Residuals vs. fitted values plot should show constant spread.
---
1. **Remove highly correlated predictors**: Identify and remove one of the correlated
variables.
---
### **Q3(A)**: *Discuss the use of (a) Quadratic Regression Model, (b) Log-Log Model,
(c) Semi-Log Model, and (d) Exponential Model.*
2. Log-Log Model: Both the dependent and independent variables are transformed
using logarithms. It is used when the relationship between the variables follows a
power law.
4. **Exponential Model**: This model is used when the growth rate of the dependent
variable is proportional to its current value, often represented by:
Question:
p-values:
- Intercept: 0.000
(a) Interpret the value of the elasticity of demand for cigarettes with respect to price.
- The price elasticity of demand is given by the coefficient of ln(P), which is -1.25.
- This implies that a 1% increase in the price of cigarettes leads to a 1.25% decrease in
cigarette consumption, holding income constant.
- Since the coefficient is negative, cigarettes are a normal good, and the demand
decreases as the price increases.
(b) Interpret the value of the income elasticity of demand for cigarettes.
- The income elasticity of demand is given by the coefficient of ln(I), which is 0.18.
- Since this is a small positive number, it suggests that cigarettes are a normal good but
not highly sensitive to changes in income.
(c) At the 5% significance level, is the price elasticity of demand statistically significant?
- The p-value for the price elasticity is 0.005, which is less than 0.05.
- It may be surprising that income does not significantly affect cigarette consumption,
but this could be due to various factors such as cigarettes being considered a necessity
by certain consumers, regardless of income changes.
(e) Write the R-code for estimating the above regression model:
```R
summary(model)
```
Q4(A)
- The ‘dummy variable trap’ occurs when two or more dummy variables are highly
correlated, typically when a full set of dummy variables is used to represent all
categories, causing multicollinearity. For example, if both “male” and “female” are
used as dummy variables, one can predict the other. This results in perfect
multicollinearity, making the regression model unstable.
Q4(B)
An analyst interviews 50 employees to determine whether males are paid more than
females, on average, holding experience constant. The regression equation is:
- Intercept: 0.000
- Experience: 0.000
- Male: 0.000
Where:
(b) Do you find the difference in the salary of male and female significant, holding years
of experience constant?
- The coefficient for the dummy variable Male is 13.9240 with a p-value of 0.000, which
is less than 0.05.
- This means that the difference in salary between males and females is statistically
significant, holding years of experience constant.
- Specifically, males earn approximately $13.92 more per hour than females with the
same level of experience.
Q4(C)
- The model is bounded between 0 and 1, meaning the predicted probabilities never go
below 0 or above 1.