Simple Linear Regression & Correlation Chapter No 14...
Simple Linear Regression & Correlation Chapter No 14...
Chapter # 14
is given by
Where “a” and “b” are the point estimates of the intercept and slop of the population
The residuals of the i-th point from the estimated regression line is
When we look for a relationship is the expected value of Y for a given value of x, the
population regression function is given by
= f(x)
Where f(x) is a function of independent variable x and may be of the linear,
quadratic, exponential or any other form.
Where
= the slop of the population regression curve also known as population regression
coefficient
Let Let
are are
n observed value of a random variable Y, with n observed value of a random variable X, with
their associated ‘x’ values, then estimated their associated ‘y’ values, then estimated
regression line “Y on X” is regression line “X on Y” is
is minimum. The resulting estimates are is minimum. The resulting estimates are
Regression coefficients
Y on X( ) X on Y( )
x – intercept( ) x – intercept ( )
That is.
That is
(3) The regression line always passes through the point of means ( ), the center of
gravity of the observed data.
(4) The sum of the observed values equals the sum of the fitted values
That is
i. The mean of the fitted values is the same as the mean of the observed
values
That is
Correlation:-
Correlation is measure of the degree of linear association between tow variables.
OR
Correlation measures the strength of a relationship between variables.
OR
The degree or strength of relationship (interdependence) between the variables is called
correlation.
Correlation coefficient:-
The numerical value of correlation is called correlation coefficient. It is denoted by ‘r’.
OR
A measure that expresses the extent to which two variables are related.
OR
A descriptive measure of the degree of linear relationship between ‘X’ and ‘Y’ is called
correlation coefficient.
Correlation Analysis:-
A group of techniques to measure the strength of the association between two variables.
Positive Correlation:-
When the values of two variables move in the same direction so that an increase or
decrease in the value of one variable is associated with an increase or decrease in the value
of the other variable, correlation is said to be positive.
OR
If both the variables are moving in same direction (increase or decrease), then it is said to
be positive or direct correlation.
Example:-
Ages and heights of children.
That is
(2) Correlation coefficient is the covariance of values of the two variables X and Y
That is
That is
That is
Thus
(5) Correlation coefficient is zero when one of the variables X and Y is constant.
a. The value of ‘r’ remains unchanged if constants are added to or subtracted from the
values of the variables or if the values of the variables are multiplied or divided by
b. But the value of ‘r’ changes in sign only if the values of the variables are multiplied
1) What is regression?
Ans. It’s a technique to investigate the relationship between variables for prediction and
estimation.
2) Define independent variable (Regressor).
Ans. The variable that forms the basis of estimation or prediction is called ind4ependent
variable
Example: - Distance can be predicted from amount of petrol, so petrol is
independent Variable.
Note: - Independent variable is also called explanatory variable, predictor or
nonrandom variable
3) What is dependent variable?
Ans. The variable whose resulting value depends upon the value of the independent variable
is called independent variable.
Example: - distance can be predicted for amount of petrol, so distances is
dependent variable.
Note: - Dependent variable is also called explained variable, predicted or
random variable.
4) Differentiate between functional relationships.
Ans. In functional relationship the value of dependent variable can be uniquely determined
form the value of the independent variable.
Ans. It measures the nature of relationship between variables. It is denoted by and given
as
The sum of the observed values equals to the sum of fitted values i.e.
Where
41) What is the relationship between correlation coefficient and regression line
Ans. The correlation coefficient ‘r’ is the slop of the regression lines for standard scores.
Ans.
Ans.
Note: - Regression coefficient is independent of change of origin
Ans.
(i)
(ii)
Multiplying (i) by 5 and subtracting (ii)
55)
The line of best fit was “x” is the number of long – distance calls
made during a month and “y” is the total telephone cost for a month. In terms of
number of long distance calls and cost, explain the meaning of the y – intercept
1420 and the slope 76.8.
Ans. When no long –distance are made there is still the monthly phone charge of Rs. 1420
and Rs. 76.8 is the rate at which the total phone bill will increase for each additional
long distance call.
63) What does the value or “r” indicates?
Ans. The value of the sample linear correlation coefficient indicates the degree and direction
of linear relationship between variables.
64) Describe the significance of the algebraic sign and the magnitude of “r”
Ans.
1.0
The linear correlation coefficient ‘r’ may assume
Strong
values from “–1” through zero to “+1”. 0.8
When ‘r’ is close to zero, we conclude that there 0.8
is little or no correlation between the variables. Moderate
0.5
Positive values of ‘r’ suggest that the two
0.5
variables are positively linearly correlated
Negative values of ‘r’ suggest that the two variables 0 Week
are negatively linearly correlated
0.5
As the calculated value of ‘r’ changes from ‘0’ toward
0.5
either “+1.0” or “-1.0”, it indicates an increasingly
Moderate
stronger linear correlation between the two variables. 0.8
Following figure shows a useful way to describe the 0.8
strength of linear relationship based on “r” Strong
1.0
66) Why do the regression coefficients and the linear correlation coefficients ‘r’
have the same sign?
are the same and determine the sign of and ‘r’ and the denominators are always
positive.
67) Is the following statement correct? Explain why or why not. A correlation
coefficient of zero implies that there is no relationship exists between the two
variables understudy.
Ans. The value of ‘r’ is a measure of extent to which X and Y are linearly related. A value of
‘r’ close to zero does not rule out any strong relationship between X and Y; there could
still be a strong relationship but one that is not linear. Therefore, we should not
conclude that there is no relationship whatsoever just because ‘r’ is near to zero.
68) Difference between correlation and regression?
Ans.
In correlation analysis we use both variables as random.
In regression analysis we take one variable as fixed and the other as random.
Correlation coefficient measures degree of linear relationship between two variables.
Whereas the objective of regression is to study the “nature of relationship” between the
variables.
so that we will be able to predict the values of one variable on basis of on another
69) What is the purpose of simple linear correlation
Ans. The purpose of simple linear correlation is to determine whether or not two or not tow
variables are related.
70) What are the simple linear correlation measures?
Ans. The simple linear correlation measures the strength or closeness of linear relationships
between two variables.
Each question has possible four answers. Choose the correct answer and encircle it.
1) A _____ relation between two variables is a relation where the value of the dependent
variable I uniquely determined from the value of the independent variables
a) Fractional b) Functional c) Cubic d) Squares
2) The study of relationship between variables for prediction and estimation
a) Regression b) Time Series c) Correlation d) All
3) A functional relation is expressed by a formula
a) Mathematical b) Chemical c) Biological d) Empirical
4) The variable that forms the basis of estimation or prediction
a) Regressand b) Regressor c) Dependent Variable d) Explained Variable
5) If x is an independent variable and y is a dependent variable then the functional
relation is of the form
a) b) c) d)
6) If y is an independent variable and x is a dependent variable then the functional
relation is of the form
a) b) c) d)
7) If ‘t’ is an independent variable and ‘s’ is a dependent variable then the functional
relation is of the form
a) b) c) d)
8) If ‘s’ is an independent variable and ‘t’ is a dependent variable then the functional
relation is of the form
a) b) c) d)
9) For a given value of ‘x’, the functional relation ______ gives the corresponding value
of ‘y’
a) b) c) d)
10) The variable that forms the basis of estimation or prediction is called the
a) Regressor b) Regressand c) Regression Curve d) None of these
11) The variable whose resulting value depends upon the selected value of the
independent variable is called the
a) α, β b) c) d) All
33) The least squares regression line minimizes
a) Calculation mistakes b) SSE c) Spelling mistakes d) None of these
34) The change in the dependent variable Y corresponding to a unit change in the
independent variable ‘x’ is measured by
a) b) c) d) None of these
35) Regression coefficient is independent of
a) Origin b) Scale c) Both ‘a’ & ‘b’ d) Unit of Measurement
36) Two regression coefficient always have
a) Opposite Signs b) Same Signs c) No Sign d) None the these
41) For
a) b) c) d)
42) The regression line between X and Y always passes through
47) If then
a) -0.84 b) 0.84 c) 0.42 d) None of these
a) b) c) d)
51) The regression lines coincide if
a) b) c) d)
a) b) c) d)
53) If two regression coefficients are ‘0.8 & ‘0.2’ then value of coefficient of correlation is
a) 0.16 b) -0.16 c) 0.40 d) -0.40
54) The correlation coefficient ‘r’ is the _______ mean of the two regression coefficients
a) Geometric b) Arithmetic c) Harmonic d) None of These
55) The correlation coefficient of two variables would be
a) -1.95 b) 1.95 c) 0.95 d) None of these
56) ‘r = 0’ indicates that the two variables are linearly
a) Independent b) Dependent c) Related d) None of These