Correlation & Regression
Correlation & Regression
SEM
Module No.4
CORRELATION AND REGRESSION ANALYSIS
MEANING OF CORRELATION:
between two or more variables. Correlation analysis is a statistical technique to study the degree and
DEFINATION OF CORRELATION:
“When the relationship is of a quantitative nature, the appropriate statistical tool for discovering and
measuring the relationship and expressing it in a brief formula is known as correlation.”
“If two or more quantities vary in sympathy so that movements in one tend to be accompanied by
corresponding movements in others, then they are said to be correlated.”
ANURADHA.S
ISBR College Page 1
Ch:4 – Correlation & Regression BBA 3
SEM
TYPES OF CORRELATION:
Correlation is described or classified in several different ways. Three of the most important are:
Whether correlation is positive (direct) or negative (inversa) would depend upon the direction of change
of the variable.
Positive Correlation: If both the variables vary in the same direction, correlation is said to be positive. It
means if one variable is increasing, the other on an average is also increasing or if one variable is
decreasing, the other on an average is also deceasing, then the correlation is said to be positive
correlation.
For example: The correlation between heights and weights of a group of persons is a positive
Correlation.
Negative Correlation: If both the variables vary in opposite direction, the correlation is said to be
negative. If means if one variable increases, but the other variable decreases or if one variable decreases,
but the other variable increases, then the correlation is said to be negative correlation.
For example: The correlation between the price of a product and its demand is a negative correlation.
Zero Correlation: If the existence of variable is absence & their is no relationship between the variables
then, it is said to be zero correlation. It means a change in value of one variable doesn’t influence or
change the value of other variable.
For example: The correlation between weight of person and intelligence is a zero or no correlation.
ANURADHA.S
ISBR College Page 2
Ch:4 – Correlation & Regression BBA 3
SEM
The distinction between simple, partial and multiple correlations are based upon the number of variables
studied.
Simple Correlation: When only two variables are studied, it is a case of simple correlation.
For example: when one studies relationship between the marks secured by student and the attendance of
student in class.
Partial Correlation: In case of partial correlation one studies three or more variables but considers only
two variables to be influencing each other and the effect of other influencing variables being held
constant.
For example: The relationship between student marks and attendance, the other variable influencing
Such as effective teaching of teacher, use of teaching aid like computer, smart board etc are assumed to
be constant
Multiple Correlation : When the study is with three or more variables it is a multiple correlation.
For example: If study covers the relationship between student marks, attendance of students, effectiveness
of teacher, use of teaching aids etc, it is a case of multiple correlation.
Depending upon the constancy of the ratio of change between the variables, the Correlation may be
Linear or Non-linear Correlation.
Linear Correlation: If the amount of change in one variable bears a constant ratio to the amount of
change in the other variable, then correlation is said to be linear line.
For example: If it is assumed that, to produce one unit of finished product we need 10 units of raw
materials, then subsequently to produce 2 units of finished product we need double of the one unit.
Raw material : X 10 20 30 40 50 60
Finished Product : Y 2 4 6 8 10 12
Non-linear Correlation: If the amount of change in one variable does not bear a constant ratio to the
amount of change to the other variable, then correlation is said to be non-linear.
For example: If we double the amount of advertisement expenditure, then sales volume would not
necessarily be doubled.
ANURADHA.S
ISBR College Page 3
Ch:4 – Correlation & Regression BBA 3
SEM
Advertisement Expenses : X 10 20 30 40 50
60
Sales Volume : Y 2 14 16 8 10
22
Quantification of the relationship between variables is very essential to take the benefit of study of
correlation. For this, we find there are various methods of measurement of correlation, which can be
represented as given below:
Karl Pearson’s method of calculating coefficient of correlation is based on the covariance of the
two variables in a series. This method is widely used in practice and the coefficient of correlation is
denoted by the symbol “r”.
ANURADHA.S
ISBR College Page 4
Ch:4 – Correlation & Regression BBA 3
SEM
If the two variables under study are X and Y, the following formula suggested by Karl Pearson can be
used for measuring the degree of relationship of correlation.
1. The coefficient of correlation always lies between – 1 to +1, symbolically it can written as – 1 ≤ r ≤ 1.
ANURADHA.S
ISBR College Page 5
Ch:4 – Correlation & Regression BBA 3
SEM
3. The coefficient of correlation is a pure number and is independent of the units of measurement. It
means if X represent say height in inches and Y represent say weights in kgs, then the correlation
coefficient will be neither in inches nor in kgs but only a pure number.
1. This research is limited in its nature. It can only determine the relationship between two
variables not more than two variables.
2. It doesn’t prove the cause and effect between the variables. It means it doesn’t indicate which
variable becomes the cause of the statistical pattern.
3. You cannot have control over the variables. It only allows you to observe or spot the variables
and their statistical patterns.
4. The information you get from this research is limited. It only studies the relationship of
variables but does not find the cause.
ANURADHA.S
ISBR College Page 6
Ch:4 – Correlation & Regression BBA 3
SEM
The correlation coefficient between these allotted two series of ranks is popularly called as “Spearman’s
Rank Correlation” and denoted by “R”
ANURADHA.S
ISBR College Page 7
Ch:4 – Correlation & Regression BBA 3
SEM
REGRESSION:
MEANING:
Regression analysis is a statistical tool to study the nature and extent of functional
relationship between two or more variables and to estimate (or predict) the unknown values of dependent
variable from the known values of independent variable.
The variable that forms the basis for predicting another variable is known as the
Independent Variable and the variable that is predicted is known as dependent variable.
1. It provides estimates of values of the dependent variables from values of independent variables.
2. It is used to obtain a measure of the error involved in using the regression line as a basis for estimation.
3. With the help of regression analysis, we can obtain a measure of degree of association or correlation
that exists between the two variables.
4. It is highly valuable tool in economies and business research, since most of the problems of the
economic analysis are based on cause and effect relationship.
Regression lines and Regression equations are used synonymously. Regression equations are algebraic
expression of the regression lines. Let us consider two variables X & Y. If Y depends on x, then the result
comes in the form of simple regression. If we take the case of two variable X and Y, we shall have two
regression lines as the regression line of X on Y and regression line of Y on X
Simple linear regression model we have the following two regression lines:
ANURADHA.S
ISBR College Page 8
Ch:4 – Correlation & Regression BBA 3
SEM
1. Regression line of Y on X: This line gives the probable value of Y (Dependent variable) for any given
value of X (Independent variable).
: Y = a + bX
2. Regression line of X on Y: This line gives the probable value of X (Dependent variable) for any given
value of Y (Independent variable).
: X = a + by
REGRESSION COEFFICIENT:
The quantity “b” in the regression equation is called as the regression coefficient or slope coefficient.
Since there are two regression equations, therefore, we have two regression coefficients.
R = √ bxy ∗ byx
ANURADHA.S
ISBR College Page 9
Ch:4 – Correlation & Regression BBA 3
SEM
2. If one of the regression coefficients is greater than unity, the other must be less than unity, since the
value of the coefficient of correlation cannot exceed unity.
3. Both the regression coefficient will have the same sign. i.e. they will be either positive or negative.
4. The coefficient of correlation will have the same sign as that of regression coefficient, i.e. if regression
coefficient have a negative sign, “r” will also have negative sign and if the regression coefficient have a
positive sign, “r” would also be positive.
For example, if bxy = -0.2 and byx = -0.8 then r = - √0.2 ∗ 0.8 = – 0.4
5. The average value of the two regression coefficient would be greater than the value of coefficient of
correlation. In symbol (bxy + byx) / 2 > r.
For example, if bxy = 0.8 and byx = 0.4 then average of the two values = (0.8 + 0.4) / 2 = 0.6 and
ANURADHA.S
ISBR College Page 10