Pearson's Correlation
Pearson's Correlation
One of the statistical concepts that is most related to this type of analysis is the correlation
coefficient. The correlation coefficient is the unit of measurement used to calculate the intensity
in the linear relationship between the variables involved in a correlation analysis, this is easily
identifiable since it is represented with the symbol r and is usually a value without units which
is located between 1 and -1. In other words, correlation examines the relationship between pairs
of scores. The greater the association between the two variables, the more accurate we can
predict the relationship between the variables. Correlation between two variables can be either
a positive correlation, a negative correlation, or no correlation. Following are a few methods
to calculate correlation
1. Pearson’s product moment (r)
2. Rank-order correlations (Spearman’s ‘p’ or ‘rho’)
3. Biserial correlation
4. Tetrachoric Correlation
5. Multiple correlation
• Positive correlation: A positive correlation between two variables means both the variables
move in the same direction. An increase in one variable leads to an increase in the other variable
and vice versa. For example, happiness leads to increase psychological well-being. Positive
correlation is denoted by + (plus)
• Negative correlation: A negative correlation between two variables means that the variables
move in opposite directions. An increase in one variable leads to a decrease in the other variable
and vice versa. For example, stress decreases performance. Negative correlation is denoted by
– (minus)
• Weak/Zero correlation: No correlation exists when one variable does not affect the other. For
example, there is no correlation between the number of years of school a person has attended
and the letters in his/her name.
1
Following are the interpretation
Formula:
For example: Following are the marks of 5 students obtained in test X and test Y. Determine
whether correlation exists between marks on test X and test Y
A B C D E
Test x 50 60 30 40 20
Test y 40 60 40 30 30
2
STEP 2: subtract test score of test Y minus Mean (e.g., 40 - 40 = 0)
STEP 3: Square the values obtained in X to get X2 values (e.g., 10 * 10 = 100)
STEP 4: Square the values obtained in Y to get Y2 values (e.g., 0 * 10 = 0)
STEP 5: Multiple the values obtained in X with Y to get XY values (e.g., 10 * 0 = 0)
STEP 6: Apply Person Correlation formula and Substitute the values.
STEP 7:
STEP 9: Interpretation: The obtained score of .77 indicates that there is a high
correlation found Test X and Test Y.
A) B)
score in score in
Students English Maths sl.no Stock A Stock B
A 40 78 1 45 9
B 21 70
2 50 8
C 25 60
3 53 8
D 31 55
E 38 80 4 58 7
F 47 66 5 60 5
C) D)
3
Answers
A r = 0.35
B r = - 0.90
C r = 0.064
D r = -2.19