Lesson 11 Pearsons R
Lesson 11 Pearsons R
Lesson 11
Correlation (Pearson’s r)
INTRODUCTION:
OBJECTIVES:
1
Correlation
To answer the third question, you must ascertain what type of relationship
exists. There are two types of relationships: simple and multiple. In a simple
relationship, there are two variables—an independent variable, also called
an explanatory variable or a predictor variable, and a dependent variable, also
called a response variable. A simple relationship analysis is called simple
regression, and there is one independent variable that is used to predict the
dependent variable. For example, a manager may wish to see whether the
number of years the salespeople have been working for the company has
anything to do with the amount of sales they make. This type of study involves a
simple relationship, since there are only two variables—years of experience and
amount of sales.
2
student’s GPA, and the student’s high school background. This type of study
involves several variables.
The correlation coefficient computed from the sample data measures the
strength and direction of a linear relationship between two variables. The symbol
for the sample correlation coefficient is 𝑟. The symbol for the population
correlation coefficient is 𝜌 (Greek letter rho).
3
The graphs in figure below show the relationship between the correlation
coefficients and their corresponding scatter plots. Notice that as the value of the
correlation coefficient increases from 0 𝑡𝑜 + 1 (parts a, b, and c), data values
become closer to an increasingly stronger relationship. As the value of the
correlation coefficient decreases from 0 𝑡𝑜 − 1 (parts d, e, and f), the data values
also become closer to a straight line. Again this suggests a stronger relationship.
Rounding Rule for the Correlation Coefficient Round the value of 𝑟 to three
decimal places. The formula looks somewhat complicated, but using a table to
compute the values, as shown in example 1, makes it somewhat easier to
determine the value of 𝑟. There are no units associated with 𝑟, and the value of
𝑟 will remain unchanged if the 𝑥 and 𝑦 values are switched.
4
Example 1: Car Rental Companies. Compute the correlation coefficient for
car rental companies in the United States for a recent year.
Solution:
Step 2: Find the values of 𝑥𝑦, 𝑥 2 , and 𝑦 2 and place these values in the
corresponding columns of the table.
5
Step 3: Substitute in the formula and solve for 𝑟.
(6)(682.77) − (153.8)(18.7)
= = 𝟎. 𝟗𝟖𝟐
√[(6)(5859.26) − (153.8)2 ][(6)(80.67) − (18.7)2 ]
Solution:
Step 2: Find the values of 𝑥𝑦, 𝑥 2 , and 𝑦 2 ; place these values in the
corresponding columns of the table.
6
Step 3: Substitute in the formula and solve for 𝑟.
(7)(3745) − (57)(511)
= = −𝟎. 𝟗𝟒𝟒
√[(7)(579) − (57)2 ][(7)(38,993) − (511)2 ]
7
𝑯 2 72
𝑰 1 48
Solution:
Step 2: Find the values of 𝑥𝑦, 𝑥 2 , and 𝑦 2 ; place these values in the
corresponding columns of the table.
(9)(1520) − (36)(370)
= = 𝟎. 𝟎𝟔𝟕
√[(9)(232) − (36)2 ][(9)(19,236) − (370)2 ]
The value of r indicates a very weak positive relationship between the variables.
8
possibilities when 𝑟 is not equal to zero: either the value of 𝑟 is high enough to
conclude that there is a significant linear relationship between the variables, or
the value of 𝑟 is due to chance.
When the null hypothesis is rejected at a specific level, it means that there
is a significant difference between the value of 𝑟 and 0. When the null hypothesis
is not rejected, it means that the value of 𝑟 is not significantly different from 0
(zero) and is probably due to chance.
9
Example 4: Using PPMC Table, test the significance of the correlation coefficient
𝑟 = 0.982, obtained in Example 1, at 𝛼 = 0.01.
Solution:
𝐻0 : 𝜌 = 0 and 𝐻1 : 𝜌 ≠ 0
Since the sample size is 6 and 𝛼 = 0.01, the critical value obtained from
PPMC Table is 0.917. For a significant relationship, a value of 𝑟 greater than
+0.917 or less than -0.917 is needed. Since 𝑟 = 0.982, the null hypothesis is
rejected. Hence, there is enough evidence to say that there is a significant linear
relationship between the variables. See figure.
10
2. Calories and Cholesterol. The number of calories and the number of
milligrams of cholesterol for a random sample of fast-food chicken
sandwiches from seven restaurants are shown here. Is there a relationship
between the variables?
11
Critical Values Pearson Product Moment Correlation Coefficient (PPMC)
12