Correlation Analyses
Correlation Analyses
Track all changes, then work with you to bring about scholarly writing.
Is there a statistically significant relationship between age, as measured in years, and height,
measured in inches?
Is there a relationship between temperature, measured in degrees Fahrenheit, and ice cream
sales, measured by income?
Is there a relationship between job satisfaction, as measured by the JSS, and income,
measured in dollars?
Assumptions
For the Pearson r correlation, both variables should be normally distributed (normally
distributed variables have a bell-shaped curve). Other assumptions include linearity
and homoscedasticity. Linearity assumes a straight line relationship between each of the two
variables and homoscedasticity assumes that data is equally distributed about the regression
line.
Key Terms
Effect size: Cohen’s standard may be used to evaluate the correlation coefficient to
determine the strength of the relationship, or the effect size. Correlation coefficients between
.10 and .29 represent a small association, coefficients between .30 and .49 represent a
medium association, and coefficients of .50 and above represent a large association or
relationship.
Continuous data: Data that is interval or ratio level. This type of data possesses the
properties of magnitude and equal intervals between adjacent units. Equal intervals between
adjacent units means that there are equal amounts of the variable being measured between
adjacent units on the scale. An example would be age. An increase in age from 21 to 22
would be the same as an increase in age from 60 to 61.
Kendall rank correlation: Kendall rank correlation is a non-parametric test that measures the
strength of dependence between two variables. If we consider two samples, a and b, where
each sample size is n, we know that the total number of pairings with a b is n(n-1)/2. The
following formula is used to calculate the value of Kendall rank correlation:
Key Terms
Spearman rank correlation: Spearman rank correlation is a non-parametric test that is used
to measure the degree of association between two variables. The Spearman rank correlation
test does not carry any assumptions about the distribution of the data and is the appropriate
correlation analysis when the variables are measured on a scale that is at least ordinal.
Is there a statistically significant relationship between horse’s finishing position a race and
horse’s age?
Assumptions
The assumptions of the Spearman correlation are that data must be at least ordinal and the
scores on one variable must be monotonically related to the other variable.
Effect size: Cohen’s standard may be used to evaluate the correlation coefficient to
determine the strength of the relationship, or the effect size. Correlation coefficients between
.10 and .29 represent a small association, coefficients between .30 and .49 represent a
medium association, and coefficients of .50 and above represent a large association or
relationship.
Ordinal data: In an ordinal scale, the levels of a variable are ordered such that one level can
be considered higher/lower than another. However, the magnitude of the difference between
levels is not necessarily known. An example would be rank ordering levels of education. A
graduate degree is higher than a bachelor’s degree, and a bachelor’s degree is higher than a
high school diploma. However, we cannot quantify how much higher a graduate degree is
compared to a bachelor’s degree. We also cannot say that the difference in education
between a graduate degree and a bachelor’s degree is the same as the difference between a
bachelor’s degree and a high school diploma.
Correlation Resources:
Algina, J., & Keselman, H. J. (1999). Comparing squared multiple correlation coefficients:
Examination of a confidence interval and a test significance. Psychological Methods, 4(1),
76-83.
Coffman, D. L., Maydeu-Olivares, A., Arnau, J. (2008). Asymptotic distribution free interval
estimation: For an intraclass correlation coefficient with applications to longitudinal
data. Methodology, 4(1), 4-9.
Cohen, J., Cohen, P., West, S. G., & Aiken, L. S. (2003). Applied multiple
regression/correlation analysis for the behavioral sciences. (3rd ed.). Mahwah, NJ:
Lawrence Erlbaum Associates. View
Hatch, J. P., Hearne, E. M., & Clark, G. M. (1982). A method of testing for serial correlation
in univariate repeated-measures analysis of variance. Behavior Research Methods &
Instrumentation, 14(5), 497-498.
Kendall, M. G., & Gibbons, J. D. (1990). Rank Correlation Methods (5th ed.). London:
Edward Arnold. View
Krijnen, W. P. (2004). Positive loadings and factor correlations from positive covariance
matrices. Psychometrika, 69(4), 655-660.
Shieh, G. (2006). Exact interval estimation, power calculation, and sample size determination
in normal correlation analysis. Psychometrika, 71(3), 529-540.
Stauffer, J. M., & Mendoza, J. L. (2001). The proper sequence for correcting correlation
coefficients for range restriction and unreliability. Psychometrika, 66(1), 63-68.
Related Pages:
Learn More
Dissertation Consulting
Webinars
Free Dissertation Resources
Blog
About Us
Sitemap
Call Us727-442-4290
Email [email protected]
Contact Us
© Statistics Solutions 2021