Recall From Last Time: Section 8 Scatter Plots and Linear Regression
Recall From Last Time: Section 8 Scatter Plots and Linear Regression
4/17/2019 Wednesday
Recall From Last Time: Section 8 Scatter Plots and Linear Regression
1. Scatter Plots
3. Line of best fit( or trend line, regression line): is a straight line that best represents the data on a scatter
plot.
This line may pass through some of the points, none of the points, or all of the points.
4. Linear correlation coefficient (r): measures the strength of the linear relationship between paired x and y
values.
-1 ≤ r ≤ 1 The sign indicates if the variables are negatively correlated or positively correlated.
r is very sensitive to outliers.
#2 Straight Enough Condition - confirm that scatter plot displays straight line pattern - correlation
coefficient r is for LINEAR Correlation only!
#3 Outlier Condition - Outliers are removed if they are known to be errors. Outliers are data values that
deviate significantly from the other data values.
Calculate r - pretty nasty calculations so usually done with a calculator, round r to 3 decimal places.
***Otherwise use the mean value of the response variable ý as the best predicted value***
Residual Plots
Residuals y− ^y should have no pattern. A “best fit” line will have some data above and some below.
Graph of (x, y -^y ) should also have no pattern. This indicates that the line of best fit is a good approximation
for the data.
A residual plot with a non-random pattern (either right-side up or inverted U-shape) indicates that the line of
best fit is not a good approximation for the data; a non-linear estimation would be a better fit.
Examples here -