Probability To Correlation1
Probability To Correlation1
Chapter 1
LESSON 3
Sampling Designs Review
Probability Sampling
Simple Random Sampling
Stratified Sampling
Systematic Sampling
Cluster Sampling
Non-Probability Sampling
Convenience Sampling
Purposive Sampling
Quota Sampling
Snowballing Sampling
Probability and Normal
Distribution
• It can be expressed as
proportions from 0 to 1 or
percentages from 0% to 100%.
Unconditional Probability
P(characteristic) = # persons with characteristic / N
Conditional Probability
P (Characteristic) = # persons with characteristic / N
Example: Unconditional Probability
P(characteristic) = # persons with characteristic / N
Consider the data below that
shows the number of gadget
addictiveness among children 5-
10 years of age who are seeking
medical care.
Age (years)
5 6 7 8 9 10 Total
Conditional Probability
Boys 431 380 500 411 421 417 2560 P (Characteristic) = # persons with characteristic / N
Girls 409 512 413 435 460 501 2730 460
𝑃 ( 9 𝑦𝑒𝑎𝑟 𝑜𝑙𝑑 ! 𝑔𝑖𝑟𝑙 )= =0.169
2730
Total 840 892 913 846 881 918 5290
s
Normal Distribution
Characteristics of a Normal Distribution
1.Symmetric unimodal,
2.asymptotic,
3.the mean, median, and mode are all equal.
Example:
You join in a crab catching contest. The sizes
of the crabs is normally distributed with a
mean of 16 cm and a standard deviation of 4.
What is the chance of catching crabs that are
less than 8 cm? What is the chance of winning
a prize if the prize is offered for any crabs over
24 cm? What is the chance of catching crabs
between 16cm and 24 cm?
Solution:
1. Draw a picture of the normal distribution.
thus,
Problem 1: P(x<8) = P(z<-2)
Problem 2: P(x>24) =
P(z>2)
Problem 3: P(16<x<24) =
P(0<z<2)
1.Calculate probabilities
Problem 1: P(x<8)=P(z<-2)=0.0228
The probability of getting
crabs smaller than 8cm is 2.28%.
2. Calculate the probability of getting crabs larger
than 24 cm.
• This problem is calculated by subtracting the
lesser than part of the area. Thus, the whole area
is 1 minus to the left of 24 cm
In symbols we have:
Note that:
Therefore:
This means that the probability of getting crabs bigger than 24cm is
2.28%
Problem 3: P(16<x<24)=P(0<z<2)
The probability of catching crabs with sizes
between 16cm and 24cm is 47.72% which is also the
probability of winning a prize.
Another Example
1.)
2.)
3.)
Another Example
1.)
2.)
3.)
Regression Analysis and
Correlation Analysis
y x
What would
you
How expect
much
This for other
distribution
Shewould
could an adult
weigh
Weight
varyingheights?
is normally
femaleamounts
weigh
–distributed.
in if
other
shewords,
were 5
60 62 64 66 68 there is a
feet (60
distribution of
60 62 64 66 68
inches)
weights tall?
for adult
60 62 64 66 68 Where
females whowould
are
60 62 64 66 68 you
5 feetexpect
tall. the
Height
TRUE LSRL
to be?
Regression Analysis
Regression Analysis – is a statistical method that makes use of the relationship between two or
more quantitative variables.
Dependent or Response Variable (Y) – can be explained with the knowledge of the values of the
other variable, called the Independent or Explanatory Variable (X).
Regression Analysis
The two main purpose of regression
The line with ‘best fit’ is that line such that when the differences
between the actual values of Y and the predicted values of Y
based on the regression line for each X are squared and
summed, the sum is minimum.
Estimation of Parameters
b is the regression coefficient or the slope of the regression line
Student
Thus,
Solutions
The regression equation for the data is:
Correlation Coefficient measures the strength of the linear relationship between two variables
X and Y.
Sample Correlation Coefficient
The formula:
Scatter Plot Examples
Linear relationships Curvilinear relationships
y y
x x
y y
x x
Scatter Plot Examples
Strong relationships Weak relationships
y y
x x
y y
x x
Scatter Plot Examples
No relationship
y
x
Examples of Approximate r Values
y y y
x x x
r = -1 r = -.6 r=0
y y
x x
r = +.3 r = +1
Note:
Aptitude Test Score Statistics Grade
Student
∑ 𝑖 𝑖
𝑥 𝑦 =30500 ∑ 𝑖
𝑥 =390 ∑ 𝑖
𝑦 =385 ∑ 𝑖
𝑥2
=31150 ∑ 𝑖 =3 0275
𝑦 2
𝑖 =1 𝑖 =1 𝑖=1 𝑖 =1 𝑖 =1
2
The relationship between student’s Statistics 𝑟 =.9801 ≈ 98.01 %
grade and aptitude is strong.
Thank you for listening