0% found this document useful (0 votes)
20 views38 pages

Probability To Correlation1

This document provides an overview of key concepts in probability and statistics including: 1) Different sampling designs such as simple random sampling, stratified sampling, and cluster sampling. 2) Characteristics of probability distributions like the normal distribution and how to calculate probabilities using the normal curve. 3) How to perform regression analysis to establish relationships between variables and use regression equations to predict dependent variable values from independent variables. Correlation analysis is also introduced to measure the strength of relationships.

Uploaded by

Symonette Octura
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views38 pages

Probability To Correlation1

This document provides an overview of key concepts in probability and statistics including: 1) Different sampling designs such as simple random sampling, stratified sampling, and cluster sampling. 2) Characteristics of probability distributions like the normal distribution and how to calculate probabilities using the normal curve. 3) How to perform regression analysis to establish relationships between variables and use regression equations to predict dependent variable values from independent variables. Correlation analysis is also introduced to measure the strength of relationships.

Uploaded by

Symonette Octura
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 38

Section 2

Chapter 1
LESSON 3
Sampling Designs Review
Probability Sampling
 Simple Random Sampling
 Stratified Sampling
 Systematic Sampling
 Cluster Sampling
Non-Probability Sampling
 Convenience Sampling
 Purposive Sampling
 Quota Sampling
 Snowballing Sampling
Probability and Normal
Distribution

• is the chance or likelihood of


an event to happen.

• It can be expressed as
proportions from 0 to 1 or
percentages from 0% to 100%.
Unconditional Probability
P(characteristic) = # persons with characteristic / N

Conditional Probability
P (Characteristic) = # persons with characteristic / N
Example: Unconditional Probability
P(characteristic) = # persons with characteristic / N
Consider the data below that
shows the number of gadget
addictiveness among children 5-
10 years of age who are seeking
medical care.

Age (years)
5 6 7 8 9 10 Total
Conditional Probability
Boys 431 380 500 411 421 417 2560 P (Characteristic) = # persons with characteristic / N
Girls 409 512 413 435 460 501 2730 460
𝑃 ( 9 𝑦𝑒𝑎𝑟 𝑜𝑙𝑑 ! 𝑔𝑖𝑟𝑙 )= =0.169
2730
Total 840 892 913 846 881 918 5290
s
Normal Distribution
Characteristics of a Normal Distribution
1.Symmetric unimodal,
2.asymptotic,
3.the mean, median, and mode are all equal.
Example:
You join in a crab catching contest. The sizes
of the crabs is normally distributed with a
mean of 16 cm and a standard deviation of 4.
What is the chance of catching crabs that are
less than 8 cm? What is the chance of winning
a prize if the prize is offered for any crabs over
24 cm? What is the chance of catching crabs
between 16cm and 24 cm?
Solution:
1. Draw a picture of the normal distribution.

2. Locate the problems in the graph.


3. Translate each problem into probability notation
Problem 1: P(x<8)
Problem 2: P(x>24)
Problem 3: P(16<x<24)
4. Change the x values into z scores.
When x is 8
z-score =
when x is 24
z-score =

thus,
Problem 1: P(x<8) = P(z<-2)
Problem 2: P(x>24) =
P(z>2)
Problem 3: P(16<x<24) =
P(0<z<2)
1.Calculate probabilities
Problem 1: P(x<8)=P(z<-2)=0.0228
The probability of getting
crabs smaller than 8cm is 2.28%.
2. Calculate the probability of getting crabs larger
than 24 cm.
• This problem is calculated by subtracting the
lesser than part of the area. Thus, the whole area
is 1 minus to the left of 24 cm
In symbols we have:

Note that:

Therefore:

This means that the probability of getting crabs bigger than 24cm is
2.28%
Problem 3: P(16<x<24)=P(0<z<2)
The probability of catching crabs with sizes
between 16cm and 24cm is 47.72% which is also the
probability of winning a prize.
Another Example
1.)

2.)

3.)
Another Example
1.)

2.)

3.)
Regression Analysis and
Correlation Analysis
y    x
What would
you
How expect
much
This for other
distribution
Shewould
could an adult
weigh
Weight
varyingheights?
is normally
femaleamounts
weigh
–distributed.
in if
other
shewords,
were 5
60 62 64 66 68 there is a
feet (60
distribution of
60 62 64 66 68
inches)
weights tall?
for adult
60 62 64 66 68 Where
females whowould
are
60 62 64 66 68 you
5 feetexpect
tall. the
Height
TRUE LSRL
to be?
Regression Analysis
Regression Analysis – is a statistical method that makes use of the relationship between two or
more quantitative variables.

Dependent or Response Variable (Y) – can be explained with the knowledge of the values of the
other variable, called the Independent or Explanatory Variable (X).
Regression Analysis
The two main purpose of regression

-To measure relationship between two or more variables.

-To predict or estimate values of dependent variable from the known


values of independent variables.
Regression Analysis
The term ‘simple’ in simple linear regression, is used when we
consider only one independent variable.

The line with ‘best fit’ is that line such that when the differences
between the actual values of Y and the predicted values of Y
based on the regression line for each X are squared and
summed, the sum is minimum.
Estimation of Parameters
b is the regression coefficient or the slope of the regression line

a is the constant of regression or the y-intercept of the regression line


Example
Last year, five randomly selected students took a math aptitude test before
they began their statistics course. The Statistics Department has a questions.

What linear regression equation best predicts statistics performance, based


on math aptitude scores?

If a student made an 80 on the aptitude test, what grade would we expect


her to make in statistics?
Solution
Aptitude Test Score Statistics Grade

Student

1 95 85 8075 9025 7225

2 85 95 8075 7225 9025

3 80 70 5600 6400 4900

4 70 65 4550 4900 4225

5 60 70 4200 3600 4900

Sum 390 385 30500 31150 30275


Solutions
Note that:
5 5 5 5 5

∑ 𝑥 𝑖 𝑦 𝑖 =30500 ∑ 𝑥 𝑖 =390 ∑ 𝑦 𝑖=385 ∑ 𝑥2𝑖 =31150 ∑ 𝑦 2𝑖 =3 0275


𝑖 =1 𝑖 =1 𝑖=1 𝑖 =1 𝑖 =1

Thus,
Solutions
The regression equation for the data is:

If a student made an 80 on the aptitude test, her expected grade in


statistics would be:
Correlation Analysis
Correlation Analysis attempts to measure the strength of the
relationship between two random variables by means of a single
number called a correlation coefficient.

Correlation Coefficient measures the strength of the linear relationship between two variables
X and Y.
Sample Correlation Coefficient
The formula:
Scatter Plot Examples
Linear relationships Curvilinear relationships
y y

x x

y y

x x
Scatter Plot Examples
Strong relationships Weak relationships

y y

x x

y y

x x
Scatter Plot Examples
No relationship
y

x
Examples of Approximate r Values
y y y

x x x
r = -1 r = -.6 r=0
y y

x x
r = +.3 r = +1
Note:
Aptitude Test Score Statistics Grade

Student

1 95 85 8075 9025 7225

2 85 95 8075 7225 9025

3 80 70 5600 6400 4900

4 70 65 4550 4900 4225

5 60 70 4200 3600 4900

Sum 390 385 30500 31150 30275


Let’s Compute
Based on our previous example on student’s aptitude and statistics grade, the computed r
would be:
5 5 5 5 5

∑ 𝑖 𝑖
𝑥 𝑦 =30500 ∑ 𝑖
𝑥 =390 ∑ 𝑖
𝑦 =385 ∑ 𝑖
𝑥2
=31150 ∑ 𝑖 =3 0275
𝑦 2

𝑖 =1 𝑖 =1 𝑖=1 𝑖 =1 𝑖 =1

2
The relationship between student’s Statistics 𝑟 =.9801 ≈ 98.01 %
grade and aptitude is strong.
Thank you for listening

You might also like