0% found this document useful (0 votes)
12 views

Stat 2

Statistics
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views

Stat 2

Statistics
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 71

NORMAL

DISTRIBUTION
CURVE
The normal distribution is most important
probability distribution in statistics because it fits many
natural phenomena. For example, heights, blood pressure,
measurement error, and IQ scores follow the normal
distribution. It is also known as the Gaussian distribution
and the bell curve.
It is a probability function that describes how the
values of a variable are distributed. It is a symmetric
distribution where most of the observations cluster
around the central peak and the probabilities for values
further away from the mean taper off equally in both
directions.
Some Properties
of a Normal
Probability
Distribution.
1. The curve of the distribution is a bell-shaped.
2. The curve is symmetrical about the mean. This means if we will cut
the curve about the mean, we will have balanced proportions of
the halves. Specifically we say that one is a reflection of the other.
Meaning, the qualities exhibited by one are the same qualities
exhibited by the other.
3. The width of the curve is determined by the standard deviation of
the distribution.
For instance, in a given distribution with mean equals 95 and
standard deviation equals 3, the width of the curve will be 3 numbers
away from each other, which is determined from the given standard
deviation.
4. The curve extends indefinitely approaching the x-axis but never
touching it. Thus, the curve is asymptotic to the line.
5. The area of the region under the curve is 1. It represents the
probability or percentage or proportion associated with the specific sets
of measurement values.

This means that for every specific measurement value, there


corresponds exactly one probability/percentage/proportion value which
describes a particular area of the region under the normal curve.
A specific proportion of the area of the region under the curve can
be calculated manually using the formula

Where:
Y = the height of the curve at a particular value of X
X = any score in the distribution
σ = the standard deviation of the population
μ = the population mean
π = 3.1416
𝑒 = 2.7183
Area Under the Normal
Curve Given a z-value
1. Express the given z-value into a three-digit number.
2. Using the z-Table, find the first 2 digits on the first column.
3. Find the third digit on the first row on the right.
4. Read the area for probability at the intersection of the row (first 2
digit number) and column (third digit number). The value observed
at the intersection indicates the area of the given z-value.
1. Find the area that corresponds to z=2. We can assume that
2 = 2.00.

The area at the intersection is 0.4772.


2. Find the area that corresponds to z=1.45.
The area you just learned also indicates the probability of the
given distribution between any two specific z-values under the curve.
IDENTIFYING
REGIONS
OF AREAS
UNDER THE
NORMAL CURVE
Example 1
Find the area of the region between z= 1 and z = 3.

The area of the region described by the point z = 1 indicates the area
from z = 0 to z = 1. Using the z-Table, it has corresponding area
of 0.3413.
So with z = 3, it describes the area of the region from z = 0 to z = 3,
with corresponding area of 0.4987 (using the z-Table).
The problem states that we need to find the area of the region
between the given two z-values

We need to subtract the area that corresponds to z = 3 to the


area
that corresponds to z = 1. Thus, we have
0.4987 – 0.3413 = 0.1574
Therefore, the area between z = 1 and z = 3 is 0.1574.
Example 2
Find the area of the region between z = 1
and z =-1.
The area of the region described by the point z = 1 indicates the area
from z = 0 to z = 1. Using the z-Table, it has corresponding area
of 0.3413.
So with z = -1, it describes the area of the region from z = 0 to z = -1,
with corresponding area of 0.3413. (using the z-Table).
The problem states that we need to find the area of the region
between the given two z-values

We need to add the area that corresponds to z = 1 to the area


that
corresponds to z = -1. Thus, we have
0.3413 + 0.3413 = 0.6826
Therefore, the area between z = 1 and z = -1 is 0.6826.
Example 3
Find the area of the region between z = 2 and z = -1.5.

The area of the region described by the point z = 2 indicates


the area from z = 0 to z = 2. Using the z-Table, it has corresponding
area of 0.4772.
The area of the region described by the point z = 2 indicates
the area from z = 0 to z = -1.5. Using the z-Table, it has
corresponding area of 0.4332.
Thus, we have
0.4772 + 0.4332= 0.9104
Therefore, the area between z = 2 and z = -1.5 is 0.9104.
DETERMINING
PROBABILITIES
Probability notations are commonly used to express a
lengthy idea into symbols concerning the normal curve.

The following are the most common probability notations used


in studying concepts on the normal curve.

P(a < z < b) this notation represents the idea stating the
probability that the z-value is between a and b

P(z> a) this notation represents the idea stating the


probability that the z-value is above a
P(z< a) this notation represents the idea stating the
probability that the z-value is below a where a and b are z-
score values.
Let us familiarize some of the terms involved in using notations.

NEGATIVE P (z < a) NEGATIVE P (z < a)


“less than z” “greater than z”
“to the left of z” “to the right of z”
“below z” “above z”
“lower than” “more than z”
“under z” “at least z”

With all these in mind, you are now ready to solve for the
proportion of the area specified z-score value.
PRACTICE!!!
SOLVE FOR THE FOLLOWING PROBABILITTIES

1. P(-2 < z <1)


2. P(z > 3)
3. P(z < -2)
4. P(z >-1.5)
5. P(z = -1)
1. P(-2 < z <1)

1. P(-2 < z <1) = 0.4772 + 0.3413

1. P(-2 < z <1) = 0.8285


2. P(z > 3)

2. P(z > 3) = 0.5000 – 0.4987

2. P(z > 3) = 0.0013


3. P(z < -2)

3. P(z < -2) = 0.5000 - 0.4772

3. P(z < -2) = 0.0228


4. P(z >-1.5)

4. P(z > -1.5) = 0.5000 + 0.4332

4. P(z > -1.5) = 0.9332


5. P(z = -1)

5. P(z = -1) = 0.3413


FINDING
PERCENTILES
UNDER THE
NORMAL CURVE
A percentile is a measure used in statistics indicating the
value below which a given percentage of observations in a
group of observations fall. It is a measure of relative standing as
it measures the relationship of a measurement of the rest of the
data.

For instance, you were told by your teacher that your


Mathematics grade for the quarter is 92 and your grade is at the
88th percentile. This means 88% of your classmates scored lower
than 92 and 21% scored more than 92 in the subject. This
explains that 88th percentile is located at a point such that 88% of
the total population lies below that point and 12% lies above.
PERCENTILES UNDER THE NORMAL CURVE
EXAMPLE 1:
Find the 86th percentile of the normal curve. The P86
means locating an area before or below the point. We want to
know what the z-value is at this point.
STEP 1
Express the given percentage as probability.
86% is the same as 0.8600

0.8600 = 0.5000 + 0.3600


STEP 2
Refer to the table of areas under the normal curve.
Locate the area of the probability

0.8600 = 0.5000 + 0.3600


Z 1.08 = 0.3599 Z 1.09 = 0.3621

STEP 3:
Locate 1.08 on the curve and draw a line through under
the curve
EXAMPLE 2:
Find the 95th percentile of the normal curve. The P95
means locating an area before or below the point. We want to
know what the z-value is at this point.

95% is the same as 0.9500


0.8600 = 0.5000 + 0.4500

Z 1.64 = 0.4495 Z 1.65 = 0.4505


STEP 2
Refer t the table of areas under the normal curve.
Locate the area of the probability

0.8600 = 0.5000 + 0.3600


Locate 1.64 on the curve and draw a line through under
the curve
EXAMPLE 3:
Find the 32nd percentile of the normal curve. The P32
means locating an area before or below the point. We want to
know what the z-value is at this point.

32% is the same as 0.3200


1- 0.3200 = 0.6800 = 0.5000 + 0.1800

Z -0.47 = 0.1808 Z -0.46 = 0.1772


Locate -0.46 on the curve and draw a line through under
the curve
EXAMPLE 3:
Find the middle 40th percentile of the normal curve. This
means locating an area 40% on the middle. We want to know
what the z-values are that encloses this percentile.
40% is the same as 0.4000
Divide .4000 into two. One half represents the area on the left
and the other the are to the right of the middle

0.4000/2 = 0.2000

Z 0.52 = 0.1985 Z 0.53 = 0.2019

Z -0.52 = 0.1985 Z -0.53 = 0.2019


Locate -0.52 and 0.52 on the curve and draw a line
through under the curve
UNDERSTANDING
THE
Z-SCORES
For any population, the mean and the standard
deviation are fixed. This gives the way to understand
that for every raw score X, there corresponds an
exactly one z-score value, and vice versa. Therefore, if
we wish to find the percentage associated with X, we
must find its matched z-value using the z formula.
The z formula is given by:

X−µ
𝑧=
𝜎

Where:
X = raw score
µ = population mean
𝜎 = population standard deviation
1. In Mrs. Shirley’s science class, a student gained a
score of 46. What is the z-value of his score if test result has
population mean 45 with standard deviation of 2? Show it on
the normal distribution curve.
Given: Find:
X = 46 𝑧 =?
µ = 45
𝜎=2
Solution:
X−µ
𝑧=
𝜎
46−45
𝑧= = 0.5
2
2. Leslie got a score of 68 in an examination she
participated. What is the corresponding z-value of her score if
the result has mean of 75 and σ = 5. Show it on the normal
distribution curve.
Given: Find:
X = 68 𝑧 =?
µ = 75
𝜎=5
Solution:
X−µ
𝑧=
𝜎 -1.4
68−75
𝑧= = -1.4 60 65 70 75 80 85
5 68
3. Given μ = 110 and σ = 10 in a college entrance
examination, find the corresponding z-value of the scores
gained by the following students.

1. X = 98
2. X = 85
3. X = 120

Given: Find:
X = 98, 65, and 120 𝑧 =?
µ = 110
𝜎 = 10
Solution:
X−µ
𝑧=
𝜎 -2.5 -1.2 1

For 98 98
80 85 90 100 110 120 130
98−110
𝑧= 10
= -1.2
For 85
85−110
𝑧= = -2.5
10
For 120
120−110
𝑧= =1
10
DETERMINING
PROBABILITIES OF
EVENTS IN A GIVEN
DATA SAMPLE
In a normally distributed data, we can determine
the probabilities of a given event by finding the
corresponding z value representing the event and
getting its area under the curve.
1. In Mrs. Shirley’s science class, the test result has a
population mean 45 with standard deviation of 5? What is the
probability of selecting a student with (a) a score less that 30
(b) what score represents the 90th percentile, (c) middle 10%
of the score shall be between what scores?
Given: Find:
X < 30 (a) 𝑃 𝑥 < 30
µ = 45 (b) X @ P90
𝜎=5 (c) X @ Pmiddle10
Solution:
X−µ
𝑧=
𝜎
For (a)
30 35 40 45 50 55
30−45
𝑧= 5
= -3.00
P(z<-3.00) = 0.5000 – z3.00
From the Z-Table
Z-3.00 = 0.4987 P(z<-3.00) = 0.5000 – 0.4987

P(z<-3.00) = 0.0013
For (b)
X−µ
To find this score we 𝑧=
should first find the value of 𝜎
z corresponding to the 90th
X−45
percentile. 1.28 =
5

(1.28)5 = 𝑥 − 45

𝑥 = (1.28)5 + 45
30 35 40 45 50 55

𝑥 = 6.4 + 45
From the table we
can see that @ z = 1.28
𝑥 = 51.4
the are is 0.3997
For (c)
From the table we
To find this score we
can see that @ z = ±0.13
should first find the value of
the are is 0.0517
z corresponding to the
middle 10th percentile.

30 35 40 45 50 55

this means locating an


area 0.0500 to both left and
right of from the middle
To the right of the middle To the left of the middle

X−µ X−µ
𝑧= 𝑧=
𝜎 𝜎
X−45 X−45
0.13 = -0.13 =
5 5

(0.13)5 = 𝑥 − 45 (-0.13)5 = 𝑥 − 45

𝑥 = (0.13)5 + 45 𝑥 = (−0.13)5 + 45

𝑥 = .65 + 45 𝑥 = −.65 + 45

𝑥 = 45.65 𝑥 = 44.35
1. Suppose in certain dog shelter with a 100 puppies the
average puppy weighs 11.2 pounds and a sample standard
deviation of 2.78 pounds. What is the probability of picking a
puppy (a) what weighs more than 13 pounds? (b) what score
represents the 65th percentile, (c) middle 20% of the weight shall
be between what weights?
Given: Find:
X > 13 lbs (a) 𝑃 𝑥 > 13
µ = 11.2 (b) X @ P65
𝜎 = 2.78 (c) X @ Pmiddle20
Solution:
X−µ
𝑧=
𝜎
For (a)
2.86 5.64 8.42 11.2 13.98 16.76
13−11.2
𝑧= 2.78
= 0.65
P(z>0.65) = 0.5000 - z0.65
From the Z-Table
Z0.65 = 0. 2422 P(z>0.65) = 0.5000 - 0.2422

P(z>0.65) = 0.2578
For (b)
X−µ
To find this score we 𝑧=
should first find the value of 𝜎
z corresponding to the 65th
X−11.2
percentile. 0.39 =
2.78

(0.39)2.78 = 𝑥 − 11.2

𝑥 = (0.39)2.78 + 11.2
30 35 40 45 50 55

𝑥 = 1.08 + 11.2
From the table we
can see that @ z = 0.39
𝑥 = 12.28
the are is 0.1517
For (c)
From the table we
To find this score we
can see that @ z = ±0.25
should first find the value of
the are is 0.0987
z corresponding to the
middle 20th percentile.

30 35 40 45 50 55

this means locating an


area 0.100 to both left and
right of from the middle
To the right of the middle To the left of the middle

X−µ X−µ
𝑧= 𝑧=
𝜎 𝜎
X−11.2 X−11.2
0.25 = -0.25 =
2.78 2.78

(0.25)2.78 = 𝑥 − 11.2 (-0.25)2.78 = 𝑥 − 11.2

𝑥 = (0.25)2.78 + 11.2 𝑥 = (−0.25)2.78 + 11.2

𝑥 = 0.7 + 11.2 𝑥 = −0.7 + 11.2

𝑥 = 11.9 𝑥 = 10.5
For now, we will not be solving these types of problem but maybe we will tackle this in
hypothesis testing. For this particular problem z-test can be performed.

Suppose the average puppy weighs 10 pounds. A


sample of 10 puppies yields a sample mean of 13.2
pounds and a sample standard deviation of 2.78 pounds.
Assume the population standard deviation is known to be
5.25. You want to test whether the sample mean differs
from the population mean of 10 pounds at a 5 percent
level of significance using a two-tailed test. Calculate the
test statistic for the sample data.

You might also like