Lesson# 4 Measure of Dispersion: Department of Statistics FC College University, Lahore
Lesson# 4 Measure of Dispersion: Department of Statistics FC College University, Lahore
measure of Dispersion
Department of Statistics
FC College University, Lahore
1
You have learnt various measures of central tendency.
Measures of central tendency help us to represent the entire mass
Preview of the data by a single value.
Can a central tendency describe the data fully and adequately?
In order to understand it, let us consider an example ,
2
Here we observe that in both groups the mean of the data arethe
Example same namely, 65
I. In group A, the observations are much more scattered from the
mean
II. In group B, almost all the observations are concentrated around
The daily income the mean
of the workers in
two factories are : Thus, there arises a need to differentiate between the groups. We need
some other measures which conern with the measure of scatteredness
Factory A: 35 45 50 65 70 90 100 (or spread).
Fcatory B: 60. 65 65 65 65 65 70
To do this, we study what is known as measures of dispersion
3
The term dispersion is used commonly to mean scatter, deviation,
fluctuation, spread or variability of data.
Meaning of The degree to which the individual values of the variate scatter
away from the average or central value, is called dispersion.
Dispersion Dispersion is used to denote a lack of uniformity in the item
values of a given variable.
Dispersion/
Variation
Everywhere
Measures of Dispersion
Absolute
Relative dispersion
dispersion
9
ABSOLUTE DISPERSION RELATIVE DISPERSION
Types of These measures gives us an idea These measures are calculated for
about the amount of dispersion in the comparison of dispersion in
Dispersion a set of observations. two or more than two sets of
observations.
These quantities measures the
dispersion in the same units as the These measures are free of the
units of original data. units in which the original data is
measured.
Absolute measures cannot be used
to compare the variation of two or These measures are a sort of ratio
more series/ data sets. and are called coefficients
10
Range Coefficient of Range
Measures of
dispersion
13
Range is defined as the difference between the maximum and the
minimum observation in the data set.
The Range If denotes the minimum observation, then range is defined as:
(Absolute Range= -
Measure) Where,
= maximum observation
= minimum observation
Example 1: Cheryl took 7 math tests in one marking period. What
is the range of her test scores?
89, 73, 84, 91, 87, 77, 94
Example-1 Range = 94-73=21
The range of these test scores is 21 points
15
It is a relative measure of dispersion and is based on the
Co-efficient of value of range. It is also called range coefficient of
dispersion. It is defined as
Range
Co-efficient of Range =
(Relative
The range is standardized by the total .
Measure)
Example -2
A testing lab wishes to test two experimental brands of outdoor paint to see how long each will last before fading.
The testing lab makes 6 gallons of each paint to test. Since different chemical agents are added to each group and only six cans
are involved, these two groups constitute two small populations. The results (in months) are shown
Find the range for the paints.
We see that the coefficient of range for Brand A is greater than that of brand B.
Thus, there is greater dispersion or variation in Brand A. The time duration for paint fading in Brand B outdoor paints
are more stable than Brand A.
Example 3: Following are the
wages of 8 workers of a
factory. Find the range and
the co-efficient of range.
Solution:
Wages in $.
1400,1450,1520,1380,1485,149 Largest value Xm= 1575 and
5,1575,1440
Smallest value = X0= 1380
Range = Xm – X0 = 1575-1380
= $195
Co-efficient of Range =
=
Example 4: Set A contains the
marks of 5 students in Set B: English
Set A: Mathematics
Mathematics out of 25 marks 30,35,40,45,50
and group B contains marks of 10,15,18,20,20
the same students in English Range = Xm – X0
out of 100 marks. Compare the Range = Xm – X0
= 50-30 = 20
data with Coefficient of Range. = 20-10 = 10
Co-efficient of Range
Co-efficient of Range = = =
= ==
= 0.33
= 0.25
We see that coefficient of range for set A is greater than that of set B. Thus there is greater dispersion in
set A.
Merits and Demerits of
Range
MERITS DEMERITS
22
Quartile It is based on the lower quartile Q1 and the upper quartile Q3. The
Deviation difference Q3- Q1 is called the interquartile range. The
difference Q3- Q1 divided by 2 is called semi-interquartile range or
(Absolute the quartile deviation.
Dispersion) Quartile Deviation = =
The quartile deviation is a slightly better measure of absolute
dispersion than the range. But it ignores the observations on the
tails.
Coefficient of
Quartile A relative measure of dispersion based on the quartile deviation is called the
Deviation coefficient of quartile deviation. It is defined as
Solution:
After arranging the observations in ascending order, we get
1040,1080,1120,1200,1240,1320,1342,1360,1440,1470,
1600,1680,1720,1730,1750,1755,1785,1880,1885,1960
= Value of
= Value of
= Value of 3
= Value of 3
= Value of
=15thObservation+0.75(16thObservation-15thObservation)
= 1750+ 0.75(1755-1750) = 1750+3.75 = 1753.75
Solution:
First we have to arrange our data set in ascending or descending order
13 13 20 23 26 27 31 34 34 34 34 35 36 37 38
41 41 41 45 47 47 47 50 51 53 54 56 62 67 82
Set B:
Set A:
nB=5
nA=5
5,8,10,12,15
4, 8,10,11 15
= Value of
= Value of
= Value of
= Value of
Demerits
Not based on all the observations.
of Quartile Easy to calculate and understand Not representative value of data.
Deviation Not affected by extreme values It ignores the first 25% and last
as in the case of range. 25% of the observations.
Very useful in estimating Not a good measure of dispersion
dispersion in grouped data with since it depends on two position
open ended class measure.
PRACTICE QUESTIONS
33
Mean Deviation
(Absolute Measure)
The mean deviation or the average deviation is defined as the mean of the absolute
deviations of observations from some suitable average which may be the arithmetic
mean or the median.
The difference (X- Average) is called deviations and when we ignore the negative sign,
this deviation is written as .
Mean deviation from mean=
M.D(about mean) =
Mean deviation from median=
M.D(about median) =
Mean
A relative measure of dispersion based on the mean deviation is
Coefficient
called theof
coefficient of the mean deviation or the mean coefficient
of dispersion. It is defined as the ratio of the mean deviation to the
Dispersion
average used in the calculation of the mean deviation.
(Relative
Mean Coefficient of Dispersion = =
Measure)
Median Coefficient of Dispersion==
Example 8: Sale of a coffee café in newYork city of 10 days are given as:
120,121,110,115,118,108,111,112,111,124 (in $).
(i) M.D(about mean) and Mean coefficient of dispersion
(ii) M.D(about median) and Median coefficient of dispersion.
X
108
Solution: (i) n = 10 110 5
=120,121,110,115,118,108,111,1 111 4
12,111,124 111 4
112 3
115 0
118 3
M.D(about mean) = 120 5
= 121 6
124 9
Total 46
Example 8: Cont..
Video Aid:
https://youtu.be/WpcocBR6Pr4
https://youtu.be/GdIkEngwGNU
Shop 1
65,68,60,76,65,74,58, 56, 61,57
Example 9: The weight measurements in grams of 20 eggs are given. Compare the two group of eggs.
gms
= 1,4, 4, 12, 1, 10, 6, 8,3, Shop 2
= 56 63,64,72,66,65,67,65,63, 67,67
M.D(about mean) = gms
= gms = 2.9,1.9,6.1.0.1,0.9,1.1.0.9,
Mean Coefficient of Dispersion = 2.9,1.1,1.1
= 0.0875 = 19
M.D(about mean) =
= gms
Mean Coefficient of Dispersion =
= 0.0288
Mathematically illogical
Easy to understand and to assume all negatives
compute. as positives.
Based on all the observations. Can’t be calculated with
Is not affected by extreme distributions with open-
values. end class.
Used to make healthy
comparisons
1.
• Find the mean deviation from mean
and median for the following
values. Also find their coefficients.
• 30, 36, 32, 33, 35, 39, 36.5, 35 and
34
2.
82, 75, 204, 81, 84, 118, 104
• Consignment 2: 78, 90, 115, 110,
98, 106, 99, 107,84, 76
• Find (i) Mean Deviation (about
mean)
• (ii) Compare the variability in two
consignment with mean coefficient
of dispersion.
4. Standard deviation
42
The Standard
Rather than relying on the location of the value, variance
Deviation and measures dispersion by calculating how far observations
Variance are from the mean.
(Absolute Variance = Average of the distance from the mean of each
observation (squared).
Dispersion) High variance means that many/most observations are far
from the mean but could be heavily influenced by outliers.
Low variance means that many/most observations are
close to the mean.
The Standard deviation Takes square root of variance to put measure in the same
unit as the observations.
Example: The average rating of the Conservatives is 4.8
and the standard deviation is 2.8. This tells us that the
average amount that the ratings differ from the mean is 2.8
points on the 11 point scale used to measure feeling
towards the Conservative Party.
In contrast, the variance is 8.0, which can be interpreted as
8 squared points on the 11 point scale.
This explanation is confusing and has little intuitive power.
44
The Standard Deviation Standard Deviation of the observations is defined as the positive
square root of the mean of the squares of deviations of observations
taken from their arithmetic mean.
Cont…..
Let ‘X’ be a random variate which takes on ‘n’ values, the standard
deviation of these n observations is given by.
Where
45
Alternatively,
The Standard Deviation
Cont. .. But the formula is used when the items are very small.
The term is positive, therefore ‘S’ is always positive.
If the original observations are in cm, the value of Standard
Deviation will also be in cm.
The Variance
48
Solution
Let x denote the 2008 market value (in billions of dollars) of a company.
X
=
75 5625
=
= 22920 107 11449
71 5041
Thus, the standard deviation of the market values of
these five companies is $82.08 billion.
49
Example 11:
Following are the 2009 earnings (in thousands of dollars) before taxes for all six employees of a small
company.
88.50 108.40 65.50 52.50 79.80 54.60
Calculate the variance and standard deviation for these data.
X
Solution:
88.50 13.62 185.5
108.40 33.52 1123.59
65.50 -9.38 87.98
= = =388.9
52.50 -22.38 500.86
thousand
79.80 4.92 24.20
=$19,721 54.60 -20.28 411.27
Thus, the standard deviation of the 2009 earnings =2333.4
of all six employees of this company is
$19,721.
50
Practice Questions
1.
• Find the sample variance and standard deviation for the
amount of European auto sales for a sample of 6 years
shown. The data are in millions of dollars. 11.2, 11.9,
12.0, 12.8, 13.4, 14.3
2.
• The following data are the ages (in years) of six students.
• 19 19 19 19 19 19 Calculate the standard deviation. Is its
value zero? If yes, why?
3.
• The following data give the number of highway
collisions with large wild animals, such as deer or
moose, in one of the northeastern states during each
week of a 9-week period.
• 7 1 03 8 2 5 7 4 9
• Find the variance,and standard deviation.
Coefficient of Variation (Relative Measure)
= 2.0976 cm
72 -2 4
74 0 0
76 2 4
(ii) C.V = = = 2.8345 %
Total 44
Example 13: The quarterly profits (in lakhs of Rs.) of two companies are given below:
Calculate their Standard Deviation and compare the variability of two companies.
Quarters Company A Company B
Jan-March 9.5 15.5
April-June 13.7 21.2
July-September 10.4 23.4
October-December 8.6 18.8
=9.5+13.7+10.4+8.6=42.2 =15.5+21.2+23.4+18.8=78.9
=(9.5)2+(13.7)2+(10.4)2+(8.6)2=460.06 =(15.5)2+(21.2)2+(23.4)2+(18.8)2=397.6275
= = = =
= Rs. 1.9267 = Rs. 2.9321
C.V (X) = = = 18.26 % C.V(Y) = = = 14.86 %
By comparing the coefficient of variations, it can be observed that Company B provides the more
Consistent Performance.
Example 14: A company has two sections with 40 and 65 employees respectively. Their average weekly wages are $450 and
$350. The standard deviation are 7 and 9.
(i) Which section has a larger wage bill?
(ii) Which section appears to be more consistent in performance?
Section A Section B
(i) Wage Bill section A (i) Wage Bill section B
= =
450 x 40 = $18000 350 x 65 = $22750
(ii) Variability of A (ii) Variability of B
C.V= C.V=
C.V = C.V=
C.V = 1.56% C.V= 2.57%
Section A appears to be more consistent in Section B has larger wage bill
performance as having less value of C.V.
Video Aid:
https://www.youtube.com/watch?v=wDAd_QHKoOg
Merits and MERITS DEMER I TS
1.
results:
• City A: mean =100, S=30
• City B: mean = 79.93, S=13.49 and median=75
• In which city is the distribution of income
more variable?
2.
experiment in the laboratory:
81,94,64,80,75,69,96,66,80,91,85,79
Find, Variance, Standard Deviation and
Coefficient of Variation
(i) The standard deviation is zero if all the observations has some
constant value. If ‘C’ is a constant, then
S.D(C)=0
Properties of (ii) The standard deviation do not change by change of Origin
Standard Deviation S.D(X A) = S.D(X), where A is a constant & S.D(A) =0
S.D(X - A) = S.D(X)
(iii) The standard deviation is affected by change of Scale
S.D(AX) = S.D(X), where A is a constant.
Also S.D() = S.D(X)
(iv) If X and Y are independent random variables then,
S.D (X Y) = S.D(X) + S.(Y) ,
where a and b are constants.
Properties of (V) If we have two sets of data with n1 and n2 observations having
means and with variances and respectively, then the
Standard combined standard deviation, SC of all the n = n1 + n2
Deviation observations is given by:
Cont…..
Alternatively,
Where =
Video Aid:
https://youtu.be/ctEGvROA7Bg
https://youtu.be/JYKcTA_IL1I
Example 5: