Lecture 3-4Descriptive Statistics Measures of Central Tendency (1)
Lecture 3-4Descriptive Statistics Measures of Central Tendency (1)
2
Descriptive Statistics
3
Types of data presentation
4
5
6
ex1
• A sample of 80 individuals, 20 males and
60 females. • Calculate suitable summary
measures. The proportion of male in the
sample = 20/80*100=25%
• The ratio of males = 20:60=1:3 1 male for
each 3 females
7
8
A way of summarizing the data
using a single value that is most
representative of the entire data
set. 1. Midrange 2. Mode 3.
Median 4. Mean • Each one has
advantages and disadvantages
and useful in certain situation.
9
10
11
• Advantages of the midrange: • Easily and
rapid to calculate • Used only when we
want to obtain a rough and quick idea
about a set of observations
12
• Disadvantages and limitations of the
midrange:
• 1. Doesn't include all values.
• 2. Affected by extreme values.
• 3. The midrange cannot be used in
statistical analysis
13
14
1. The Mode
• The mode is a value that has the highest frequency in a data
set.
• A distribution may have one, more than one, or no mode at all.
• Defined for both qualitative and quantitative data.
• For a grouped distribution, we talk about the modal class.
• Useful when data consist of a small number of unique values.
• A data set that has only one value that occurs with the greatest
frequency is said to be unimodal.
• If a data set has two values that occur with the same greatest
frequency, both values are considered to be the mode and the
data set is said to be bimodal.
• If a data set has more than two values that occur with the same
greatest frequency, each value is used as the mode, and the
data set is said to be multimodal.
• Excel functions MODE.SNGL(data range) and
MODE.MULT(data range) 15
Mode for Qualitative Data
• 2 Modes: Black and White (7 times each)
W B B R Y
Y G G R B
B B W Y W
G W W W R
R Y W B B
16
Mode for Qualitative Data
• Mode = Black (8 times)
B B B R Y
Y G G R B
B B W Y W
G W W W R
R Y W B B
17
Mode for Quantitative Data
18
Example
a. 5 5 5 3 1 5 1 4 3 5 Mode is 5
b. 1 2 2 2 3 4 5 6 6 6 7 9 Bimodal - 2 and 6
c. 1 2 3 6 7 8 9 10 No Mode
19
Mode for Quantitative Data
Example:
The prices of 50 cars sold by a dealer. Determine
the mode.
112 100 130 120 136 118 102 110 109 112
110 118 117 116 118 122 114 114 102 109
107 112 114 115 118 117 118 128 106 110
116 108 110 125 113 120 119 111 104 111
120 113 120 117 102 110 118 112 114 114
20
2. The Median
• The median, MD, is the midpoint of the entire data array.
Middle value when data are ordered from smallest to largest.
This results in an equal number of observations above the
median as below it.
– Unique for each set of data
– Not affected by extremes
– Meaningful for ratio, interval, and ordinal data
• To determine the median:
1. Sort the data values.
2. Pick the value in the middle
– For n data values,
If n is odd,
then MD = X(n + 1) / 2
If n is even,
then MD = (X(n/2) + X(n/2 + 1)) / 2
(Note: this need not be a data value)
• Excel function MEDIAN(data range)
21
Median
• Determine the median of the following data.
112 100 130 120 136 118 102 110 109 102
22
Solution
• We arrange the data in an ascending
order,
23
Example
6.72 3.46 3.60 6.44
3.46 3.60 6.44 6.72
(even number of values)
Xi
i 1
N
• For a sample of size n, the mean, X , is given by
n
X i
X i 1
n
• Excel function AVERAGE(data range)
27
Properties of the Mean
• Meaningful for interval and ratio data.
• All data used in the calculation.
• Unique for every set of data.
• Affected by unusually large or small observations
(outliers)
• The only measure of central tendency where the
sum of the deviations of each value from the
measure is zero; i.e.,
(xi – x ) = 0
• The mean is approximated up to a one more
decimal place than the data. E.g. if the data were
integers, so we approximated the mean to a one
decimal place. 28
Example
• Find the mean of the given data,
1, 2, 7, 6, 2, 9, 10
Solution
• The mean = (1+ 2+ 7+ 6+ 2+ 9+10)/7 =5.3
29
Example
30
Mean for grouped data
1. Find class midpoints (Xm).
2. Multiply the midpoint of each class by its
frequency (f).
3. Add up the products (f.Xm).
4. Divide by the sum of the frequencies.
31
Example
A student received an A in English Composition 1 (3 credits), a C
in Introduction to Psychology (3 credits), a B in Biology 1 (4
credits), and a D in Physical Education ( 2 credits). Assuming
A=4 grade point, B=3 grade points, C=2 grade points, D=1 grade
point, F=0 grade points, find the student’s grade point average.
Solution:
Course Credits (w) Grade (X)
English Composition 1 3 A (4 points)
Introduction to Psychology 3 C (2 points)
Biology 1 4 B (3 points)
Physical Education 2 D (1 points)
X
w X
3.4 3.2 4.3 2.1 32
2.7
w 3342 12