Chapter 3
Chapter 3
Numerical Measures
Chapter 3
1
Learning Objectives
LO3-1 Compute and interpret the mean, the median, and the mode
LO3-2 Compute a weighted mean
LO3-3 Compute and interpret the geometric mean
LO3-4 Compute and interpret the range, variance, and standard deviation
LO3-5 Explain and apply Chebyshev’s theorem and the Empirical Rule
LO3-6 Compute the mean and standard deviation of grouped data
2
Measures of Location
3-3
Population Mean
Where:
• μ represents the population mean. It is the Greek lowercase letter “mu.”
• N is the number of values in the population.
• x represents any particular value.
• Σ is the Greek capital letter “sigma” and indicates the operation of adding.
• Σx is the sum of the x values in the population
2-4
Population Mean
3-5
Example: Population Mean
There are 42 exits on I-75 through the state of Kentucky. Listed below are the
distances between exits (in miles).
There are 42 exits on I-75 through the state of Kentucky. Listed below are the distances between exits (in
miles).
3-7
Sample Mean
3-9
Example: Sample Mean
3-10
Parameter Versus Statistics
3-12
Properties of the Arithmetic Mean
The Median
MEDIAN The midpoint of the values after they have been ordered
from the minimum to the maximum values.
3-14
Characteristics of the Median
3-15
Finding the Median
3 5 7 5 9 1 3 9 17 10
Arranging the data in ascending order gives:
1 3 3 5 5 7 9 9 10 17
3-17
Relative Positions of Mean, Median, and Mode
3-18
The Weighted Mean
• The Carter Construction Company pays its hourly employees $16.50, $19.00, or $25.00
per hour. There are 26 hourly employees: 14 are paid at the $16.50 rate, 10 at the
$19.00 rate, and 2 at the $25.00 rate.
• What is the mean hourly rate paid for the 26 employees?
3-19
The Geometric Mean
• The geometric mean is the nth root of the product of n positive values
• The formula for geometric mean is
• The geometric mean is also used to find the rate of change from one period to another
3-20
Why Study Dispersion?
• A measure of location, such as the mean, median, or mode, only describes the center of
the data. It is valuable from that standpoint, but it does not tell us anything about the
spread of the data. A small value for a measure of dispersion indicates that the data are
clustered closely, say, around the arithmetic mean. The mean is therefore considered
representative of the data. Conversely, a large measure of dispersion indicates that the
mean is not reliable.
• A second reason for studying the dispersion in a set of data is to compare the spread in
two or more distributions.
3-21
Range
• The range is the difference between the maximum and minimum values in a set of data
• The formula for range is
The variance measures the mean amount by which the values in a population, or sample, vary
from their mean.
The variance has an important advantage over the range. It uses all the values in the
computation. Recall that the range uses only the highest and the lowest values.
3-23
Population Variance
where:
• σ2 is the population variance (σ is the lowercase Greek letter sigma). It is read as “sigma squared.”
• x is the value of a particular observation in the population.
• μ is the arithmetic mean of the population.
• N is the number of observations in the population
3-25
Sample Variance
where:
s2 is the sample variance.
x is the value of each observation in the sample.
xത is the mean of the sample.
n is the number of observations in the sample.
3-26
Example: Sample Variance
The hourly wages for a sample of 5 part-time employees at Home Depot are: $12, $20, $16,
$18, and $19.
What is the sample variance?
3-27
Standard Deviation
where:
s is the sample standard deviation
𝑥 is the value of each observation in the sample
𝑥ҧ is the mean of the sample
n is the number of observations in the sample
3-28
Standard Deviation
3-29
Interpretations and Uses of the Standard Deviation
CHEBYSHEV’S THEOREM For any set of observations (sample or population), the proportion
of the values that lie within k standard deviations of the mean is at
least 1 – 1/k2, where k is any value greater than 1.
The arithmetic mean biweekly amount contributed by the Dupree Paint employees to
the company’s profit-sharing plan is $51.54, and the standard deviation is $7.51. At
least what percent of the contributions lie within plus 3.5 standard deviations and
minus 3.5 standard deviations of the mean?
3-30
Interpretations and Uses of the Standard Deviation
THE EMPIRICAL RULE For a symmetrical, bell-shaped frequency distribution, approximately 68% of the
observations will lie within plus and minus one standard deviation of the mean, about
95% of the observations will lie within plus or minus 2 standard deviations of the
mean, and practically all (99.7%) will lie within 3 standard deviations of the mean.
3-31
Sample Mean of Grouped Data
3-32
Sample Mean of Grouped Data
The computations for the arithmetic mean of data grouped into a frequency distribution will be shown based
on the Applewood Auto Group profit data. Recall in Chapter 2, in Table 2–7 on page 30, we constructed a
frequency distribution for the vehicle profit. The information is repeated below. Determine the arithmetic
mean profit per vehicle.
3-33
Sample Mean of Grouped Data
3-34
Standard Deviation of Grouped Data
3-35
Calculating the Standard Deviation of Grouped Data
Step 1: Subtract the mean from the class midpoint. That is, find (M − 𝑥ҧ )
Step 2: Square the difference between the class midpoint and the mean.
Step 3: Multiply the squared difference between the class midpoint and the mean by the class
frequency.
Step 4: Sum the f(M − 𝑥ҧ )2. Using formula (3-12) to calculate the standard deviation.
The mean and the standard deviation calculated from the data grouped into a frequency distribution are
usually close to the values calculated from raw data. The grouped data result in some loss of information.
3-36
Calculating the Standard Deviation of Grouped Data
Applewood Auto Group Frequency Distribution Compute the standard deviation of the vehicle
profits.
68% x180=122.4
- min $1199
- max $2503
3-37
Sample Mean and Standard Deviation of Grouped Data
The mean and the standard deviation calculated from the data grouped into a
frequency distribution are usually close to the values calculated from raw data.
The grouped data result in some loss of information.
3-38
Ethics and Reporting Results
• Useful to know the advantages and disadvantages of mean, median, and mode
as we report statistics and as we use statistics to make decisions
• Important to maintain an independent and principled point of view
• Statistical reporting requires objective and honest communication of any
results
3-39