Quantitative Methods
Quantitative Methods
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.
What is Meant by Statistics?
1-2
Why Study Statistics?
1-3
Why Study Statistics
1-4
What is Meant by Statistics?
1-5
Formal Definition of Statistics
1-6
Who Uses Statistics?
1-7
Types of Statistics – Descriptive Statistics and
Inferential Statistics
1-8
Types of Statistics – Descriptive Statistics and
Inferential Statistics
1-9
Population versus Sample
1-10
Why take a sample instead of studying every
member of the population?
1-11
A Sampling Unit
1-12
Population Parameter
1-13
Sample Statistic
1-14
Usefulness of a Sample in Learning about a
Population
1-15
Components of Statistics
1-16
Data and Data Quality
Define data.
The usefulness of data depends of the
quality of data collected.
The quality of data depends on:
i) The data type
ii) The source of data
iii) Methods of data collection.
1-17
Types of Variables
1-18
Quantitative Variables - Classifications
1-20
Four Levels of Measurement or
Measurement scales
Nominal level - data that is Interval level - similar to the ordinal
classified into categories and level, with the additional property
cannot be arranged in any that meaningful amounts of
particular order. differences between data values
can be determined. There is no
natural zero point.
EXAMPLES: eye color, gender, EXAMPLE: Temperature on the
religious affiliation. Fahrenheit scale.
Ordinal level – data arranged in Ratio level - the interval level with an
some order, but the differences inherent zero starting point.
between data values cannot be Differences and ratios are
determined or are meaningless. meaningful for this level of
measurement.
EXAMPLE: During a taste test of 4 soft
drinks, Mellow Yellow was ranked EXAMPLES: Monthly income of surgeons, or
number 1, Sprite number 2, Seven-up distance traveled by manufacturer’s
number 3, and Orange Crush number 4. representatives per month.
1-21
Nominal-Level Data
Properties:
1. Observations of a qualitative variable can
only be classified and counted.
2. There is no particular order to the labels.
3. Eg. Gender (1-male; 2- female); mode of
transport ( 1- bus, 2-car, 3-train, 4- bicycle)
1-22
Ordinal-Level Data
Properties:
1. Data classifications are
represented by sets of labels or
names (high, medium, low);
lower, middle, upper, that have
relative values.
2. Because of the relative values,
the data classified can be
ranked or ordered.
3. Is associated with categorical
data
1-23
Interval-Level Data
1-24
Examples of interval-scaled data
1-25
Ratio-Level Data
1-26
Properties of Ratio data
1-27
Why Know the Level of Measurement of a
Data?
1-28
Summary of the Characteristics for
Levels of Measurement
1-29
Data Sources
1-31
Data collection methods
i. Observation.
ii. Panels
iii. Focus groups
iv. Surveys
Face-to-face interviews
Telephone interviews
Computer assisted interviews
personal interviews,
e-surveys).
1-32
What are the advantages and
disadvantages of gathering data
using each method listed
above?
1-33
Describing Data:
Frequency Tables, Frequency
Distributions, and Graphic Presentation
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.
Frequency Table and Frequency Distribution
1-37
EXAMPLE – Creating a Frequency
Distribution Table
1-38
Constructing a Frequency Table - Example
1-39
Relative Frequency Distribution
1-40
Graphic Presentation of a Frequency
Distribution
1-41
Histogram
1-42
Frequency Polygon
A frequency polygon
also shows the shape
of a distribution and is
similar to a histogram.
It consists of line
segments connecting
the points formed by
the intersections of the
class midpoints and the
class frequencies.
1-43
Histogram Versus Frequency Polygon
1-44
Cumulative Frequency Distribution
1-45
Describing Data:
Numerical Measures
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.
Parameter Versus Statistics
1-47
Population Mean
For ungrouped data, the population mean is the sum of all the population values divided by the total number of
population values. The sample mean is the sum of all the sample values divided by the total number of sample
values.
EXAMPLE:
1-48
The Median
MEDIAN The midpoint of the values after they have been ordered from the smallest to the largest, or the largest to
the smallest.
EXAMPLES:
The ages for a sample of five college students are: The heights of four basketball players, in inches, are:
1-49
The Mode
1-50
The Relative Positions of the Mean,
Median and the Mode
1-51
The Geometric Mean
Useful in finding the average change of percentages, ratios, indexes, or growth rates over time.
It has a wide application in business and economics because we are often interested in finding the
percentage changes in sales, salaries, or economic figures, such as the GDP, which compound or
build on each other.
The geometric mean will always be less than or equal to the arithmetic mean.
The formula for the geometric mean is written:
EXAMPLE:
Suppose you receive a 5 percent increase in salary this year and a 15 percent
increase next year. The average annual percent increase is 9.886, not 10.0. Why is
this so? We begin by calculating the geometric mean.
1-52
Measures of Dispersion
A measure of location, such as the mean or the median, only describes the center of the data. It is valuable from
that standpoint, but it does not tell us anything about the spread of the data.
For example, if your nature guide told you that the river ahead averaged 3 feet in depth, would you want to wade
across on foot without additional information? Probably not. You would want to know something about the variation
in the depth.
A second reason for studying the dispersion in a set of data is to compare the spread in two or more distributions.
RANGE
MEAN DEVIATION
1-53
EXAMPLE – Mean Deviation
EXAMPLE:
The number of cappuccinos sold at the Starbucks location in the Orange Country
Airport between 4 and 7 p.m. for a sample of 5 days last year were 20, 40, 50, 60,
and 80. Determine the mean deviation for the number of cappuccinos sold.
Step 2: Subtract the mean (50) from each of the observations, convert to positive if difference
is negative
Step 3: Sum the absolute differences found in step 2 then divide by the number of
observations
1-54
Variance and Standard Deviation
VARIANCE The arithmetic mean of the squared deviations from the mean.
The variance and standard deviations are nonnegative and are zero only
if all observations are the same.
For populations whose values are near the mean, the variance and
standard deviation will be small.
For populations whose values are dispersed from the mean, the
population variance and standard deviation will be large.
The variance overcomes the weakness of the range by using all the
values in the population. PLEASE NOTE THAT THE DENOMINATOR IS
“N” for a population but “n-1” for sample variance
1-55
EXAMPLE – Population Variance and
Population Standard Deviation
The number of traffic citations issued during the last five months in Beaufort County, South Carolina, is
reported below:
Step 2: Find the difference between each observation and the mean, and square that difference.
Step 3: Sum all the squared differences found in step 3
Step 4: Divide the sum of the squared differences by the number of items in the population.
2 ( X ) 2
1,488
124
N 12
1-56
Sample Variance and
Standard Deviation
Where :
s 2 is the sample variance
X is the value of each observatio n in the sample
X is the mean of the sample
n is the number of observatio ns in the sample
EXAMPLE
The hourly wages for a sample of part-time
employees at Home Depot are: $12, $20,
$16, $18, and $19.
1-57
Chebyshev’s Theorem and Empirical Rule
1-58
The Arithmetic Mean and Standard
Deviation of Grouped Data
EXAMPLE:
Determine the arithmetic mean vehicle selling
EXAMPLE
price given in the frequency table below.
PLEASE NOTE THAT THE Compute the standard deviation of the vehicle
DENOMINATOR IS “N” for a population selling prices in the frequency table below.
variance but “n-1” for sample variance
1-59
Describing Data:
Displaying and Exploring Data
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.
Dot Plots
A dot plot groups the data as little as possible and the identity of an individual observation is not lost.
To develop a dot plot, each observation is simply displayed as a dot along a horizontal number line
indicating the possible values of the data.
If there are identical observations or the observations are too close to be shown individually, the dots are
“piled” on top of each other.
EXAMPLE
Reported below are the number of vehicles sold in the last 24 months at Smith Ford Mercury Jeep, Inc.,
in Kane, Pennsylvania, and Brophy Honda Volkswagen in Greenville, Ohio. Construct dot plots
and report summary statistics for the two small-town Auto USA lots.
1-61
Stem-and-Leaf
Stem-and-leaf display is a statistical technique to present a set of data. Each numerical value is divided
into two parts. The leading digit(s) becomes the stem and the trailing digit the leaf. The stems are located
along the vertical axis, and the leaf values are stacked against each other along the horizontal axis.
Two disadvantages to organizing the data into a frequency distribution:
(1) The exact identity of each value is lost
(2) Difficult to tell how the values within each class are distributed.
EXAMPLE
Listed in Table 4–1 is the number of 30-second radio advertising spots purchased by each of the 45
members of the Greater Buffalo Automobile Dealers Association last year. Organize the data into a
stem-and-leaf display. Around what values do the number of advertising spots tend to cluster?
What is the fewest number of spots purchased by a dealer? The largest number purchased?
1-62
Quartiles, Deciles and Percentiles
The standard deviation is the most widely used measure of dispersion.
Alternative ways of describing spread of data include determining the location of values that divide a set of
observations into equal parts.
To formalize the computational procedure, let Lp refer to the location of a desired percentile. So if we wanted
to find the 33rd percentile we would use L33 and if we wanted the median, the 50th percentile, then L50.
The number of observations is n, so if we want to locate the median, its position is at (n + 1)/2, or we could
write this as (n + 1)(P/100), where P is the desired percentile
1-63
Percentiles - Example
EXAMPLE
Listed below are the commissions earned last month by a sample of 15 brokers at Salomon
Smith Barney’s Oakland, California, office.
Locate the median, the first quartile, and the third quartile for the commissions earned.
Step 2: Compute the first and third quartiles. Locate L25 and L75 using:
25 75
L25 (15 1) 4 L75 (15 1) 12
100 100
Therefore, the first and third quartiles are located at the 4th and 12th
positions, respective ly
L25 $1,721
L75 $2,205
1-64
Boxplot - Example
Step 2: Draw a box that starts at Q1 (15 minutes) and ends at Q3 (22
minutes). Inside the box we place a vertical line to represent the median (18 minutes).
Step 3: Extend horizontal lines from the box out to the minimum value (13
minutes) and the maximum value (30 minutes).
1-65
Skewness
Another characteristic of a set of data is the shape.
There are four shapes commonly observed: symmetric, positively skewed, negatively skewed, bimodal.
1-66
Contingency Tables – An Example
Using the contingency table able, the quality of the three shifts can be compared. For
example:
1. On the day shift, 3 out of 20 windows or 15 percent are defective.
2. On the afternoon shift, 2 of 15 or 13 percent are defective and
3. On the night shift 1 out of 15 or 7 percent are defective.
4. Overall 12 percent of the windows are defective
1-67
Probability Concepts
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.
Probability, Experiment, Outcome, Event:
Defined
An experiment is a process
that leads to the occurrence
of one and only one of several
possible observations.
An outcome is the particular
result of an experiment.
An event is the collection of
one or more outcomes of an
experiment.
1-69
Approaches in Probability
Classical Approach
It is an approach in which the sampling
distribution of all outcomes is known with
certainty i.e the outcomes from a given
experiment are fixed.
The probabilities of the events are equally
likely e.g tossing a die, tossing a coin (head
or tail)
1-70
Relative Approach
1-71
Classical and Empirical Probability
EXAMPLE:
On February 1, 2003, the Space Shuttle Columbia
exploded. This was the second disaster in 113 space
missions for NASA. On the basis of this information,
what is the probability that a future mission is
successfully completed?
1-72
Subjective/ Judgemental Approach
1-73
Mutually Exclusive Events and
Collectively Exhaustive Events
Events are mutually exclusive if the occurrence of any one event means that
none of the others can occur at the same time.
Events are collectively exhaustive if at least one of the events must occur
when an experiment is conducted.
The sum of all collectively exhaustive and mutually exclusive events is 1.0 (or
100%)
collectively exhaustive
and mutually exclusive
events
Events are independent if the occurrence of one event does not affect the
occurrence of another.
1-74
Subjective Probability - Example
1-75
Summary of Types of Probability
1-76
Rules of Addition
Rules of Addition EXAMPLE:
Special Rule of Addition - If two events
A and B are mutually exclusive, the An automatic Shaw machine fills plastic bags with a mixture of
probability of one or the other event’s beans, broccoli, and other vegetables. Most of the bags contain
occurring equals the sum of their the correct weight, but because of the variation in the size of the
probabilities.
beans and other vegetables, a package might be underweight or
P(A or B) = P(A) + P(B) overweight. A check of 4,000 packages filled in the past month
revealed:
The General Rule of Addition - If A
and B are two events that are not
mutually exclusive, then P(A or B) is
given by the following formula:
P(A or B) = P(A) + P(B) - P(A and B)
1-77
The Complement Rule
The complement rule is used to determine EXAMPLE
the probability of an event occurring An automatic Shaw machine fills plastic bags with a mixture of
by subtracting the probability of the beans, broccoli, and other vegetables. Most of the bags contain
event not occurring from 1. the correct weight, but because of the variation in the size of the
P(A) + P(~A) = 1 beans and other vegetables, a package might be underweight or
or P(A) = 1 - P(~A). overweight. Use the complement rule to show the probability of
a satisfactory bag is .900
P(B) = 1 - P(~B)
= 1 – P(A or C)
= 1 – [P(A) + P(C)]
= 1 – [.025 + .075]
= 1 - .10
= .90
1-78
The General Rule of Addition and Joint Probability
1-79
Special and General Rules of Multiplication
The special rule of multiplication requires that The general rule of multiplication is used to find the joint
two events A and B are independent. probability that two independent events will occur.
Two events A and B are independent if the
occurrence of one has no effect on the
probability of the occurrence of the other.
This rule is written: P(A and B) = P(A)P(B)
EXAMPLE
EXAMPLE
A golfer has 12 golf shirts in his closet. Suppose 9 of these shirts
A survey by the American Automobile are white and the others blue. He gets dressed in the dark,
association (AAA) revealed 60 percent of its so he just grabs a shirt and puts it on. He plays golf two
members made airline reservations last year. days in a row and does not do laundry.
Two members are selected at random. Since
What is the likelihood both shirts selected are white?
the number of AAA members is very large,
we can assume that R1 and R2 are
independent. What is the probability both
made airline reservations last year?
Solution: The event that the first shirt selected is white is W1. The
The probability the first member made an airline probability is P(W1) = 9/12
reservation last year is .60, written as P(R1) = The event that the second shirt (W2 )selected is also white. The
.60
conditional probability that the second shirt selected is white,
The probability that the second member selected
made a reservation is also .60, so P(R2) = .60. given that the first shirt selected is also white, is
Since the number of AAA members is very large, you P(W2 | W1) = 8/11.
may assume that R1 and R2 are independent. To determine the probability of 2 white shirts being selected we
use formula: P(AB) = P(A) P(B|A)
P(R1 and R2) = P(R1)P(R2) = (.60)(.60) = .36 P(W1 and W2) = P(W1)P(W2 |W1) = (9/12)(8/11) = 0.55
1-80
Contingency Tables
A CONTINGENCY TABLE is a table used to classify sample observations according to two or more identifiable
characteristic
EXAMPLE:
A sample of executives were surveyed about
their loyalty to their company. One of the Event A1 happens if a randomly selected executive
questions was, “If you were given an offer will remain with the company despite an equal or
by another company equal to or slightly slightly better offer from another company. Since
better than your present position, would there are 120 executives out of the 200 in the
you remain with the company or take the survey who would remain with the company
other position?” The responses of the 200 P(A1) = 120/200, or .60.
executives in the survey were cross- Event B4 happens if a randomly selected executive
classified with their length of service with has more than 10 years of service with the
the company. What is the probability of company. Thus, P(B4| A1) is the conditional
randomly selecting an executive who is probability that an executive with more than 10
loyal to the company (would remain) and years of service would remain with the company.
who has more than 10 years of service? Of the 120 executives who would remain 75 have
more than 10 years of service, so
P(B4| A1) = 75/120.
1-81
Tree Diagrams
1-82
Permutation and Combination
EXAMPLE EXAMPLE
Suppose that in addition to selecting the There are 12 players on the Carolina Forest High
group, he must also rank each of the School basketball team. Coach Thompson
players in that starting lineup according to must pick five players among the twelve on
their ability. the team to comprise the starting lineup.
How many different groups are possible?
12! 12!
12 P 5 95,040 12 C5 792
(12 5)! 5!(12 5)!
1-83
Probability Distributions
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.
What is a Probability Distribution?
PROBABILITY DISTRIBUTION A listing of all the outcomes of an experiment and the probability associated
with each outcome.
Experiment:
1-85
Random Variables
RANDOM VARIABLE A quantity resulting from an experiment that, by chance, can assume different values.
EXAMPLES EXAMPLES
1. The number of students in a class. 1. The length of each song on the latest Tim McGraw
2. The number of children in a family. album.
3. The number of cars entering a carwash in a 2. The weight of each student in this class.
hour. 3. The temperature outside as you are reading this
4. Number of home mortgages approved by book.
Coastal Federal Bank last week. 4. The amount of money earned by each of the more
than 750 players currently on Major League Baseball
team rosters.
1-86
The Mean and Variance of a Discrete
Probability Distribution
MEAN
•The mean is a typical value used to represent the central location of a probability distribution.
•The mean of a probability distribution is also referred to as its expected value.
The standard deviation is found by taking the positive square root of the variance.
1-87
Mean, Variance, and Standard
Deviation of a Probability Distribution - Example
VARIANCE
1-89
Binomial Distribution - Example
EXAMPLE Binomial – Shapes for Varying (n constant)
Five percent of the worm gears produced by
an automatic, high-speed Carter-Bell
milling machine are defective.
1-90
Poisson Probability Distribution
The Poisson probability distribution describes the number of times some event occurs during a specified
interval. The interval may be time, distance, area, or volume.
Examples include:
• The number of misspelled words per page in a newspaper.
• The number of calls per hour received by Dyson Vacuum Cleaner Company.
• The number of vehicles sold per day at Hyatt Buick GMC in Durham, North Carolina.
• The number of goals scored in a college soccer game.
1-91
Poisson Probability Distribution - Example
EXAMPLE
Assume baggage is rarely lost by Northwest Airlines.
Suppose a random sample of 1,000 flights shows a
total of 300 bags were lost. Thus, the arithmetic
mean number of lost bags per flight is 0.3
(300/1,000). If the number of lost bags per flight
follows a Poisson distribution with u = 0.3, find the
probability of not losing any bags.
.
Use Appendix B.5 to find the probability that no bags will
be lost on a particular flight.
1-92
More About the Poisson Probability
Distribution
•The Poisson probability distribution is always positively skewed and the random variable has no
specific upper limit.
•The Poisson distribution for the lost bags illustration, where µ=0.3, is highly skewed.
1-93
Normal Probability Distribution
1-94
The Standard Normal Probability Distribution
1-95
The Normal Distribution – Example
1-96
The Empirical Rule - Example
1-97
Normal Distribution – Finding Probabilities
EXAMPLE
The mean weekly income of a shift
foreman in the glass industry is
normally distributed with a mean of
$1,000 and a standard deviation of
$100.
1-98
Normal Distribution – Finding Probabilities
(Example 2)
What is the probability of selecting a shift foreman in What is the probability of selecting a shift foreman in the
the glass industry whose income is: glass industry whose income is:
Between $790 and $1,000? Between $840 and $1,200
1-99
Using Z in Finding X Given Area - Example
x - 67,900
- 1.75 , then solving for x
2,050
- 1.75(2,050) x - 67,900
x 67,900 - 1.75(2,050)
x 64,312
1-100
Normal Approximation to the Binomial
The normal distribution (a continuous distribution) yields a good approximation of the binomial
distribution (a discrete distribution) for large values of n.
The normal probability distribution is generally a good approximation to the binomial probability
distribution when n and n(1- ) are both greater than 5. This is because as n increases, a
binomial distribution gets closer and closer to a normal distribution.
1-101
Normal Approximation to the Binomial - Example
Suppose the management of the Santoni Pizza Restaurant found that 70 percent of its new
customers return for another meal. For a week in which 80 new (first-time) customers
dined at Santoni’s, what is the probability that 60 or more will return for another meal?
1-103