Eco2061 Week 2
Eco2061 Week 2
Chapter 1:
*Slides are based on Statistics for Managers: Using Microsoft Excel, 6th Edition by David M. Levine, Mark L.
Berenson, Timothy C. Krehbiel, David F. Stephan; Statistics for Business and Economics, by Newbold, P.,
Carlson, W.L., Thorne, B., 8th ed. Pearson; and Statistics for Business and Economics by Anderson, Sweeney,
Williams, Freeman, Shoesmith3rd ed., slides from
Outline
Why learn statistics
How business uses statistics
turning data into information to facilitate decision making
Why collect data?
Source of data
Difference between descriptive and inferential statistics
The basic vocabulary of statistics
The types of data used in business
To develop tables and charts for categorial and numerical data
Presenting graphs
Why Learn Statistics
Statistics
The branch of mathematics that transforms data into
useful information for decision makers.
Collect data
e.g., Survey
Present data
e.g., Tables and graphs
Characterize data
e.g., The sample mean
1-8
Inferential Statistics
Estimation
e.g., Estimate the population
1-9
Sources of Data
Primary Sources: The data collector is the one using the data
for analysis
Data from a political survey
Data collected from an experiment
Observed data
Secondary Sources: The person performing data analysis is
not the data collector
Analyzing census data
Examining data from print journals or data published on the internet.
Sources of data fall into four categories
A designed experiment
A survey
An observational study
Examples Of Data Distributed By
Organizations or Individuals
VARIABLES
Variables are a characteristics of an item or individual and are what you
analyze when you use a statistical method.
DATA
Data are the different values associated with a variable.
OPERATIONAL DEFINITIONS
Data values are meaningless unless their variables have operational
definitions, universally accepted meanings that are clear to all associated
with an analysis.
Basic Vocabulary of Statistics
POPULATION
A population consists of all the items or individuals about which
you want to draw a conclusion. The population is the “large
group”
SAMPLE
A sample is the portion of a population selected for analysis. The
sample is the “small group”
PARAMETER
A parameter is a numerical measure that describes a
characteristic of a population.
STATISTIC
A statistic is a numerical measure that describes a characteristic of
a sample.
Population vs. Sample
Population Sample
Example:
Suppose you wish to sample n = 9 items from a
population of N = 72.
j = N / n = 72 / 9 = 8
Variables
Categorical Numerical
(Qualitative) (Quantitative)
Examples:
Marital Status
Political Party Discrete Continuous
Eye Color
(Defined categories)
Examples: Examples:
Number of Children Weight
Defects per hour Voltage
(Counted items) (Measured characteristics)
Levels of Measurement
Categorical Variables
Categorical Numerical
Variables Variables
Categorical
Data
Frequency
Distribution Bar Pie Pareto
Table Chart Chart Diagram
Categorical Data Are Organized By Utilizing
Tables
Categorical
Data
Tallying Data
One Two
Categorical Categorical
Variable Variables
Summary Contingency
Table Table
Data is collected with tally marks which are frequencies or occurrences in a
tally chart.
Organizing Categorical Data: Summary
Table
Banking Preference
Banking Preference? %
16% ATM
ATM 16% 24%
Automated or live 2% 2% Automated or live
telephone telephone
Internet 24%
Internet
41%
A Contingency Table Helps Organize Two
or More Categorical Variables
100% 100%
% in each category
Cumulative %
(line graph)
(bar graph)
60% 60%
ATM 16%
40% 40%
Automated or live 2%
20% 20% telephone
Drive-through service at 17%
0% 0% branch
In person Internet Drive- ATM Automated
at branch through or live In person at branch 41%
service at telephone Internet 24%
branch
Graphical
Presentation of Data
Categorical Numerical
Variables Variables
Price ($)
5.000
4.500
4.000
3.500
3.000
Price ($)
2.500
2.000
1.500
1.000
0.500
0.000
1975 1980 1985 1990 1995 2000 2005 2010
Visualizing Numerical Data
By Using Graphical Displays
Numerical Data
Frequency Distributions
Ordered Array and
Cumulative Distributions
Stem-and-Leaf
Display Histogram Ogive
Organizing Numerical Data:
Ordered Array
24, 35, 17, 21, 24, 37, 26, 46, 58, 30, 32, 13, 12, 38, 41, 43, 44, 27, 53,
27
Organizing Numerical Data:
Frequency Distribution Example
Relative
Class Frequency Frequency Percentage
Cumulative Cumulative
Class Frequency Percentage
Frequency Percentage
Frequency
with gaps from empty classes 2
1.5
Can give a poor indication of how 1
0.5
frequency varies across classes 0
4
8
12
16
20
24
28
32
36
40
44
48
52
56
60
More
Temperature
Frequency
yield a blocky distribution 6
4
can obscure important patterns of 2
variation. 0
0 30 60 More
Temperature
(X axis labels are upper class endpoints)
Visualizing Numerical Data:
The Polygon
7
6
Frequency
5
4
3
2
(In a percentage
1
polygon the vertical axis 0
would be defined to
show the percentage of 5 15 25 35 45 55 65
Class Midpoints
observations per class)
Visualizing Numerical Data:
The Ogive
Lower % less
class than lower
Class boundary boundary
10 but less than 20 10 15
20 but less than 30 20 45
30 but less than 40 30 70
40 but less than 50 40 90 Ogive: Age Of Students
50 but less than 60 50 100
100
80
Cumulative Percentage 60
40
20
In an ogive the percentage 0
of the observations less
than each lower class 10 20 30 40 50 60
boundary are plotted versus
the lower class boundaries. Lower Class Boundary
Visualizing Two Numerical
Variables: The Scatter Plot
Scatter plots are used for numerical data consisting of paired
observations taken from two numerical variables
26 140 200
Cost per Day
150
29 146
100
33 160 50
38 167 0
20 30 40 50 60 70
42 170
Volume per Day
50 188
55 195
60 200
Data Presentation Errors