Week 01 Introduction and Graphical Statistics
Week 01 Introduction and Graphical Statistics
B3
• Introduction
• Excel data analysis toolpak installation demo
• Motivation and examples
• Population and sample
• Graphical statistics
• Descriptive statistics
Population and Sample (1.3)
POPULATION
A population consists of all the items or
individuals about which you want to draw a
conclusion. The population is the “large
group”
SAMPLE
A sample is the portion of a population
selected for analysis. The sample is the “small
group”
Population vs. Sample
Population Sample
Chap 1-5
Graphical Statistics (2.3-2.5)
Before you do anything with your data,
look at it
.
Types of Variables
Variables
Categorical Numerical
Examples:
Marital Status
Political Party Discrete Continuous
Eye Color
(Defined categories) Examples: Examples:
Number of Children Weight
Defects per hour Voltage
(Counted items) (Measured characteristics)
. Chap 1-8
Levels of Measurement
A nominal scale classifies data into distinct categories in
which no ranking is implied.
.
Levels of Measurement (con’t.)
.
Levels of Measurement (con’t.)
.
Interval and Ratio Scales
.
Visualizing Categorical Data:
The Bar Chart
In a bar chart, a bar shows each category, the length of which
represents the amount, frequency or percentage of values falling
into a category which come from the summary table of the variable.
Banking Preference
Internet 24%
ATM
Banking Preference? %
16% ATM
ATM 16% 24%
2% Automated or live
Automated or live 2%
telephone
telephone
Drive-through service at
Drive-through service at 17%
17% branch
branch
In person at branch
In person at branch 41%
Internet 24% Internet
41%
Visualizing Numerical Data:
Class The Histogram
Frequency
Relative
Frequency
Percentage
(In a percentage 2
histogram the vertical
axis would be defined to 0
show the percentage of 5 15 25 35 45 55 More
observations per class)
Visualizing Two Numerical Variables: Scatter
Plot
29 146
150
33 160
100
38 167 50
42 170 0
20 30 40 50 60 70
50 188
Volume per Day
55 195
60 200
Visualizing Two Numerical Variables: Time
Series Plot
Number of
Year Franchises Number of Franchises, 1996-2004
120
1996 43
100
1997 54 Franchises
Number of
80
1998 60 60
1999 73 40
2000 82 20
0
2001 95
1994 1996 1998 2000 2002 2004 2006
2002 107 Year
2003 99
2004 95
Examples
% of electricity
Appliances consumption Construct a bar chart and a pie
AC 18 chart.
Dryers 5
Washers 24
Computers 1
Make conclusions.
Cooking 2
Dishes 2
Freezers 2
Lighting 16
Friges 9
Heating 7
Water heating 8
TV etc 6
Examples
#2.39, p.58 “Cost of baseball games”.
Dataset BBcost2011 (BBcost2015).
Construct a histogram.