0% found this document useful (0 votes)
19 views

Week 01 Introduction and Graphical Statistics

Copyright
© © All Rights Reserved
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
19 views

Week 01 Introduction and Graphical Statistics

Copyright
© © All Rights Reserved
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 19

COQTA1.

B3
• Introduction
• Excel data analysis toolpak installation demo
• Motivation and examples
• Population and sample
• Graphical statistics
• Descriptive statistics
Population and Sample (1.3)
POPULATION
A population consists of all the items or
individuals about which you want to draw a
conclusion. The population is the “large
group”

SAMPLE
A sample is the portion of a population
selected for analysis. The sample is the “small
group”
Population vs. Sample
Population Sample

All the items or individuals about A portion of the population of


which you want to draw conclusion(s) items or individuals
Probability Sample:
• Simple
Every individual Random
or item Sample
from the frame has an equal chance
of being selected

• Samples obtained from table of random numbers or


computer random number generators.
Examples

• Wrong sampling practice. 1936 US Presidential


Elections. Literary Digest collected a sample of
size n=10,000,000 which was heavily biased. Got a
wrong prediction.

• Good sampling practice. 1980 trial of Chrysler


Corporation vs. United States Environmental
Protection Agency. A very clean sample of n=10
cars provided a bulletproof evidence.

Chap 1-5
Graphical Statistics (2.3-2.5)
Before you do anything with your data,
look at it

In Excel: INSERT → CHARTS


Data Analysis Toolpak → Histogram
Types of Variables

 Categorical (qualitative) variables have values


that can only be placed into categories, such as
“yes” and “no”; major; architectural style; etc.

 Numerical (quantitative) variables have values


that represent quantities.
 Discrete variables arise from a counting process
 Continuous variables arise from a measuring process

.
Types of Variables
Variables

Categorical Numerical

Examples:
 Marital Status
 Political Party Discrete Continuous
 Eye Color
(Defined categories) Examples: Examples:
 Number of Children  Weight
 Defects per hour  Voltage
(Counted items) (Measured characteristics)

. Chap 1-8
Levels of Measurement
A nominal scale classifies data into distinct categories in
which no ranking is implied.

Categorical Variables Categories

Personal Computer Yes / No


Ownership

Type of Stocks Owned Growth / Value/ Other

Internet Provider AT&T, Verizon, Time Warner Cable

.
Levels of Measurement (con’t.)

An ordinal scale classifies data into distinct


categories in which ranking is implied
Categorical Variable Ordered Categories

Student class designation Freshman, Sophomore, Junior,


Senior
Product satisfaction Satisfied, Neutral, Unsatisfied

Faculty rank Professor, Associate Professor,


Assistant Professor, Instructor
Standard & Poor’s bond ratings AAA, AA, A, BBB, BB, B, CCC, CC,
C, DDD, DD, D
Student Grades A, B, C, D, F

.
Levels of Measurement (con’t.)

 An interval scale is an ordered scale in which the


difference between measurements is a meaningful
quantity but the measurements do not have a true
zero point.

 A ratio scale is an ordered scale in which the


difference between the measurements is a
meaningful quantity and the measurements have a
true zero point.

.
Interval and Ratio Scales

.
Visualizing Categorical Data:
The Bar Chart
 In a bar chart, a bar shows each category, the length of which
represents the amount, frequency or percentage of values falling
into a category which come from the summary table of the variable.

Banking Preference

Banking Preference? % Internet


ATM 16%
In person at branch
Automated or live 2%
telephone
Drive-through service at branch
Drive-through service at 17%
branch
In person at branch 41% Automated or live telephone

Internet 24%
ATM

0% 5% 10% 15% 20% 25% 30% 35% 40% 45%


Visualizing Categorical Data:
The Pie Chart
 The pie chart is a circle broken up into slices that represent categories.
The size of each slice of the pie varies according to the percentage in
each category.
Banking Preference

Banking Preference? %
16% ATM
ATM 16% 24%
2% Automated or live
Automated or live 2%
telephone
telephone
Drive-through service at
Drive-through service at 17%
17% branch
branch
In person at branch
In person at branch 41%
Internet 24% Internet
41%
Visualizing Numerical Data:
Class The Histogram
Frequency
Relative
Frequency
Percentage

10 but less than 20 3 .15 15


20 but less than 30 6 .30 30
30 but less than 40 5 .25 25

40 but less than 50 4 .20 20


50 but less than 60 2 .10 10
8
Total 20 1.00 100Histogram: Age Of Students
6
Frequency

(In a percentage 2
histogram the vertical
axis would be defined to 0
show the percentage of 5 15 25 35 45 55 More
observations per class)
Visualizing Two Numerical Variables: Scatter
Plot

Volume Cost per


per day day
Cost per Day vs. Production Volume
23 125
26 140 250
200
Cost per Day

29 146
150
33 160
100
38 167 50
42 170 0
20 30 40 50 60 70
50 188
Volume per Day
55 195
60 200
Visualizing Two Numerical Variables: Time
Series Plot

Number of
Year Franchises Number of Franchises, 1996-2004
120
1996 43
100
1997 54 Franchises
Number of
80
1998 60 60
1999 73 40

2000 82 20
0
2001 95
1994 1996 1998 2000 2002 2004 2006
2002 107 Year
2003 99
2004 95
Examples
% of electricity
Appliances consumption Construct a bar chart and a pie
AC 18 chart.
Dryers 5
Washers 24
Computers 1
Make conclusions.
Cooking 2
Dishes 2
Freezers 2
Lighting 16
Friges 9
Heating 7
Water heating 8
TV etc 6
Examples
#2.39, p.58 “Cost of baseball games”.
Dataset BBcost2011 (BBcost2015).
Construct a histogram.

#2.54, p.62 “Stock performance”.


Construct a time series plot. Is there a pattern?

You might also like