Week 1 - Introduction To Descriptive Statistics
Week 1 - Introduction To Descriptive Statistics
Essentials of Modern
Business Statistics (7e)
Anderson, Sweeney, Williams, Camm, Cochran
© 2018 Cengage Learning
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 1
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Chapter 3, Part A
Descriptive Statistics: Numerical Measures
❑ Measures of Location 位置度量
可变性度量
❑ Measures of Variability
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 2
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Numerical Measures
▪ If the measures are computed for data from a sample, they are
called sample statistics.
▪ If the measures are computed for data from a population, they are
called population parameters.
▪ A sample statistic is referred to as the point estimator of the
corresponding population parameter.
使用正确的词汇 vocabulary,提升沟通的效率。
样本统计数据 - 人口统计数据,用样本统计整体的情况。
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 3
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Measures of Location
❑ Mean
位置度量: “这个样本处于什么位置”
❑ Median 比如体重、智商、一管牙膏的体积。
一个统计数据的中心在哪儿
❑ Mode
❑ Weighted Mean
❑ Geometric Mean
❑ Percentiles
❑ Quartiles
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 4
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Mean
Perhaps the most important measure of location is the mean.
▪ The mean provides a measure of central location.
▪ The mean of a data set is the average of all the data values.
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 5
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 6
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Population Mean m
和上页唯一的区别是大写N,因为是总值/population的总个数
现实世界中,无法访问到所有population中的每一个个体。所以要抽
样调查获得某种事实。
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 7
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 8
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 9
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Median, Part 1
▪ The median of a data set is the value in the middle when the data
items are arranged in ascending order.
▪ Whenever a data set has extreme values, median is the preferred
measure of central location.
▪ The median is the measure of location most often reported for
annual income and property value data.
▪ A few extremely large incomes or property values can inflate the
mean.
中位数 - 排序后处于中间的值。
有时候比均值更有代表性,因为中位数排除了个别极端值的影响
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 10
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Median, Part 2
For an odd number of observations:
7 observations
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 11
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Median, Part 3
For an even number of observations: 样本量是偶数个数时候,取中间两个值求个简单平均
8 observations
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 12
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Median, Part 4
Example: Monthly Starting Salary
Averaging the 6th and 7th data values:
中位数,排除了个别极端值的
影响
3,710 3,755
3,850 3,880
3,880 3,890
3,920 3,940
3,950 4,050
Note: The data is in ascending order 4,130 4,325
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 13
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Trimmed Mean
▪ Another measure sometimes used when extreme values are
present, is the trimmed mean.
▪ It is obtained by deleting a percentage of the smallest and largest
values from a data set and then computing the mean of the
remaining values.
▪ For example, the 5% trimmed mean is obtained by removing the
smallest 5% and the largest 5% of the data values and then
computing the mean of the remaining values.
去掉最高最低的一部分后求平均,也是为了修剪掉极端值
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 14
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
▪ The mode of a data set is the value that occurs with greatest
frequency. 数据集的mode 模态
▪ The greatest frequency can occur at two or more different values.
▪ If the data have exactly two modes, the data are bimodal.
▪ If the data have more than two modes, the data are multimodal.
双模态 多模态
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 15
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Mode, Part 2
Example: Monthly Starting Salary
The only monthly starting salary that occurs more than once is $3,880.
Mode = 3,880
Monthly Starting Monthly Starting
Salary ($) Salary ($)
3,710 3,755
3,850 3,880 (circled)
3,880 3,890
3,920 3,940
3,950 4,050
4,130 4,325
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 17
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
1 3850
2 3950
3 4050
4 3880
5 3755
6 3710
7 3890
8 4130
9 3940
10 4325
11 3920
12 3880
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 18
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Graduate Monthly
Salary ($)
1 3850
2 3950
3 4050
4 3880
5 3755
6 3710
7 3890
8 4130
9 3940
10 4325
11 3920
12 3880
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 19
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 20
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 21
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 22
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 24
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 25
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
4 4.9 1.049
5 15.8 1.158
6 5.5 1.055
7 -37.0 .630
8 26.5 1.265
9 15.1 1.151
10 2.1 1.021
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 26
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 27
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 28
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
1 -22.1 .779
2 28.7 1.287
3 10.9 1.109
4 4.9 1.049
5 15.8 1.158
6 5.5 1.055
7 -37 0.63
8 26.5 1.265
9 15.1 1.151
10 2.1 1.021
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 29
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
1 (22.1) .779
2 28.7 1.287
3 10.9 1.109
4 4.9 1.049
5 15.8 1.158
6 5.5 1.055
7 (37.0) 0.63
8 26.5 1.265
9 15.1 1.151
10 2.1 1.021
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 30
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Percentiles, Part 1
▪ A percentile provides information about how the data are spread
over the interval from the smallest value to the largest value.
▪ Admission test scores for colleges and universities are frequently
reported in terms of percentiles.
▪ The pth percentile of a data set is a value such that at least p
percent of the items take on this value or less and at least (100 - p)
percent of the items take on this value or more.
百分位,确定分布的位置,比如第90百分位数,90%的数值是等于或小于这个值,10%大于这个值。
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 31
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Percentiles, Part 2
Arrange the data in ascending order.
Compute Lp, the location of the pth percentile.
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 32
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
3,710 3,755
3,850 3,880
3,880 3,890
3,920 3,940
3,950 4,050
4,130 4,325
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
33
Modern Business Statistics (6e)
3,710 3,755
3,850 3,880
3,880 3,890
3,920 3,940
3,950 4,050
4,130 4,325
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 34
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 35
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
1 3850
2 3950
3 4050
4 3880
5 3755
6 3710
7 3890
8 4130
9 3940
10 4325
11 3920
12 3880
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 36
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
1 3850
2 3950
3 4050
4 3880
5 3755
6 3710
7 3890
8 4130
9 3940
10 4325
11 3920
12 3880
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 37
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Quartiles
Quartiles are specific percentiles.
❑ First Quartile = 25th Percentile
❑ Second Quartile = 50th Percentile = Median
❑ Third Quartile = 75th Percentile
主要需要了解四分位数,比如说箱型图的使用。
第25个百分位数,第50个百分位数,第75个百分位数
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 38
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
(the 9th value plus .75 times the difference between the
10th and 9th values)
3,710 3,755
3,850 3,880
3,880 3,890
3,920 3,940
3,950 4,050
4,130 4,325
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use.
39
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 40
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
1 3850
2 3950
3 4050
4 3880
5 3755
6 3710
7 3890
8 4130
9 3940
10 4325
11 3920
12 3880
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 41
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
1 3850
2 3950
3 4050
4 3880
5 3755
6 3710
7 3890
8 4130
9 3940
10 4325
11 3920
12 3880
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 42
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 43
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
❑ Range
❑ Interquartile Range
❑ Variance
❑ Standard Deviation
❑ Coefficient of Variation
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 44
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Range, Part 1
▪ The range of a data set is the difference between the largest and
smallest data values.
Range = Largest value – Smallest value
▪ It is the simplest measure of variability.
▪ It is very sensitive to the smallest and largest data values.
范围:最大值减去最小值,很简单。
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 45
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Range, Part 2
Example: Monthly Starting Salary
Range = largest value - smallest value
3,850 3,880
3,880 3,890
3,920 3,940
3,950 4,050
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 46
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Interquartile Range
▪ The interquartile range of a data set is the difference between the
third quartile and the first quartile.
▪ It is the range for the middle 50% of the data.
▪ It overcomes the sensitivity to extreme data values.
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 47
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 48
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Variance, Part 1
▪ The variance is a measure of variability that utilizes all the data.
方差,很重要的概念,是后续很多活动的基础。
使用数据集中的每个值,基于每个值与均值之间的差异。
使用方差可以比较各种不同的特征。
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 49
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Variance, Part 2
▪ The variance is the average of the squared differences between
each data value and the mean.
▪ The variance is computed as follows:
基于样本使用x拔,基于总体使
用μ。
不论是整体还是样本,都要使用
数据的每个点与均值之差的平
方。
注意分母的区别(为什么?) for a for a
sample population
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 50
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
标准差就是前面两个值的平方根。
标准差与数据的单位是相同的,而不是平方。
比如说IQ的标准差也是IQ值而不是平方。
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 51
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
For a For a
sample population
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 52
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 53
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Coefficient of Variation
▪ The coefficient of variation indicates how large the standard
deviation is in relation to the mean.
The coefficient of variation is computed as follows
变异系数:标准差除以平均值乘
以100%
表示标准差与均值之间的差异大
小
for a for a
sample population
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 54
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
Standard Deviation
Coefficient of Variation
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or
otherwise on a password-protected website or school-approved learning management system for classroom use. 55
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 56
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
1 3,850
2 3,950
3 4,050
4 3,880
5 3,755
6 3,710
7 3,890
8 4,130
9 3,940
10 4,325
11 3,920
12 3,880
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 57
otherwise on a password-protected website or school-approved learning management system for classroom use.
Modern Business Statistics (6e)
© 2018 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or 58
otherwise on a password-protected website or school-approved learning management system for classroom use.