0% found this document useful (0 votes)

111 views39 pages

Numerical Descriptive Measures 1

statistics

Uploaded by

Dolores Abangan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

111 views39 pages

Numerical Descriptive Measures 1

statistics

Uploaded by

Dolores Abangan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 39

Numerical Descriptive

Measures
Measures of Central Tendency

Mean, Median, Mode, Geometric Mean

Quartiles
Measures of Variation

Range, Interquartile Range, Variance and Standard Deviation,

Coefficient of Variation
Shape

Symmetric, Skewed
Using Box-and-Whisker Plots
Coefficient of Correlation
Pitfalls in Numerical Descriptive Measures and Ethical Issues

Summary Measures
Summary Measures

Central Tendency
Mean

Quartiles

Variation

Mode
Median

Range

Coefficient of
Variation

Variance

Geometric Mean

Standard Deviation

Introduction
Think of a sample portfolio composed of three

stocks.

200
shares
100 shares ARR =
ARR = 10% 15%

100 shares
ARR = 20%

A central measure for this portfolios ARR for is 15%.

Now observe the following portfolio
A central measure of this portfolios ARR for is 15% too.
200
shares
100
100 shares
shares ARR =
ARR
ARR == 5%
5% 15%

100 shares
ARR = 25%

Considering the average ARR only the two

portfolios are equal. But are they really?

Is the dispersion of ARR the same for the two
portfolio?
The dispersion (variability) is an important
property when describing a set of numbers, at
least as important as the central location.

Measures of Central
Tendency
Central Tendency

Mean

Median

Mode

X
i 1

i 1

Chap 3-5

Geometric Mean

X G X 1 X 2 L X n

2004 Prentice-Hall, Inc.

1/ n

Measures of Central Tendency

The central data point reflects the
locations of all the actual data points.
How?
With two data points,
the central location
With one data point
should fall in the middle
clearly the central
location is at the point between them (in order
to reflect the location of
itself.
both of them).

Measures of Central
Tendency
The central data point reflects the
locations of all the actual data points.
How?
If the third data point appears in the center
the measure of central location will remain
in the center, but (click)

Measures of Central
Tendency
The central data point reflects the
locations of all the actual data points.
How?
But if the third data point
appears on the left hand-side
of the midrange, it should pull
the central location to the left.

Measures of Central
Tendency
As more and more data points are added, the
central location moves (left and right) as required
in order to reflect the effects of all the points.

Mean (Arithmetic Mean)

Mean (Arithmetic Mean) of Data Values
Sample mean

Sample Size

X
i 1

Population mean
N

X
i 1

X1 X 2 L X n

n
Population Size

X1 X 2 L X N

Mean (Arithmetic Mean)

The Most Common Measure of Central Tendency
Affected by Extreme Values (Outliers)

0 1 2 3 4 5 6 7 8 9 10
Mean = 5

0 1 2 3 4 5 6 7 8 9 10 12 14
Mean = 6

Median
Robust Measure of Central Tendency
Not Affected by Extreme Values
0 1 2 3 4 5 6 7 8 9 10
Median = 5

0 1 2 3 4 5 6 7 8 9 10 12 14
Median = 5

In an Ordered Array, the Median is the Middle

Number
If n or N is odd, the median is the middle number
If n or N is even, the median is the average of the 2

middle numbers

Mode
A Measure of Central Tendency
Value that Occurs Most Often
Not Affected by Extreme Values
There May Not Be a Mode
There May Be Several Modes
Used for Either Numerical or Categorical Data

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Mode = 9

0 1 2 3 4 5 6
No Mode

Geometric Mean
Useful in the Measure of Rate of Change of a

Variable Over Time

X G X 1 X 2 L X n

1/ n

Geometric Mean Rate of Return

Measures the status of an investment over time

RG 1 R1 1 R2 L 1 Rn

1/ n

Example
An investment of $100,000 declined to $50,000 at the
end of year one and rebounded back to $100,000 at end
of year two:

R1 0.5 (or 50%)

R2 1 (or 100% )

Average rate of return:

( 0.5) (1)
R
0.25 (or 25%)
2
Geometric rate of return:
RG 1 0.5 1 1
0.5 2

1/ 2

1 11/ 2 1 0 (or 0%)

Quartiles
Split Ordered Data into 4 Quarters

25%

i n 1
Position of i-th Quartile Qi
4
Q and
1

Q3 are Measures of Non-central Location

Q2 = Median, a Measure of Central Tendency

Quartiles
The lower half of a data set is the set of all values that are

to the left of the median value when the data has been put
into increasing order.
The upper half of a data set is the set of all values that are
to the right of the median value when the data has been
put into increasing order.
The first quartile, denoted by Q1 , is the median of
the lower half of the data set. This means that about 25%
of the numbers in the data set lie below Q1 and about 75%
lie above Q1 .
The third quartile, denoted by Q3 , is the median of

the upper half of the data set. This means that about 75%
of the numbers in the data set lie below Q3 and about 25%
lie above Q3 .

Quartiles
Data in Ordered Array: 11 12 13 16 16 17 17 18 21
Median

1 9 1
Position of Q1
2.5
4

12 13

12.5

(17 18)
Q3 2 17.5

Measures of Variation
Measures of central location fail to tell the whole

story about the distribution.

A question of interest still remains unanswered:

How much are the values of a given set

spread out around the mean value?

Measures of Variation
Variation

Range

Variance

Interquartile
Range
Population
Variance
Sample
Variance

Standard
Deviation
Population
Standard
Deviation
Sample
Standard
Deviation

Coefficient
of Variation

Range
Measure of Variation
Difference between the Largest and the Smallest

Observations:

Range X Largest X Smallest

Ignores How Data are Distributed
Range = 12 - 7 = 5

Range = 12 - 7 = 5

Chap 3-21

2004 Prentice-Hall, Inc.

Interquartile Range
Measure of Variation
Also Known as Midspread
Spread in the middle 50%

Difference between the First and Third Quartiles

Data in Ordered Array: 11 12 13 16 16 17 17 18 21

Interquartile Range Q3 Q1 17.5 12.5 5

Not Affected by Extreme Values

Variance
Important Measure of Variation
Shows Variation about the Mean
Sample Variance:
n

S2
Population Variance:

X
i 1

n 1

X
i 1

The Variance
Example
Find the variance of the following set of numbers,
representing annual rates of returns for a group of
mutual funds. Assume the set is (i) a sample, (ii) a
population: -2, 4, 5, 6.9, 10

Solution:

The Variance
Solution:

Assuming a sample

Standard Deviation
Most Important Measure of Variation
Shows Variation about the Mean
Has the Same Units as the Original Data
Sample Standard Deviation:

S
Population Standard Deviation:

X
i 1

n 1

X
i 1

Standard Deviation
Example

The daily percentage of defective items in two weeks

of production (10 working days) were calculated for
two production lines?
Which line provides good items more consistently?
Line 1: 8.3, 6.2, 20.9, 2.7, 33.6, 42.9, 24.4, 5.2, 3.1, 30.05
Line 2: 12.1, 2.8, 6.4, 12.2, 27.8, 25.3, 18.2, 10.7, 1.3, 11.4

Standard Deviation
Solution:
Line 1:

Standard Deviation
Solution:
Line 2:

Standard Deviation
Line 1 should be considered less consistent
because the standard deviation of its defective
proportion is larger (i.e. therefore the standard
deviation of the good item proportion is also
larger).

Interpreting the Standard Deviation

The standard deviation can be used to
compare the variability of several distributions
make a statement about the general shape of a

distribution.

When describing the shape of a distribution we

refer to

A distribution with any shape

A mound shaped distribution

Standard Deviation
From a Frequency Distribution

(continued)

Approximating the Standard Deviation

Used when the raw data are not available and the

only source of data is a frequency distribution

m
j 1

X fj
2

n 1
n sample size
c number of classes in the frequency distribution
m j midpoint of the jth class
f j frequencies of the jth class

Comparing Standard
Deviations
Data A

11 12

Mean = 15.5
s = 3.338

20 21

Data B
Mean = 15.5

11 12

20 21

s = .9258

Data C
Mean = 15.5

11 12
Chap 3-33

20 21

s = 4.57

2004 Prentice-Hall, Inc.

Coefficient of Variation
Measure of Relative Variation
Always in Percentage (%)
Shows Variation Relative to the Mean
Used to Compare Two or More Sets of Data

Measured in Different Units

S
CV 100%
X

Sensitive to Outliers

Comparing Coefficient
of Variation
Stock A:
Average price last year = $50
Standard deviation = $2

Stock B:
Average price last year = $100
Standard deviation = $5

Coefficient of Variation:
Stock A:
Stock B:

$2
S

CV 100%
100% 4%
X
$50

$5
S

CV 100%
100% 5%
X
$100

Shape of a Distribution
Describe How Data are Distributed
Measures of Shape
Symmetric or skewed

Left-Skewed

Symmetric

Mean < Median < Mode Mean = Median =Mode

Right-Skewed
Mode < Median < Mean

Exploratory Data Analysis

Box-and-Whisker Plot

Graphical display of data using 5-number summary

X smallest Q
1

Median( Q2)

Xlargest

Distribution Shape &

Box-and-Whisker Plot
Left-Skewed

Q2 Q3

Symmetric

Q1Q2Q3

Right-Skewed

Q1 Q2 Q3

The Empirical Rule

For Data Sets That Are Approximately Bell-

shaped:
Roughly 68% of the Observations Fall Within 1

Standard Deviation Around the Mean

Roughly 95% of the Observations Fall Within 2
Standard Deviations Around the Mean
Roughly 99.7% of the Observations Fall Within 3
Standard Deviations Around the Mean

Numericals On Samping Distribution
0% (1)
Numericals On Samping Distribution
3 pages
Z Score
No ratings yet
Z Score
12 pages
Homework 4
No ratings yet
Homework 4
4 pages
9B BMGT 220 THEORY of ESTIMATION 2
No ratings yet
9B BMGT 220 THEORY of ESTIMATION 2
4 pages
SPSS ANNOTATED OUTPUT Discriminant Analysis 1
No ratings yet
SPSS ANNOTATED OUTPUT Discriminant Analysis 1
14 pages
2 Ukuran Numerik Dan Deskriptif
No ratings yet
2 Ukuran Numerik Dan Deskriptif
31 pages
Cheat Sheet
No ratings yet
Cheat Sheet
4 pages
Revision Data Description
No ratings yet
Revision Data Description
1 page
Unit-11 IGNOU STATISTICS
No ratings yet
Unit-11 IGNOU STATISTICS
23 pages
Chapter2 Sampling Simple Random Sampling
No ratings yet
Chapter2 Sampling Simple Random Sampling
24 pages
Bus. Statt. Chapter-Lecture 2+3
No ratings yet
Bus. Statt. Chapter-Lecture 2+3
43 pages
Dr. K. M. Salah Uddin Associate Professor Dept. of MIS, DU
No ratings yet
Dr. K. M. Salah Uddin Associate Professor Dept. of MIS, DU
41 pages
Wanous 1997
No ratings yet
Wanous 1997
6 pages
Empirical Rule
No ratings yet
Empirical Rule
25 pages
Simulex PDF
No ratings yet
Simulex PDF
43 pages
Chapter 7:sampling and Sampling Distributions
No ratings yet
Chapter 7:sampling and Sampling Distributions
50 pages
UCCM2233 - Chp3 Num Descriptive Measures-Wble
No ratings yet
UCCM2233 - Chp3 Num Descriptive Measures-Wble
103 pages
A Study To Evaluate The Effectiveness of Planned Teaching Programme On Knowledge Regarding Prevention of Home Accidents Among Mothers of Under-Five Children in Community Area Bagalkot
No ratings yet
A Study To Evaluate The Effectiveness of Planned Teaching Programme On Knowledge Regarding Prevention of Home Accidents Among Mothers of Under-Five Children in Community Area Bagalkot
3 pages
STATISTICS
No ratings yet
STATISTICS
12 pages
Sequence and Series One Shot Bounceback PDF
100% (1)
Sequence and Series One Shot Bounceback PDF
131 pages
BBA LLB 121ques Bank Unit 1 - 124047
No ratings yet
BBA LLB 121ques Bank Unit 1 - 124047
13 pages
Non Parametric Tests - A
No ratings yet
Non Parametric Tests - A
13 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
16 pages
PPT1
No ratings yet
PPT1
23 pages
Generating Good Pseudo-Random Numbers: Computational Statistics & Data Analysis December 2006
No ratings yet
Generating Good Pseudo-Random Numbers: Computational Statistics & Data Analysis December 2006
10 pages
Ken Black QA 5th Chapter 11 Solution
No ratings yet
Ken Black QA 5th Chapter 11 Solution
30 pages
Absence From School Related To Cancer
No ratings yet
Absence From School Related To Cancer
7 pages
Study On Automatic Separation of Coal and Gangue Based On Image Processing Techniques
No ratings yet
Study On Automatic Separation of Coal and Gangue Based On Image Processing Techniques
5 pages
Statistics For Business and Economics: Describing Data: Numerical
No ratings yet
Statistics For Business and Economics: Describing Data: Numerical
55 pages
Course MTH-161 Introduction To Statistics Instructor Credit Hours
No ratings yet
Course MTH-161 Introduction To Statistics Instructor Credit Hours
7 pages
1957 - Antonowitz - An Analysis of The Java Ratio
No ratings yet
1957 - Antonowitz - An Analysis of The Java Ratio
3 pages
OSMEÑA COLLEGES - Docx Syllabus For Fundamentals of Stat.
No ratings yet
OSMEÑA COLLEGES - Docx Syllabus For Fundamentals of Stat.
10 pages
Mahatma Gandhi University, Kottayam Mgu Bba (Honours) : Answer Any 5 Questions Carrying 2 Marks Each
No ratings yet
Mahatma Gandhi University, Kottayam Mgu Bba (Honours) : Answer Any 5 Questions Carrying 2 Marks Each
13 pages
Week 3 Chapter 3 Numerical Decriptive Measures
No ratings yet
Week 3 Chapter 3 Numerical Decriptive Measures
57 pages
Chapter 6 Processing and Analysis of Data
No ratings yet
Chapter 6 Processing and Analysis of Data
30 pages
Regression Models Course Project
100% (1)
Regression Models Course Project
4 pages
MCQ Time Series With Correct Answers
No ratings yet
MCQ Time Series With Correct Answers
6 pages
Mathematics: Answer Key
No ratings yet
Mathematics: Answer Key
5 pages
Ken Black QA ch18
No ratings yet
Ken Black QA ch18
54 pages
Basic Statistics Note.1
No ratings yet
Basic Statistics Note.1
47 pages
TEMPLATE School Data Analysis Report 1st Periodic Test
No ratings yet
TEMPLATE School Data Analysis Report 1st Periodic Test
3 pages
Final Project - Regression Models
100% (1)
Final Project - Regression Models
35 pages
Assignment Week 1 (Jio Q1)
No ratings yet
Assignment Week 1 (Jio Q1)
3 pages
Quant
No ratings yet
Quant
7 pages
CH 16
100% (1)
CH 16
54 pages
One Sample Tests of Hypothesis: ©the Mcgraw Hill Companies, Inc. 2008 Mcgraw Hill/Irwin
100% (1)
One Sample Tests of Hypothesis: ©the Mcgraw Hill Companies, Inc. 2008 Mcgraw Hill/Irwin
45 pages
Groebner ch05
No ratings yet
Groebner ch05
69 pages
Ruck Man
No ratings yet
Ruck Man
180 pages
Wholesale Custumer
100% (1)
Wholesale Custumer
32 pages
Statistical Methods For Decision Making
100% (1)
Statistical Methods For Decision Making
15 pages
Bivariate Analysis: Measures of Association
100% (1)
Bivariate Analysis: Measures of Association
38 pages
Duration - and - Convexity
No ratings yet
Duration - and - Convexity
22 pages
Unit3 160420200647 PDF
No ratings yet
Unit3 160420200647 PDF
146 pages
Intro To SEM - Day 3 - Nov2012
No ratings yet
Intro To SEM - Day 3 - Nov2012
50 pages
STASTIC
No ratings yet
STASTIC
12 pages
Unit 7 Single Sampling Plans: Structure
No ratings yet
Unit 7 Single Sampling Plans: Structure
30 pages
Business Statistics: Methods For Describing Sets of Data
No ratings yet
Business Statistics: Methods For Describing Sets of Data
103 pages
CS2610 Final Exam: If Is - Nan Print
No ratings yet
CS2610 Final Exam: If Is - Nan Print
5 pages
Assignment
No ratings yet
Assignment
9 pages
Statistics For Business and Economics: Bab 6
No ratings yet
Statistics For Business and Economics: Bab 6
26 pages
Linear Regression
No ratings yet
Linear Regression
71 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
18 pages
Chapter 1: Descriptive Statistics: 1.1 Some Terms
No ratings yet
Chapter 1: Descriptive Statistics: 1.1 Some Terms
15 pages
Introduction To Rstudio: Creating Vectors
No ratings yet
Introduction To Rstudio: Creating Vectors
11 pages
Analysis of Complex Sample Survey Data: Multinomial and Ordinal Logistic Regression For Complex Samples
No ratings yet
Analysis of Complex Sample Survey Data: Multinomial and Ordinal Logistic Regression For Complex Samples
39 pages
Pengantar Ekonometrika Terapan
No ratings yet
Pengantar Ekonometrika Terapan
23 pages
15 Linear Regression in Geography
No ratings yet
15 Linear Regression in Geography
24 pages
Probability and Statistics Sheet: A A S, T & M T
No ratings yet
Probability and Statistics Sheet: A A S, T & M T
57 pages
R Examples
No ratings yet
R Examples
56 pages
Ch07 - Dummy Variables - Ver1
No ratings yet
Ch07 - Dummy Variables - Ver1
29 pages
Linear Regression
No ratings yet
Linear Regression
28 pages
QM Statistic Notes
No ratings yet
QM Statistic Notes
24 pages
A Short Course in Multivariate Statistical Methods With R
No ratings yet
A Short Course in Multivariate Statistical Methods With R
11 pages
Groebner Business Statistics 7 Ch07
No ratings yet
Groebner Business Statistics 7 Ch07
34 pages
2.1 Descriptive Statistics Contd
No ratings yet
2.1 Descriptive Statistics Contd
20 pages
Normal Distribution
No ratings yet
Normal Distribution
16 pages
Session3 and 4 - RKS - PredictiveAnalytics
No ratings yet
Session3 and 4 - RKS - PredictiveAnalytics
46 pages
Statistics For Business and Economics: Describing Data: Numerical
No ratings yet
Statistics For Business and Economics: Describing Data: Numerical
40 pages
12.simple Regression NLS Edit
No ratings yet
12.simple Regression NLS Edit
62 pages
Tutorial 03 - S2 - 2017 - Solutions For Business Statistics
No ratings yet
Tutorial 03 - S2 - 2017 - Solutions For Business Statistics
15 pages
Slides Prepared by John S. Loucks St. Edward's University
100% (1)
Slides Prepared by John S. Loucks St. Edward's University
44 pages
Chap 2
No ratings yet
Chap 2
41 pages
Regression Analysis
No ratings yet
Regression Analysis
7 pages
EC2303 Final Formula Sheet PDF
No ratings yet
EC2303 Final Formula Sheet PDF
8 pages
Chapter 3. Describing Data-Numerical Measures
No ratings yet
Chapter 3. Describing Data-Numerical Measures
47 pages
Method Chooser Basic Statistical Tests
100% (1)
Method Chooser Basic Statistical Tests
36 pages
The Chicago Guide To Writing About Numbers 1st Edition Jane E. Miller Download
100% (1)
The Chicago Guide To Writing About Numbers 1st Edition Jane E. Miller Download
47 pages
R - Tutorial: Matrices Are Vectors
No ratings yet
R - Tutorial: Matrices Are Vectors
13 pages
Reubs High School: Statistics Project
No ratings yet
Reubs High School: Statistics Project
13 pages

Numerical Descriptive Measures 1

Uploaded by

Numerical Descriptive Measures 1

Uploaded by

Numerical Descriptive

Mean, Median, Mode, Geometric Mean

Range, Interquartile Range, Variance and Standard Deviation,

A central measure for this portfolios ARR for is 15%.

Considering the average ARR only the two

portfolios are equal. But are they really?

2004 Prentice-Hall, Inc.

Measures of Central Tendency

Mean (Arithmetic Mean)

Mean (Arithmetic Mean)

In an Ordered Array, the Median is the Middle

Variable Over Time

Geometric Mean Rate of Return

R1 0.5 (or 50%)

Average rate of return:

1 11/ 2 1 0 (or 0%)

Q3 are Measures of Non-central Location

Q2 = Median, a Measure of Central Tendency

story about the distribution.

How much are the values of a given set

Range X Largest X Smallest

2004 Prentice-Hall, Inc.

Difference between the First and Third Quartiles

Data in Ordered Array: 11 12 13 16 16 17 17 18 21

Interquartile Range Q3 Q1 17.5 12.5 5

The daily percentage of defective items in two weeks

Interpreting the Standard Deviation

When describing the shape of a distribution we

A distribution with any shape

Approximating the Standard Deviation

only source of data is a frequency distribution

2004 Prentice-Hall, Inc.

Measured in Different Units

Mean < Median < Mode Mean = Median =Mode

Exploratory Data Analysis

Graphical display of data using 5-number summary

Distribution Shape &

The Empirical Rule

Standard Deviation Around the Mean

You might also like