4.1 Descriptive Stat - Part 1
4.1 Descriptive Stat - Part 1
1
Outline
2
List of symbols used
3
Overview
Descriptive Statistics
• Describes the important characteristics of a set of
data.
• Organize, present, and summarize data:
1. Graphically
2. Numerically
4
Important Characteristics of
Quantitative Data
“Shape, Center, and Spread”
• Center: A representative or average value that
indicates where the middle of the data set is located.
Symmetric
• Data is symmetric if the left half of its histogram is
roughly a mirror image of its right half.
Skewed
• Data is skewed if it is not symmetric and if it
extends more to one side than the other.
Uniform
• Data is uniform if it is equally distributed (on a
histogram, all the bars are the same height or
approximately the same height).
The Shape of Distributions
Symmetric Uniform
Outliers
• Unusual data values as compared to the rest of the set.
They may be distinguished by gaps in a histogram.
Section 2.1
Frequency Distributions
and Their Graphs
9
Frequency Distributions
Frequency Distribution
• A table that organizes data values into classes or
intervals along with number of values that fall in
each class (frequency, f ).
1. Ungrouped Frequency Distribution – for data
sets with few different values. Each value is in
its own class.
Ungrouped Grouped
6 5 4 5 5 3 5
6 2 3 5 5
4 9
5 5 7 4 3
4 5 4 5 6 5 18
5 1 6 2 6 6 12
6 6 6 6 4
7 3
4 5 4 5 3
5 5 7 6 5
Graphs of Frequency Distributions:
Frequency Histograms
Frequency Histogram
• A bar graph that represents the frequency distribution.
• The horizontal scale is quantitative and measures the
data values.
• The vertical scale measures the frequencies of the
classes.
• Consecutive bars must touch.
2 2 15
Frequency, f
3 5 10
4 9 5
5 18
0
6 12 1 2 3 4 5 6 7
Number of Peas
7 3
Relative Frequency Distributions and
Relative Frequency Histograms
Relative Frequency Distribution
• Shows the portion or percentage of the data that falls
in a particular class.
class frequency f
• relative frequency
Sample size n
18
Labeling Grouped Frequency
Distributions
• Class midpoints: the value halfway between LCL
and UCL:
(Lower class limit) (Upper class limit)
2
21
Other Graphs
Dot plot
• Consists of a graph in which each data value is
plotted as a point along a scale of values
Figure 2-5
Time Series
(Paired data)
Time Series
• Data set is composed of quantitative entries taken at
regular intervals over a period of time.
e.g., The amount of precipitation measured each
day for one month.
• Use a time series chart to graph.
Quantitative
data time
Larson/Farber 4th ed. 27
Time-Series Graph
Figure 2-8
Ex. www.eia.doe.gov/oil_gas/petroleum/
Graphing Qualitative Data Sets
Pie Chart
• A circle is divided into sectors
that represent categories.
Pareto Chart
• A vertical bar graph in which the
Frequency
height of each bar represents
frequency or relative frequency.
Categories
Figure 2-6
THANK YOU.