0% found this document useful (0 votes)
56 views

Statistics and Probability Theory: Fasih Ur Rehman

This document discusses various methods for representing and describing the center and spread of data, including histograms, measures of central tendency (median), variability (range), quartiles, and box-and-whisker plots. Specifically, it provides examples to illustrate histograms, finding the median and range of data, calculating quartiles and the interquartile range, and constructing box-and-whisker plots from a data set. Examples of stem-and-leaf plots and comparing histograms to bar charts are also presented.

Uploaded by

musamausman
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
56 views

Statistics and Probability Theory: Fasih Ur Rehman

This document discusses various methods for representing and describing the center and spread of data, including histograms, measures of central tendency (median), variability (range), quartiles, and box-and-whisker plots. Specifically, it provides examples to illustrate histograms, finding the median and range of data, calculating quartiles and the interquartile range, and constructing box-and-whisker plots from a data set. Examples of stem-and-leaf plots and comparing histograms to bar charts are also presented.

Uploaded by

musamausman
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 17

Statistics and

Probability Theory
Lecture 02
Fasih ur Rehman
Last Class

• Introduction to the course


• What is statistics and statistical methods
• Data and its representation
Today’s Agenda

• Continue from data Representation


– Histogram
• Center and Spread of Data
• Quartiles
• Box and Whisker Plot
Data Representation (Example)
• 89 84 87 81 89 86 91 90 78 89 87 99 83 89
• Sort this data
• 78 81 83 84 86 87 87 89 89 89 89 90 91 99
• Group this data
– Make 5 groups

Group No of Elements
75 - 79 1
80 - 84 3
85 - 89 7
90 - 94 2
94 - 99 1

• Stem and leaf plot will be


Data Representation (Example)

• Individual entries of left most column in stem


and leaf plot are called Cumulative Absolute
Frequency CAS, i. e. the sum of the absolute
frequencies of values up to the line of the
leaf.
– For example, 11 shows that there 11 values in the
data not exceeding 89.
• Dividing the CAS by n (total number of entries
in the data) gives Cumulative Relative
Frequency .
Histogram

• Area of the rectangles are proportional to the


relative frequency. Group Abs. Freq Rel. Freq
75 - 79 1 1/14
80 - 84 3 3/14
85 - 89 7 7/14
90 - 94 2 2/14
94 - 99 1 1/14
Histogram vs. Bar Chart
Center and Spread of Data

• As a center of location the location of data


values we can take a median.
• 78 81 83 84 86 87 87 89 89 89 89 90 91 99
• 87 and 89 are middle values (7th and 8th) so
median will be 88.
• Median may not be present in the data.
• Spread of data can be measured by the range
• Spread is also called variability.
– In this case spread is 99 – 78 = 21.
Example1

• 3, 13, 7, 5, 21, 23, 39, 23, 40, 23, 14, 12, 56,
23, 29
• putting data in order
• 3, 5, 7, 12, 13, 14, 21, 23, 23, 23, 23, 29, 39,
40, 56
• The median value turns out to 23
• The spread 56 – 3 = 53
Example1

• 3, 13, 7, 5, 21, 23, 23, 40, 23, 14, 12, 56, 23, 29
• Here we have even number of elements in data.
Putting this data in order
• 3, 5, 7, 12, 13, 14, 21, 23, 23, 23, 23, 29, 40, 56
• n = 14
• 3, 5, 7, 12, 13, 14, 21, 23, 23, 23, 23, 29, 40, 56
• Median is found by (21 + 23)/2 = 22 i.e. by taking
mean value of two middle values.
• The spread 56 – 3 = 53
• Median separates the data in two equal halves.
Quartiles

• With Quartiles data is divided in 4 groups in


the same manner as we do for median.
• There are three quartiles in data called
– Lower Quartile ql (median of the lower half of the
data)
– Middle Quartile qm(median of the data)
– Upper Quartile qu (median of the upper half of the
data)
• Interquartile Range IQR can be found by
IQR = qu - ql
Example2

• 78 81 83 84 86 87 87 89 89 89 89 90 91 99
• Lower half of data is
• 78 81 83 84 86 87 87
• Lower Quartile is 84
• Upper half of data is
• 89 89 89 89 90 91 99
• Lower Quartile is 89
• Middle Quartile (same as median) is 88
• IQR = 89 – 84 = 5
Box and Whisker Plot

• Also called Box Plot


• Box plot is obtained by 5 values of data.
– Minimum value of the data
– Three quartiles
– Maximum value of the data
Example2

• 78 81 83 84 86 87 87 89 89 89 89 90 91 99
• Lower half of data is
• 78 81 83 84 86 87 87
• Lower Quartile is 84
• Upper half of data is
• 89 89 89 89 90 91 99
• Lower Quartile is 89
• Middle Quartile is 88
• IQR = 89 – 84 = 5
Example2

• 91 89 93 91 87 94 92 85 91 90 96 93 89
• Sorted data
• 85 87 89 89 90 91 91 91 92 93 93 94 96
• Values required for box plot respectively are
• 85, 89, 91, 93 and 96
Summary

• Data Representation
– Histogram
• Center and Spread of Data
• Quartiles
• Box and Whisker Plot
References

• 1: http://www.mathsisfun.com/
• 2: Advanced Engineering Mathematics by E
Kreyszig 8th edition

You might also like