Branches of Statistics, Data Types, and Graphs
Branches of Statistics, Data Types, and Graphs
STATISTICS DEFINED
Statistics is the branch of mathematics about data collection (the aspect dealing with obtaining numerical
measurements), data tabulation/presentation (the aspect dealing with organizing data into tables, graphs, or
charts), data analysis (the aspect dealing with extracting relevant information from the given data), and data
interpretation (the aspect dealing with drawing conclusions from the analyzed data). (Pagoso et al., 1992).
Based on this definition, data is the major component of statistics. Hence, statistics is succinctly referred to as
the science of data.
BRANCHES OF STATISTICS
Statistics is divided into two: descriptive statistics and inferential statistics. Procedures focused on
collecting and describing a set of data to obtain relevant information are concerns of descriptive statistics.
These procedures apply only to the group (whether sample or population) from which the data has been
collected. On the other hand, procedures concerned with the analysis of the data from the sample in order to
make predictions or inferences about the population are the concerns of inferential statistics. These procedures
may be about making generalizations from samples to populations, doing estimations and hypothesis tests,
finding relationships among variables, and making predictions. It is important to note that population as used in
statistics refers to the totality of the group under study while sample is just a subset of this group.
Example 1: Describing the enrollment in a university in terms of the percentages per level (freshman,
sophomore, junior, and senior) is a concern of descriptive statistics.
Example 2: Testing the hypothesis that male and female students significantly differ in performance in
mathematics test is a concern of inferential statistics.
Data are referred to as pieces/bits of information that function as the basic component of any
statistical investigation. These are obtained whenever measurements are done or observations are recorded.
TYPES OF DATA
Examples of quantitative data: number of siblings, speed of a car, blood pressure reading, etc.
Examples of qualitative data: eye color, year level, socioeconomic status, etc.
Data may also be classified according to the different measurement characteristics. Numbers have
the following functions: to classify and to compare values either by ranking, getting differences, or forming
quotients. Nominal data are data where numbers can be assigned to categories but they cannot be ranked,
and no mathematical computation can be done. Ordinal data are data where numbers can be assigned to
categories and these numbers can now be compared by ranking. Interval data are data where the numbers
can be subtracted and these differences can now be compared. Ratio data are data obtained from
measurements with a unique origin.
Examples:
Gender is nominal because if the number 1 is assigned to male and 2 to female, you cannot compare
the numbers. You cannot say that 1 < 2. You cannot perform subtraction either. You cannot say that 2 – 1 = 1.
Grade levels in basic education are ordinal. You can now compare the grade levels. The statement 1
< 2 is now true. It simply means that grade level 1 is lower than grade level 2. But just like in nominal data,
mathematical computations are not possible. There is no meaning in the statement “5 – 3 = 2”.
Temperature readings are intervals. There is no unique origin for temperature reading in the Celsius
scale. But you can now compare differences. If, on Monday, the highest temperature is 36oC and the lowest is
30oC, the difference in temperature is 6oC. If, on Tuesday, the highest temperature is 37oC and the lowest is
29oC, the difference in temperature is 8oC. These differences can now be compared. The difference in
temperature on Monday is lower than the difference in temperature on Tuesday.
Height of a person is a ratio of data. There is a unique origin in the instrument being used for
measurement. Comparing by ranking, forming differences, and forming quotients are now possible.
TYPES OF GRAPHS
Bar – a graph made of bars, with heights representing the frequencies (or
percentages) of respective categories
Example: The graph shows the frequency of students from different levels in a
university with 1300 students.
Example: The graph shows the percentages of students from different levels in
a community college with 650 students.
Line – a graph which shows the relationship between two or more set of
quantities using lines
Example of a line graph: The graph below shows the enrolment for schools A
and B from 2017 to 2020.
The definitions and interpretations of the different measures of position are given in the boxes below.
PROCEDURE IN COMPUTING THE FRACTILES
Different authors suggest different ways of computing the fractiles. The procedure adapted here is the one used
by Bluman (2013) and Freund & Simon (1997).
○ If pn is not an integer, use the next higher integer for the pth fractile position.
○ If pn is an integer, use the mean of the values in positions pn and (pn + 1) as the pth fractile.
Example:
20 16 18 30 10 12 18 13 25 28
ARRANGE: 10 12 13 16 18 18 20 25 28 30
MEASURES OF VARIABILITY
1. Range . This is a measure of spread obtained by subtracting the smallest value from the largest value in a
data set.
Example: The data below are the scores of 8 students in a Statistics examination.
43 46 41 39 36 48 41 28
DEFINITIONS
An experiment refers to some situation of
interest whose outcome is determined by
chance.
A sample space S is a set of all possible
outcomes of an experiment. Each element
in a sample space is called an outcome or
sample point.
An event is any subset of a sample space.
Example 1:
A bowl has 4 orange marbles, 5 green marbles, and 6 yellow marbles. One marble is then picked at random.
Determine the probability that it is green.
Solution:
Let S = the experiment of picking a marble from the bowl
n(S) = 15
E = the event of selecting a green marble
n(E) = 5