Classification of Data (1)
Classification of Data (1)
DEFINITION
Time series are usually listed in chronological order, normally starting with
the earliest period. When emphasis falls on the most recent events, a
reverse time order may be used.
Qualitative Classification
In qualitative classification data are classified on the basis of some attribute
or quality such as sex, color of hair, literacy, religion, etc. The attribute under
study cannot be measured; one can only find out whether it is present or
absent in the units of the population under study. For example population
under study may be divided into two categories as follows:
Population
Urban Rural
In a similar manner, we may classify population on the basis of sex, i.e.
into males and females, or literacy, i.e., into literate and illiterate, and
so on.
Population
Males Females
A discrete variable is that which can vary only by finite jumps and
cannot manifest every conceivable fractional value. For instance, the
number of rooms in a house can only take values such as 1, 2, 3, etc.
Discrete data are obtained by counting.
Series which can be described by a continuous variable are called continuous
series. Series represented by a discrete variable are called discrete series.
DISCRETE
CONTINUOUS
1 0 2 3 4 5 6
7 2 3 4 0 2 5
8 4 5 12 6 3 2
7 6 5 3 3 7 8
9 7 9 4 5 4 3
Represent the data in the form as of a discrete frequency distribution.
Formation of a continuous frequency
distribution
The following technical terms are important when a continuous frequency
distribution is formed or data are classified according to class-interval:
Class limits: the class limits are the lowest and the highest values that can be
included in the class. For example, take the class20 − 40. The two boundaries of
class are known as the lower limit and the upper limit of the class. The lower limit
of a class is the value below which there can be no item in the class. The upper
limit of a class is the value above which no item can belong to that class.
Class intervals: the difference between the upper and lower limit of a class is
known as class interval. For example, in the class 200-300, the class interval is 100
(i.e., 300 minus 200).
Class frequency: The number of observations corresponding to a particular class
is known as the frequency of that class or the frequency.
Class mid-point or class mark: it is the value lying half-way between the
lower and upper class limits of a class-interval.
𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈 𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙 𝑜𝑜𝑜𝑜 𝑡𝑡𝑡𝑡𝑡 𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐+𝐿𝐿𝐿𝐿𝐿𝐿𝐿𝐿𝐿𝐿 𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙 𝑜𝑜𝑜𝑜 𝑡𝑡𝑡𝑡𝑡 𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐
𝑀𝑀𝑀𝑀𝑀𝑀 − 𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝 𝑜𝑜𝑜𝑜 𝑎𝑎 𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐 =
2
Methods of classifying data according
to class-intervals
Exclusive method: When the class intervals are so fixed that the upper limit
of one class is the lower limit of the next class it is called the exclusive
method of classification.
Income (Tsh) No. of persons
1000-1100 50
1100-1200 100
1200-1300 200
1300-1400 150
1400-1500 40
1500-1600 10
Total 550
Inclusive method: The upper limit of one class is included in that class itself.
Prepare a frequency table for the following data with width of each class
interval as 10. Use exclusive method of classification:
57 44 80 75 00 18 45 14 04 64
72 51 69 34 22 83 70 20 57 28
96 56 50 47 10 34 61 66 80 46
22 10 84 50 47 73 42 33 48 65
10 34 66 53 75 90 58 46 39 69
Solution:
Marks Tallies Frequency
0 − 10 || 2
10 − 20 |||| 5
20 − 30 |||| 4
30 − 40 |||| 5
40 − 50 |||| ||| 8
50 − 60 |||| ||| 8
60 − 70 |||| || 7
70 − 80 |||| 5
80 − 90 |||| 4
90 − 100 || 2
𝑻𝑻𝑻𝑻𝑻𝑻𝑻𝑻𝑻𝑻 𝟓𝟓𝟓𝟓
Relative Frequency Distribution
Weight
The simplest and most revealing devices for summarizing data and
presenting them in a meaningful fashion is the statistical table.
A table is a systematic arrangement of statistical data in columns and rows.
Rows are horizontal arrangements whereas columns are vertical ones.
The purpose of a table is to simplify the presentation and to facilitate
comparisons.
Role of tabulation
It simplifies complex data: When data are tabulated all unnecessary
details and repetitions are avoided. Data are presented systematically in
columns and rows. Hence, the reader gets a very clear idea of what the
table presents. Also, a large amount of space is saved because of non-
duplicating of his headings and designations: the description at the top of
a column serves for all the terms beneath it.
It facilitates comparison: Since a table is divided into various parts and for
each part there are total and sub-totals, the relationship between different
parts of data can be studied much more easily with the help of a table
than without it.
It gives identity to the data: When the data are arranged in a table with a
title and number they can be distinctly identified and can be used as a
source reference in the interpretation of a problem.
It reveal patterns: Tabulation reveal patterns within the figures which
cannot be seen in the narrative form. It also facilitates the summation of the
figures if the reader desires to check the total.
Type of tables
1. Simple and Complex Tables
The distinction between simple and complex tables is based upon the
number of characteristics studied.
In a simple table only one characteristic is shown. Hence, this type of table
is also known as one-way table.
In a complex table, two or more characteristics are shown. Such tables are
more popular in practice because they enable full information to be
incorporated and facilitate a proper consideration of all related facts.
When two characteristics are shown such a table is known as two-way or
double tabulation. It is formed when either the row or the column is divided
into two coordinate parts.
When three or more characteristic are represented in the same table, such
a table is called higher order table. The need for such a table arises when
we are interested in presenting a number of characteristics simultaneously.
While constructing such a table it is necessary to first establish an order of
precedence among the attributes or characteristics sought to be classified
having regard to their relative importance.
Age (in yers) No. of Employees
𝐵𝐵𝐵𝐵𝐵𝐵𝐵𝐵𝐵𝐵 25 ……..
25 − 35 …… Simple Table
35 − 45 ……..
45 − 55 ……
𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴 55 ……..
𝑻𝑻𝑻𝑻𝑻𝑻𝑻𝑻𝑻𝑻 … …
Employees 𝑻𝑻𝑻𝑻𝑻𝑻𝑻𝑻𝑻𝑻
Age (in years) Males Females
Two-way Table
𝐵𝐵𝐵𝐵𝐵𝐵𝐵𝐵𝐵𝐵 25 …… …… ……
25 − 35 …… …… ……
35 − 45 …… …… ……
45 − 55 …… …… ……
𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴 55 …… …… ……
𝑻𝑻𝑻𝑻𝑻𝑻𝑻𝑻𝑻𝑻 …… …… ……
2. General and Special Purpose Tables