0% found this document useful (0 votes)
96 views

Data Collection 2023

Statistics is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. Data collection is the process of gathering and measuring information on targeted variables in an established system. This chapter typically discusses identifying math and creating graphs. It covers topics like frequency distribution tables, stem and leaf diagrams, histograms, ogives, frequency polygons, frequency curves, and pie charts.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
96 views

Data Collection 2023

Statistics is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. Data collection is the process of gathering and measuring information on targeted variables in an established system. This chapter typically discusses identifying math and creating graphs. It covers topics like frequency distribution tables, stem and leaf diagrams, histograms, ogives, frequency polygons, frequency curves, and pie charts.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 29

BBA VISION

Sub: Statistics (In English)


Chapter: Data Collection/Introduction
What is Statistics ?

Statistics is the discipline that concerns the collection, organization, analysis,


interpretation, and presentation of data.

What is Data Collection?


Answer: Data collection is the process of gathering and measuring information
on targeted variables in an established system.

Data Collection এর এই চ্যাপ্টারর আমারের কে সাধারণত Graph আঁোরত


হরে ।
এই চ্যাপ্টারর সাধারণত আমারের কে নিরনাক্ত মযাথ এেং গ্রাফ েররত হরে ।
1. Frequency Distribution Table.
2. Stem and Leaf Diagram
3. Histogram
4. Ogive Curve
5. Frequency Polygon
6. Frequency Curve.
7. Pie Chart
Problem-1:
Marks scored by 30 students are given below:
41 55 48 47 53 48 33 32 42 55 44 38 60 65 71 80 41 53 47 48 55 20 31 34 42 51 35 35 26 25

Convert the marks into a continuous series of a class-interval of 10 & Calculate percentage
of each class.
Frequency Distribution
Continuous Series
Solution:
a.
Here,
Lowest Value = 20
Highest Value = 80
Class Interval = 10
41 55 48 47 53 48 33 32 42 55 44 38 60 65 71 80 41 53 47 48 55 20 31 34 42 51 35 35 26 25
Frequency Distribution Table
Class Tally Frequency % of Class
20-30 ।।। 3 10%
30-40 ।।।। ।। 7 23.33%
40-50 ।।।।।।।।। 11 36.67%
50-60 ।।।।। 6 20%
60-70 । 1 3.33%
70-80 । 1 3.33%
80-90 । 1 3.33%
N= 30
Problem-2:
Marks obtained by 50 students are given below
34 54 10 21 51 52 12 43 48 36 48
22 39 26 34 19 10 17 47 38 13 30
30 60 59 15 7 18 40 49 40 51 55
32 41 22 30 35 53 25 14 18 19 40
43 4 17 45 25 43
Construct a frequency table with class interval 0-9, 10-19, 20-29, 30-39, and so on.

Solution:
Here,
Highest Value = 60
Lowest Value = 4
Frequency Distribution Table
Class Tally Frequency
0-9
10-19
20-29
30-39
40-49
50-59
60-69

Exclusive Class: When the lower limit is included, but the upper limit is excluded, then
it is an exclusive class interval.
10-20
20-30
30-40
Inclusive Class: When both of the lower and the upper class limit is included, then it is
an inclusive class interval.
10-19
20-29
30-39

Problem-3:
Draw a frequency table by exclusive method:

110 175 161 157 155 108 164 128 114 178
165 133 195 151 71 94 87 42 30 62
130 156 167 124 164 146 116 149 104 141
103 204 162 149 79 113 69 121 93 143
140 144 187 184 197 87 40 122 203 148

Calculation of Class Interval:


We know,
𝑹𝒂𝒏𝒈𝒆
Class Interval =
𝟏+𝟑.𝟑𝟐𝟐𝒍𝒐𝒈𝑵
𝟏𝟕𝟒
= 𝟏+𝟑.𝟑𝟐𝟐𝒍𝒐𝒈𝟓𝟎
𝟏𝟕𝟒
= 𝟔.𝟔𝟒𝟑𝟗

= 26.189
Let, The Class Interval = 25

Here,
Range = Highest Value – Lowest Value
= 204 – 30
= 174
N = 50

Frequency Distribution Table


Class Tally Frequency
30-55 4
55-80 3
80-105 6
105-130 8
130-155 12
155-180 11
180-205 6
N=50

Class Interval Assumption এর কেনেল

Result Class Interval


0-7.49 5
7.50-12.49 10
12.50-17.49 15
17.50-22.49 20
22.50-27.49 25
27.50-32.49 30

HW:
Problem-4:
Prepare a frequency distribution from the following data:‒

33 32 47 55 21 50 27 12 68 49 40 17 44 62 24
42 33 38 45 26 44 33 48 52 30 50 37 38 45 48

Problem-5:
By using following data :

106 121 76 115 74 127 150 82 126


78 94 108 177 69 128 96 119 161
128 84 86 79 130 144 115 123 100
84 120 111

i. Prepare a frequency table of each class interval is 20 and the mid value of the first class is 70.

ii. Construct a Stem & leaf Diagram

Problem-6: (HW)
The following data represent the marks in business math (x) and business statistics (y) of 13
students. Prepare a Bi-Variate frequency table from the following data:
x 32 15 21 27 58 37 47 58 46 50 40 54 62
y 75 48 54 70 92 53 74 85 65 62 73 77 40

Problem-7 (HW)
The Data given below relate to the height and weight of 20 persons. You are required to form a two-way frequency
table with class intervals 62 to 64, 64 to 66, and so
Weight 170 135 136 137 148 124 117 128 143 129
Height 70 65 65 64 69 63 65 70 71 62
Weight 163 139 122 134 140 132 120 148 129 152
Height 70 67 62 68 67 69 66 68 67 67

Problem -08:
Age of Boys Age of Girls Age of Boys Age of Girls

28 23 27 24

37 30 39 34

42 40 23 20

25 26 33 31

29 25 36 29

47 41 32 35

37 35 22 23

35 25 29 27

23 21 38 34

41 38 48 47
Prepare two- way table taking suitable class interval or bivariate frequency distribution.

Calculation of Class Interval:


Boys: Girls:
Bivariate Frequency distribution table
Age of Girls
20-25 25-30 30-35 35-40 40-45 45-50 Total
Age of Boys

20-25 3

25-30 5
30-35 2
35-40 6
40-45 2
45-50 2
Total 5 5 4 3 2 1 20
Stem & Leaf Diagram
Stem = োন্ড
Leaf = পাতা
Problem-9:
Construct a stem-leaf diagram from the following data.

10 8 7 5 10 13
25 21 30 15 19 40
30 50 28 29 32 33
20 26 37 39 24 42
Here,
Lowest Value = 5
Highest Value= 50

Steam-Leaf diagram
Stem Leaf Ascending Frequency
0 875 578 3
1 00359 00359 5
2 5189064 0145678 7
3 002379 002379 6
4 02 02 2
5 0 0 1
N = 24
Problem-10:
The following data represent the marks in statistics of 25 students.

65.5 66.0 66.9 68.2 65.9


66.7 67.3 69.2 67.9 66.8
69.8 68.4 68.5 65.0 67.5
67.2 69.6 67.3 69.9 69.0
68.3 65.4 65.1 66.2 68.0
Required : Display the data by stem and leaf diagram.

Here,
Lowest Value = 65.0
Highest Value= 69.9

Steam-Leaf diagram
Stem Leaf Ascending Frequency
65 54109 01459 5
66 70928 02789 5
67 23395 23359 5
68 34520 02345 5
69 86290 02689 5
N = 25

Bivariate Frequency Distribution:

Problem-11:
By using following data :

106 121 76 115 74 127 150 82 126


78 94 108 177 69 128 96 119 161
128 84 86 79 130 144 115 123 100
84 120 111
ii. Construct a Stem & leaf Diagram

Problem-12:
Construct a stem and leaf diagram from the following data :‒
10, 8, 7, 5, 10, 13, 25, 21, 30, 16, 19, 40, 30, 50, 28, 29, 32, 33, 20, 26, 37,
39, 24, 42.
Histogram (আয়াতরলখ)

A histogram is a display of statistical information that uses rectangles to show the


frequency of data items in successive numerical intervals of equal size.
To draw a Histogram we need 2 things:
1. Class Interval
2. Frequency

Problem-13:
Fig: Histogram

Ogive Curve
Ogive Curve ২ টে হরয় থারে .
1. Less than Ogive Curve
2. More than ogive Curve.

Ogive Curve আঁোরত নিরনাক্ত elements এর প্ররয়াজি হয়ঃ


1. Class
2. Less than/More than cumulative frequency.

# Ogive Curve হাত নেরয় আেরত হরে ।

# Less than Ogive ক্লাস এর Highest Value এর উপরর আঁেরত হরে

# More than Ogive ক্লাস এর Lowest Value এর উপরর আঁেরত হরে

# প্ররে যনে Less than ো More than উরেখ েরর কেয়, তাহরল কযো উরেখ েররে কসো আঁেরত
হরে । নেন্তু যনে েরল Draw ogive Curve. তাহরল Less than এেং More than েুটেই আঁেরত হরে
এেং েুটেই এেই Graph কপপার এ ।

Problem-14:
Draw ogive curve from the following data and point out the quartiles
Class 10-20 20-30 30-40 40-50 50-60 60-70
Frequency 10 40 50 55 45 20

Table for drawing ogive curve


Class Frequency LCF MCF
10-20 10 10 220
20-30 40 50 210
30-40 50 100 170
40-50 55 155 120
50-60 45 200 65
60-70 20 220 20
N = 220

Y
300

More than ogive


Less than ogive

200

100

Q1 Q3

O 10 20 30 40 50 60 70 X

Fig: Less than & More than Ogive Curve

Q1 is on n/4 or 220/4 or 55th item


Q3 is on 3n/4 or 660/4 or 165th item
Scale: On ‘OX’ axis, 1 square = 2 units
On ‘OY’ axis, 1 square = 10 units
Problem-15:
Draw less than and more than ogives from the data given below and hence determine the
median
Marks 15-25 25-35 35-45 45-55 55-65 65-75
No. of Stu. 5 3 7 5 3 7

Table for drawing ogive curve


Class Frequency LCF MCF
15-25 5 5 30
25-35 3 8 25
35-45 7 15 22
45-55 5 20 15
55-65 3 23 10
65-75 7 30 7

Y
30

20

10

Median

O 15 25 35 45 55 65 75 X

Fig: Less than & More than Ogive Curve

Problem- 16:

Wages 30-40 40-50 50-60 60-70 70-80 80-90


No. of wor. 10 15 13 7 30 12
a. Show the mode from a histogram.
b. Calculate mode and compare this two.
Frequency Distribution Table

Class Frequency
30-40 10
40-50 15
50-60 13
60-70 7
70-80 30
80-90 12

a.
Y
30

20

10

O 30 40 50 60 70 80 90 X

Fig: Histogram & Frequency Polygon


Scale: On ‘OX’ axis, 1 square = 2 units
On ‘OY’ axis, 1 square = 2 units

b. Calculation of Mode:

Here, the highest frequency is 30, so the Mode class (70-80)


We know,
∆𝟏
Mode = 𝑳 + ∆ ×𝑪
𝟏 +∆𝟐

𝟐𝟑
= 70 + × 𝟏𝟎
𝟐𝟑 +𝟏𝟖

= 75.68

Here,

L = 70

∆𝟏 = 𝟑𝟎 − 𝟕 = 23

∆𝟐 = 30-12 = 18

C = 10

Problem-17: [NU. M. Com. (Hon)1998]


Given below are the marks obtained by 50 students in a certain examination:
87 78 82 60 63 28 70 37 58 60
78 60 30 61 68 90 25 67 57 45
58 48 33 66 53 80 29 81 54 12
67 51 68 62 11 78 45 90 20 27
66 29 76 23 57 32 84 63 28 90

Required:
1. Construct a frequency table taking a suitable class interval.
2. Represent the table by ogive.

Class = Mid Value of Class


Frequency = Frequency
Problem-18:
From the following table draw a histogram:
Wages in Tk. No. of workers
0 – 10 5
10 – 20 8
20 – 30 10
30 – 40 14
40 – 50 11
50 – 60 6
60 -70 3
Problem-19:
Following data in a Frequency Distribution and Draw a Histogram.
40, 38, 44, 28, 60, 21, 35, 42, 40, 36,
50, 67, 25, 58, 30, 48, 65, 35, 55, 39,
72, 44, 70, 55, 53, 21, 76, 46, 57, 67,
51, 34, 41, 56, 62, 42, 64, 73, 38, 41,

Frequency Curve & Frequency Polygon

Frequency Curve & Frequency Polygon আঁোরত নিরনাক্ত elements এর প্ররয়াজি হয়ঃ
1. Mid Value of Class
2. Frequency.

𝑯𝒊𝒈𝒉𝒆𝒔𝒕 𝑽𝒂𝒍𝒖𝒆 + 𝑳𝒐𝒘𝒆𝒔𝒕 𝑽𝒂𝒍𝒖𝒆


# Mid Value নিণরয়র
ণ সূত্র
𝟐
# Frequency Curve হাত নেরয় আেরত হরে । Frequency Polygon কেল নেরয় আঁোরত হরে ।

# Frequency Polygon এেং Frequency Curve আলাো আলাো গ্রারফ আঁোরত হরে ।
Problem-20:
Draw a Frequency curve from the following data
Class 10-20 20-30 30-40 40-50 50-60 60-70
Frequency 10 30 50 40 45 20

Table for Drawing Frequency curve


Class Mid Value Frequency
10-20 15 10
20-30 25 30
30-40 35 50
40-50 45 40
50-60 55 45
60-70 65 20

50

Frequency Curve
40

30

20

10

O 15 25 35 45 55 65 X
Fig: Frequency Curve
Scale: On ‘OX’ axis, 1 square = 2 units
On ‘OY’ axis, 1 square = 2 units
Problem-21:
Draw a Frequency Polygon from the following data
Class 0-10 10-20 20-30 30-40 40-50 50-60 60-70
Frequency 5 10 15 30 35 45 25

Table for Drawing Frequency Polygon


Class Mid Value Frequency
0-10 5 5
10-20 15 10
20-30 25 15
30-40 35 30
40-50 45 35
50-60 55 45
60-70 65 25

Frequency Polygon
45

35

25

15

O 5 15 25 35 45 55 65 X
Fig: Frequency Polygon
Scale: On ‘OX’ axis, 1 square = 2 units
On ‘OY’ axis, 1 square = 2 units
Problem-22:
Draw a Histogram & Frequency Polygon from the following data
Class 10-20 20-30 30-40 40-50 50-60 60-70
Frequency 10 30 50 40 35 20

Class Frequency
10-20 10
20-30 30
30-40 50
40-50 40
50-60 35
60-70 20

50 Frequency Polygon

Histogram
40

30

20

10

O 10 20 30 40 50 60 70 X

Fig: Histogram & Frequency Polygon


Scale: On ‘OX’ axis, 1 square = 2 units
On ‘OY’ axis, 1 square = 2 units
Problem-23:
Draw a Histogram & Frequency Curve from the following data
Class 10-20 20-30 30-40 40-50 50-60 60-70
Frequency 10 30 50 40 35 20

Table for Drawing ogive curve


Class Frequency
10-20 10
20-30 30
30-40 50
40-50 40
50-60 35
60-70 20

50 Frequency Curve

Histogram
40

30

20

10

O 10 20 30 40 50 60 70 X

Fig: Histogram & Frequency Curve

Scale: On ‘OX’ axis, 1 square = 2 units


On ‘OY’ axis, 1 square = 2 units

কিােঃ
1. প্ররে যনে Histogram এেং Frequency Polygon এেসারথ আঁোরত েরল,
তাহরল আলাো আলাো ভারে Graph আঁোরিার কোি প্ররয়াজি িাই ।
Histogram আঁোরিার পর প্রনতটে Histogram এর Mid Point কে Select
েরর তারপর কসই Point গুরলারে কেল ধরর কযাগ েরর নেরত হরে ।
2. প্ররে যনে Histogram এেং Frequency Curve এেসারথ আঁোরত েরল,
তাহরল আলাো আলাো ভারে Graph আঁোরিার কোি প্ররয়াজি িাই ।
Histogram আঁোরিার পর প্রনতটে Histogram এর Mid Point কে Select
েরর তারপর কসই Point গুরলারে হাত নেরয় কযাগ েরর নেরত হরে ।

Problem-24:
The following are the daily sales (in taka) of 40 shops.
44 57 54 45 62 51 38 62 49 41 36 56 42 36 40 43 37 47 50 46 43 65 49 55 32 48 35 46 44 46
41 38 63 54 51 41 44 57 46 45
(i) Prepare a frequency distribution table taking 5 as class interval.
(ii) Draw a frequency polygon from the table.

Problem-25:
Following data in a Frequency Distribution and Draw a Histogram and Frequency
polygon on the basis of the data:
40, 38, 44, 28, 60, 21, 35, 42, 40, 36, 50, 67, 25, 58, 30, 48, 65, 35, 55, 39, 72, 44, 70, 55, 53,
21, 76, 46, 57, 67, 51, 34, 41, 56, 62, 42, 64, 73, 38, 41
Pie Chart
Problem-26:
Draw a pie-chart for the following data : Cost of Materials Tk. 40,000 Cost of labor Tk.
30,000; Direct Expenses of manufacture; Tk. 10,000; Factory Overhead expenses Tk. 15,000;
Misc. Expenses Tk. 5,000.
Solution:
Calculation Table for Pie Chart

Expenses Cost Calculation Degree


𝟒𝟎𝟎𝟎𝟎
Materials 40000 × 𝟑𝟔𝟎° 𝟏𝟒𝟒°
𝟏𝟎𝟎𝟎𝟎𝟎
𝟑𝟎𝟎𝟎𝟎
Labor 30000 × 𝟑𝟔𝟎° 𝟏𝟎𝟖°
𝟏𝟎𝟎𝟎𝟎𝟎
𝟏𝟎𝟎𝟎𝟎
Direct Exp. 10000 × 𝟑𝟔𝟎° 𝟑𝟔°
𝟏𝟎𝟎𝟎𝟎𝟎
𝟏𝟓𝟎𝟎𝟎
Factory O/H 15000 × 𝟑𝟔𝟎° 𝟓𝟒°
𝟏𝟎𝟎𝟎𝟎𝟎
𝟓𝟎𝟎𝟎
Misc. Exp. 5000
Misc. Exp, 18 × 𝟑𝟔𝟎° 𝟏𝟖°
𝟏𝟎𝟎𝟎𝟎𝟎
Total Factory O/H, 54 100000 𝟑𝟔𝟎°

Materials, 144

Direct Exp., 36

Labor, 108

Problem-27: [DU. M. Com (Hons) – 1982]


Annual production of wheat in ton for 2010 to 2019 are given below. Draw a line chart for average production of
different year.
Year 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019
Yearly Production 200 215 225 250 240 260 275 265 280 292

Solution
Table for calculation of average production
Year Yearly Production Monthly Average Production
2010 200 16.67
2011 215 17.92
2012 225 18.75
2013 250 20.83
2014 240 20.00
2015 260 21.67
2016 275 22.92
2017 265 22.08
2018 280 23.33
2019 292 24.33

Histori gram
28
Average Monthly Production

26 Value 1 = .5 Inch in X-axis


24 Value 2 = .5 Inch in Y-axis

22

20

18

16

14
2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020
Year

Problem-28: [NU. B.Com. (Hons) - 1996]

Daily/wages of 40 workers are given below –


88 194 108 68 101 129 119 149 210 203 96 104 146
127 97 89 123 108 94 149 93 118 136 187 102 191
148 163 172 93 104 132 131 142 92 87 144 183 105
108
Prepare the following by suitable class interval:
(i) An absolute frequency distribution;
(ii) A Cumulative relative frequency distribution;
(iii) Draw an ogive on the above frequency distribution and estimate the range of central 50% wages.

Solution
Let us determine the suitable class-interval with the help of Struges formula, which is shown below:
The highest and lowest value is 210 and 68 respectively. The range or difference in the highest and lowest value is 210
– 68 or 142.
Here, N = Total number of observation = 40
Here, 20 is class interval as a round figure.
Range
C =
1 + 3.322 LogN
142
=
1 + 3.322 Log40
142
=
1 + 3.322 × 1.602
= 22.46
Frequency Distribution Table
Class Interval Tally Frequency Cumulative Absolute Cumulative Absolute
(Daily wages) Marks (f) Frequency (cf) Frequency Frequency
60 – 80 | 1 1 0.025 .025
80 – 100 |||| |||| 9 10 0.225 .25
100 – 120 |||| |||| 10 20 0.25 .50
120 – 140 |||| 6 26 0.15 .65
140 – 160 |||| 6 32 0.15 .80
160 – 180 |||| | 2 34 0.05 .85
180 – 200 |||| 4 38 0.10 .95
200 – 220 || 2 40 0.05 1.00
N = 20
Absolute Frequency = Frequency of each class ÷ Total frequency.
Absolute Cumulative Frequency = Cumulative Frequency of each class ÷ Total frequency
The ogive of the above data is given below and shown the central 50% wages of the worker.
45

40 Less than ogive


Value 20 = 5 Inch in X-axis
35 Value 5 = 5 Inch in Y-axis
Cummulative Frequency 30

25

20 Central 50% wages range


15

10

0
40 60 80 100 120 140 160 180 200 220 240
Upper limit of Class

Notes for ogive: For Ploting Less than ogive method we started with the upper limits of the class in X-axis and
cumulative frequency in Y-axis. After ploting less than cumulative frequency in upper limit of graph and mark the point,
then join all the points by free hand then we get a rising curve is called less than ogive.

Problem-29: [N.U. B. Com (Hons) - 2006]


Represent the following with a suitable graph:
Variable: 1 2 3 4 5 6
Frequency: 50 80 30 12 3 1

Solution
For given data the most suitable graph is line graph. Which is shown below:
90
80
80
70
60
50
Frequency

50
40
30
30
20 12
10 3 1
0
0 1 2 3 4 5 6 7
Variable
Problem-30: [DU. BBA (Hon's) – 2019]
Drew an Ogive for the following distribution. Read the median form the graph. How many workers earned
month salary between Tk. 25,400 and Tk. 40,700?
Monthly salary No. of workers Monthly salary No. of workers
15,000 – 20,000 6 35,000 – 40,000 16
20,000 – 25,000 10 40,000 – 45,000 12
25,000 – 30,000 22 45,000 – 50,000 15
30,000 – 35,000 30

Solution
Let us the arrange the frequency distribution for less than and more than methods as given below:
Table for less than and more than frequency cumulative frequency
Monthly Salary (f) Cumulative Frequency
Less than More than
15,000 – 20,000 6 6 111
20,000 – 25,000 10 16 105
25,000 – 30,000 22 38 95
30,000 – 35,000 30 68 73
35,000 – 40,000 16 84 43
40,000 – 45,000 12 96 27
45,000 – 50,000 15 111 15
With the help of these frequency distribution tables, we can draw ogive by less than and more than method as shown
below:
125
Less then ogive
100 More Than Ogive
Cumulative frequency
No. of workers

75

Median = 32,500
50

25

0
0 5000 10000 15000 20000 25000 30000 35000 40000 45000 50000 55000
Monthly Salary (Tk.)

Number of workers whose wages lies between Tk. 25,400 and Tk. 40,700:
22 12
=( × 400) + 30 + 16 + ( × 700)
5,000 5,000
= 1.76 + 30+ 16+ 1.68
= 49.44 Or, 49
Hence, approximately 49 worker’s earnings lie between Tk. 25,000 and Tk. 40,700.

You might also like