Business Statistics & Analytics KMBN104 UNIT-1
Business Statistics & Analytics KMBN104 UNIT-1
KMBN104
UNIT-1
The word statistics has been derived from the Latin word Status or Italian word
Statistica or German word Statistick. In each case it means ‘an organized political
state’.
The common man refers the word statistics as numerical data. for example- statistics
of National income, market statistics, production statistics, import and export
statistics etc.
DEFINITION OF STATISTICS
According to A.L.Bowley- statistics is the science of counting and statistics may be
called the science of averages.
SCOPE OF STATISTICS
In ancient days the scope of statistics was limited but now-a -days the scope of
statistics is so vast and ever expanding that it is difficult to define it. The scope of
statistics may be discussed under three main parts-
1. Division of statistics
2. Importance of statistics
3. Applications of statistics in various disciplines.
The science of statistics may be classified into the following main divisions-
1. Theoretical statistics
2. Statistical methods
3. Applied statistics
The statistical measures which tells us the location or position of the central vaule or
central point to describe the central tendency of the entire mass of the data is known
as the measures of central tendency .
“ A measure of central tendency is a single value within the range of the entire mass
of the data that is used to represent the whole data.”
KINDS OF STATISTICAL AVERAGES
The various averages (measures of central tendency) are as given below-
1. Arithmetic average or mean or arithmetic mean
2. Median
3. mode
Definition-
Arithmetic mean (A.M) of a group of observations is the quotient obtained
by dividing the sum of all observation s by the total number of observations.A.M. is
denoted by 𝑥̅ .
Arithmetic mean = Total value of observations divided by Totalnumber of
observation.
∑𝑥
𝑥̅ =
𝑁
Whereas-N= Number of observations.
∑ x= Total of all observations. it is simply translated as “add up all the
values of x.
Solution-
Size(x) Frequency(f) fx
6 5 30
7 8 56
8 9 72
9 12 108
10 6 60
11 6 66
12 4 48
N or ∑f = 50 ∑fx = 440
∑
Mean (𝑥̅ ) = = 440/50 = 8.8
∑
Mean (𝑥̅ ) = = 1810/70=25.86
MEDIAN
Definition- if a group of N observation is arranged in ascending or descending order
of magnitude then the middle value is called median of these observations and it is
denoted by M. that is M = th observation.
COMPUTATION OF MEDIAN
INDIVIDUAL SERIES-
In case of individual series, the following steps are taken for calculating the
median-
1)- arrange the data in ascending or descending order.
2)- locate the middle value-
if the no. of observation N is odd then there will be a single value in the middle
which is taken as median. Mathematically,
Median, M= th observation.
If the no. of observation N is even then there will be two mid value the median
is the average of these two mid values. Mathematically,
Median, M=
Example 4:
Example 5-
Calculate median for the following table-
Value : 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Frequency: 2 3 8 10 12 16 10 8 6 5 6 4 3 1
Solution-
Value(x) Frequency(f) Cumulative frequency(cf)
2 2 2
3 3 5
4 8 13
5 10 23
6 12 35
7 16 51
8 10 61
9 8 69
10 6 75
11 5 80
12 6 86
13 4 90
14 3 93
15 1 94
Median, M = L1 + ( L2 – L1)
Where, M = median
L1 = lower limit of the median class.
L2 = upper limit of the median class.
f = frequency of the median class.
C = cumulative frequency of preceding the median class.
N= sum of frequency.
Example 6 -
solution-
L1= 15 f= 16
L2 = 20 c= 20 N=60
Now , median M
M = L1 + ( L2 – L1)
= 15 + 60/2 – 20 /16 (20-15)
= 15+(30-20/16)5
= 15 + 10*5/16
= 15+ 3.125
= 18.125
MODE
The mode is the number which appears more times than any number in a given
set. It is quoted as a typical value of the variable.
The value of the variable for which the frequency is maximum is called mode or
modal value and it is denoted by Z.
Example-
Example-
Example-
Frequency : 6 9 10 16 12 8 7
Z=𝐿1 + ∗
∗𝑖
Z=30 +[(16-10)/2*16 -10 -12 ]*10
=36
MEASURE OF DISPERSION
To obtained a measure of location or position of a distribution, we need to know how
the data is spread about that point. Information about the spread can be given by
one or more of dispersion.
THE RANGE
This is the simplest measure of dispersion available in statistical analysis. It uses
only two extreme values. The range is defined as the difference between the
maximum and minimum values of a given data set. Its advantage lies in its simplicity
and its independence of the measure of position. However, it is distorted by the
extreme values and tells us nothing between the maximum and minimum values.
Example :
Calculate the first and third quartiles for the following data set:
44, 76, 49, 52, 52, 48, 51.
We first arrange the data set in ascending order.
44, 48, 49, 51, 52, 52, 76.
Q1 is the value of = [7+1]/4 items, =8/4=2 item
This is 48.
Q3 is the value of= (7+1)*3/4 item,=24/4=6 item
This is 52
𝒒𝟏 𝒄𝒇𝟎
First Quartile (Q1) = 𝑳𝟏 + ( 𝒇
)∗𝒊 q1= (N*1)/4 items
𝒒𝟑 𝒄𝒇𝟎
Third Quartile (Q3) =𝑳𝟏 + ( 𝒇
) ∗ 𝒊 q3= (N*3)/4 items
𝑸𝟑 𝑸𝟏
Quartile Deviation (Q.D.) = 𝟐
MEAN DEVIATION
of central location, then the standard deviation would be the natural choice for a
measure of dispersion.
The standard deviation measures the differences from the mean; a larger value
indicates large variation. Standard deviation is denoted by small sigma σ. The
standard deviation is in the same units as the actual observations.
To calculate the standard deviation for ungrouped (individual series) data, we
follow the following steps.
1) Find Assume mean (A) [select A as a minimum value(data) from the given
data]
2) Taking Deviation(dx) from X (x-A) Then totaled up as 𝚺dx
3) Squares the deviations (dx2) Then totaled up as 𝚺dx2
4) Formula is—
2
S.D= −
2
σ= −
2
S.D= −
2
S.D= − 𝒙 =A+
Example-
Solution-
2 2
S.D= − = − = 19.76
COEFFICIENT OF VARIATION
Coefficient of variation calculates the standard deviation from a set of observation
as
Percentage of the arithmetic mean.
C.V. = S.D.*100/mean
C.V. = σ*100/𝒙
SKEWNESS
Skewness in a set of data relates to the shape of the histogram which could be
drawn from the data. The Literal meaning of the word Skewness is ‘lack of
symmetry’. If frequency distribution on either side of the central value is not
symmetrical, it will be called skewness.
Definition-
1) “Skewness is the tendency of a distribution to depart from normal in the
balance of its two sides.” – Blair
2) “A distribution is said to be skewed when the mean and median fall at different
points in the distribution, and balance is shifted to one side or the other.”-
Garrett
Types of Skewness
1) Symmetrical distribution
2) Asymmetrical distribution
Measures of Skewness
1) Karl Pearson coefficient of Skewness (Jk)= (Mean-Mode)/S.D.
2) Bowley’s coefficient of Skewness(JQ)=
SOLVE THESE QUESTIONS
1-Calculate mean, median, mode of the given data-
X 5 10 15 20 25 30 35 40 45
y 3 51 25 5 9 7 12 16 8
3- Calculate mean, median, mode, standard deviation and coefficient of variation of the give
series-
Marks(above) 0 10 20 30 40 50 60
No of students 150 140 100 80 70 30 14
Age under 10 20 30 40 50 60 70 80
No. Of 5 15 30 55 81 100 120 125
persons