Statatics Chapter 1
Statatics Chapter 1
Mekelle University 1
Quick reference in statistics
Frequency distributions
Frequency distribution is a tabular arrangement of data by classes together with
corresponding frequencies. (Discrete frequency distribution and continuous frequency
distribution)
Mekelle University 2
Quick reference in statistics
5. Class limits the end numbers in a class known as class limits. E.g 15 is lower
class limit and 19 upper class limit of the first class in the above table.
6. Class boundaries (true class limits) are obtained as follows
7. Class mid point (Class mark) is the mid point of a class interval example
(15+19)/2 = 17 is the class mark of the 1st class in the above table.
8. Class width (Class size or class strength) the difference between upper and
lower class boundaries of a class or difference between successive L.C.Ls or
successive upper class limits or successive class marks.
Steps in creating in creating continuous frequency distribution table
1. Determine the largest and smallest numbers in the raw data
2. Calculate the Range
3. Determine the number of classes you want to have, usually the number of classes
ranges from 5 to 20.
4. Divide Range by the number of classes to get class width
5. Identify the lower class limit of the first class (make sure that the minimum value
is included in the first class and the maximum value in the last class).
6. Identify the remaining lower class limits by adding class width
7. Write all upper class limits
8. Start tallying.
Example Create a continuous frequency distribution for the following data of 100
employees and their weekly salaries.
70 61 64 ……….74………65………..72…72…
1. Max = 74 min = 61
2. Range = 74 – 61 = 13
3. Make the number of classes to be 5
4. Class width = Range/5 = 13/5 = 2.63 ≈ﻩ
Table II
Salary Frequency
60 – 62 5
63 – 65 18
66 – 68 42
69 – 71 27
72 - 74 8
Mekelle University 3
Quick reference in statistics
= x1 + x2 + x3 + x4 + x5
= 4 + 2 + 6 +(-5) + 8
= 15
Mekelle University 4
Quick reference in statistics
= x1 + x2 + x3 + x4 = 4 + 6 + (-5) + 8 = 13
= -3 – 10 + 2 + 8 = -3
= 1 + (-4) + (-3) + 16
= 10
=
1.
2.
5.
Mekelle University 5
Quick reference in statistics
Example 2 Given = 7, ,
Find
a. +5yj)
= . . . property 1.
= . . . property 3
= 2(7) = 5(-3) = 14 – 15 = -1
b. (xj – 3) (2yj + 1)
= (2xjyj) + xj – 6yj – 3
= (2xjyj) + - 3 … property 1
Example 2 = 4 and = 10
Arithmetic mean
The arithmetic mean or the mean of a set of n numbers x 1, x2, x3. . . xn
is denoted of by and is defined as
1. Simple mean
Formula I
=7.2
Mekelle University 6
Quick reference in statistics
= 7.67
II weighted mean
If w1 w2 . . . wn are weights of the values (x1, x2, x3. . . xn)
respectively , then
ixi
Salary Frequenc Class (xi)
Mekelle University 7
Quick reference in statistics
y
(i)
60 - 62 5 61 305
63 - 65 18 64 1152
66 - 68 42 67 284
69 - 71 27 70 1890
72 - 74 8 73 584
= 67.45
1. =A+ or = A +
width
d1 = 20020 – 20000 = 20
d2 = 20005 – 20000= 5
d3 = 20008 – 20000= 8
d4 = 19992 – 20000 = -8
d5 = 199987 – 20000 = -13
= 12
Mekelle University 8
Quick reference in statistics
= 20000 + 12
/5 = 20000 + 2.4
= 20002.4
I short method
ii coding method
=A+
= 67 = 3
= 67 + (0.15)3 = 67 = 0.45 = 67.45
Mekelle University 9
Quick reference in statistics
Example If the mean results of scores of three classes were 79, 74, 82
with sizes 32, 25 and 17 respectively , then the mean result of the
scores of the students
= 78
4. If each value is multiplied by a constant number C, then CX, will be the mean for the
new data.
The median
Definition: - The median of a set of numbers arranged in an array is the middle value or
the arithmetic mean of the two middle values
i.e observation, …………………if n=add
……………………….if n is even
Mekelle University 10
Quick reference in statistics
X = = = th
= 4th observation = 15
N=8=even
X = = = = = 26
X=L+
Salary fi
= 50, therefore the median class is 66-68
60-62 5
because the 50th observation is found in the
63-65 18 class.
66-68 42
X = 65.5+ = 65.5 +
69-71 27
72-74 8
X=65.5 + = 65.5+ (0.64)3 = 65.5 + 1.93 = 67.43
Mekelle University 11
Quick reference in statistics
The Mode( )
Definition:- the mode(s) of n values is the value with the highest frequency ( it is most
frequent value).
E.g what is the mode of the values, 8, 3, 2, 3, 4, 7, 3
Soln. 8, 3, 3, 3, 2, 4, 7
=> the mode is 3 because it is the most frequent value.
=L+ C
Mekelle University 12
Quick reference in statistics
Classes fi
42-50 3 The next lower class (fl)
51-59 4
60-68 4 Modal class (fm)
69-77 9
78-86 2 Next higher class (fh)
87-95 3
= 68.5 + 9 = 68.5 + 9
Measures of Variation
= =
Observation:- Even though the two sets of data have the same arithmetic mean, the
values in i are more scattered or dispersed than that of ii.
Mekelle University 13
Quick reference in statistics
Or
Or
Mekelle University 14
Quick reference in statistics
V = S2
Example 1 Compute Range, MD, Standard deviation and variance of the following data.
2, 3, 6, 8, 11
Solution.i Range (R)=Max – Min = 11-2 = 9
iv. S= =
= = = 3.29
v. V = S2 = 10.8
Example 2 Consider the table below and compute the Range, MD, S and V
Classes fi Xi A fi di ui Ui2 fixi fiui fiui2
2–6 1 4 14 9 -10 -2 4 4 -2 4
7 – 11 4 9 14 16 -5 -1 1 36 -4 4
12 – 16 2 14 14 2 0 0 0 28 0 0
17 – 21 2 19 14 12 5 1 1 38 2 2
22 - 26 1 24 14 11 10 2 4 24 2 4
Total 10 50 xi- A 130 -2 14
Solution = = 13
Mekelle University 15
Quick reference in statistics
iii. S= C =5 =5 =5 = 5.83
iv. V = S2 =(5.83)2 = 34
S2 =
R.R =
C.V =
C.M.D =
Mekelle University 16
Quick reference in statistics
Example Refer to the results obtained in the above table and compute the relative
variations.
Solution
Solution
i. Z = = = 1 (above the mean)
Uses of Z-scores
1. Make comparisons in performance
Example Suppose a student scores 90 in statistics exam with mean 75 and standard
deviation 7.5. The same student scored 85 in English with mean 72 and standard
deviation 6. In which course has the student done better?
Solution
Mekelle University 17
Quick reference in statistics
=> Yi =
Example Convert X:7, 9, 1, 11, 13 into another distribution Y with = 12 and Sy = 14
=> Minimum
=> Minimum
Mekelle University 18
Quick reference in statistics
Therefore b =
and a= -b
Therefore Y’ = - b( - X) is the equation of the regression line of Y on X.
If we assume X to be dependent and Y independent we will have regression equation X
on Y then a0 and b0 are obtained as follows
b0 = a0 = -b
Example
Let X = number of hrs. which some students studied for an exam
Y = Grades out of 100 scored.
The following information are obtained
n = 10 students = 95 = 1121 = 652 = 6996
1. b = = =
b= = 3.67
Mekelle University 19
Quick reference in statistics
r=
Note that -1 r 1
Analysis i. r 1 there is a strong positive relationship.
ii. r -1there is a strong negative linear relationship.
iii. r 0 there is almost no liner relationship
X Y X2 Y2 X-Y
1 2 1 4 2
3 4 9 16 12
4 4 16 16 16
5 8 25 64 40
7 12 49 144 84
Total 20 30 100 244 154
Questions
1. Determine the equation of the regression line of Y on X
2. Compute the value of “ r “ and interpret
Answer
1. X = = 4, Y = = 6.
B= = =
= = = 1.57
Mekelle University 20
Quick reference in statistics
2. r = = = = 0.98
Interpretation: There is a strong positive relation ship between the two variables
Mekelle University 21