0% found this document useful (0 votes)
109 views

Oe Statistics Notes

This document discusses bivariate data analysis and correlation. [1] Bivariate data involves two variables that can be represented as pairs of observations. [2] Correlation measures the strength and direction of the linear relationship between two variables. Positive correlation means the variables increase together, while negative correlation means they change in opposite directions. [3] Methods to quantify correlation include scatter plots and Karl Pearson's correlation coefficient r, which ranges from -1 to 1 to indicate the correlation strength and direction.

Uploaded by

Firdose
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
109 views

Oe Statistics Notes

This document discusses bivariate data analysis and correlation. [1] Bivariate data involves two variables that can be represented as pairs of observations. [2] Correlation measures the strength and direction of the linear relationship between two variables. Positive correlation means the variables increase together, while negative correlation means they change in opposite directions. [3] Methods to quantify correlation include scatter plots and Karl Pearson's correlation coefficient r, which ranges from -1 to 1 to indicate the correlation strength and direction.

Uploaded by

Firdose
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 32

Date : 7.1.

2022
Continuation of Module II: Univariate Data Analysis and Bivariate Data Analysis

Discussion on bivariate data

 Data with 2 variables


In bivariate data analysis our objective is to study the relationship between two variables
Representation of bivariate data
Mathematically bivariate data is represented as ( x1 , y1) , ( x2 , y2), ( x3 , y3) ……………. ( xn, yn)
In other words bivariate data for n units can be written as ( xi ,yi) where i = 1,2,3,4,………..,n
Examples: height and weight, temperature and sales, advertising costs and sales, biology and maths
marks, mid-term and end Sem marks, tv watch hours and mid-term marks
Correlation: It is a statistical tool that helps to measure and analyze the degree of relationship between two variables.
To know the extent or magnitude of the relationship between the two variables concerned – this is the subject matter of
correlation.

Correlation Analysis: We have under consideration only two variables. Let us call them X and Y. Our intuition is that the
two variables are related, and we want to know the extent of their relationship. The variable X and Y could be related in
number of ways; however, we focus on the extend of linear relationship between X and Y. By linear relationship we mean
X and Y such that an equal increase/ decrease in X brings about a proportional increase /decrease in Y. Another way of
understanding the word ‘linear’ is a plot of X and Y on a graph sheet yield points that are more or less around the straight
line. · The measure of correlation called the correlation coefficient · The direction of change is indicated by a sign.

Types of correlation:

Positive Correlation: The correlation is said to be positive correlation if the values of two variables changing with same
direction.

i.e. , if both X and Y moves in the same direction ( ↑ x ↑ y or ↓ x ↓ y ) it is said to be positively correlated.

Ex: Pub. Exp. & sales, Height & weight.

Negative Correlation: The correlation is said to be negative correlation when the values of variables change with opposite
direction.

i.e., if X and Y moves in the opposite direction (↑ x ↓ y or ↓ x ↑ y ) it is said to negatively correlated.

Ex: Price & qty. demanded.

More examples:

Positive relationship:

water consumption and temperature.

study time and grades.

Negative relationships:

alcohol consumption and driving ability.

Note: If the direction is positive the sign of the correlation coefficient is + and if the direction is negative the sign
is negative.

Zero correlation:

If X and Y are not related, then ‘r’ between then will be zero. Under such situation X and Y are not correlated.
Methods of measurement of correlation:
Quantification of the relationship between variables is very essential to take the benefit of study of correlation.
For this, we find there are various methods of measurement of correlation, which can be represented as given
below:

Note : We are only learning SCATTER PLOT and KARL PEARSON’S COEFFICIENT OF
CORRELATION/ PRODUCT MOMENT CORRELATION
Scatter Diagram:

 A scatter plot is used to have a visual representation and understanding of the type of correlation existing
between X and Y in a bivariate data . To construct a scatter, plot the two variables X and Y are taken
along the coordinate’s axis on graph.
 Scatter Diagram is a graph of observed plotted points where each point represents the values of X & Y as
a coordinate. It portrays the relationship between these two variables graphically.
Remark:
If r is calculated using the formula discussed for any bivariate data , we may get a sensible value of ‘r’. However it is wrong t
give any interpretations. Such correlation is termed as non - sense correlation.
Ex: foot wear size of a person and monthly income.
Marks of students and heights of students.
Karl Pearson's Coefficient of Correlation or Product Moment Correlation Coefficient
 It is numerical measure of the linear relationship between two variables (between X and Y).
 The formula was developed by Professor Karl Pearson .
 It is denoted by ‘ r’.
The product moment correlation coefficient between two variables x and y is defined as
cov ( x , y )
r=
σxσ y
where cov ( x, y) is the covariance between x and y.
σ x =standard devation of x σ y =standard devation of y
n
1
cov( x ,y) = ∑ x y -x y
n i=1 i i
Remarks:
1.‘r’ lies between -1 and 1. (-1 ≤ r ≤ 1).
2. ‘r’ is location and scale invariant ( ‘ r’ is independent of change in origin and scale).
1 and 2 are the properties of ‘r’.
Sl .No Pattern of points on the scatter plot Values of r
1 Perfect positive correlation +1
2 Perfect positive correlation -1
3 Strong/ High positive correlation r > 0.7
4 Strong High negative correlation r < - 0.7( means for the values less than – 0.7
ie. -0.84, - 0. 79 and so on )
( these two values are given for your
understanding )
5 Weak/Low positive correlation 0.3 ≤ r ≤ 0.7 ( this inequality means for any value
lying between and equal to 0.3 , 0.7)
6 Weak/Low negative correlation -0.3 ≤ r ≤ -0.7 (this inequality means for any value
lying between and equal to -0.3 , -0.7)

Also , - 0.3 < r < 0.3 : Poor correlation between X and Y .


10.1.2022

Problem1:
Find Karl Pearson’s coefficient of correlation between capital employed and profit obtained from the following
data.
Capital Employed (Rs. In Crore) 10 20 30 40 50 60 70 80 90 100
Profit (Rs. In Crore) 2 4 8 5 10 15 14 20 22 50
Solution: Let us assume that capital employed is variable X and profit is variable Y.
Karl Pearson Correlation coefficient is given by
cov (x , y )
r=
σxσ y
where cov ( x, y) is the covariance between x and y.
σ x =standard devation of x ; σ y =standard devation of y
n
1
cov( x ,y) = ∑ x i y i- x y ;
n i=1

x = 550/10 =55 ; y = 150/10 = 15 ; σ x =¿ 28.72 ; σ y =¿ 13.28


n
1
cov( x ,y) = ( ∑ x y )- x y
n i=1 i i
= ( 11500/10) - ( 55 * 15) = 325

r = cov(x,y) / σ x σ y = 0.85
Here capital employed and profit are strongly positively correlated.
Since r = 0.8519 . There is strong positive relationship between variable X and Y or in other words we can say X
and Y are strongly positively correlated.
COMPUTER:
 Scatter Diagram – select data – insert tab – recommended charts- all charts – scatter diagram – fill up all
the chart elements
 Correlation coefficient - Data tab – Data Analysis – correlation – ok – small dialog box will open – a)
input range – select both the columns including heading/ label
b) data is arranged by column
c) check the box – label in first row
d) select output range - click on the text box next to it and then select an
empty cell on worksheet --- click ok --Identify the correlation coefficient
between two different variables.
Regression
Introduction
In our discussion , we shall learn how to express the relationship mathematically , so that we can predict or
forecast one of the variable if we know the value of the other variable.

Simple linear regression

Let there be two variables under consideration x and y . Generally, x is the independent variable and y is the
dependent variable.

Independent variable :

 The variables that are manipulated are changed and whose effect are measured and compared.
 It helps in predicting or forecasting the values of the dependent variable.
 The independent variable is also known as regressor / predictor / input variable.

Dependent variable:

 The dependent variable refers to that type of variable that measure that affect of independent variable on test
units.
 We can also say that dependent variable are the types of variables that are completely dependent on
independent variable.
 They are variables which can be predicted with the help of independent variable.
 The dependent variable is also known as regressed / response / output variable.

Examples :

a) the annual turnover of a company and the bonus given to the employees

independent variable :the annual turnover of a company

dependent variable: the bonus given to the employees

b) the weight of the mother and the weight of the new born babies

independent variable : weight of the mother

dependent variable: the weight of the new born babies

c) daily sales in an ice cream parlor and the daily temperature

independent variable : daily temperature

dependent variable: daily sales of ice cream


Identify the predictor and response variable:

The gross salary of the lecturers in a college and the income tax deduction

Income tax deduction – response variable ( dependent variable)

The gross salary of the lectures – predictor ( independent variable)

Note:

In any real-life situation, we could have situations involving a number of variables. Many independent variables
may affect another variable.

For example:

The number of units sold on a certain number would dependent on the brand , price , the memory , the features
etc

The Sales of a new 2-wheeler will dependent upon the mileage, the capacity of the engine, the advertisement
campaign and many more.

We are interested in establishing the mathematical relationship between x and y and is only focusing on

the linear relationship.

We are interested in establishing a relationship of the form

y=a +bx

here y is dependent variable ; x in independent variable ; a is intercept ( constant ) ;

b or byx is regression coefficient / slope

Regression equation of y on x is given by

( y - y ) = byx ( x - x )

σy
Where byx = cov(x,y) / σ x 2 = r.
σx

Refer to Previous problem:

a) Obtain the regression equation of y on x.


b) Predict the value of y when x = 120

Solution : carry out the problem till you Cov(x,y) and standard deviations of x and y

Now, Regression equation of y on x is given by

( y - y ) = byx ( x - x )
σy
Where byx = cov(x,y) / σ x 2 = r. ; where x = y=¿
σx

( y – 15 ) = (325/ 28.722 ) ( x – 55)

y – 15= 0.39 ( x – 55)

y – 15 = 0.39 x – 21.45

y = 0.39x – 21.45 + 15

y = - 6.45+ 0.39 x - regression equation of y on x ; y =a+ bx

where intercept = -6.45 and regression coefficient = 0.39

Interpretation of Intercept

If X takes 0 the first value that y takes is -6.45

Interpretation of regression coefficient

One unit change in x is 0.39 change in y

Note: + correlation implies positive slope – positive regression coefficient

b) predict the value of y when x = 120

substitute x = 120 in the regression equation

y = -6.45+ ( 0.39*120) = -6.45 + 46.8 = 40.35 ( write units)

Coefficient of Determination

The percentage of change in dependent variable (Y) is explained by independent variable x.

R2= r2 X 100

For the above problem

r =0.85 ; R2 = r2 X 100 = ( 0.85)2 X 100 = 72 %

72% change in Y (Profit) is explained by X ( Capital Employed)

EXCEL

Mount Carmel College, Autonomous

Department of Statistics and Analytics

Open Electives – Elementary Statistics using Excel


Worksheet – Bivariate Data Analysis – Simple Linear Correlation and Regression

1. For each of the following scatter plot choose the correlation which you think is the most appropriate from
the following:
Correlation is a) weak positive b) weak negative c) strong positive d) strong negative e) Zero

Strong Negative

Weak Negative

No Correlation

Strong Positive

2. Mr. Jhon calculated a correlation coefficient r = 0.95. Comment on the strength and direction of r.
Direction : positive sign indicates- upward direction
Strength : 0.95 - strong
There is a strong positive correlation between the two variables.

3. A teacher of a certain college in Bengaluru would like to examine the relationship between the number of
problems students solved during the semester and their final exam marks. He randomly selects 10 students
for study and asked them to keep track of the number of problems completed during the course of the
semester. At the end of the semester the following data is obtained

Problems Solved 50 5 64 65 7 77 7 84 8 91
8 6 8 5
Final Exam 61 6 66 66 7 73 7 73 7 75
Marks 8 2 2 6
a) Draw the scatter diagram and comment. – ( Either on graphsheet or computer

Final Exam Marks


80
75
70
65
60
55
50
40 50 60 70 80 90 100
( Note : Included chart elements)
There is a strong positive correlation between Number of problem solved and final exam marks

b) Compute the product moment correlation coefficient. ( Find r) – both manually and using computer
Formula writing is must.

Final Exam
  Problems Solved Marks

Problems Solved 1
Final Exam
Marks 0.944391864 1
Here r is 0.94 .
There is a strong positive correlation between Number of problem solved and final exam marks

c) Classify the direction and strength of the correlation.


Since r = 0.94 , direction – positive sign indicates upward direction
Strength – 0.94 , strong
There is a strong positive correlation between Number of problem solved and final exam marks

4. The following table gives the length (in mm) and weight (in gram) of a species of aquarium fish.

Length (X in mm) 8 10 15 17 20 22 24 25
Weight (Y in g) 25 30 32 35 37 40 42 45

a) Compute Karl Pearson’s correlation coefficient. What is the direction and strength of correlation
between the two variables? ( Find r )
  Length (X in mm) Weight (Y in g)

Length (X in mm) 1
Weight (Y in g) 0.982681 1

r = 0.98
Since r = 0.98 , direction – positive sign indicates upward direction
Strength – 0.98 , strong
There is a strong positive correlation between Length and weight

b) Identify the dependent and independent variable.


Independent vaiable -- Length (X) dependent variable – weight ( Y)
c)Predict the weight ( in g ) when the length is 23 ( in mm).

Regression equation ( y - y ) = byx ( x - x ) Solve and get in the form as given below

Y=a+bX ; Y = 17.53 + 1.03 X


  Coefficients
Intercept 17.53036
Length (X in
mm) 1.033738

Prediction Y when X = 23
Y = 17.53 + 1.03 *( 23) = 41.22 mm
d) Find coefficient of determination and give suitable interpretation.
R2 = r2 X 100 = 96.56% ; 96.56 % of change in Y( weight ) is influenced by the change in X(length)
R 0.96566
Square 3

5. From the following data on increase in temperature ( X in o C) and the mortality rate ( Y in number
of death per 1000 ) of species of bug.

X 20 32 43 44 39 46

Y 42 72 50 90 45 48

a) Find the product moment correlation coefficient between X and Y and interpret the result.
b) Obtain the regression equation of Y on X and predict the mortality rate ( Y in number
of death per 1000 ) when temperature ( X in o C) is 45.
c) Find coefficient of determination and give suitable interpretation

6. In a bivariate data, on x and y the two variances are 39 and 9. The covariance is -14.5. Find the coefficient
of correlation.
cov ( x , y )
r=
σxσ y

7. In a bivariate data, with 10 observations on x and y, ∑x = 56, ∑y = 138, ∑x2 = 1357, ∑y2 =2136, and
∑xy= 836. Find the coefficient of correlation.
cov (x , y )
r=
σxσ y
where cov ( x, y) is the covariance between x and y.
σ x =standard devation of x ; σ y =standard devation of y
n
1
cov( x ,y) = ∑ x y -x y
n i=1 i i
8. In a bivariate data, with 10 observations on x and y, ∑x = 10, ∑y = 210, ∑x2 = 14, ∑y2=5340, and
∑xy= 180. Find the coefficient of correlation .
*********************

Classical definition of Probability


Let a trial results in ‘N’ mutually exclusive, equally likely and exhaustive outcomes of which
‘m’ are favorable to the occurrence of event E. Then
Number of Favourable cases m
P(E) = Total number of Exhaustive cases = N
Note: a) 0 ≤ P(E) ≤ 1 - make a note
b) P( E) = Probability of success
P(Ec) or P(E’)= Probability of failure
P(E) + P(Ec) = 1
A : rain today
P(A) = 0.5 – there is 50 % of chance that it will rain today
P(A) = 0 - there is no chance that it will rain
P(A) = 1 - there is 100 % chance that it will rain
Problem 1: A uniform dice is thrown at random. Find the probability that the number on it is
a) 5 b) greater than 4 c) even
Solution:
Random Experiment: throwing a dice
Sample Space S={1,2,3,4,5,6}
Number of Exhaustive cases= 6
a) A: getting number 5

Favourable event : {5}


Number of Favourable cases= 1
Number of Favourable cases
P(A) = Total number of Exhaustive cases

1
P(A)= 6 =0.1667

b) B: getting number greater than 4


Favourable events : {5,6}
Number of Favourable cases= 2
Number of Favourable cases
P(B)= Total number of Exhaustive cases

2
P(B ) = 6 =0.3333

c) C: getting even number


Favourable events : {2,4,6}
Number of Favourable cases= 3
Number of Favourable cases
P(C)= Total number of Exhaustive cases

3
P(C) = 6 =0.5

Problem2: In a single throw with two uniform die. Find the probability of throwing sum of the
die is
a) five b) eight
Solution : Random experiment: Throwing two die
Sample space S= { (1, 1), (1, 2), ( 1, 3), (1, 4), (1, 5), (1, 6)
(2, 1), (2, 2), (2, 3), (2, 4), ( 2, 5), ( 2,6)
(3, 1), (3, 2), (3, 3), (3, 4), (3, 5), (3,6)
(4, 1), (4, 2), (4, 3), (4, 4), (4, 5), (4,6)
(5, 1), (5, 2), (5, 3), (5, 4), (5, 5), (5,6)
(6, 1), (6, 2), (6, 3), (6, 4), (6, 5), (6,6) }
Exhaustive cases = 36

a) E1 : getting sum of number as 5

Favourable events: { (1,4), (2,3),(3,2),(4,1)}


Number of favorable cases: 4
Number of Favourable cases
P(E1) = Total number of Exhaustive cases

4
= 36 = 0.111

b) E2 : getting sum of number as 8

Favourable events: { (2,6), (3,5),(4,4),(5,3) and (6,2)}


Number of favorable cases: 5
Number of Favourable cases 5
P(E1) = Total number of Exhaustive cases = 36 = 0.1383

c) E3 : getting sum as multiple of 5

Favourable events:{ (1,4),( 2,3),( 3,2), (4,1), (4,6),(5,5),(6,4)}


Number of favourable cases: 7
Number of Favourable cases 7
P(E3) = Total number of Exhaustive cases = 36
d) E4: getting sum as number less than 5
Favourable events: { (1,1) ,(1,2),(1,3),(2,1),(2,2) and ( 3,1)}
Favourable cases = 6
Number of Favourable cases 6 1
P(E3) = Total number of Exhaustive cases = 36 = 6

Important concepts:
Combination: A combination is simply a manner of selecting some objects from a given set of objects in such a way
that the order of their selection doesn’t matter.
A bag is full of 4 green marbles and 5 blue marbles

GREEN BLUE
1. Out of 4 green marble, you are selecting 1 green marble = 4 C1
2. Out of 4 green marbles, you are selecting 2 green marbles = 4 C2
3. Out of 5 blue marbles, you are selecting 3 blue marbles = 5 C3

n
Cr 4
C4 = 1

0! =1
1! = 1
2! = 2 x 1!= 2 X 1 =2
3! = 3 X 2!= 3 x 2 x 1 =6
4! = 4X 3!= 4 X 3 X2! =4 x 3 x 2 x 1 =24
5! = 5 X 4! = 5 x 4 x 3 x 2 x 1 =120
….10!

Formula
n!
n
Cr =
( n−r ) ! r !
For example:
4! 4! 4 X3 X 2 X 1 24
1. 4
C1 = = = =¿ = 4
( 4−1 ) ! 1! 3! 1 !3 X 2 X 1 X1 6
10 ! 10 ! 10 X 9 X 8! 10 X 9
2. 10C8 = = = = 2 X 1 = 5X 9 = 45
( 10−8 ) ! 8 ! 2! 8 ! 2!8! ¿
¿

Using calculator:
Steps: Enter the expression in given format nCr
Consider the example 5C2

Enter the number 5 -> Shift -> -> 2

Problem 3: A card is drawn at a random from a pack of playing cards, what is the probability of getting
a) an Ace b) a king c) a red card d) a club

Solution: Random experiment: Picking a card from a pack of playing cards


Exhaustive cases : 52C1 = 52!/ (52-1)! 1!= 52! /(51! X 1!)
= 52 X 51! / (51! X 1!) = 52/1 = 52
a) E1 : getting an ace
There are 4 aces but we need to draw only one card
Favourable cases : 4C1 = 4

Number of Favourable cases


P(E1) =
Total number of Exhaustive cases
= 4C1 / 52C1
4
=
52

b) E2 : getting a king
Favourable cases : 4C1 = 4
Number of Favourable cases
P(E2) =
Total number of Exhaustive cases
= 4C1 / 52C1
4
=
52

c) E3: getting a red card


Favourable cases : 26C1 = 26
Number of Favourable cases
P(E3) =
Total number of Exhaustive cases
= 26C1 / 52C1
26
=
52

d) E4: getting a club


Favourable cases : 13C1 = 13
Number of Favourable cases
P(E4) =
Total number of Exhaustive cases
= 13C1 / 52C1
13
=
52
Problem 4: Three cards are drawn at a random from a pack of playing cards, what are the probability of getting
all cards as
a) queen b) diamond c) one queen and two king
Solution: Random experiment: Picking three card from a pack of playing cards
52! 52 ! 52 X 51 X 50 X 49 ! 52 X 51 X 50
Exhaustive cases : 52C3 =
( 52−3 ) ! 3 ! 49 ! 3 ! = 49 ! 3 X 2 X 1
= =
6
=

a) A1 : getting 3 queens
Favourable cases : 4C3 = 4
Number of Favourable cases
P(A1) =
Total number of Exhaustive cases
= 4C3 / 52C3

b)A2 : getting 3 diamond


Favourable cases : 13C3

Number of Favourable cases


P(A2) =
Total number of Exhaustive cases
= 13C3 / 52C3
= 286/22100

=0.0129 0≤ P(A)≤1

A) A3 : one queen and two king


Favourable cases : 4C1 X 4C2 Hint : AND X(multiply)
Number of Favourable cases
P(A3) =
Total number of Exhaustive cases
= 4C1 X 4C2 / 52C3 = 4 X6 = 24

Problem 5. A bag has 2 white balls and 4 Red balls. One ball is picked at random. Find the probability of getting
a) White b) Red
Solution: Random experiment: Picking a ball from a bag
Total number of balls = 2 white+ 4 Red = 6 balls
Exhaustive cases : 6C1 = 6
a) B1 : getting a white ball
Favourable cases : 2C1 = 2
Number of Favourable cases
P(E1) =
Total number of Exhaustive cases
= 2C1 / 6C1
2
= = 0.333
6

b) B2 : getting a Red ball


Favourable cases : 4C1 = 4
Number of Favourable cases
P(B2) =
Total number of Exhaustive cases
= 4C1 / 6C1
4
= = 0.66
6
Problem 6: A computer lab consist of 10 computers, out of which 6 computers have internet connection and
remaining doesn’t have internet connection. Find the probability that randomly selected 2 computers
a) have internet connection b) doesn’t have internet connection

Solution: Random experiment: Selecting two computers from a lab


Exhaustive cases : 10C2
a) C1 : getting computer with internet connection
Favourable cases : 6C2
Number of Favourable cases
P(C1) =
Total number of Exhaustive cases
= 6C2 / 10C2 = 15 / 45 = 0.33
= simplify
Note: answer should be in decimal and conclusion to be written in terms of percentage

b) C2 : getting computer with no internet connection


Favourable cases : 4C2
Number of Favourable cases
P(C2) =
Total number of Exhaustive cases
= 4C2 / 10C2
= 6/45 = 0.1333
There is 13.33% of chance of getting two computers without internet connection from a lab of 10
computers
Problem 7. A bag has 4 white balls and 16 Red balls. Two ball are picked at random. Find the probability of
getting one white and one red ball
Solution: Random experiment: Picking a ball from a bag
Total number of balls = 4 white+ 16 Red = 20 balls
Exhaustive cases : 20C2
a) E : getting a white ball and a red ball
Favourable cases : 4C1 X 16C1 - white and red balls are mutually exclusive

Number of Favourable cases


P(E) =
Total number of Exhaustive cases
= 4C1 X 16C1 / 20C2
Axiomatic definition of probability
Let S be sample space and A be any event in sample space, then the function P(A) is called probability of event
A, if it satisfy three axioms
i) P(A) ≥ 0
ii) P(S) = 1
iii) If A1,A2,….An are n mutually exclusive events then
P(A1 ∪A2 ∪….∪An) = P(A1 ) + P(A2)+ ….+P(An)

Note :P( A ) + P( Ac) = P( S) =1


Addition Theorem of Probability
Statement:
Let A and B are two non mutually exclusive events then P(A ∪ B) = P(A) + P(B) – P(A ∩ B)

keywords: either of the event , A or B, atleast one of the event ---- P(A ∪ B)
Example: Suppose an experiment has the outcomes 1, 2, 3, … , 12 where each outcome has an equal chance of
occurring.
Let event A = number less than & equal to 6 - {1, 2, 3, 4, 5, 6} and
event B = number more than & equal to 6 - {6, 7, 8, 9}.
 A intersect B = A ∩ B =  A and B = { 6 } Here A and B are non- mutually exclusive events
 A union B = A ∪ B = A or B = A either B = Atleast A or B = 1,2,3,4,5,6,7,8,9
The Venn diagram is as follows:

P( E1) = fav cases/ exhaustive


P( A U B) = P (A) + P(B) – P( A ∩ B )
= 6/12 + 4/ 12 – 1/12 = 9/12 = 3/4 =0.75

Problem1: A card is drawn from pack of playing card. What is the probability of getting
a)a diamond or king
b) a red or queen
(Or is the keywords – P( A U B))
Solution: Random experiment: A card is drawn
Exhaustive cases: 52C1
a) A: getting a diamond
B: getting a king
Here A and B are non mutually exclusive event.
Because there is one sample point common in both event i.e diamond’s king/ king of diamond card
P( diamond or king) = P(A ∪ B) = P(A) + P(B) – P(A ∩ B) - using addition theorem
= (13C1/52C1) + (4C1/52C1) -(1C1/52C1)
= (13/52) +(4/52) – (1/52)
= 16/52
= 4/13
b) A: getting a red
B: getting a queen
Here A and B are non mutually exclusive event.
Because there are two sample points common in both event i.e queens belongs to red card

P( red or queen ) = P(A ∪ B) = P(A) + P(B) – P(A ∩ B)


= (26C1/52C1) + (4C1/52C1) -(2C1/52C1)
= (26/52) +(4/52) – (2/52)
= 28/52
= 7/13

Note : If A and B are two mutually exclusive events then P(A ∪ B ) = P(A) + P(B)
example:

Problem1: A card is drawn from pack of playing card. What is the probability of getting
a)a queen or king
b) a red or black
a) A: getting a queen
B: getting a king
Here A and B are mutually exclusive event.
Because there is no sample point common in both events

P( queen or king) = P(A ∪ B) = P(A) + P(B)


= (4C1/52C1) + (4C1/52C1)
= (4/52) +(4/52)
= 8/52
b) A: getting a red
B: getting a black
Here A and B are mutually exclusive event.
Because there is no sample point common in both event

P( red or black) = P(A ∪ B) = P(A) + P(B)


= (26C1/52C1) + (26C1/52C1)
= (26/52) +(26/52)
= 52/52
=1

Using classical definition ( Different variety)


Problem 1: Three cards are drawn at a random from a pack of playing cards, what are the probability of getting
b) 1 queen, 1 king and 1 jack b) 2 diamond and 1 heart c) 1 black and 2 red

Solution: Random experiment: Picking three card from a pack of playing cards
Exhaustive cases : 52C3
b) A1 : getting 1 queen, 1 king and 1 jack
Favourable cases : 4C1 X 4C1 X 4C1

Number of Favourable cases


P(A1) =
Total number of Exhaustive cases
= (4C1 X 4C1 X 4C1) / 52C3

b)A2 : 2 diamond and 1 heart


Favourable cases : 13C2 X 13
C1

Number of Favourable cases


P(A2) =
Total number of Exhaustive cases
= (13C2 X C1)/ 52C3
13

c) A 3: 1 black and 2 red card


Favourable cases : 26C1 X 26C2
Number of Favourable cases
P(A2) =
Total number of Exhaustive cases

P(A2)= (26C1 X 26C2) / 52C3

Problem 5. A bag has 2 white balls and 4 Red balls. Three balls are picked at random. Find the probability of
getting
a) 1 White and 2 Red b) 1 Red and 2 white
Solution: Random experiment: Picking a ball from a bag
Exhaustive cases : 6C3
c) B1 : 1 White and 2 Red
Favourable cases : 2C1X 4C2

Number of Favourable cases


P(E1) =
Total number of Exhaustive cases
= (2C1X 4C2)/ 6C3
=
d) B2 : 2 Red and 1 white
Favourable cases : 2C2X 4C1
Number of Favourable cases
P(B2) =
Total number of Exhaustive cases
= (2C2X 4C1 ) / 6C3
Independent events
Two events are said to be independent if the occurrence of one does not affect the occurrence of other in a series
of trial.
Example1 : A: getting tail in first toss
B: getting head in second toss
Example 2: Two cards are drawn one after the other( successively) with replacement.
A : getting queen in first draw 4
C1
B: getting queen in second draw 4
C1
Dependent event
Two events are said to be dependent if the occurrence of one affect the occurrence of other in a series of trial.
Example 1: Two cards are drawn one after the other( successively) without replacement.
A : getting queen in first draw 4
C1
B: getting queen in second draw 3
C1

Multiplication theorem of probability( AND law i.e intersection)


Let A and B be two dependent events in the same sample space S
P(A ∩ B) = P(A) . P(B| A) ; P(A)>0 ( Remark: here event B is depend on event A)
{Or P( B|A) = P(A ∩ B) / P( A) – conditional probability}
( “|” given) i. e P(B|A) probability of B given A i.e given A refers to happening of A
A : first draw
B: second draw
or ( Use this formula when A and B are dependent event ie without
replacement)

P(B ∩ A) = P(A ∩ B) =P(B) . P(A| B) ; P(B)>0


Where P( A|B) = P(A ∩ B) / P( B)
(Remark: here event A is depend on event B)
B: first draw
A: second draw
Note: If A and B are independent event then P(A ∩ B) = P(A) . P(B) ( with replacement)
A: first draw
B: second draw
Addition theorem - P( A union B) -P( A or B) - P( A U B) – non mutually or mutually exclusive events
Multiplication theorem – P( A intersection B) – P( A and B) – P( A∩ B) – dependent or independent
events
Problem 1: Two cards are drawn successively (one after the other) from pack of playing cards.
Find the probability of getting two jacks( jack 1 in first draw and jack in 2nd draw) under the following
conditions
i) with replacement
ii) without replacement

Solution:
Random experiment: Drawing two cards successively
i) With replacement
A: getting jack in first draw P(A) = 4C1/52C1
B: getting jack in second draw P(B) = 4C1/52C1

P( a getting jack in first draw and getting jack in second draw)


( here A and B are independent events)
P(A ∩ B) = P(A) . P(B)
= (4C1/52C1 ) . (4C1/52C1)

ii) Without replacement


A: getting jack in first draw P(A) = 4C1/52C1
B: getting jack in second draw P(B|A) = 3C1/51C1
(here B is dependent on A)

P( a getting jack in first draw and getting jack in second draw)


( here B is dependent on A)
P(A ∩ B) = P(A) . P(B|A)
= (4C1/52C1 ) . (3C1/51C1)

Problem 2: Two cards are drawn successively (one after the other) from pack of playing cards.
Find the probability of getting two red cards under the following conditions
i) with replacement
ii) without replacement

Random experiment: Drawing two cards successively drawn


i) With replacement
A: getting red in first draw P(A) = 26C1/52C1
B: getting red in second draw P(B) = 26C1/52C1

P( a getting red card in first draw and getting red card in second draw)
( here A and B are independent)
P(A ∩ B) = P(A) . P(B)
= (26C1/52C1 ) . (26C1/52C1)

ii) Without replacement


A: getting red card in first draw P(A) = 26C1/52C1
B: getting red card in second draw P(B|A) = 25C1/51C1
( here B is dependent on A)
P( a getting red in first draw and getting red in second draw)
( here B is dependent on A)
P(A ∩ B) = P(A) . P(B|A)
= (26C1/52C1 ) . (25C1/51C1)

Problem 3: Two cards are drawn successively (one after the other) from pack of playing cards.
Find the probability of getting red card in first draw and black card in the second draw under the following
conditions
i) with replacement
ii) without replacement

Solution : Random experiment: Drawing two cards successively


i) With replacement
A: getting red in first draw P(A) = 26C1/52C1
B: getting black in second draw P(B) = 26C1/52C1

P( a getting red card in first draw and getting black card in second draw)
( here A and B are independent)
P(A ∩ B) = P(A) . P(B)
= (26C1/52C1 ) . (26C1/52C1)

ii) Without replacement


A: getting red card in first draw P(A) = 26C1/52C1
B: getting black card in second draw P(B|A) = 26C1/51C1
( here B is dependent on A)

P( a getting red in first draw and getting black in second draw)


( here B is dependent on A)
P(A ∩ B) = P(A) . P(B|A)
= (26C1/52C1 ) . (26C1/51C1)
Problem 4: Two cards are drawn successively (one after the other) from pack of playing cards.
Find the probability of getting king in first draw and getting number divisible by 5 in the second draw under the
following conditions
i) with replacement
ii) without replacement

Solution : Random experiment: Drawing two cards successively


i) With replacement
A: getting king in first draw P(A) = 4C1/52C1
B: getting number divisible by 5 in second draw P(B) = 8C1/52C1
{ S5, S10, C5, C10, D5, D10, H5, H10}
P( a getting king card in first draw and getting number divisible by 5 in second draw)
( here A and B are independent)
P(A ∩ B) = P(A) . P(B)
= (4C1/52C1) . (8C1/52C1)

ii) Without replacement


A: getting king in first draw P(A) = 4C1/52C1
B: getting number divisible by 5 in second draw P(B|A) = 8C1/51C1
{ S5, S10, C5, C10, D5, D10, H5, H10}
( here B is dependent on A)

P(a getting king card in first draw and getting number divisible by 5 in second draw)
( here B is dependent on A)
P(A ∩ B) = P(A) . P(B|A)
= (4C1/52C1 ) . (8C1/51C1)
Conditional Probability ( Refer multiplication theorem)
The conditional probability of occurrence of event A given event B has already occurred is defined as
P(A|B) = P(A ∩ B) / P(B) ; P(B)>0
The conditional probability of occurrence of event B given event A has already occurred is defined as
P(B|A) = P(A ∩ B) / P(A) ; P(A)>0

Problem1 : A sample of 500 respondents were selected in a large metropolitan area to study consumer behavior
with the following results

Enjoys Shopping for clothing Male Female Total


Yes 136 224 360
No 104 36 140
Total 240 260 500

a) Suppose the respondent chosen is female. What is the probability that she does not enjoy shopping for
clothing?
b) Suppose the respondent chosen enjoys shopping for clothing. What is the probability that the individual is
male?
c) Are enjoying shopping for clothing and gender of the individual independent?
Solution : Random Experiment : Selecting a respondent in a survey
a) A: Female respondent
B : Respondent don’t enjoy shopping
The conditional probability of respondent doesn’t enjoy shopping (B) given respondent is female(A) is given by
P(B|A) = P(A ∩ B) / P(A) ; P(A)>0
a)Method 1 : P(A ∩ B) = 36 / 500 and P( A) = 260/500
36/500
P(B|A) = P(A ∩ B) / P(A) = = 224/260 =
260/500
b)Direct Method - P( B | A ) = 36/260 =

a) A: Respondent enjoy shopping


B : Male respondent
The conditional probability of respondent is male (B) given respondent enjoy shopping (A) is given by
P(B|A) = P(A ∩ B) / P(A) ; P(A)>0
a)Method 1 : P(A ∩ B) = 136 / 500 and P( A) = 360 /500
136/500
P(B|A) = P(A ∩ B) / P(A) = = 136/360
360/500
b)Direct Method - P( B | A ) = 136/360 =

Bayes’ Theorem
Suppose E1, E2, ….En are mutually exclusive events. Let A be any event related which occurs along with all of Ei’s
then
P ( A|Ei ) P( Ei) E3
P( Ei| A) = E2
P ( A|E 1 ) . P ( E 1 ) + P ( A|E 2 ) . P ( E 2 )+ … … ..+ P ( A|En ) . P( En) E4
E1
A
E5
En
E6
E7

Problem 1: A company has two machines to produce CDs, Machine M1 produces 45% of the production and
machine M2 produces 55 %. The defective rates for machine M1 is 8% and for machine M2 is 10%
a) What is the probability that an item selected at random is defective?
E2
b) If a defective item is drawn, what is the probability that it is from machine M1? E1
A
Solution:
E1 = CDs produced by machine M1
E2 = CDs produced by machine M2
Here E1 and E2 are mutually exclusive events
Let A be the event of defective item produced
P(E1) = 45% =0.45 ; P(E2) = 55% =0.55
P(A|E1) = 8% =0.08 i.e probability of item being defective from machine M1
P(A|E2) = 10% =0.10 i. e probability of item being defective from machine M2

a) What is the probability that an item selected at random is defective?


P(A) = The probability of defective item produced

Law of total probability - ( denominator of Bayes’theorem)


P(A) =P{(A∩ E1) OR (A ∩ E2)}
=P(A∩ E1) +P (A ∩ E2)
| As the events are mutually exclusive

(A∩ E1) = P(A|E1) P(E1) =0.08 * 0.45 =0.036 ( use multiplication theorem)
(A∩ E2) = P(A|E2) P(E2) = 0.1* 0.55 =0.055

P(A) =P(A∩ E1) +P (A ∩ E2) = P(A|E1) P(E1) + P(A|E2) P(E2)


=0.036+0.055
=0.091
b) If a defective item is drawn, what is the probability that it is from machine M1?
P( E1|A)= Probability of CD drawn from machine 1 given or known that is it a defective one.
From Bayes theorem we have
P ( A|Ei ) P( Ei)
P( Ei| A) =
P ( A|E 1 ) . P ( E 1 ) + P ( A|E 2 ) . P ( E 2 )+ … … ..+ P ( A|En ) . P( En)
P ( A| E1 ) P(E 1)
In this problem, P( E1| A) =
P ( A|E 1 ) . P ( E 1 ) + P ( A|E 2 ) . P ( E 2 )
0.08∗.14 0.08∗.14
= = = 0.39
( 0.08∗0.48 )+(0.1∗0.55) 0.091
If a defective item is drawn, there is 39% of chance that is produced from machine M1.
Problem 2: There are 3 machines producing 10000, 20000 and 30000 bullets per hour
respectively. These machines are known to produce 5%,4% and 2%  defective bullets respectively. One bullet
is taken at random from an   hour’s production of the three machines. What is the probability that it is defective?
If the drawn bullet is defective, what is the probability that this  was produced by the  third machine?
Solution: E1 = Bullet produced by machine I E2 = Bullet produced by machine II
E3 = Bullet produced by machine III
Here E1,E2 and E3 are mutually exclusive
P(E1) = 10000/60000 = 1/6 P(E2) = 20000/60000=1/3 P(E3) = 30000/60000=1/2
Let A be the event of defective bullet produced
P(A|E1) = 5% =0.05 P(A|E2) = 4% =0.04 P(A|E3) = 2% =0.02
a) P(A) = probability of bullet drawn is defective
P(A) =P{(A∩ E1) OR (A ∩ E2) OR (A ∩ E3)}}
=P(A∩ E1) +P (A ∩ E2) +P (A ∩ E3)
= P(A|E1) P(E1) + P(A|E2) P(E2) + P(A|E3) P(E3) - (using multiplication theorem)
=( 0.05*1/6) +( 0.04 *1/3) +(0.02* ½) =
| As the events are mutually exclusive
b) If the drawn bullet is defective, what is the probability that the bullet drawn
was produced by the third machine?

P(E3| A)=? Probability of bullet drawn from machine 3 given that it is defective
From Bayes theorem we have
P ( A|E 3 ) P( E 3)
P( E3| A) = = 0.316
P ( A|E 1 ) . P ( E 1 ) + P ( A|E 2 ) . P ( E 2 )+ P ( A|E 3 ) P(E 3)
(Substitute and solve the problem)
Homework
Problem 3: In a certain assembly plant three machines A, B and C make 30%, 45% and 25% of the products
respectively. It is known from past experience that 2%, 3% and 2% of the products made by
each machines respectively are defective. Suppose a finished product is randomly selected a) what is the
probability that it is defective b) if a product was chosen randomly and found to be defective what is the
probability that it was made by machine C? 
Solution :
E1 : products produced by machine A ; P( E1) = 30% = 0.3
E2 : products produced by machine B; P( E2) = 45% = 0.45
E3 : products produced by machine C ; P( E3) = 25 % = 0.25

Let D be the defective product


P( D| E1) = 2%= 0.02
P( D|E2) = 3% = 0.03
P( D| E3) = 2% =0.02
 
a) what is the probability that it is defective ?

Solution : law of total probability


P( D) = P[ ( D∩ E1) or( D∩ E2)or ( D∩ E3)}
P( D) = P( D∩ E1) + P ( D∩ E2)+ P ( D∩ E3)
P( D) = P( D| E1) P( E1) + P( D| E2) P( E2) + P( D| E3) P( E3) = (0.02*0.3 )+ (0.03*0.45)+ (0.02*0.25)
= 0.0245
P ( D| E3 ) P(E 3)
b ) P( E3| D) = = ( 0.02*0.25) /0.0245 =0.005/0.0245
P ( D|E 1 ) . P ( E 1 ) + P ( D|E 2 ) . P ( E 2 ) + P ( D|E 3 ) P (E 3)
=0.2040
OPEN ELECTIVE
ELEMENTARY STATISTICS USING EXCEL
QUIZ – 2
MEASURE OF UNIVARIATE AND BIVARIATE DATA ANALYSIS
20 MINUTES 10
MARKS

1. Which average will you prefer in the following situation? "To obtain the average speed of Joe when he drives a car at 20
mph for the half of the journey and 30 mph for second half."(0.5 Points)
a. Arithmetic Mean
b. Geometric Mean
c. Harmonic Mean

2. What is the strength and direction in a scatter plot indicated below? (2 points)

3. The given stem and leaf plot displays the number of visitors in Mall during a month. What is median number visitors? (1
Point)
0 123 5 6 7
1 579
2 1134 55 6 6 6 6 7 7 7 8 9 9
3 0057
2 The life in hours) of 15 bulbs of a certain brand were collected and it was found that Q1=900hours and D8 =
1500hours. Interpret Q1 and D8 in the given scenario. (2 points)

3 The following boxplot shows the ages of students in School. Comment on the skewness. (0.5 points)

4 A data is collected on increase in temperature (x) and the sales of ice cream (y). it was found that correlation
coefficient (r) between temperature and sales is 0.85 and regression equation is Y= 2.5+0.7x
Answer the following
a. Identify independent and dependent variable.
b. Predict y when x = 40
c. Find coefficient of determination. (3 points)

5 If the coefficient of variation of product A is 17% and if the cofficient of variation of product B is 20%. Then
we can conclude that product A is ________ than product B. (1 point)
a. More consistent
b. Less consistent

Mount Carmel College, Autonomous, Bengaluru


Department of Statistics and Analytics
Open Electives – Elementary Statistics using Excel
Worksheet – Measures of Central Tendency

Date

1. The temperature (o C) in Himachal Pradesh on 5 days during Winter Season


-3 -8 0 -11 2
What is the median temperature? -3

2. Find the mode of following and comment on distribution


i) -1 -3 0 2 0 3 0 – 0
ii) 5 6 6 7 8 10 9 7 - bimodal ( 6 and 7)
iii) 2 3 3 3 5 2 5 2 7 5 - multimodal ( 2 3 5)

3. What is geometric mean of 12 14 20 0 25? 0

4. The given stem and leaf plot displays the number of birds visitng a Feeder in a
day

n = 29 median = 15th obs


i) What was the median number of birds visiting the feeder each day? 25
ii) What is maximum number birds visiting the feeder each day? 26 - mode

5. The front row in a movie theatre has 23 seats. If you were asked to sit in the seat that occupied
the median position, in which seat would you have to sit? 12th seat

6. The mean weight of five complete computer stations is 167.2 pounds. The weights of four of
the computer stations are 158.4 pounds, 162.8 pounds, 165 pounds, and 178.2 pounds
respectively. What is the weight of the fifth computer station?
Mean = Sum of weight five computer / 5
5 * mean – ( weights of four computers) = fifth computer weight = 172.6 pounds
7. The mean salary of 20 women employees is 15 ( in 1000’s) and the mean salary of 30 male
employees is 25 ( in 1000’s). Then the average salary of women and men employees is
______________( in 1000’s). Combined mean = ( 20*15 +30* 25 ) / ( 20+30)

8. If the mean pocket expenses of a I B.Sc student for June, July and August is Rs 500. If the
pocket expenses for the month September and October is 350 and 400 respectively. What
would be the mean expenses of student in five months?
Method 1 : Mean of ( June, July , Aug) = Sum of J, J A / 3
500 = sum of J J A /3
Sum of J J A = 1500

Mean of Five months = ( sum of J J A + S + O ) / 5


= ( 1500+ 350+ 400 ) / 5 =
Method 2 : mean 1 = 500 ; months =3
Mean 2 = ( S+ O ) / 2 = 350 + 400 / 2 = 750/ 2 = 375 ; 2 months
Combined mean = ( 500 * 3 + 375 *2 ) / (3+2) =

9. The following data represent the number of pop-up advertisements received by 10 families
during the past month.

43 37 35 30 41 23 33 31 06 21

i) Calculate the mean, median and mode number of advertisements received by each family
during the month.
ii) Which measure of central tendency provides best measure of center for this data? Explain.
Median ( because data has an outlier ( i.e 6 ) and there is no mode)

10. The following are the life (in hours) of 15 bulbs of a certain brand. Find the values of the three
quartiles, P68 and D4.
850 997 878 900 902 730 900 950 975 1000 1050 750 885 1100 1075

Interpret your answers. What can you say about the values of the 40th percentile and 5th decile?

11. A person invested certain amount in Reliance stock which gained 10% in the 1st Year, 20%
in the 2nd Year and 30% in the 3rd Year. What is the average rate of gain?
- Geometric Mean ( rate %)

12. What would be the average speed of swimmer if he swims first 5 min. at 15km/hr and
another 5 min. at 10km/hr ? Harmonic mean ( data is Km/Hr)

13. A man drove car from home to his office at a speed of 55 km per hour and returned home at a
speed of 80 km per hour. What was my average speed for the whole trip? Harmonic mean ( data is Km/Hr)
Mount Carmel College, Autonomous, Bengaluru
Department of Statistics and Analytics
Open Electives – Elementary Statistics using Excel
Worksheet – Measures of Dispersion

Date

1. Answer the following:


a) For the data set 23, 23, 23, 23, 23 the standard deviation value is ____0_____________.
b) If the coefficient of variation of Player A is 15.5% and Player B is 27.4% then we can
conclude that Player A is ______more consistent_________ than Player B.
c) The following box-plot shows the ages of students in School.

i) What is the age of the youngest student? 8 ii) What is the age of the oldest student? 17
 iii) What is the range of the data? 9 iv) What is the upper quartile? 14 Q3
v) What is inter-quartile range? 4 ( Q3- Q1) iv) Comment on skewness of the data. Left skewed

d) If for a given data set, find β1, β2 and comment


i) µ2 = 2, µ3 = 6, µ4 = 24 ii) µ2 = 2, µ3 = 0, µ4 = 12

2. The following double box plot represents the average prices of two brands of Sneakers.
Answer the following questions
a) Which brand of sneakers has a price range of $10? Nike
b) Which is the median value of Nike Sneakers? 47
c) Which is the value of lower quartile of Reebok Sneakers? 34
d) Comment on the skewness of Reebok Sneakers. Right Skewed

3. The height of 12 students (in cms) of a certain class is given below:


145 149 158 143 156 140 163 147 161 158 152 160
a) Find the i) range and its coefficient.
ii) inter-quartile range.
iii) variance, standard deviation and coefficient of variation.
b) Construct the box and whisker plot and comment on the distribution. -graphsheet
c) Use the plot to check for outliers.
d) Evaluate skewness and kurtosis. Comment.

4. Sketch a box-and-whisker plot for the data from a certain school giving the average score of 15
students attending remedial classes before appearing for their board exams. Also check for outliers.
Find skewness and comment.
9.66 5.90 8.02 5.79 8.73 0.82 8.01 8.35 0.49 6.68 15.64 4.08 6.17 9.91 5.47

5. The amount of rainfall in a city during a particular season for 6 days are given as
Monday Tuesday Wednesday Thursday Friday Saturday
17.8 cm  19.2 cm 16.3 cm 12.5 cm 12.8cm 11.4cm
a) Find the range, variance and its coefficient.
b) Construct the box and whisker plot and comment on the distribution.
c) Evaluate skewness and comment.

6. For a summer job, you were working in the quality control department for a computer company
that manufactures computer parts. The specific part that you are to evaluate the quality of is supposed to be 8
micrometers in thickness. You obtained samples of five of these parts manufactured by the day shift and five parts
manufactured by the night shift workers. Here are the findings:
Day Shift 7.9, 8.0, 8.2, 8.3, 7.8 Night Shift 2, 4, 12, 14, 20
a) Find (i) mean thickness for each shift. (ii) standard deviations for each shift.

B) Which crew is doing a better job? Why? – Coefficient of variation ( CV day shift < CV night shift)
There is less variation in the measurement produced during day shift as compare to night shift

7. The following data provide the runs scored by two batsmen in the last 10 matches.

Batsman A: 25, 20, 45, 93, 8, 14, 32, 87, 72, 4 Batsman B: 33, 50, 47, 38, 45, 40, 36, 48, 3

Which Batsman is more consistent? Why?

You might also like