ASSIGNMENT: STATISTICS AND
PROBABILITY
NAME: M FARHAN ASLAM
SEAT# P19101036
CS- 503 STATISICAL METHODS
MCS – MORNING
ROLL # 36
Q1a) Define Statistics by giving examples.
Statistics
Statistics are defined as numerical data, and is the field of math that deals with the
collection, tabulation and interpretation of numerical data.
a. An example of statistics is a report of numbers saying how many followers of
each religion there are in a particular country.
b. An example of statistics is a math class offered in high schools and colleges.
b) In the given data, we measure the length of 50 neem leaves.
5.5 6.1 5.9 6.2 6.3 5.7 5.9 5.4 5.5 5.9 6.3 5.7 5.5 6.4 6.0 6.6 5.6 6.1 6.9 5.9 6.0 5.1
6.7 6.0 6.2 5.5 6.4 6.9 6.2 6.2 5.3 5.6 6.0 5.4 5.2 5.5 7.0 5.8 6.3 4.9 5.6 5.5 6.0 6.7
6.8 5.8 5.7 6.0 6.1 7.2
i) Construct a stem-and-leaf plot and find the value of the median from it.
ii) Comment on the shape of the distribution.
Ans: As the distributed data was right of the average value, that would mean
a negative skewed.
Q2 The number of road accident reported by Police per day for last two months.
a) Construct a frequency distribution
b) Constructing a histogram
c) Comments on the shape of distribution
Ans: As per the shape from data of the graph its positive skewed.
d) As insurance person, what you understand for your Motor Department.
Ans: Number of accidents has been decreases as each passing day as recorder for
the last two months.
Q3 Consider the frequency distribution of the length of snakes measured in cms.
a) Find the relation between Mean, Median and Mode.
b) Write comments on the nature of the distribution.
Ans: Mode > median > mean It is negative skewed data.
c) Calculate the skewness and Kurtosis and write comments on the nature of
the distribution.
Comment: It is positive skewed.
d) Construct a Histogram and superimpose on it a frequency curve.
e) Discuss the shape of the distribution with the results obtained in part (a) to (d).
Ans: The shape of the distribution with the result obtained in the part (a) to (d)
shows that the data of frequency distribution of length of snake’s measures in cm’s
is “Skewed negative”
Q.4 The following data is incomplete,
Find the missing entries and also calculate the Quartile deviation.
Q.5 a) Explain with suitable examples the term ‘dispersion’ . State the relative and
absolute measures of dispersion and describe the situations for using these
measures.
Examples of dispersion measures include:
• Standard deviation.
• Interquartile range (IQR)
• Range.
• Mean absolute difference (also known as Gini mean absolute difference)
• Median absolute deviation (MAD)
• Average absolute deviation (or simply called average deviation)
• Distance standard deviation.
Absolute & relative dispersion are two different ways to measure the spread of a
data set. They are used extensively in biological statistics, as biological phenomena
almost always show some variation and spread. ... Absolute measures always
have units, while relative measures do not.
Absolute measures of dispersion include:
• The range,
• The quartile deviation,
• The mean deviation,
• The standard deviation and variance.
Q.5 b) Mean of the 6 numbers 6,9,3,2,x ,y is 6 and the variance is 10. Find the
value of ‘x’ & ‘y’.
Q.6a) What is Conditional probability? Explain with the help of an example.
Conditional probability is the probability of one event occurring with some
relationship to one or more other events.
For example:
• Event A is that it is raining outside, and it has a 0.3 (30%) chance of raining
today.
• Event B is that you will need to go outside, and that has a probability of 0.5
(50%).
b) Let A and B be the two possible outcomes of an experiment and suppose P A)(
= 4.0 , P(A B) = 7.0 , and P B)( = p .
i) For what value of p , are A and B mutually exclusive?
ii) For what value of p , are A and B independent?
iii) If A and B are independent events then prove the followings: c) A and B are
independent d) A and B are independent.
Q.7 The following function of the random variable “x” is given by:
a) What is the value of c?
b) Plot the pdf and cdf.
c) Find E(X)
d) What is cdf of X
e) Find the value of “M” (median).
Q8. The random variable ‘X’ representing the number of errors per 100 lines of
software code has the following pdf.
a) Find mean and variance of X.
b) Find mean and variance of Z = 3X-2.
Q9.a) Determine the value of C so that the given function is pdf. f(x) = C(x2 + 4) ;
x=0,1,2,3 b) The cumulative distribution function of X is
a) What is the pmf of X?
b) Find P(4< x < 7).
Q.10a) Ten percent of the population is left-handed. Use the normal approximation
to the Binomial distribution to find the probability that there are at least 60 left-
handed students in a school of 400 students.
b) Plot the following Normal Curves on the same graph paper and give comments
on the shape of the curve for different values of parameters. (i) N ( =10 , 2 =2)
(ii) N ( =10 , 2 =4) (iii) N ( =15 , 2 =2)