0% found this document useful (0 votes)
11 views9 pages

21MAT31A - Module 3 - Statistics - QB - Copy

The document outlines the syllabus and questions for a course on Vector Spaces, Sampling Theory, and Optimization, specifically focusing on statistics and probability distributions. It includes topics such as mean, median, mode, standard deviation, correlation, regression, and various probability distributions like binomial and normal distributions. The document also presents a series of questions aimed at applying these statistical concepts in practical scenarios.

Uploaded by

danamma.m.hatti
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views9 pages

21MAT31A - Module 3 - Statistics - QB - Copy

The document outlines the syllabus and questions for a course on Vector Spaces, Sampling Theory, and Optimization, specifically focusing on statistics and probability distributions. It includes topics such as mean, median, mode, standard deviation, correlation, regression, and various probability distributions like binomial and normal distributions. The document also presents a series of questions aimed at applying these statistical concepts in practical scenarios.

Uploaded by

danamma.m.hatti
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

COURSE: VECTOR SPACES, SAMPLING THEORY & OPTIMIZATION

COURSE CODE: 21MAT31A


MODULE – 3: STATISTICS

Syllabus:

Statistics:

Review of Mean, Median, Mode & Standard Deviation-Moments, Skewness & Kurtosis. Analyzing a
Bivariate distribution – Scatter diagram, Correlation – Coefficient of Correlation. Regression – Lines of
Regression, Regression curves, Properties of Regression Coefficients, Angle between lines of Regression

Q.No Questions
1. a) The following is the frequency distribution of a random sample of weekly earnings of
the employees. Calculate the average weekly

Weekly 10 12 14 16 18 20 22 24 26 28 30 32
earning
No. of 3 6 10 15 24 42 75 90 79 55 36 26
employees

b) Calculate the mean of the following frequency distribution

Class 0–8 8 – 16 16 – 24 24 – 32 32 - 40 40 - 48
Frequency 8 7 16 24 15 7

2. a) The total sale (in thousands) of a particular item in a shop, on 10 consecutive days is
reported by a clerk as 35, 29.6, 38, 30, 40, 41, 42, 45, 3.6, and 3.8. Calculate the
average for the data given by the clerk. Later it was found that there was a number
10 in the machine and the reports of 4th to 8th day were 10 more than the true values
and in the last 2 days he put a decimal in wrong place (for example 3.6 was really 36).
Calculate the true mean value.

b) For the two frequency distributions given below the mean calculated from the first
was 25.4 and that the second was 32.5 find the value of the 𝑥 and 𝑦.

Class 10 – 20 20 – 30 30 – 40 40 - 50 50 – 60
Frequency - 1 20 15 10 𝑥 𝑦
Frequency - 2 4 8 4 2𝑥 𝑦
3. a) Find the median of the following data

Number 5 10 15 20 25 30 35 40 45
Frequency 29 224 465 582 634 644 650 653 655

b) Find the median of the following

Class 20 – 30 30 – 40 40 - 50 50 – 60 60 - 70
Frequency 3 5 20 10 5

4. a) In a factory employing 3000 person, 5% earn less than Rs. 3 per hour, 580 earn from
Rs. 3.01 to Rs. 4.5 per hour, 30% earn from Rs. 4.51 to Rs. 6 per hour. 500 earn from
Rs. 6.01 to Rs. 7.5 per hour, 20% earn from Rs. 7.51 to 9 per hour and the rest earn
Rs. 9.01 or more per hour. What is the median wage?

b) A number of particular articles has been classified according to their weight. After
drying for two week the same articles have again be weighted and similarly classified.
It is known that the median weight in the first weight was 20.83 while in the second
weighting it was 17.35. Some frequencies 𝑎 and 𝑏 in the first weighting and 𝑥 and 𝑦
𝑥 𝑦
in the second are missing. It is given that 𝑎 = 3 and 𝑏 = 2. Find out the missing
frequencies.

Class 0–5 5 – 10 10 – 15 15 - 20 20 – 25 25 - 30
Frequency - 1 𝑎 𝑏 11 52 75 22
Frequency - 2 𝑥 𝑦 40 50 30 28

5. a) Calculate the mode form the following frequency distribution

Number 4 5 6 7 8 9 10 11 12 13
Frequency 2 5 8 9 12 14 14 15 13 11

b) Find the mode of the following distribution

Class 0 – 10 10 – 20 20 – 30 30 – 40 40 - 50 50 - 60 60 - 70
Frequency 5 8 7 12 28 20 10

6. a) The median and mode are given to be Rs. 25 and Rs. 24 respectively. Calculate the
missing frequency.

Class 0 – 10 10 – 20 20 – 30 30 – 40 40 - 50
Frequency 14 𝑥 27 𝑦 15

b) The median and mode of the following wages are known to be Rs. 33.5 and Rs. 34
respectively. Find the value of 𝑥, 𝑦 and 𝑧. Given total frequency is 230.

Class 0 – 10 10 – 20 20 – 30 30 – 40 40 - 50 50 - 60 60 - 70
Frequency 4 16 𝑥 𝑦 𝑧 6 4
7. a) Compute the standard deviation for the following data

Class 0- 100 - 200 - 300 - 400 - 500 - 600 - 700 -


interval 100 200 300 400 500 600 700 800
Frequency 10 54 184 264 246 40 1 1

b) For a group of 200 candidates, the mean and standard deviation of scores were found
to be 40 and 15 respectively. Later on it was discovered the score 43 and 35 was
misread as 34 and 53 respectively. Find the corrected standard deviation
corresponding to the corrected figures.

8. a) The scores obtained by two batsmen A and B in 10 matches are given below.
Calculating mean, SD and coefficient of variation for each batsman, determine who is
more efficient and who is more consistent.

A 30 44 66 62 60 34 80 46 20 38
B 34 46 70 38 55 48 60 34 45 30

b) The first group of the two samples has 100 items with mean 15 and standard
deviation 3. If the whole group has 250 items with mean 15.6 and standard
deviation √13.44. Find the standard deviation of the second group.

9. a) Calculate the first four moments of the following distribution about the mean

x 0 1 2 3 4 5 6 7 8
f 1 8 28 56 70 56 28 8 1

b) Calculate the first four moments about the mean for the data given below. The
following distribtuiton relates to the number of assistants in 50 retail establishment.

x 0 1 2 3 4 5 6 7 8 9
f 3 4 6 7 10 6 5 5 3 1

10. a) Calculate the first four moments of the following distribution of wages

x 5 6 7 8 9 10 11 12 13 14 15
f 1 2 5 10 20 51 22 11 5 3 1

b) Compute the Karl Pearson’s measures of skewness for the following data

Heights (inches) 58 59 60 61 62 63 64 65
No. of persons 10 18 30 42 35 28 16 8

11. a) For a distribution, Bowley’s coefficient of skewness is 0.6. The sum of upper and
lower quartiles is 100 and median is 38. Find the upper and lower quartiles.

b) Compute the Coefficient of skewness based on third moment for the following data
and comment upon the nature of skewness
Class 0 – 10 10 – 20 20 – 30 30 – 40 40 - 50 50 - 60 60 - 70
Frequency 6 12 22 24 16 12 8

12 a) The first four moments about the working mean 28.5 of a distribution are 0.294,
7.144, 42.409, 454.98. Evaluvate the Kurtosis of the distribution.

b) The first four moments of a distribution about the value 4 of the variable are -1.5, 17,
-30 and 108. State whether the distribution is playkurtic or leptokurtic.

13. a) Evaluvate the Kurtosis for the following data

x 1 3 5 7 9
f 1 4 6 4 1

b) Define: (i) Bivariate analysis (ii) Scatter diagram and state the types of correlation
between x and y with neat sketch

14. a) Make a scatter diagram for the following data and state the type of correlation between x
and y.
x 1 2 3 4 5 6 7
y 7 14 21 28 35 42 49

b) Make a scatter diagram for the following data and state the type of correlation between x
and y.
x 2 4 6 8 10 12 14
y 35 30 25 20 15 10 5

15. 𝜎𝑥 2 + 𝜎𝑦 2 − 𝜎𝑥−𝑦 2
a) Establish the formula 𝑟 = 2 𝜎𝑥 𝜎𝑦

b) State any five properties of regression coefficient

16. a) Find the correlation co-efficient between 𝑥 and 𝑦 from the given data:

𝑥 78 89 97 69 59 79 68 57
𝑦 125 137 156 112 107 138 123 108

b) Find the correlation co-efficient between 𝑥 and 𝑦 from the given data:

𝑥 21 23 30 54 57 58 72 78 87 90
𝑦 60 71 72 83 110 84 100 92 113 135
17. a) Obtain the (i) the correlation coefficient (ii) regression lines of y on x and x on y for
the following data:

𝑥 1 2 3 4 5
𝑦 2 5 3 8 7

b) Obtain the (i) the correlation coefficient (ii) regression lines of y on x and x on y for
the following data:

x 2 4 6 8 10
y 5 7 9 8 11

18. a) The following results were obtained from records of age(x) and blood pressure (y) of
a group of 10 men, given Σ (𝑥 − 𝑥̅ )(𝑦 − 𝑦̅) = 1220. Find the appropriate regression
equation and use it to estimate the blood pressure of a man whose age is 45

𝑥 𝑦
Mean 53 142
Variance 130 165

b) In a partially destroyed laboratory record of correlation data, the following result only
are a variable, variance of x is 9, regression equation y on x and x on y are 4𝑥 − 5𝑦 +
33 = 0, 20𝑥 − 9𝑦 − 107 = 0 respectively. Calculate the coefficient of correlation, 𝑥̅ ,
𝑦̅ and 𝜎𝑦

19. a) If θ is the acute angle between the two regression lines relating the variables x and y,
𝜎𝑥 𝜎𝑦 1−𝑟 2
show that tan 𝜃 = 𝜎 2 +𝜎 2(
)
𝑥 𝑦 𝑟

b) Find the co-efficient of correlation between x and y given 2𝜎𝑥 = 𝜎𝑦 and the angle
3
between the lines of regression is tan−1 (5)

20. a) If the co-efficient of correlation between x and y is 0.5 and the acute angle between
3
the lines of regression is tan−1 (5) then show that 2𝜎𝑥 = 𝜎𝑦 or 𝜎𝑥 = 2𝜎𝑦

b) The regression lines of x on y and y on x are 5𝑥 − 𝑦 = 6 and 2𝑥 − 3𝑦 = −8


respecitively. Find the angle between these lines.
COURSE: VECTOR SPACES, SAMPLING THEORY & OPTIMIZATION
COURSE CODE: 21MAT31A
MODULE – 4: PROBABILITY DISTRIBUTIONS AND SAMPLING DISTRIBUTION

Syllabus:

Probability Distributions:
Introduction to probability, Random variables (discrete and continuous), Probability mass function,
Probability density function, Probability distributions: Binomial distribution, Poisson distributions, Normal
distributions

Sampling Distribution:
Sampling distribution, Central limit theorem, Sampling with & without replacement, Confidence limits for
means, Student’s t-distribution

Q.No Questions

1. a) The Random Variable X has the following probability mass function, find (i) k (ii) P(X<3)
(iii) P(3<X≤ 5) (iv) variance

X 0 1 2 3 4 5
P(X) K 3K 5K 7K 9K 11K

b) A random variable X has the following probability distributions. Find (i) k (ii) p(x<6) (iii)
p(x>6) (iv) Mean and also find the probability distribution

x 0 1 2 3 4 5 6 7
P(x) 0 K 2k 2k 3k 𝑘2 2𝑘 2 2
7𝑘 + 𝑘

2. a) Find ‘k’ such that the following distribution represents a finite probability distribution. Hence
find (i)mean (ii) p(x≤1) (iii) p(x>1) (iv) p(-1<x≤2)

x -3 -2 -1 0 1 2 3
P(x) k 2k 3k 4k 3k 2k k

b) A random variable (X=x) has the following probability function for various values of x. Find
(i)k (ii) p(x<1) (iii) p(x>-1)

X -2 -1 0 1 2 3
P(X) 0.1 k 0.2 2k 0.3 k

3. a) Find the mean and variance of Binomial Distribution.

b) The probability that a pen manufactured by a factory bedefective is 1/10. If 12 such pens are
manufactured, what is the probability that (i)exactly 2 are defective (ii) atleast 2 are defective
(iii)none of them are defective.
4. a) The number of telephone lines busy at an instant of time is a binomial variate with
probability 0.1 that a line is busy. If 10 lines are chosen at random, what is the probability
that (i)no lines is busy (ii) all lines are busy (iii)atleast one line is busy (iv) almost 2 lines are
busy.

b) If 800 familes with 5 childrens each, how many childrens would be expected to have (i) 3 boys
(ii) 5 girls (iii) either 2 or 3 boys (iv) almost 2 girls, by assuming probability of boys and girls
are to be equal
5. a) When a coin is tossed 4 times, find the probability of getting (i) exactly one head (ii) almost 3
heads (iii) atleast 2 heads

b) Derive mean and variance for the Poisson Distribution

6. a) 2% of the fuses manufactured by a firm are found to be defective. Find the probability that a
box containing 200 fuses which contains (i) No defective fuse (ii) 3 or more defective fuses
(iii) atleast one defective fuse

b) The number of misprints on a page of the Daily Mercury has a Poisson distribution with mean
1.2. Find the probability that the number of errors (i) on page four is 2 (ii) on page three is
less than 3.

7. c) A certain screw making machines produces on an average two defective out of 100 and pack
them in boxes of 500. Find the probability that the box contains (i)three defectives (ii) At least
one defective.

d) The number of industrial injuries per working week in a particular factory is known to follow a
Poisson distribution with mean 0.5. Find the probability that
(a) in a particular week there will be:
(i) less than 2 accidents,
(ii) more than 2 accidents;
(b) in a three week period there will be no accidents.

8. a) The probability density function f(x) of continuous random variable is given by


𝑘𝑥(1 − 𝑥), 0 ≤ 𝑥 ≤ 1
f(𝑥) = { Determine k, 𝑃(0 < 𝑥 < 1/3).
0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒

b) A continuous random variable has the following density function


2
𝑓(𝑥) = {𝑘𝑥 , −3 ≤ 𝑥 ≤ 3Evaluate (i) k, (ii) P(1≤ 𝑥 ≤ 2), (iii) P( 𝑥 ≤ 2), (iv) P(x>1).
0 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒

9. 𝑘
a) A continuous random variable has the density function𝑓(𝑥) = , −∞ < 𝑥 < ∞.
1+𝑥 2
Determine (i) k, (ii) 𝑃(𝑥 > 0), (𝑖𝑖𝑖)𝑃(0 < 𝑥 < 1).

b) The probability density function of continuous random variable is given by (𝑥) = 𝑘𝑒 −|𝑥| ,
−∞ < 𝑥 < ∞. Prove that 𝑘 = 1⁄2 and also find mean and variance.
10. a) Derive mean and variance for normal distribution.

b) In examination 7% of students score less than 35% marks and 89% of students score less than
60% marks, Find the mean and standard deviation, if the marks are normally distributed. It is
1 𝑧 2
given that if p (z) =
√2𝜋
∫0 𝑒 −𝑧 /2 𝑑𝑧 then p(1.2263) = 0.39 p(1.4757) = 0.43.

11. a) The mean weight of 1,000 students during medical examination was found to be 70kg and
S.D weight 6kg. Assume that the weight are normally distributed, find the number of students
having weight (i) less than 65kg (ii) more than 75kg (iii) between 65kg to
75kg.[P(0.83)=0.2967]

b) A sales tax officer has reported that the average sales of the 500 business that he has to deal
with during a year is Rs.36,000 with a standard deviation of 10,000. Assuming that the sales
in these business are normally distributed, find (i) the number of business as the sales of
which exceed Rs.40,000 (ii) the percentage of business the sales of which are likely to range
between Rs.30,000 and Rs. 40,000. [P(0.4)=0.1554, P(0.6)=0.2257]

12. a) A sample of 100 dry battery cells tested to find the length of life produced by a company and
following results are recorded: mean life is 12 hrs, SD is 3 hrs. Assuming data to be normally
distributed, find the expected life of a dry cell. (i) have more than 15 hrs (ii) between 10 and
14 hrs.[P(0.667)=0.2486,P(1)=0.3413].

b) Explain the following


i) Null hypothesis
ii) Alternative hypothesis
iii) Type I and type II error
iv) Level of significance
v) Standard error

12. a) A population has mean 75 and standard deviation 12.


(i) Random samples of size 121 are taken. Find the mean and standard deviation of the
sample.
(ii) How would the answers to part a) change if the size of the samples were 400 instead of
121?

b) A population has mean 5.75 and standard deviation 1.02.


(i) Random samples of size 81 are taken. Find the mean and standard deviation of the
sample.
(ii) How would the answers to part a) change if the size of the samples were 25 instead of
81?
13. a) The weights of 1500 ball bearings are normally distributed with a mean of 635 gms and S.D of
1.36gms. If 300 random samples of size are drawn from this population, determine the
expected mean and S.D of the sampling distribution of means if sampling is done a) with
replacement b) without replacement.

b) 500 ball bearings have a mean weight of 142.30 gms and S.D. of 8.5 gms. Find the probability
that a random sample of 100 ball bearings chosen from this group will have a combined
weight a) between 14061 and 14175 gms b) more than 14460 gms (ϕ(1.99)=0.4767,
ϕ(0.65)=0.2422, ϕ(2.706)=0.4971)
14. c)
15. d)
16. e)
17. f)
18. g)
19. h)
20. i)

Missing question will be updated Soon

You might also like