EEPF Finals
EEPF Finals
1. (5 × 2 marks) Provide an example for each of the following, or argue why no example is possible.
X such
(e) Random variable that E[ X ] = 0, var( X ) = 1/3, and, for X1 , . . . , Xn ∼ iid X, the sample
mean X n ∼ U −n1 , n1 .
2. (10 marks) In a virtual cricket league, suppose the runs scored off any delivery is distributed as
3/8 1/2 1/8
{ 0 , 1 , 2 }. In the first two deliveries, (1) the probability of scoring (2, 2) is 0, (2) the probability of
−1
scoring (1, 2) is equal to that of (2, 1), and (3) the covariance of runs scored is .
16
(a) (5 marks) Develop a joint PMF model for the runs scored in the first two deliveries. How many
parameters (or unknowns) are needed in the model (after using all provided information)?
(b) (5 marks) In a new season, the runs in the first two deliveries of 8 matches are observed to be
(0, 0), (0, 1), (1, 0), (0, 0), (0, 2), (0, 0), (0, 1), (1, 1). What is the ML estimate of the parameters of
the above joint PMF model? State your assumptions about the data.
3. (10 marks) HoD (EE) suspects that the academic performance of BTech students in EE, as measured
by CGPA after 8 semesters, has deteriorated from the graduating class of 2012 to that of 2022 - both
on average (which was 8.0 in 2012) and on variability across students.
(a) (4 marks) Develop a test to statistically verify the suspicion of the HoD on average perfor-
mance. Describe all aspects of the test (sampling, null/alternative hypothesis, test statistic,
significance level computation). State all your assumptions clearly.
(b) (4 marks) Repeat the above for the suspicion that variability in performance has increased.
(c) (2 marks) Suppose Dean (AC) has the same suspicions for all BTech students of IITM. Do you
have to alter any aspect of your sampling and tests?
(a) (6 marks) Assuming a and b are parameters, let ( â ML , b̂ ML ) be ML estimates of ( a, b). Consider
the estimate θ̂plugin = b̂ ML − â ML for θ. Find the bias and MSE of θ̂plugin .
(b) (4 marks) Consider the class of estimators θ̂ (α) = α(b̂ ML − â ML ) for θ. Find α for which θ̂ (α) is
unbiased. Find α for which θ̂ (α) has least MSE.
5. (10 marks) n children are chosen uniformly at random from a group of 2n children and given one
chocolate each. The group consists of n brother-sister pairs. A brother-sister pair is termed lucky if
they both got chocolates, and unlucky if neither got. Assume n is even.
(a) (4 marks) Find the expected number of lucky brother-sister pairs. Repeat for unlucky pairs.
(b) (6 marks) Find PMF of the number of lucky brother-sister pairs. Repeat for unlucky pairs.
Distributions
Distribution PMF/PDF CDF E[ X ] var( X )
0 x<a
( b − a )2
x − a
1 a+b
U [ a, b] ,a<x<b a<x<b
b−a
b−a 2 12
1 x>b
(
1 − (1 − p ) b k c k ≥ 1 1 1− p
Geometric( p) (1 − p)k−1 p, k = 1, 2, . . .
0 else p p2
2 2 x −µ
N (µ, σ2 ) √ 1 e−( x −µ) /2σ FZ σ , Z ∼ N (0, 1) µ σ2
2πσ
Formulae
(R
x
g( x ) f X ( x )dx PDF f X ( x )
Expected value E[ g( X )] =
(R ∑ x g( x ) p X ( x ) PMF p X ( x )
x,y
g( x, y) f XY ( x, y)dxdy Joint PDF f XY ( x, y)
E[ g( X, Y )] =
∑ x,y g( x, y) p XY ( x, y) Joint PMF p XY ( x, y)
[Co]variance var( X ) = E[ X 2 ] − E[ X ]2 , Cov( X, Y ) = E[ XY ] − E[ X ] E[Y ]
X1 +···+ Xn ( X1 − X n )2 +···+( Xn − X n )2
Sample statistics X1 , . . . , Xn ∼ iid X, X n = n , Sn2 = n −1