1 Academic Integrity and Collaboration Policy For This Assignment
1 Academic Integrity and Collaboration Policy For This Assignment
submitted
via Canvas in pdf format
2 Reading assignment
Class notes: Read the class notes up to and including Chapter 37 on
random vectors (Chapters 1 through 36 were previously assigned.).
Weiss textbook: Sections 7.1, 7.2, 7.3, 10.1, 10.2, 6.1, 6.2, 6.3, 6.4, 9.1,
9.2, 9.3 (Chapters 1 through 5, and Sections 8.1, 8.2, 8.3, 8.4, and 8.5 were
previously assigned.)
Schaum’s book (Hwei Hsu): Skip the “notes” (use our notes and Weiss’s
textbook). However, carefully work through exercises 2.27 – 2.36 (all),
2.44 – 2.50 (all), 2.57, 2.58. Do not turn these in, but make sure you
understand the details of the calculations and how to solve these exercises.
1
3 Written Assignment (to be turned in)
3.1 Exercise 1 (general random variables and cdfs) (10
points –5 points per part)
3.1.1 Background
Read Proposition 8.2 from Weiss’s book carefully. In class, we derived this
result. In this exercise, you will make use of it. Here is an example. Study it
carefully, and proceed similarly for the assigned problem below. You will have
to make some adjustments, since the cdf for the assigned problem is di¤erent.
Example 1 Solve Exercise 8.25a from the textbook. Also, solve Exercise 8.25e
from the textbook but only for the case a = 1; b = 2 (as in part a).
Solution: We calculate
Next, we’ll …nd these probabilities by using the pmf and the Fundamental
Probability Formula. The values a for which P (fW = ag) > 0 are 0; 1; 2; and 3;
2
and so P (fW 2 Ag) = 1; where A = f0; 1; 2; 3g : When using the Fundamental
Probability Formula for these exercises, we can intersect the interval B with the
set A; as follows:
3
a) Prove that Y is an absolutely continuous random variable, and …nd a pdf for
Y; proceeding as in the class notes.
Suggestion: I am not requiring that you do this on this particular problem,
but I recommend that you quickly verify that the function you …nd as a pdf
is non-negative and integrates to 1; that’s a good way to catch most careless
errors.
b) Calculate P (f2 Y < 6g) by using the pdf you found in part a.
4
Calculation 2: Using the cdf, we have
FY (y) = FX (g (y))
5
let u = g (y) and proceed as follows:
0 d
FY (y) = FY (y)
dy
d
= FX (g (y))
dy
d
= FX (u)
dy
d du
= FX (u)
du dy
du
= fX (u)
dy
= fX (g (y)) g 0 (y) ;
assuming fX is continuous at u:
b) Let T be a triangle (the boundary and interior) in the plane, having base
length 4 and height 13: As it turns out, the shape of the triangle itself will not
matter (but its base length and height will).
A point ! is chosen at random from the interior of T:
We de…ne a function Y : ! R as follows: for each ! 2 ; we let Y (!)
denote the perpendicular distance from ! to the base of the triangle.
Find FY ; the cdf of Y:
6
Hint: First, the area of any triangle is half the length of its base multiplied by
its height. Now proceed as follows:
1) On a sheet of paper draw a triangle with base 4 and height 13:
2) Draw a “random” point inside the triangle. Call the point A:
3) Draw a dotted line segment connecting the point A to the base (extend
the base parallel to itself with dotted lines if necessary), making sure that the
dotted line segment is perpendicular to the base (this is crucial). Let y denote
the distance, along this line segment, from the point A to the base.
4) Draw a dotted line segment parallel to the base and passing through the
point A; and extend it so that it meets both sides (other than the base) of the
triangle.
5) The dotted line segment you drew in 4) divides the triangle T into two parts.
It is easy to show with elementary geometry that the part of T on or above
that line segment is similar (look up “similar triangles” if you wish to review
the topic) to the entire triangle T: The part of T on or above the dotted line
from 4) is fY > yg : By formulas concerning similar triangles (namely, that the
ratio of their areas is the square of the ratios of their heights), we have
2
Area (fY > yg) (height of fY > yg)
= :
Area (T ) height of T
The height of the top triangle fY > yg can be found easily in terms of h
and/or y using your picture.
The height of T can be found easily in terms of h and/or y:
Area (T ) can be found easily in terms of h and/or y:
You can now easily solve for Area (fY > yg) : Then use it to …nd P (fY > yg) :
You can then use that to …nd P (fY yg) ; which you can use to get the cdf of
Y: Remember to proceed with cases, as described in the background above.
d) Let X Exp (5) ; and let Y = max (X; 7) : I.e., if X (!) 7 then Y (!) = 7;
while if X (!) > 7 then Y (!) = X (!) : Find the cdf of X: Is X absolutely
continuous? If so, explain why, and …nd a pdf of X: If not, explain why not.
Hint: Let FX and FY denote the cdfs for X and for Y; respectively. Finding
FX is easy, since there is a right tail probability formula for exponential random
variables. The strategy is to …nd FY in terms of FX : For any real number y;
consider the event fY yg : There are two cases: y 7 and y < 7:
Case 1: y < 7: In this case, Y y if and only if both X and 7 are less than
or equal to y: However, 7 cannot be less than or equal to y in Case 1, so Y y
cannot occur. I.e., fY yg = ;; and so FY (y) = P (fY yg) = 0:
Case 2: y 7: In this case, Y y if and only if both X and 7 are less than
or equal to y; which (since we’re in Case 2) is true if and only if X y: I.e.,
7
fY yg = fX yg ; and so FY (y) = P (fY yg) = P (fX yg) = FX (y) ;
which can be found once we …nd the cdf of X:
1 (1=2)[(x )= ]2
f (x) = p e ;
2
where is a real number (called the mean or expected value of X) and is a
positive real number (called the standard deviation of X). These assumptions
about and are always to be made when we write X N ; 2 :
The quantity 2 ; the square of the standard deviation, is called the variance
of X:
With a fair amount of work (see the class notes; it is an exercise in integra-
tion), it can be shown that f is a legitimate pdf for each and as above.
The graph of f is bell-shaped and is called a “bell curve.” Its center (where
the highest point of the curve is located) is at x = ; and the graph is left-right
symmetric about the vertical line x = : Varying a¤ects how much of the
area will be located near the mean, : smaller values of correspond to taller,
narrower graphs with most of the area close to the line x = :
Your calculator has built-in features for approximate evaluation of integrals
of normal pdfs. Start a new document, select Add Calculator, and then press
MENU-5-5-2 to select the normal cdf option. Then enter the left endpoint, the
right endpoint, ; and in the designated …elds, and press ENTER. You will
get the probability that the normal random variable you speci…ed (with and
as you entered them) takes a value between the two endpoints you selected.
For practice, try setting 5 as the left endpoint, 11 as the right endpoint, = 8;
and = 3: You should get about 0:6827:
8
Be sure you know how to use that feature and that you also can evaluate
integrals of normal pdfs by entering the pdf directly. I recommend you try both
methods for each of these problems to check your work.
Suppose X B (n; p) ; where n is a positive integer and p 2 (0; 1) :
Suppose that B is a union of intervals and/or integers, and suppose we wish
to estimate
P (fX 2 Bg) :
When p is su¢ ciently close to 0 or to 1; we can use a Poisson approximation
(switching the roles of success and failure if p is close enough to 1), but what
about when p is not close to either?
It turns out that when p is far enough away from both 0 and 1 we may use
a normal approximation to the binomial! Our speci…c criteria in practice are
np 5 and n (1 p) 5 simultaneously.
9
b) Normal random variables are so frequently used that there is a “rule” for
estimating certain probabilities involving normal random variables. The Em-
pirical Rule says that the probability
P (f k <X< + k g)
P (f k X + k g) = P (f k <X< + k g)
as well.
For example, a N 0; 12 random variable is called a standard normal
random variable. It is easy to see with a calculator that the probability X
takes a value within 3 standard deviations of its mean (i.e., between 0 3 1 = 3
and 0 + 3 1 = 3) is
Z 3
1 (1=2)x2
p e dx = 0:99730020 : : :
3 2
Rounding to four decimal places, …nd the values of P (f 10 < X < 14g) for
N 2; 122 ; N 2; 62 ; and N 2; 42 random variables.
X (!)
Z (!) = ; whenever ! 2 :
b
fX < bg = Z< :
Proceed the same way but with double inequalities at each step.
10
d) Solve Exercise 8.104a from the textbook as follows. Let X be the measured
BAC with units of percents. We are told that X is a normal random variable
with = 0:11 and = 0:005: Find a pdf of X; and use it and the Fundamental
Probability Formula to calculate the required probability. Remember that X is
in terms of percents, so do not divide any percents by 100.
E (X) V AR (X)
X is discrete Table 7.4, p. 332 Table 7.10, p. 358
X is absolutely continuous Table 10.1, p. 568 Table 10.2, p. 588
V AR (aX + b) = a2 V AR (X) ;
ST DDEV (aX + b) = jaj ST DDEV (X) ;
11
3.7.2 Instructions and hints
For each of the following random variables, …rst indicate what type of random
variable X is (its type and parameter(s), if this information is not given). For
these exercises only (to save you time), you do not need to justify your claims
about what type of random variable X is. You can just assert X B (20; 0:4) ;
or X G (0:9) ; or whatever, without proof or justi…cation.
If you can use the tables in the book to quickly evaluate expected values,
variances, or standard deviations for these exercises then do so.
If you can use the a¢ ne transformation formulas above to quickly evaluate
expected values, variances, or standard deviations for these exercises then do
so.
If no other way Zworks, you can use the de…nition of expected value (as
X
x p (x) ; or as x f (x) dx), or the de…nition of variance, to …nd the
expected value or variance of the given random variable (and with the variance
you can easily …nd the standard deviation).
In some cases, you will be able to proceed quite quickly as follows. If you
are able to determine, for instance, that X B (10; 0:6) ; then you can use the
tables to quickly …nd Ep(X) = 10 0:6p= 6; V AR (X) = 10 0:6 0:4 = 2:4;
and ST DDEV (X) = V AR (X) = 2:4: If you need V AR (2X 7) ; that
is simply 22 V AR (X) = 9:6; according to the a¢ ne transformation formulas
above. Many of the problems below can be solved quickly in this fashion. In
other cases, you will have to work harder.
In each case, use exact distributions –do not approximate X with any other
type of random variable, even if such an approximation would be reasonable to
make.
Also, saying “X has a binomial distribution” is the same as saying “X is a
binomial random variable.”
Some of the random variable types might be types we have not yet worked
with. They are all in the book, as is the relevant information about their
expected values and variances.
All …nal answers for E (X) and V AR (X) must be numerical and cannot
have any unknown parameters in them. Enough information is given to allow
you to solve for all of the parameters exactly.
The formulas for E (X) and V AR (X) can help us determine the parameters
for X: For instance, if we are told X has a binomial distribution with n = 20; and
if we are further told that X has an average value of 8; then since E (X) = np
we can substitute E (X) = 8 and n = 20 to …nd that p = 0:4; and we can use
that information to …nd V AR (X) :
12
b) The Economic Security Index (ESI) is one measure of the percentage of
Americans who have experienced a “signi…cant …nancial loss.” For a more pre-
cise de…nition of this index, and for more details, you may consult economicse-
curityindex.org if you wish (it is not necessary to do so to solve this problem).
State by state results are also available. One of the reports from that site can
be found here (it is not necessary to read this to solve this problem):
http://economicsecurityindex.org/upload/media/esiupdate_11_16_13.pdf
The ESI (nationwide) for 2012 was estimated to be about 17.8. I.e., in
the technical sense of the de…nition of the ESI, about 17.8% of Americans had
experienced a signi…cant …nancial loss. This was an improvement compared to
2011 when the number was 18.9%.
A random sample of 10 Americans is selected. Let X denote the number of
Americans in our selected sample who had experienced a signi…cant …nancial
loss in the technical sense of the de…nition of the ESI. Find E (X) and V AR (X) ;
assuming there were 315,000,000 Americans in 2012.
c) Suppose the number of cars getting onto the Key Bridge from Virginia be-
tween 3 p.m. and 3:30 p.m. on a Monday, Tuesday, Wednesday, or Thursday
in November has a Poisson distribution with an average of 14 cars per minute.
Let X be the number of cars getting onto the Key Bridge on a given Wednesday
in November, between 3:07 p.m. and 0 seconds and 3:08 p.m. and 0 seconds.
Find E (X) and V AR (X) :
13
People are tested one at a time until 30 individuals with the crucial variety of
antibody have been found, and then no one else is tested. Let X be the number
of individuals who receive blood tests as a result of these e¤orts. E.g., if the
2000th person tested is the 30th person to have this variety of antibody then
X = 2000: (For the purposes of this problem, suppose su¢ cient independence
that we may regard the blood tests as Bernoulli trials.)
Find E (X) and V AR (X) :
14