Probability
Probability
Basic Definitions
·
Union of A and B , CAUB) :
the probability of A OR B
occurring
no . of outcomes in A
P(A)
sample space
· =
total no
. of outcomes
notation Multiplication Rule
·
-
=
·
P(A)20 for all events ALE P(AnB) = P(B) x P(A/B)
·
Pr(E) = 1
Pr($) 0
, A representing empty P(A(B) x P(B) + P(a(B') xP(B)
· =
set
P(A)
.
·
Not'A' >
-
Pr(A)) = 1 -
P(a) (complement)
·
Pr( = VB) = Pr(A) + Pr(A1B) -> addition rule
Mutually exclusive :
nothing in common >
-
events do not add up to 1
Pr(A (B) =
Pr(A)
Pr(B(A) Pr(B) P(AnB)
P(A(B)
·
=
=
-
·
T
P(T) x P(T)
T TT
Discrete Random Variables
Two conditions :
1
>
- SUM of all probabilities =
u =
E(X) = 2xxP(X = x)
2
[x
=
xp(x)
x
E(X) of a function of X :
E(g(x)) =
2g(x) xp(x)
E(X + Y) =
E(X) +
E(y)
E(X) of aX + b -
E(aX + b) = aE(X) + b
(for a
,
b constant)
E((X u)2]
2
o =
var(X) = -
or var(x) = E(x2) -
[E(X)]2
standard deviation :
o = sd(X) =
vvar(X)
Var(aX + b) Var(x)
=
Linear function >
- = a
* In
general for many
,
random variables X,
Pr(m -
20 = X = m + 20) = 0 . 95
Bernoulli Sequence
· success or failure (two outcomes)
·
p
=
probability of success on a
single trial (failure = 1 -
p)
·
trials are independent
where (2) =
x ! (n -
x) !
Notation : XvBi(n p) ,
>
-
where
,
n = no .
of trials Mean =
M= E(x) =
nxP
p
=
probability of success
-
7 Variance 02
= =
nxp(1 -
P)
1- P =
probability of other outcome Standard Deviation = o = vo
(failure)
·
(8) =
1 ·
(Y) = n
Pr(X21) > 0 95 .
·
(n) = 1 -
(n 1)
-
= n
1 -
Pr(X 0) =
>0 95 .
# · (0 52)" < 0 .
05
.
0 .
52" < 0 05 .
n
= 4 58 .
.. n = 5
Pr(X= 2) 70 .
95
1 -
Pr(X(2)70 95 .
Pr(X(2) < 0 05
%0 48' (0 32)"
.
40 (0 52)" +
05
# .
.
10 .
-
0 .
524+ 0 .
48n (0 .
52)" 10 . 05
n
= 7 . 7986
·
. n = S
-
Continuous Random Variables
f(x
one that can take any value . probability
M
-
* total area = 1
Probability density function
:
F(x) = P(a = x = b) =
f f(x)dx = 0 O a b
-x
P(a(x = b) =
p(a = x(b) = P(acx = b) =
P(a = x = b)
for x[a b] ,
Sf(x)di = 1
·
for xe[a c) ,
(f(x)dx = 1
(P,f(x)dx
P D F must
A . .
satisfy 2
properties :
>
-
f(x) = O for all I
>
-
ff(x)dx =
1
2
Variance and Standard Deviation Var(x) =
E(X) -
E(X)" = o
·
68 % of data = 1S D .
of mean
. P(M -
0 < XM +
0) = 0 68.
·
95 % of data = 2S D of . .
mean P(M-20X = M + 20) = 0 .
95
· 99 7 % of data
. = 3 S D .
of mean . ↑) (M 30
-
= X =M + 30) = 0 . 997
(b) (P)
Mean M = E(X) =
S xx f(x)d IQR = 75th percentile - 25th percentile
· 25th percentive =
(f(x) =
0 25
.
E(g(()) =
(g(x)xf(x)dx
·
75th
percentile =
Sf(x) = 0 . 15
·
E(Xz) (x2xf(x)dx
=
·
E(aX + b) = af(x) + b
0 5 .
=
Sw f(x)dx = P(X = M) =
P(X)M)
Normal Distribution
1 -2 ) =)
f(x) =
oven
Ye for -
>x >
·
Notation : X-N(M , 02)
Properties :
·
Extends to co in both directions
·
curve symmetrical about the mean .
Pr(axx(b) =
S f(x)dx
Area =
S % f(x) do
.
= 1
5 - 4 -
1
5 - 4 - 2 randInt (lower bound, upper bound, no of results
no . of population wh attribute
Population proportion p =
population size
no .
in sample with attribute
sample proportion P =
sample size
(na+ 1)
distribution : !
Hypergeometric (3) =
n
is the
D = # desired characteristic in x ! (n-1) !
pop
(a)x( 2)
.
=
d = # desired characteristic in sample no of ways to select
P(X d)
.
=
=
N pop size .
objects from n objects
(i)
= .
x
n =
sample size
Distribution
Sampling
↑
01
P( =
p) t 0
E(p) =
[(pXP(p)) ·
SD(F) VE(P2) = -
E(p)2
LARGE population .
E(P) =
p(pop proportion)
.
,
sa(i) =
Large pop Small .
sample >
- Binomial distribution .
distribution
Large pop Large Sample Normal
.
>
-
Confidence Intervals for the Population Proportion
·
p(pop proportion)
. is UNKNOWN.
90 % C .
I 1 . 65 note : E values give the
level of confidence
.
997
95 % C I
.
C(upper) + (Clower)
= 2