Statistics 111 - Lecture 6: Probability
Statistics 111 - Lecture 6: Probability
Probability
Introduction to Probability,
Conditional Probability and
Random Variables
June 3, 2008
Administrative Note
Homework 2 due Monday, June 8th
Look at the questions now!
June 3, 2008
Course Overview
Collecting Data
Exploring Data
Probability Intro.
Inference
Comparing Variables
Means
June 2, 2008
Proportions
Regression
Contingency Tables
Events
Sample Space
The sample space S of a random process is
the set of all possible outcomes
Example: one coin toss
S = {H,T}
Example: three coin tosses
S = {HHH, HTH, HHT, TTT, HTT, THT, TTH, THH}
Example: roll a six-sided dice
S = {1, 2, 3, 4, 5, 6}
Example: Pick a real number X between 1 and 20
S = all real numbers between 1 and 20
Combinations of Events
The complement Ac of an event A is the event that A
does not occur
Probability Rule 3:
P(Ac) = 1 - P(A)
The union of two events A and B is the event that
either A or B or both occurs
The intersection of two events A and B is the event
that both A and B occur
Event A
June 3, 2008
Complement of A
Union of A and B
Intersection of A and B
Disjoint Events
Two events are called disjoint if they can not
happen at the same time
Events A and B are disjoint means that the
intersection of A and B is zero
Independent events
Events A and B are independent if knowing that A
occurs does not affect the probability that B occurs
Example: tossing two coins
Event A = first coin is a head
Event B = second coin is a head
Independent
10
11
June 3, 2008
12
13
14
15
Break
June 3, 2008
16
Outline
Moore, McCabe and Craig: Section
4.3,4.5
Conditional Probability
Discrete Random Variables
Continuous Random Variables
Properties of Random Variables
Means of Random Variables
Variances of Random Variables
June 4, 2008
17
Conditional Probabilities
The notion of conditional probability can be found
in many different types of problems
Eg. imperfect diagnostic test for a disease
Disease +
Disease -
Total
Test +
30
10
40
Test -
10
50
60
Total
40
60
100
18
June 4, 2008
19
20
Random variables
A random variable is a numerical outcome of
a random process or random event
Example: three tosses of a coin
S = {HHH,THH,HTH,HHT,HTT,THT,TTH,TTT}
Random variable X = number of observed tails
Possible values for X = {0,1, 2, 3}
21
P(X)
0.413
0.236
0.211
0.090
0.032
0.018
June 4, 2008
22
Another Example
Random variable X = the sum of two dice
X takes on values from 2 to 12
Use equally-likely outcomes rule to
calculate the probability distribution:
X
10
11
12
# of
Outco
mes
P(X)
1/36
2/36
3/36
4/36
5/36
6/36
5/36
4/36
3/36
2/36
1/36
23
Probability Histograms
Probability histogram of sum of two dice:
24
June 4, 2008
25
26
June 4, 2008
27
28
10
11
12
P(X)
1/36
2/36
3/36
4/36
5/36
6/36
5/36
4/36
3/36
2/36
1/36
29
30
10
11
12
P(X)
1/36
2/36
3/36
4/36
5/36
6/36
5/36
4/36
3/36
2/36
1/36
31
June 4, 2008
32