Open navigation menu
Close suggestions
Search
Search
en
Change Language
Upload
Sign in
Sign in
Download free for days
0 ratings
0% found this document useful (0 votes)
14 views
Stats 1 - IITM BS Notes - Part 5
Continuation (Part 5) of notes on Statistics 1 for Data Science by IIT Madras
Uploaded by
ryandonovan.des
AI-enhanced title
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content,
claim it here
.
Available Formats
Download as PDF or read online on Scribd
Download now
Download
Save Stats 1_IITM BS notes_Part 5 For Later
Download
Save
Save Stats 1_IITM BS notes_Part 5 For Later
0%
0% found this document useful, undefined
0%
, undefined
Embed
Share
Print
Report
0 ratings
0% found this document useful (0 votes)
14 views
Stats 1 - IITM BS Notes - Part 5
Continuation (Part 5) of notes on Statistics 1 for Data Science by IIT Madras
Uploaded by
ryandonovan.des
AI-enhanced title
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content,
claim it here
.
Available Formats
Download as PDF or read online on Scribd
Download now
Download
Save Stats 1_IITM BS notes_Part 5 For Later
Carousel Previous
Carousel Next
Download
Save
Save Stats 1_IITM BS notes_Part 5 For Later
0%
0% found this document useful, undefined
0%
, undefined
Embed
Share
Print
Report
Download now
Download
You are on page 1
/ 17
Search
Fullscreen
n independent trials, X = number of successes > Let there be m independent Bernoulli trials. © X O12 - --7 > Let p is probability of success. word F > X= number of succeses in 1 independent trials > The probabilities of outcomes of the independent trials are ‘SNo | Outcome | Number | Probabilities of sue- a pxPeeP noi | pxpx...* (I~) n=1 | p..xpx(l-p)xP 1 (=p) x (i—p)... x px (Mp) 1 (= p)x(=9)... ep 0 (l= p)x (l=) Visualising Binomial Distribution For a small n, distribution is skewed except for when the probability is 0.6. 1 | a al «a oa For larger n values, the distribution begins to get symmetric 02 Paw Rrmts| a on Hi oa | ff bel on | aeExpectation and Variance of a Binomial Random Variable Expectation and Variance of Binomial Random Variable Pm X=M+ t+... tM > E(X) = E(X1 + Xo t+ -.. + Xp) = E(X) + E(%) +--+ E(%) > E(X)=ptp+...+p=np V(X) = V(X + Xo + + Xn) = VOX) + VOX) + » V(X) = (1 — p) + p(1— p) +... + p(1—p) Result Using the fact that the expectation of the sum of random variables is ‘equal to the sum of their expectations, we see that Expectation of a Binomial random variable X is +V(X,) p(1 — p) E(X) = np Also, since the variance of the sum of independent random variables is ‘equal to the sum of their variances, we have variance of 2 Binomial random variable is Sn Hypergeometric Distribution Imagine a bag of black and white balls, Picking two balls will be dependent. For the hypergeometric to work, > The population must be dividable into two and only two independent subsets (black balls and white balls in our example) > The experiment must have changing probabilities of success with each experiment (the fact that balls are not replaced after the draw in our example makes this true in this case) > Another way to say this is that you sample without replacement and therefore each draw is not independent.Understanding the Hypergeometric distribution If we randomly select n items without replacement from a set of NV items of which: m of the items are of one type and NM ~ m of the items are of a second type " B)(B)...18](w)iw) ....w)~ ~(B)...[e) (wy ...[w Pe Nom ¥ mx items of type 1, then the probability mass Let X be the number of function of the discrete random variable, X, is called the hypergeometric distribution and is of the form: ‘m) (Nm n-x ‘When you draw this out: X=2 wt be #~swast” ¢ NSExample: Sampling from a deck of cards > Take a deck of 52 cards. Draw five cards from the deck. Let the random variable X denote the number of aces in the random sample of five cards. What is the probability distribution of X? > This is a Hypergeometric distribution with N=52,.n=5,m=4 > The pmf is given by @ ran ldlo=s) Finally: If we randomly select 1 items without replacement from a set of V items of which: m of the items are of one type and N — m of the items are of a second type The probability mass function of the discrete random variable is called the hypergeometric distribution and is of the form: pixana (has) () ix Sx
X ~ Hypergeometric(N, m,n) nm > EX) = FT » Var(X) =n > ¥~ Bin( 2 NomN=n ROW NAT > EY) > Var(Y) = nem 4) am N > Consider the term: N=" > For n= 1, replacement has no effect both are Bernoulli trial > For n=, the whole population is sampled- hence variance is zero. > If the population IV is very large compared to the sample size e. N >> n) then Hypergeometric(N, m, n) is about PL | Comparison of Binomial and Hypergeometric distributions To see how this comparison behaves, see this video from 24:00 onwards. Poisson distribution - Distribution of Poisson random variable This gives the probability of a number of events occurring in a given period of time.> The Poisson probability distribution gives the probability of a number of events occurring in a fixed interval of time or space. > We assume that these events happen with a known average rate, \, and independently of the time since the last event. > Let X denote the number of times an event occurs in an interval of time (or space). > We say X ~ Poisson(), in other words, X is a random variable that follows Poisson distribution with parameter 2. > The Poisson distribution may be used to approximate the Binomial distribution if the probability of success is “small” and the number of trials is “larg Poisson as Binomial approximation > Define “succes as exactly one event happening in a short ath dt The m events happening in Taterval of length r can be viewed 8 n successes happening in n intervals of length dt, with each a3 pba > Hence the problem canbe viewed as 9 Bin (n/p = experiment Try to understand why the red area is defined as it is. Probability mass function of Poisson The distribution, with an average number of J events per interval, is defined as Poisson discrete random variable, X ~ Poisson(), with the p.m given by > The random varaible X represents the number of events per time interval (In the example: number of vehicles passing per ‘minute) > eis the mathematical constant 2.718To see how the probability mass function is derived, see this video between 14:00 and 18:00. As 2 increases, the distribution looks more symmetric as seen below: Graph of pmf for 7 Ty Graph of pmf for \= 8 Li os 1 wal ep + 03 lr 02 1 a 1 5 al I wor aa AML ak set Expectation - Poisson Distribution Expectation of Poisson distributionVariance - Poisson Distribution > Now, E(X?) = E(X(X ~1)) + E(X) =? +A > Hence Var(X) = E(X?) — (E(X))? = 9? +=? Note that both the expectation and variance of a Poisson distribution are 2 (which is the average rate). Examples of Poisson distribution > Events occurring in fixed interval of time 1. Number of vehicles passing through a traffic intersection in a fixed time interval of one minute. 2. Number of people withdrawing money from a bank in a fixed time interval of fifteen minutes. 3. Number of telephone calls received per minute at a call center > Events occurring in fixed interval of space 1. ‘Number of typos (incorrect spelling) in a book. 2, Number of defects in a wire cable of finite length. 3, Number of defects per meter in a roll of cloth Modeling number of killings The number of dogs that are killed on a particular stretch of road in Chennai in any one day can be modeled by a Poisson(.42) random variable. 1, Calculate the probability that exactly two dogs are killed on a given day on this stretch of road. Let X= number of dogs killed in one day X ~ Poisson(\ = 0.42) 84? x 0.427 P(X =2)= = 0.058 2. Find the probability that exactly two dogs are killed over a 5-day period on this stretch of road. Let X= number of dogs killed in five day X ~ Poisson( = 2.1) eee P(X = 2) = SABE = 0.2%Week 12 Continuous Random Variable > A continuous random variable is one that has possible values that form an interval along the real number line. In other words, a continuous random variable can assume any value ‘over an interval or intervals. Probability density function (pdf) While discrete random variables have probability mass functions, continuous random variables have probability density functions, > Every continuous random variable X has a curve associated with it > The probability distribution curve of @ continuous random variable is also called its probability density function. It is denoted by F(x) Area under a pdf > Consider any two points a and b, where a is less than b. > The probability that X assumes a value that lies between a and b is equal to the area under the curve between a and b. ‘That is, P(X € [a,b]) = Pla < X < b) is area under curve between a and b Plasx< w= [resProperties of pdf 1, The area under the probability distribution curve of a continuous random variable between any two points is between 0 and 1. # sultan Few! between Oand 1 2 \ =P(x-a)20 a 4 Properties of pdf 2, Total area under the probability distribution curve of a ‘continuous random variable is always 1. fe) ‘area under curve is 1 fae 1 R Zhe ma) teCumulative distribution function Cumulative distribution function For a continuous random variable X F(a) = P(X
Expected value: E(X) = fx F(x)dx. > Variance: Var(X) = f(x — E(X))? F(x)dx Uniform distribution Uniform distribution U(a, b) > A continuous random variable has a[uniform distribution) denoted X ~ U(a, ),Uniform distribution U(a, b) > A continuous random variable has a uniform distribution, denoted X ~ U(a,6), probability density function is: = 7 &® { 0 otherwise Standard uniform distribution When the uniform distribution is across the interval of 0 to 1, it is a standard uniform distribution. a Standard uniform distribution 4 > A random variable has the standard uniform distribution with minimum 0 and maximum 1 if its probability density function is given by 1 0
Verify F(x) is a pdf > F(x) 20, ford
LS Flddx = Ip faye = 1 Graph of pdf of a Standard uniform distribution U(0, 1)Cumulative distribution of Uniform distribution For X ~ U(a,b) 0 forx
b To see how you get the value of f(x) when x lies between a and b, see below: Cumulative distribution of Uniform distribution cat. Faye P(Xee) For X ~ U(a,b) It is important to note here that the slope of the line between a and b is 1/b-a. Cumulative distribution of Uniform distribution a: 2 be 4 1 og | F(X) 06 04 02 > When x < a F(x) =0.Expectation - Continuous Random variable Expectation of X ~ U(a, b) > X ~ Ula,b); atb 2 E(X) = E(X) =f? xf(x). dx Variance - Continuous Random Variable Variance of X ~ U(a, b) > X~ Ula. b); _ (b~aP Var(X) = “T > E(x?) = SP #(x), dx = fox? idx = = Bisttab = 3 > Var(X) = E(X2) - EX)? = Pigs _ (og)? = (6-2) =eNon Uniform Distribution Suppose that the number of minutes of playing time of a certain college basketball player in a randomly chosen game has the following density curve. F(x) Jou-(2-Or8, een piebas 3? ony 228 ov Triangular Distribution Triangular distribution It is now 2 p.m., and Joan is planning on studying for her statistics test until 6 p.m., when she will have to go out to dinner. However, she knows that she will probably have interruptions and thinks that the amount of time she will actually spend studying in the next 4 hours is a random variable whose probability density curve is as follows: ie Tae ek ald F(x)Exponential Distribution x
You might also like
Notes ch2 Discrete and Continuous Distributions
PDF
No ratings yet
Notes ch2 Discrete and Continuous Distributions
48 pages
Chapter Six
PDF
No ratings yet
Chapter Six
45 pages
statatics and probability chapter 3 and 4
PDF
No ratings yet
statatics and probability chapter 3 and 4
10 pages
Probability Distributions 2
PDF
No ratings yet
Probability Distributions 2
36 pages
Chapter-6
PDF
No ratings yet
Chapter-6
12 pages
1853_Random Variable & Distribution
PDF
No ratings yet
1853_Random Variable & Distribution
43 pages
Business Inferential Statistics Lessons
PDF
No ratings yet
Business Inferential Statistics Lessons
7 pages
lecture 6
PDF
No ratings yet
lecture 6
43 pages
Lecture-4 - Random Variables and Probability Distributions
PDF
67% (3)
Lecture-4 - Random Variables and Probability Distributions
42 pages
Chapter 3: Probability: Experiment
PDF
No ratings yet
Chapter 3: Probability: Experiment
12 pages
Chapter 5 Prob
PDF
No ratings yet
Chapter 5 Prob
6 pages
QT-Random Variable and Probability Distribution-1
PDF
No ratings yet
QT-Random Variable and Probability Distribution-1
4 pages
Chapter 6 PPT Bio
PDF
No ratings yet
Chapter 6 PPT Bio
39 pages
01 SOC Chapter 1 Probability Distributions - Nov 26, 2023
PDF
No ratings yet
01 SOC Chapter 1 Probability Distributions - Nov 26, 2023
41 pages
Chapter 1 (3)
PDF
No ratings yet
Chapter 1 (3)
47 pages
MetNum1 2023 1 Week 12
PDF
No ratings yet
MetNum1 2023 1 Week 12
61 pages
Probability Distribution
PDF
0% (1)
Probability Distribution
21 pages
Random Variables: Petter Mostad 2005.09.19
PDF
No ratings yet
Random Variables: Petter Mostad 2005.09.19
24 pages
Chapter 6
PDF
No ratings yet
Chapter 6
11 pages
Probability Distribution
PDF
No ratings yet
Probability Distribution
33 pages
Bus Stat CHP 6&7
PDF
No ratings yet
Bus Stat CHP 6&7
7 pages
Lesson 6
PDF
No ratings yet
Lesson 6
4 pages
Lecture Two (2)
PDF
No ratings yet
Lecture Two (2)
13 pages
Chapter 7 Eng
PDF
No ratings yet
Chapter 7 Eng
59 pages
Chap 1
PDF
No ratings yet
Chap 1
4 pages
Business Statistics Lecture 5 New
PDF
No ratings yet
Business Statistics Lecture 5 New
68 pages
CH 6
PDF
No ratings yet
CH 6
31 pages
23 4 14 Prob Distribution
PDF
100% (1)
23 4 14 Prob Distribution
132 pages
Chapter 4
PDF
No ratings yet
Chapter 4
31 pages
Lec 01
PDF
No ratings yet
Lec 01
44 pages
Sta Statistical Theory
PDF
No ratings yet
Sta Statistical Theory
104 pages
Unit V Probability Distributions
PDF
No ratings yet
Unit V Probability Distributions
52 pages
Lecture 3-Discrete random variables
PDF
No ratings yet
Lecture 3-Discrete random variables
46 pages
Lec Note E6
PDF
No ratings yet
Lec Note E6
7 pages
Unit 2 Ma 202
PDF
No ratings yet
Unit 2 Ma 202
13 pages
Topic Probability Distributions
PDF
100% (1)
Topic Probability Distributions
25 pages
BS UNIT 2 Note # 3
PDF
No ratings yet
BS UNIT 2 Note # 3
7 pages
STAT 342 Statistical Methods For Engineers
PDF
No ratings yet
STAT 342 Statistical Methods For Engineers
26 pages
A3 - Random Variables and Distributions
PDF
100% (1)
A3 - Random Variables and Distributions
19 pages
Random Variables: - Definition of Random Variable
PDF
No ratings yet
Random Variables: - Definition of Random Variable
29 pages
EDA-Discrete Probability Distribution
PDF
No ratings yet
EDA-Discrete Probability Distribution
35 pages
Notion of Random Variables and Binomial and Poisson Distribution
PDF
No ratings yet
Notion of Random Variables and Binomial and Poisson Distribution
36 pages
Chapter 3 - Special Probability Distributions
PDF
No ratings yet
Chapter 3 - Special Probability Distributions
45 pages
Chap 6
PDF
No ratings yet
Chap 6
38 pages
Chapter 2 Elemtry vs Probabilty Distr
PDF
No ratings yet
Chapter 2 Elemtry vs Probabilty Distr
77 pages
Class 4 SP
PDF
No ratings yet
Class 4 SP
23 pages
5 Discrete RV
PDF
No ratings yet
5 Discrete RV
44 pages
Random Variables and Probability Distributions: Week IV
PDF
No ratings yet
Random Variables and Probability Distributions: Week IV
4 pages
Chap05 - Discrete Random Variables
PDF
No ratings yet
Chap05 - Discrete Random Variables
35 pages
R-6 Theory
PDF
No ratings yet
R-6 Theory
4 pages
Probability
PDF
No ratings yet
Probability
36 pages
FOW9 - SB - Note Chapter 6&7
PDF
No ratings yet
FOW9 - SB - Note Chapter 6&7
13 pages
Eda Lec 3
PDF
No ratings yet
Eda Lec 3
35 pages
4 - Probability Theory II
PDF
No ratings yet
4 - Probability Theory II
85 pages
Discrete Distribution
PDF
No ratings yet
Discrete Distribution
19 pages
RV 1
PDF
No ratings yet
RV 1
22 pages
DISC: 203 - Probability & Statistics: Lecture 7 - 9 Probability Distributions - I
PDF
No ratings yet
DISC: 203 - Probability & Statistics: Lecture 7 - 9 Probability Distributions - I
34 pages
Chapter Three
PDF
No ratings yet
Chapter Three
12 pages
Stats 1 - IITM BS Notes - Part 4
PDF
No ratings yet
Stats 1 - IITM BS Notes - Part 4
16 pages
Stats 1 - IITM BS Notes - Part 3
PDF
No ratings yet
Stats 1 - IITM BS Notes - Part 3
16 pages
Stats 1 - IITM BS Notes - Part 2
PDF
No ratings yet
Stats 1 - IITM BS Notes - Part 2
16 pages
Stats 1 - IITM BS Notes - Part 1
PDF
No ratings yet
Stats 1 - IITM BS Notes - Part 1
16 pages