0% found this document useful (0 votes)

158 views

Poisson Distribution Explained - Intuition, Examples, and Derivation - Towards Data Science

The document discusses the Poisson distribution, its applications, and how it relates to the binomial distribution. Specifically: 1. The Poisson distribution can model the probability of a given number of discrete, independent events occurring within a fixed time interval, when the average rate of occurrences is known. 2. It is useful when only the average rate of occurrences is known, rather than both the number of trials and probability of success as required by the binomial distribution. 3. The Poisson distribution can be derived from the binomial distribution by taking the limits as the number of trials approaches infinity and the probability of success approaches zero, while their product remains constant.

Uploaded by

Qurban khaton Hussainyar

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

158 views

Poisson Distribution Explained - Intuition, Examples, and Derivation - Towards Data Science

Uploaded by

Qurban khaton Hussainyar

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Open in app Sign up Sign In

Search Medium

Published in Towards Data Science

You have 2 free member-only stories left this month. Sign up for Medium and get an extra one

Ms Aerin Follow

Jun 1, 2019 · 7 min read · · Listen

Save

Poisson Distribution — Intuition, Examples, and

Derivation
When to use a Poisson Distribution
Before setting the parameter λ and plugging it into the formula, let’s pause a second
and ask a question.

Why did Poisson have to invent the Poisson Distribution?

Why does this distribution exist (= why did he invent

this)?

When should Poisson be used for modeling?

38
1. Why did Poisson invent Poisson Distribution?
7.6K
To predict the # of events occurring in the future!

More formally, to predict the probability of a given number of events occurring in a

fixed interval of time.

If you’ve ever sold something, this “event” can be defined, for example, as a customer
purchasing something from you (the moment of truth, not just browsing). It can be
how many visitors you get on your website a day, how many clicks your ads get for the
next month, how many phone calls you get during your shift, or even how many
people will die from a fatal disease next year, etc.

Below is an example of how I’d use Poisson in real life.

Every week, on average, 17 people clap for my blog post.

I’d like to predict the # of ppl who would clap next week because I
get paid weekly by those numbers.
What is the probability that exactly 20 people (or 10, 30, 50, etc.)
will clap for the blog post next week?

2. For now, let’s assume we don’t know anything about the Poisson
Distribution. Then how do we solve this problem?
One way to solve this would be to start with the number of reads. Each person who
reads the blog has some probability that they will really like it and clap.

This is a classic job for the binomial distribution, since we are calculating the
probability of the number of successful events (claps).

A binomial random variable is the number of successes x in n repeated trials. And we

assume the probability of success p is constant over each trial.

However, here we are given only one piece of information — 17 ppl/week, which is a
“rate” (the average # of successes per week, or the expected value of x). We don’t know
anything about the clapping probability p, nor the number of blog visitors n.

Therefore, we need a little more information to tackle this problem. What more do we
need to frame this probability as a binomial problem? We need two things: the
probability of success (claps) p & the number of trials (visitors) n.

Let’s get them from the past data.

The stat of my Medium blog post about Gradient Descent

These are stats for 1 year. A total of 59k people read my blog. Out of 59k people, 888 of
them clapped.

Therefore, the # of people who read my blog per week (n) is 59k/52 = 1134. The # of
people who clapped per week (x) is 888/52 =17.

# of people who read per week (n) = 59k/52 = 1134

# of people who clap per week (x) = 888/52 = 17

Success probability (p) : 888/59k = 0.015 = 1.5%

Using the Binomial PMF, what is the probability that I’ll get exactly 20 successes (20 people
who clap) next week?
<Binomial Probability for different x’s>

╔══════╦════════════════╗
║ x ║ Binomial P(X=x)║
╠══════╬════════════════╣
║ 10 ║ 0.02250 ║
║ 17 ║ 0.09701 ║ 🡒 The average rate has the highest P!
║ 20 ║ 0.06962 ║ 🡒 Nice. 20 is also quite Likely!
║ 30 ║ 0.00121 ║
║ 40 ║ < 0.000001 ║ 🡒 Well, I guess I won’t get 40 claps..
╚══════╩════════════════╝

We just solved the problem with a binomial distribution.

Then, what is Poisson for? What are the things that only Poisson can do, but Binomial
can’t?

3. The shortcomings of the Binomial Distribution

a) A binomial random variable is “BI-nary” — 0 or 1.

In the above example, we have 17 ppl/wk who clapped. This means 17/7 = 2.4 people
clapped per day, and 17/(7*24) = 0.1 people clapping per hour.
If we model the success probability by hour (0.1 people/hr) using the binomial
random variable, this means most of the hours get zero claps but some hours will get
exactly 1 clap. However, it is also very possible that certain hours will get more than 1
clap (2, 3, 5 claps, etc.)

The problem with binomial is that it CANNOT contain more than 1 event in the unit of
time (in this case, 1 hr is the unit time). The unit of time can only have 0 or 1 event.

Then, how about dividing 1 hour into 60 minutes, and make unit time smaller, for
example, a minute? Then 1 hour can contain multiple events. (Still, one minute will
contain exactly one or zero events.)

Is our problem solved now?

Kind of. But what if, during that one minute, we get multiple claps? (i.e. someone
shared your blog post on Twitter and the traffic spiked at that minute.) Then what? We
can divide a minute into seconds. Then our time unit becomes a second and again a
minute can contain multiple events. But this binary container problem will always
exist for ever-smaller time units.

The idea is, we can make the Binomial random variable handle multiple events by
dividing a unit time into smaller units. By using smaller divisions, we can make the
original unit time contain more than one event.

Mathematically, this means n → ∞.

Since we assume the rate is fixed, we must have p → 0. Because otherwise, n*p, which
is the number of events, will blow up.

Using the limit, the unit times are now infinitesimal. We no longer have to worry
about more than one event occurring within the same unit time. And this is how we
derive Poisson distribution.

b) In the Binomial distribution, the # of trials (n) should be known beforehand.

If you use Binomial, you cannot calculate the success probability only with the rate
(i.e. 17 ppl/week). You need “more info” (n & p) in order to use the binomial PMF.
The Poisson Distribution, on the other hand, doesn’t require you to know n or p. We
are assuming n is infinitely large and p is infinitesimal. The only parameter of the
Poisson distribution is the rate λ (the expected value of x). In real life, only knowing
the rate (i.e., during 2pm~4pm, I received 3 phone calls) is much more common than
knowing both n & p.

4. Let’s derive the Poisson formula mathematically from the Binomial PMF.

Deriving Poisson from Binomial

Now you know where each component λ^k , k! and e^-

λ come from!
Finally, we only need to show that the multiplication of the first two terms n!/((n-
k)!*n^k) is 1 when n approaches infinity.
It is 1.

We got the Poisson Formula!

From https://en.wikipedia.org/wiki/Poisson_distribution

Now the Wikipedia explanation starts making sense.

Plug your own data into the formula and see if P(x)
makes sense to you!
Below is mine.

< Comparison between Binomial & Poisson >

╔══════╦═══════════════════╦═══════════════════════╗
║ k ║ Binomial P(X=k) ║ Poisson P(X=k;λ=17) ║
╠══════╬═══════════════════╬═══════════════════════╣
║ 10 ║ 0.02250 ║ 0.02300 ║
║ 17 ║ 0.09701 ║ 0.09628 ║
║ 20 ║ 0.06962 ║ 0.07595 ║
║ 30 ║ 0.00121 ║ 0.00340 ║
║ 40 ║ < 0.000001 ║ < 0.000001 ║
╚══════╩═══════════════════╩═══════════════════════╝
* You can calculate both easily here:
Binomial: https://stattrek.com/online-calculator/binomial.aspx
Poisson : https://stattrek.com/online-calculator/poisson.aspx

A few things to note:

1. Even though the Poisson distribution models rare events, the rate λ can be any
number. It doesn’t always have to be small.

2. The Poisson Distribution is asymmetric — it is always skewed toward the right.

Because it is inhibited by the zero occurrence barrier (there is no such thing as
“minus one” clap) on the left and it is unlimited on the other side.

3. As λ becomes bigger, the graph looks more like a normal distribution.

https://en.wikipedia.org/wiki/Poisson_distribution

4. The Poisson Model Assumptions

a. The average rate of events per unit time is constant.

This means the number of people who visit your blog per hour might not follow a
Poisson Distribution, because the hourly rate is not constant (higher rate during the
daytime, lower rate during the nighttime). Using monthly rate for consumer/biological
data would be just an approximation as well, since the seasonality effect is non-trivial
in that domain.

b. Events are independent.

The arrivals of your blog visitors might not always be independent. For example,
sometimes a large number of visitors come in a group because someone popular
mentioned your blog, or your blog got featured on Medium’s first page, etc. The
number of earthquakes per year in a country also might not follow a Poisson
Distribution if one large earthquake increases the probability of aftershocks.

5. Relationship between a Poisson and an Exponential distribution

If the number of events per unit time follows a Poisson distribution, then the amount
of time between events follows the exponential distribution. The Poisson distribution
is discrete and the exponential distribution is continuous, yet the two distributions are
closely related.

Let’s go deeper: Exponential Distribution Intuition

Statistics Mathematics Probability Data Science Machine Learning

Enjoy the read? Reward the writer.Beta

Your tip will go to Ms Aerin through a third-party platform of their choice, letting them know you appreciate their story.

Give a tip
Sign up for The Variable
By Towards Data Science

Every Thursday, the Variable delivers the very best of Towards Data Science: from hands-on tutorials and cutting-edge
research to original features you don't want to miss. Take a look.

By signing up, you will create a Medium account if you don’t already have one. Review
our Privacy Policy for more information about our privacy practices.

Get this newsletter

About Help Terms Privacy

Get the Medium app

(Monographs and Surveys in Pure and Applied Mathematics) A K Gupta, D K Nagar - Matrix Variate Distributions-Chapman and Hall - CRC (1999)
No ratings yet
(Monographs and Surveys in Pure and Applied Mathematics) A K Gupta, D K Nagar - Matrix Variate Distributions-Chapman and Hall - CRC (1999)
384 pages
Testing of Hypothesis A02
No ratings yet
Testing of Hypothesis A02
38 pages
4 Continuous Probability Distribution.9188.1578362393.1974
No ratings yet
4 Continuous Probability Distribution.9188.1578362393.1974
47 pages
ANTSE Class 4 Previous Year Paper (2008-2013)
No ratings yet
ANTSE Class 4 Previous Year Paper (2008-2013)
85 pages
Important Questions
No ratings yet
Important Questions
11 pages
Lecture 2 Chapter3
No ratings yet
Lecture 2 Chapter3
53 pages
Nonparametric Tests in R
No ratings yet
Nonparametric Tests in R
5 pages
Opt 428 D
No ratings yet
Opt 428 D
156 pages
Probability Theory III (B.Stat. 2017-2020)
No ratings yet
Probability Theory III (B.Stat. 2017-2020)
173 pages
Odds Ratio
No ratings yet
Odds Ratio
16 pages
Error Detection
No ratings yet
Error Detection
25 pages
Quantitative Analysis
0% (1)
Quantitative Analysis
84 pages
Slides PyConfr Bordeaux Calcagno
No ratings yet
Slides PyConfr Bordeaux Calcagno
46 pages
Ab Initio Molecular Orbital Theory Warren Hehre
No ratings yet
Ab Initio Molecular Orbital Theory Warren Hehre
8 pages
Lecture Slides 2 Descriptive Statistics
No ratings yet
Lecture Slides 2 Descriptive Statistics
149 pages
0262072629.MIT Press - Peter D. Grunwald, in Jae Myung, Mark A. P Advances in Minimum Description Length & Applications - Apr.2009
No ratings yet
0262072629.MIT Press - Peter D. Grunwald, in Jae Myung, Mark A. P Advances in Minimum Description Length & Applications - Apr.2009
455 pages
Vedantu Test Answers
No ratings yet
Vedantu Test Answers
22 pages
Probability and Statistics
100% (1)
Probability and Statistics
26 pages
Normal Distribution
No ratings yet
Normal Distribution
19 pages
Lecture 8: Gradient Descent and Logistic Regression
No ratings yet
Lecture 8: Gradient Descent and Logistic Regression
39 pages
CBSE Syllabus For Class 11 Maths 2022-23 (Revised) PDF Download
No ratings yet
CBSE Syllabus For Class 11 Maths 2022-23 (Revised) PDF Download
6 pages
LN ML Rug
No ratings yet
LN ML Rug
267 pages
Bayesian Network
No ratings yet
Bayesian Network
32 pages
Introduction To Statistics: Hazar Khogeer
No ratings yet
Introduction To Statistics: Hazar Khogeer
74 pages
An Introduction To Objective Bayesian Statistics PDF
No ratings yet
An Introduction To Objective Bayesian Statistics PDF
69 pages
29.measuring Data Similarity and Dissimilarity Introduction
No ratings yet
29.measuring Data Similarity and Dissimilarity Introduction
43 pages
B. Tech. Biotechnology Syllabus CDFST
No ratings yet
B. Tech. Biotechnology Syllabus CDFST
42 pages
Diabetic Retinopathy Detection-IRO-Journals-4 1 3
No ratings yet
Diabetic Retinopathy Detection-IRO-Journals-4 1 3
8 pages
MA3351 QB Part A - B 01 - by LearnEngineering - in
No ratings yet
MA3351 QB Part A - B 01 - by LearnEngineering - in
21 pages
Childhood Asthma Prediction Model Using SVM
No ratings yet
Childhood Asthma Prediction Model Using SVM
9 pages
MATH1510 Financial Mathematics I: Jitse Niesen University of Leeds January - May 2012
No ratings yet
MATH1510 Financial Mathematics I: Jitse Niesen University of Leeds January - May 2012
20 pages
Physics Circle Oscillations
No ratings yet
Physics Circle Oscillations
5 pages
LectureNotes LinearAlgebra
No ratings yet
LectureNotes LinearAlgebra
98 pages
Advanced Certification in Data Science and Artificial Intelligence
No ratings yet
Advanced Certification in Data Science and Artificial Intelligence
15 pages
BG3104 Set 1 2014-15
No ratings yet
BG3104 Set 1 2014-15
108 pages
Deep Learning in Mining Biological Data
100% (1)
Deep Learning in Mining Biological Data
33 pages
Neutrosophic Hypersoft Matrices With Application To Solve Multiattributive Decision-Making Problems
No ratings yet
Neutrosophic Hypersoft Matrices With Application To Solve Multiattributive Decision-Making Problems
18 pages
Mixing Process
No ratings yet
Mixing Process
147 pages
UE20CS302 Unit3 Slides
No ratings yet
UE20CS302 Unit3 Slides
308 pages
Tute 2 2021 - R - U3 - Part - VI - Compressed
No ratings yet
Tute 2 2021 - R - U3 - Part - VI - Compressed
27 pages
Applied Mathematics II Lecture Note
No ratings yet
Applied Mathematics II Lecture Note
40 pages
c06FunctionsAndRelations Web
No ratings yet
c06FunctionsAndRelations Web
90 pages
Mann Stats 8e PPT Ch04 (Main)
No ratings yet
Mann Stats 8e PPT Ch04 (Main)
143 pages
Homework2 Ans
No ratings yet
Homework2 Ans
5 pages
Basic Probability
No ratings yet
Basic Probability
70 pages
Quantum Sheet by Quanta Institute
No ratings yet
Quantum Sheet by Quanta Institute
130 pages
Latest Q Bank (Ed-8.5) of Maths IV
No ratings yet
Latest Q Bank (Ed-8.5) of Maths IV
28 pages
FATS-WPS Office
No ratings yet
FATS-WPS Office
10 pages
Class Syllabus ME850 AF
No ratings yet
Class Syllabus ME850 AF
8 pages
Observation of A Strange Attractor
No ratings yet
Observation of A Strange Attractor
10 pages
University of Central Punjab: Laser Physics Assignment # 1
No ratings yet
University of Central Punjab: Laser Physics Assignment # 1
7 pages
Statistical Tables and Formulae PDF
No ratings yet
Statistical Tables and Formulae PDF
93 pages
مختصر المعادلات التفاضلية الجزئية
No ratings yet
مختصر المعادلات التفاضلية الجزئية
14 pages
Nonlinear Curve Fitting
No ratings yet
Nonlinear Curve Fitting
43 pages
Itf
No ratings yet
Itf
31 pages
Lecture Notes in Statistics 149 (Daquis) (Nov 08)
No ratings yet
Lecture Notes in Statistics 149 (Daquis) (Nov 08)
81 pages
Detector Varex PaxScan 4343RC 17x17
No ratings yet
Detector Varex PaxScan 4343RC 17x17
2 pages
Syllabus STA220 - E02-USEK-Spring 2022-2023-202320-CRN 20566
No ratings yet
Syllabus STA220 - E02-USEK-Spring 2022-2023-202320-CRN 20566
3 pages
Untitled
No ratings yet
Untitled
349 pages
Lectures on the Coupling Method
From Everand
Lectures on the Coupling Method
Torgny Lindvall
No ratings yet
Poisson Distributions - Definition, Formula & Examples
No ratings yet
Poisson Distributions - Definition, Formula & Examples
10 pages
Poisson Distribution - Wikipedia
No ratings yet
Poisson Distribution - Wikipedia
57 pages
Basic Concepts of The Poisson Process
No ratings yet
Basic Concepts of The Poisson Process
8 pages
Poisson Distribution - Brilliant Math & Science Wiki
No ratings yet
Poisson Distribution - Brilliant Math & Science Wiki
2 pages
2.6 Applications of Poisson Distribution - Business Statistics
No ratings yet
2.6 Applications of Poisson Distribution - Business Statistics
2 pages
Stat 202 Assignment 2
No ratings yet
Stat 202 Assignment 2
2 pages
Queuing Theory AEM
No ratings yet
Queuing Theory AEM
11 pages
PSLP notes
No ratings yet
PSLP notes
13 pages
ALY6000 Module 6.0
No ratings yet
ALY6000 Module 6.0
54 pages
S1 Topic 6 Exam Questions 2001 - 2024
No ratings yet
S1 Topic 6 Exam Questions 2001 - 2024
91 pages
Variance MLE
No ratings yet
Variance MLE
7 pages
Statistical Tables Complete
No ratings yet
Statistical Tables Complete
30 pages
PSet6 Solutions
No ratings yet
PSet6 Solutions
6 pages
Excel For Discrete Prob. Distributions
No ratings yet
Excel For Discrete Prob. Distributions
9 pages
Question Bank PT&PR1
No ratings yet
Question Bank PT&PR1
3 pages
Statistical Model Specification
No ratings yet
Statistical Model Specification
3 pages
Chapter 4 Probability Distribution
No ratings yet
Chapter 4 Probability Distribution
29 pages
Mathematics For Informatics 4a: The Story of The Film So Far..
No ratings yet
Mathematics For Informatics 4a: The Story of The Film So Far..
6 pages
Random Variable Expectation and Variance
No ratings yet
Random Variable Expectation and Variance
30 pages
Binomial Distribution. (Application)
100% (1)
Binomial Distribution. (Application)
5 pages
Mediator Versus Moderator Variables
No ratings yet
Mediator Versus Moderator Variables
2 pages
DeepSeek图解10页
No ratings yet
DeepSeek图解10页
11 pages
PRP Rejinpaul
No ratings yet
PRP Rejinpaul
5 pages
07 GLM
No ratings yet
07 GLM
49 pages
Grade 10 April 01
No ratings yet
Grade 10 April 01
95 pages
Poisson + Multinomial
No ratings yet
Poisson + Multinomial
2 pages
Mathematics-Iii (Probability & Statistics) : Inst Ruct Ions T O Candidat Es
No ratings yet
Mathematics-Iii (Probability & Statistics) : Inst Ruct Ions T O Candidat Es
2 pages
CHAPTER 3. Some Common Probability Distributions 2023
No ratings yet
CHAPTER 3. Some Common Probability Distributions 2023
6 pages
Probability and Statistics 2 Reader PDF
No ratings yet
Probability and Statistics 2 Reader PDF
140 pages
Regression Models for Categorical Dependent Variables Using Stata 3rd Edition J. Scott Long download
100% (1)
Regression Models for Categorical Dependent Variables Using Stata 3rd Edition J. Scott Long download
55 pages
Random Variable
No ratings yet
Random Variable
39 pages
CT 111
No ratings yet
CT 111
6 pages
m 24 may
No ratings yet
m 24 may
2 pages
MTL390 L0 Introduction
No ratings yet
MTL390 L0 Introduction
12 pages
Presentation Risk Management
No ratings yet
Presentation Risk Management
53 pages