0% found this document useful (0 votes)

43 views22 pages

Understanding Maximum Likelihood Estimation

This document provides an overview of Maximum Likelihood Estimation (MLE), including its historical context, fundamental concepts, and practical applications through examples. It explains how MLE is used to estimate parameters of various distributions, such as the binomial, Poisson, exponential, and normal distributions, highlighting the process of maximizing the likelihood function. The document also emphasizes the efficiency properties of MLE, particularly with larger sample sizes.

Uploaded by

anupamdutta926

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views22 pages

Understanding Maximum Likelihood Estimation

Uploaded by

anupamdutta926

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Maximum Likelihood Estimation - I

Dr. A. Ramesh
DEPARTMENT OF MANAGEMENT STUDIES

1
Agenda

• This lecture will provide intuition behind the MLE using Theory and
examples.

2
Maximum Likelihood Estimation

• The method of maximum likelihood was first introduced by R. A.

Fisher, a geneticist and statistician, in the 1920s.
• Most statisticians recommend this method, at least when the
sample size is large, since the resulting estimators have certain
desirable efficiency properties
• Maximum likelihood estimation(MLE) is a method to find most likely density
function, that would have generated data.
• MLE requires one to make distribution assumption first.

3
An intuitive view on likelihood

 = −2,  2 = 1
 = 0,  2 = 1

 = 0,  2 = 4

4
Maximum Likelihood Estimation: Problem
• A sample of ten new bike helmets manufactured by a certain company is
obtained. Upon testing, it is found that the first, third, and tenth helmets
are flawed, whereas the others are not.
• Let p = P(flawed helmet), i.e., p is the proportion of all such helmets that
are flawed.
• Define (Bernoulli) random variables X1, X2, . . . , X10 by

Source: Probability and Statistics for Engineering and the Sciences, Jay L Devore, 8th Ed, Cengage

5
Maximum Likelihood Estimation: Problem

• Then for the obtained sample, X1 = X3 = X10 = 1 and the other seven Xi’s are
all zero
• The probability mass function of any particular Xi is ,
which becomes p if xi = 1 and 1 – p when xi = 0
• Now suppose that the conditions of various helmets are independent of
one another
• This implies that the Xi’s are independent, so their joint probability mass
function is the product of the individual pmf’s.

6
Maximum Likelihood Estimation: Binomial Distribution

• Joint pmf evaluated at the observed Xi’s is

f(x1, . . . , x10; p) = p(1 – p)p . . . p = p3(1 – p)7 - (1)

• Suppose that p = .25. Then the probability of observing the sample that
we actually obtained is (.25)3(.75)7 = .002086.
• If instead p = .50, then this probability is (.50)3(.50)7 = .000977.
• For what value of p is the obtained sample most likely to have occurred?
• That is, for what value of p is the joint pmf (eq 1) as large as it can be?
• What value of p maximizes (eq 1)

7
Maximum Likelihood Estimation: Binomial Distribution
• Figure shows a graph of the likelihood (eq 1) as a function of p.
• It appears that the graph reaches its peak above p = .3 = the proportion of
flawed helmets in the sample.

Graph of the likelihood (joint pmf) (eq 1)

8
Graph of the natural logarithm of the likelihood

• Figure shows a graph of the

natural logarithm of (eq 1)
• Since ln[g(u)] is a strictly
increasing function of g(u),
finding u to maximize the
function g(u) is the same as
finding u to maximize ln[g(u)].

9
Maximum Likelihood Estimation: Binomial Distribution

• We can verify our visual impression by using calculus to find the value of p
that maximizes (eq 1).
• Working with the natural log of the joint pmf is often easier than working
with the joint pmf itself, since the joint pmf is typically a product so its
logarithm will be a sum.
• Here ln[ f (x1, . . . , x10; p)] = ln[p3(1 – p)7]
• = 3ln(p) + 7ln(1 – p)

10
Maximum Likelihood Estimation: Binomial Distribution

Thus

11
Interpretation
• Equating this derivative to 0 and solving for p gives
3(1 – p) = 7p, from which 3 = 10p and so p = 3/10 = .30 as conjectured

• That is, our point estimate is p = .30.

• It is called the maximum likelihood estimate because it is the parameter
value that maximizes the likelihood (joint pmf) of the observed sample
• In general, the second derivative should be examined to make sure a
maximum has been obtained, but here this is obvious from Figure

12
Maximum Likelihood Estimation: Binomial Distribution

• Suppose that rather than being told the condition of every helmet, we had
only been informed that three of the ten were flawed.
• Then we would have the observed value of a binomial random variable X =
the number of flawed helmets.
• The pmf of X is For x = 3, this becomes
• The binomial coefficient is irrelevant to the maximization, so again p =
0.30.

13
Maximum Likelihood Function Definition
• Let 𝑋1 , 𝑋2 ,…, 𝑋𝑛 have joint pmf or pdf
𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 ; 𝜃1 , … , 𝜃𝑚 ) (a)

• Where the parameters 𝜃1 , … , 𝜃𝑚 have unknown values. When 𝑥1 , … , 𝑥𝑛 are the observed
sample values and (a) is regarded as a function of 𝜃1 , … , 𝜃𝑚 , it is called the likelihood
function.
^ ^
• The maximum likelihood estimates (mle’s)
the likelihood function, so that
 ,...,
1 m
are those values of the i’s that maximize

^ ^

𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 ; 1,..., m) ≥ 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 ; 𝜃1 , … , 𝜃𝑚 ) for all 𝜃1 , … , 𝜃𝑚

• When the 𝑋𝑖′ 𝑠 are substituted in place of the 𝑥𝑖′ 𝑠, the maximum likelihood estimators result.

14
Interpretation
• The likelihood function tells us how likely the observed sample is as a
function of the possible parameter values.
• Maximizing the likelihood gives the parameter values for which the
observed sample is most likely to have been generated—that is, the
parameter values that “agree most closely” with the observed data.

15
Estimation of Poisson Parameter
• Suppose we have data generated from a Poisson distribution. We want to
estimate the parameter of the distribution
e−  X
• The probability of observing a particular random variable is P( X ;  ) =
X!
• Joint likelihood by multiplying the individual probabilities together

e −   X1 e −   X 2 e−  X n
P( X 1 , X 2 ,, X n ;  ) =   
X 1! X 2! X n!
L (  ; X) =  e −   X i
i

L(  ; X) = e − n  nX

16
Estimation of Poisson Parameter
• Note in the likelihood function the factorials have disappeared.
• This is because they provide a constant that does not influence the
relative likelihood of different values of the parameter
• It is usual to work with the log likelihood rather than the likelihood.
• Note that maximising the log likelihood is equivalent to maximising the
likelihood. Take the natural log of the
likelihood function
L(  ; X) = e − n  nX
(  ; X) = −n + nX log  Find where the derivative of the log
likelihood is zero
d nX
= −n +
d  Note that here the MLE is the same as the
ˆ = X moment estimator

17
Estimation of exponential distribution Parameter
• Suppose X1, X2, . . . , Xn is a random sample from an exponential
distribution with parameter . Because of independence, the likelihood
function is a product of the individual pdf’s:

• The natural logarithm of the likelihood function is

ln[ f (x1, . . . , xn ; )] = n ln() – xi

18
Estimation of exponential distribution Parameter

• Equating (d/d)[ln(likelihood)] to zero results in

n/ – xi = 0, or  = n/xi =

• Thus the MLE is

19
Estimation of parameters of Normal Distribution
• Let X1, . . . , Xn be a random sample from a normal distribution.
• The likelihood function is

• so

20
Estimation of parameters of normal distribution
• To find the maximizing values of  and  2, we must take the partial derivatives
of ln(f ) with respect to  and  2, equate them to zero, and solve the resulting
two equations.

• Omitting the details, the resulting MLE’s are

• The MLE of  2 is not the unbiased estimator, so two different principles of

estimation (unbiasedness and maximum likelihood) yield two different
estimators

21
Thank you

Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
9 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
8 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
3 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
7 pages
Statistical Inference and MLE Techniques
No ratings yet
Statistical Inference and MLE Techniques
55 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
7 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
7 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
5 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
7 pages
Maximum Likelihood Estimation in STAT 414
No ratings yet
Maximum Likelihood Estimation in STAT 414
8 pages
Maximum Likelihood Estimation Overview
No ratings yet
Maximum Likelihood Estimation Overview
8 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
6 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
22 pages
MLE: Estimation Techniques in Statistics
No ratings yet
MLE: Estimation Techniques in Statistics
4 pages
Maximum Likelihood Estimation Overview
No ratings yet
Maximum Likelihood Estimation Overview
13 pages
MLE: Examples and Estimation Methods
No ratings yet
MLE: Examples and Estimation Methods
7 pages
Understanding Parameter Estimation
No ratings yet
Understanding Parameter Estimation
6 pages
Maximum Likelihood Estimators Explained
No ratings yet
Maximum Likelihood Estimators Explained
22 pages
MLE for Uniform Distribution Explained
No ratings yet
MLE for Uniform Distribution Explained
7 pages
Bias-Variance Trade-off & MLE Explained
No ratings yet
Bias-Variance Trade-off & MLE Explained
37 pages
Maximum Likelihood Estimation Examples
No ratings yet
Maximum Likelihood Estimation Examples
6 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
11 pages
MLE of Gaussian Mean and Bernoulli Probability
No ratings yet
MLE of Gaussian Mean and Bernoulli Probability
5 pages
Maximum Likelihood Estimators Explained
No ratings yet
Maximum Likelihood Estimators Explained
15 pages
Maximum Likelihood Estimators Explained
No ratings yet
Maximum Likelihood Estimators Explained
15 pages
Robust and Maximum Likelihood Estimators
No ratings yet
Robust and Maximum Likelihood Estimators
15 pages
Maximum Likelihood Estimation in Statistics
No ratings yet
Maximum Likelihood Estimation in Statistics
9 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
32 pages
Fitting Probability Distributions Explained
No ratings yet
Fitting Probability Distributions Explained
17 pages
Maximum Likelihood Estimation Basics
No ratings yet
Maximum Likelihood Estimation Basics
59 pages
MLE vs Bayesian Estimation Explained
No ratings yet
MLE vs Bayesian Estimation Explained
35 pages
Estimating Distributions from Data
No ratings yet
Estimating Distributions from Data
14 pages
Maximum Likelihood and Bayesian Theory
No ratings yet
Maximum Likelihood and Bayesian Theory
50 pages
Understanding Maximum Likelihood Estimation
No ratings yet
Understanding Maximum Likelihood Estimation
46 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
276 pages
Maximum Likelihood Estimation in Astronomy
No ratings yet
Maximum Likelihood Estimation in Astronomy
37 pages
Probabilistic Machine Learning Concepts
No ratings yet
Probabilistic Machine Learning Concepts
38 pages
Maximum Likelihood Estimation Guide
No ratings yet
Maximum Likelihood Estimation Guide
17 pages
Maximum Likelihood Estimation Overview
No ratings yet
Maximum Likelihood Estimation Overview
21 pages
MATLAB Implementation of MLE Method
No ratings yet
MATLAB Implementation of MLE Method
277 pages
Understanding Maximum Likelihood Estimation
No ratings yet
Understanding Maximum Likelihood Estimation
10 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
18 pages
Maximum Likelihood Estimation in Statistics
No ratings yet
Maximum Likelihood Estimation in Statistics
14 pages
MLE and Statistical Inference for Engineers
No ratings yet
MLE and Statistical Inference for Engineers
8 pages
GLM and Maximum Likelihood Overview
No ratings yet
GLM and Maximum Likelihood Overview
32 pages
Hasan Method: Estimation Techniques
No ratings yet
Hasan Method: Estimation Techniques
5 pages
Understanding Maximum Likelihood Estimation
No ratings yet
Understanding Maximum Likelihood Estimation
4 pages
Theory of Estimation Overview
No ratings yet
Theory of Estimation Overview
14 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
14 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
8 pages
MLE for CSE Students
No ratings yet
MLE for CSE Students
24 pages
Maximum Likelihood Estimation Overview
No ratings yet
Maximum Likelihood Estimation Overview
36 pages
Maximum Likelihood Estimation Overview
No ratings yet
Maximum Likelihood Estimation Overview
10 pages
Maximum Likelihood Estimation Basics
No ratings yet
Maximum Likelihood Estimation Basics
5 pages
Consequences of Computer Viruses
No ratings yet
Consequences of Computer Viruses
5 pages
Knowledge Management Tools & Wisdom Insights
No ratings yet
Knowledge Management Tools & Wisdom Insights
3 pages
Optimized Resume Template for Graduates
No ratings yet
Optimized Resume Template for Graduates
2 pages
Csec It Jan 2007 P2
No ratings yet
Csec It Jan 2007 P2
11 pages
Microsoft Access Visitor Management System
No ratings yet
Microsoft Access Visitor Management System
38 pages
Understanding Digital History Projects
No ratings yet
Understanding Digital History Projects
8 pages
2014 Anthology WEB BIx Enhance Rep Skills
No ratings yet
2014 Anthology WEB BIx Enhance Rep Skills
138 pages
Reading List for Media Resources
100% (2)
Reading List for Media Resources
35 pages
Managing Microsoft 365 Tenant for MS-102
No ratings yet
Managing Microsoft 365 Tenant for MS-102
23 pages
Sampling and Quantization in ADC
No ratings yet
Sampling and Quantization in ADC
23 pages
Connecting to VNC on Linux Servers
No ratings yet
Connecting to VNC on Linux Servers
12 pages
Subtracting Decimals for Grade 5
No ratings yet
Subtracting Decimals for Grade 5
25 pages
Games Family User Manual (4P)
No ratings yet
Games Family User Manual (4P)
10 pages
Communication Techniques for FieldPoint
No ratings yet
Communication Techniques for FieldPoint
2 pages
Improved Logarithmic Multiplier for NNs
No ratings yet
Improved Logarithmic Multiplier for NNs
17 pages
Behavioral Modeling and Use Cases Guide
No ratings yet
Behavioral Modeling and Use Cases Guide
8 pages
IPv4 vs IPv6: Key Differences Explained
No ratings yet
IPv4 vs IPv6: Key Differences Explained
6 pages
IT Hardware Solutions in East Africa
No ratings yet
IT Hardware Solutions in East Africa
2 pages
Metasploit Framework in Pen-Testing
No ratings yet
Metasploit Framework in Pen-Testing
9 pages
Task Analysis in UX Software Design
No ratings yet
Task Analysis in UX Software Design
8 pages
FuzzyDesigner: Membership Functions Overview
No ratings yet
FuzzyDesigner: Membership Functions Overview
2 pages
Hospital Patient Info System Overview
No ratings yet
Hospital Patient Info System Overview
17 pages
WorldRemit Money Transfer Overview
No ratings yet
WorldRemit Money Transfer Overview
1 page
C# Basics: Classes, Variables, and Input
100% (1)
C# Basics: Classes, Variables, and Input
9 pages
Computer Networks Overview and Models
No ratings yet
Computer Networks Overview and Models
31 pages
BMW 35128WT EEPROM Emulator Guide
No ratings yet
BMW 35128WT EEPROM Emulator Guide
11 pages
Irfan Wazir's Professional Resume
No ratings yet
Irfan Wazir's Professional Resume
3 pages
AN5426
0% (1)
AN5426
37 pages
VU Quiz Collection for CS403 & CS304
No ratings yet
VU Quiz Collection for CS403 & CS304
6 pages
Employee Central Implementation Guide
No ratings yet
Employee Central Implementation Guide
24 pages

Understanding Maximum Likelihood Estimation

Uploaded by

Understanding Maximum Likelihood Estimation

Uploaded by

Maximum Likelihood Estimation - I

• The method of maximum likelihood was first introduced by R. A.

• Joint pmf evaluated at the observed Xi’s is

Graph of the likelihood (joint pmf) (eq 1)

• Figure shows a graph of the

• That is, our point estimate is p = .30.

𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 ; 1,..., m) ≥ 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 ; 𝜃1 , … , 𝜃𝑚 ) for all 𝜃1 , … , 𝜃𝑚

• The natural logarithm of the likelihood function is

ln[ f (x1, . . . , xn ; )] = n ln() – xi

• Equating (d/d)[ln(likelihood)] to zero results in

• Thus the MLE is

• Omitting the details, the resulting MLE’s are

• The MLE of  2 is not the unbiased estimator, so two different principles of

You might also like