0% found this document useful (0 votes)

29 views

EM Converge Property

The document discusses the convergence properties of the Expectation Maximization (EM) algorithm. It summarizes Jeff Wu's paper that corrected errors in an earlier paper on the convergence of the EM algorithm. Specifically, it addresses whether EM finds: 1) A global or local maximum, or just a stationary value for the likelihood function. 2) If the parameter sequence generated by EM iterations converges to a limit. It introduces theorems from Jeff Wu's paper that answer these questions. The key theorem is the Global Convergence Theorem, which establishes conditions under which the limit of the EM sequence falls within the solution set (i.e. a global or local maximum).

Uploaded by

yu chen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views

EM Converge Property

Uploaded by

yu chen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

The Convergence Property of EM Algorithm

Yu Chen

A thesis presented for a seminar of EM

Mathematical Department
Technical University of Munich
Germany
17.12.2020
Abstract
The thesis reviews Jeff Wu’s paper ON THE CONVERGENCE PROPERTIES OF THE
EM ALGORITHM[4] which studied: (1) whether EM algorithm finds a local maximum or just
a stationary value for the target likelihood over incomplete data? (2) whether the parameter
sequence generated from the EM iteration process finally converge?. Jeff Wu dedicated to
correcting the error appeared in the paper MAXIMUM LIKELIHOOD FROM INCOMPLETE
DATA VIA THE EM ALGORITHM (WITH DISCUSSION)[2] which Jeff Wu’s paper is based
on via presenting 7 theorems and one corollary. In this thesis, we focus on studying the
relationship among the theorems.

1 Introduction
Expectation Maximization (EM) consisting of a E-step and M-step is an iterative algorithm that
tries to maximize the likelihood over incomplete data. This algorithm is popular in not only statis-
tics but also optimization, machine learning and computer vision. Due to its wide applications, it
has a range of forms describing the E-step and M-step. Dempster, Laird and Rubin (abbreviated
DLR) [2] introduced a general form for the EM algorithm and analysed some properties for it,
whereas their proof of convergence of an EM sequence is not totally correct. Therefore, Jeff Wu
corrects the error in his paper[4]. In this thesis, we inherit the general EM form from the DLR
paper and review Jeff Wu’s correct proof of EM convergence properties.
Specifically, this thesis answers two questions regarding the convergence of the EM. (1) Does
EM finally reach a global maximum or local maximum or stationary value for the likelihood? (2)
Whether the sequence generated from the EM iteration converges to a limit? The key to answer
the first question is the Global Convergence Theorem[5]. Based on this theorem Jeff Wu drew[4]
three theorems answering the first question. In addition, the other four theorems are introduced to
obtain the answer for the second question. The EM sequence can converge to the unique maximum
when the likelihood is unimodal and a differentiability condition is satisfied[4].

2 The Generalized EM Algorithm

We have 2 sample spaces X and Y with corresponding p.d.f. f (x|φ) and g(y|φ) satisfying the
following relationship: Z
g(y|φ) = f (x|φ)dx
{x:y(x)=y}

where y = y(x) is the observed incomplete data in Y and φ ∈ Ω the parameter space. The
relationship between the two sample space is a many-to-one mapping from X to Y which we will
talk about in detail in section 3. However, another way to express this relationship is:
Z
g(y|φ) = f (x, y|φ)dx

In order to present the generalized EM, we first draw the likelihood of the incomplete obser-
vation y. To this end, we introduce the conditional density of x given y and φ (note: f (x|φ) =
f (x, y|φ)):
f (x|φ)
k(x|y, φ) =
g(y|φ)
So,
f (x|φ)
g(y|φ) =
k(x|y, φ)
.

1
0
Then the log likelihood L(φ ) is:
0 0
L(φ ) = log(g(y|φ ))
0
= Ex∼k(x|y,φ) [log(g(y|φ ))]
0
f (x|φ )
= Ex∼k(x|y,φ) [log( )]
k(x|y, φ0 ) (1)
0 0
= Ex∼k(x|y,φ) [log(f (x|φ )) − log(k(x|y, φ ))]
0 0
= E{log(f (x|φ ))|y, φ} − E{log(k(x|y, φ ))|y, φ}
0 0
= Q(φ |φ) − H(φ |φ)
0 0 0
Where we assume Q(φ |φ) and H(φ |φ) exist for all pairs of (φ , φ)
0 0 0
Now, we are interested in maximizing the log likelihood max
0
L(φ ) = Q(φ |φ) − H(φ |φ). The
φ
EM algorithm solve this using an iteration process:

φp → φp+1 ∈ M (φp )

where M is a point-to-set map consisting of two steps:

• E-STEP Determine Q(φ|φp ) for current φp
• M-STEP φp+1 = argmaxφ∈Ω Q(φ|φp )

However, Q(φ|φp ) can be very complex and may not numerically feasible to maximize it, so
we need a more general way to depict the EM algorithm[4]. Since we are only interested in the
convergence property of EM algorithm, we do not care the specific methods we use for the M-step.
We just need to guarantee the properties that the EM must satisfy in the Generalized EM (GEM).
Dempster, Laird and Rubin (1977) defined the GEM algorithm in their DLR paper [2] as an
iterative scheme:
φp+1 ∈ M (φp )
0
where φ → M (φ) is a point-to-set map such that:
0 0
Q(φ |φ) ≥ Q(φ|φ) ∀ φ ∈ M (φ) (2)

So, we see that EM is special case of the GEM. Moreover, two properties of the GEM have
been sumarized in DLR (Theorem 1 and Lemma 1):
0 0
H(φ|φ) ≥ H(φ |φ) ∀ φ ∈ Ω (3)

and for any sequence {φp } from a GEM algorithm:

L(φp+1 ) ≥ L(φp ) (4)

3 Global Maximum or Local Maximum or Stationary Val-

ues?
We have presented the GEM and its properties in the previous section, and it is easy to see from
(2) that if L(φp ) is bounded, then the sequence L(φp ) converges monotonically to some limit L∗ .
Now, we are interested in whether L∗ is the global maximum or local maximum or just stationary
value of L(φp ) over Ω. To do this, we make the following assumptions:
• 1) Ω ⊆ Rr

2
• 2) Ωφ0 = {φ ∈ Ω : L(φ) ≥ L(φ0 )} is compact for any L(φ0 ) > −∞
• 3) L(φ) is continuous in Ω and differentiable in the interior of Ω
• 4) {L(φp )}p≥0 is bounded above for any φ0 ∈ Ω
• 5) φp is in the interior of Ω, int(Ω)
• 6) φp converges to some φ∗ ∈ int(Ω) such that the Hessian matrices ∇2 Q(φ∗ |φ∗ ) and
0 0
∇2 H(φ∗ |φ∗ ) exist at the first φ∗ , and ∇2 Q(φ |φ) is continuous in (φ , φ)
where assumption 4) is a consequence of the previous three assumptions. Assumption 6) ensures
that we have tools to analyze whether L∗ is the global maximum or local maximum or just sta-
tionary value.
0
In the M-step of EM, we globally maximize Q(φ |φ) for current φ, so based on assumption 6),
2 ∗ ∗
we have ∇ Q(φ |φ ) is non-positive definite (n.p.d.). According to Lemma 2 of DLR, we have
−∇2 H(φ∗ |φ∗ ) is non-negative definite (n.n.d.). Therefore, the Hessian matrix of the log-likelihood
∇2 L(φ∗ ) = ∇2 Q(φ∗ |φ∗ ) − ∇2 H(φ∗ |φ∗ ) may not be n.p.d. i.e. L(φ∗ ) may not be a local maximum.
Murray[3] gave an example illustrating that the EM converges to a stationary value rather than a
local maximum.
Now, to answer the global or local or stationary question, we introduce point-to-set map M
on set X i.e. M maps from points of X to subsets of X. M is called closed at x∗ if xk ∈ X,
lim xk = x∗ and lim yk = y ∗ , yk ∈ M (xk ) imply y ∗ ∈ M (x∗ ). With this concept, we introduce
k→∞ k→∞
the following theorem.
Global Convergence Theorem.

Figure 1: Global Convergence Theorem

• {xk }∞
k=0 is generated by xk+1 ∈ M (xk ) where M is p2s map on set X

• Solution set Γ ⊂ X
- 1 ) xk ∈ S where compact set S ⊂ X
- 2 ) M is closed over ΓC
Then all limit points of {xk } are in Γ
- 3 ) function α is continuous on X such that
a) if x ∈ ΓC , then α(y) > α(x) ∀ y ∈ M (x);
b) if x ∈ Γ, then α(y) ≥ α(x) ∀ y ∈ M (x).
Then α(xk ) converges monotonically to α(x∗ ) for some x∗ ∈ Γ
PROOF see Appendix. Figure 1 helps understand the relationships among each concept in the
theorem and illustrates the first consequence that the limit of the sequence {xk } finally fall into
the solution set Γ.
Now, we set M as the point-to-set map in a GEM iteration and set the function α as the
log-likelihood function L. Additionally, let the solution set Γ be one of the following:

3
• M : local maxima in the interior of Ω
• ϕ: stationary points in the interior of Ω
Then we get Theorem 1 as a special case of the Global Convergence Theorem.
Theorem 1
φp is a GEM sequence generated by φp+1 ∈ M (φp ), and suppose:
1) M is a closed p2s map over ϕC (or M C )
2) L(φp+1 ) > L(φp ) ∀φp ∈/ ϕ (or M )
Then, all limit points of {φp } are stationary (or local maxima) of L, and L(φp ) converges mono-
tonically to L∗ = L(φ∗ )

Note that if Q(ψ|φ) is continuous in both ψ and φ, then condition 1) in Theorem 1 satisfies[4].
In fact it is also sufficient to imply condition 2) in Theorem 1 such that we Theorem 2.
Theorem 2
If Q(ψ|φ) is continuous in ψ and φ, then all limit points of {φp } in an EM are stationary points of
L, and L(φp ) converges monotonically to L∗ = L(φ∗ ) for some point φ∗ .
PROOF
Condition 1) of Theorem 1 has held, we only need to prove condition 2).
0 0
∵H(φ|φ) ≥ H(φ |φ) ∀φ ∈ Ω (Theorem 1 of DLR)
∴∇H(φp |φp ) = 0
∴∇L(φp ) = ∇Q(φp |φp ) 6= 0 ∀ φp ∈ /ϕ
∵Q(φp+1 |φp ) > Q(φp |φp ) and H(φp |φp ) ≥ H(φp+1 |φp )
∴L(φp+1 ) > L(φp )

Theorem 2 can be easily applied because the continuity condition is not too strong. For example,
if the unobserved data x can be expressed as the curved exponential family , then the continuity
holds.
Curved Exponential Family
X is a random vector with p.d.f. f (x|φ) from a probability space X.
k
X
f (x|φ) = A(φ) exp ( Ti (x)ηi (φ))h(x)
i=1

Where Ti (x) is a real valued statistics, ηi (φ) is a real valued function on the parameter space
Ω ⊆ Rq , and q < k ∈ N.
If covφ (T~ ) (T~ = [T1 , T2 , ·, Tk ]) is positive definite, then X belongs to the curved exponential family.
Note that Theorem 2 DOES NOT apply to M (consider some φp ∈ ϕ whereas φp ∈ / M ). So,
how to ensure that the sequence L(φk ) converges to a local maximum? We need another condition,
so we obtain the following theorem.
Theorem 3
0
If Q(ψ|φ) is continuous in ψ and φ, and supφ0 ∈Ω Q(φ |φ) > Q(φ|φ) ∀ φ ∈ ϕ\M
then all limit points of {φp } in an EM are local maxima of L, and L(φp ) converges monotonically
to L∗ = L(φ∗ ) for some local maxima φ∗ .
However, the new condition in Theorem 3 is hard to verify in real application. Therefore,
Theorem 1 is the most general answer to our first question, and Theorem 2 provides a basis in real
application.
Now, we have given an answer to the first question that it is not easy to make sure L(φp )
converge to a local maximum, whereas we still do not know whether the sequence {φp } generated
from the EM process converge to a specific point. Even though we have known that L(φp ) converges
to L∗ , this convergence does not imply the convergence of the GEM (EM) sequence {φp } because
this sequence is generated from a point-to-set map M . We study this question in the next section.

4 Does A EM Sequence {φp } Converge?

To better study the convergence of {φp }, we define two sets as the following:
ϕ(a) = {φ ∈ ϕ : L(φ) ≡ a}

4
M (a) = {φ ∈ M : L(φ) ≡ a}
According to the definition above, if L(φp ) → L∗ then the limit points of φp are in ϕ(L∗ ) (or
M (L∗ )). Notice that if ϕ(L∗ ) (or M (L∗ )) consists of a single point φ∗ then lim φp = φ∗ , so we
p→∞
have the following theorem.
Theorem 4
φp is a GEM sequence generated by φp+1 ∈ M (φp ), and suppose:
1) M is a closed p2s map over ϕC (or M C )
2) L(φp+1 ) > L(φp ) ∀φp ∈/ ϕ (or M )
If ϕ(L∗ ) = {φ∗ } (or M (L∗ ) = {φ∗ }) where L∗ = lim L(φp ), then lim φp = φ∗
p→∞ p→∞

Note that condition 1) and 2) in Theorem 4 are the two conditions from Theorem 1, we only
introduce an extra condition that ϕ(L∗ ) = {φ∗ } (or M (L∗ ) = {φ∗ }). This new condition can be
relaxed by lim ||φp+1 − φp || = 0 such that we have the next theorem, Theorem 5.
p→∞
Theorem 5
φp is a GEM sequence generated by φp+1 ∈ M (φp ), and suppose:
1) M is a closed p2s map over ϕC (or M C )
2) L(φp+1 ) > L(φp ) ∀φp ∈/ ϕ (or M )
If lim ||φp+1 − φp || = 0, then all limit points of {φp } are in a connected and compact subset of
p→∞
ϕ(L∗ ) or M (L∗ ) where L∗ = lim L(φp ).
p→∞
(Here, a connected subset cannot be represented as an union of two disjoint sets.)
In particular, if ϕ(L∗ ) or M (L∗ ) is discrete, then φp converges to some φ∗ in ϕ(L∗ ) or M (L∗ ).
PROOF
see Theorem 28.1 of Ostrowski (1967) [1] and Theorem 1.
Both Theorem 4 and 5 inherit condition 1) and 2) from Theorem 1, but we can obtain a
condition that implies these two conditions and at the same time we strengthen the conditions
after if in Theorem 4 and 5 to get a new theorem Theorem 6. To do this, we define a new set:

ψ(L) = {φ ∈ Ω : L(φ) ≡ L}

Theorem 6
0
φp is a GEM sequence generated by φp+1 ∈ M (φp ) with ∇Q(φp+1 |φp ) = 0, and suppose ∇Q(φ |φ)
0
is continuous in φ and φ.
If either (a) ψ(L ) = {φ∗ }, or (b) lim ||φp+1 − φp || = 0 and ψ(L∗ ) is discrete satisfies, then φp
∗
p→∞
converges to a stationary point φ∗ with L(φ∗ ) = L∗ , the limit of L(φp ) .
PROOF
0
The continuity of ∇Q(φ |φ) implies condition 1) and 2) of Theorem 1.
(a) or (b) in Theorem 6 is stronger than the condition after if in Theorem 4 or Theorem 5.
0
The continuity of ∇Q(φ |φ) and ∇Q(φp+1 |φp ) = 0 imply ∇L(φ∗ |φ∗ ) = ∇Q(φ∗ |φ∗ ) = 0.

Condition a) in Theorem 6 can be replaced by that L(φ) is unimodal in Ω with φ∗ being the
only stationary point, so we get a corollary of Theorem 6 (Theorem 7).
Theorem 7
0
Suppose that L(φ) is unimodal in Ω with φ∗ being the only stationary point and that ∇Q(φ |φ)
0
is continuous in φ and φ , then for any EM sequence {φp }, φp converges to the unique maximizer
φ∗ of L(φ).

5
5 Summary
• (1) For an EM sequence {φp } that increases the likelihood L(φp ), if L(φp ) is bounded above,
it converges to some L∗ .

• (2) If Q(ψ|φ) is continuous in ψ and φ, then all limit points of {φp } in an EM are stationary
points of L, and L(φp ) converges monotonically to L∗ = L(φ∗ ) for some point φ∗ . The curved
exponential family satisfies the continuity condition of Q. Additionally, if {φp } converges to
0
some limit φ∗ , then φ∗ is a stationary point under the condition that ∇Q(φ |φ) is continuous
0
in φ and φ.
• (3) To ensure that the limit of L(φp ) is not a stationary value but a local maximum under the
0
condition of the previous item, we need another condition: supφ0 ∈Ω Q(φ |φ) > Q(φ|φ) ∀ φ ∈
ϕ\M . However, this is a condition hard to verify in practice. To deal with this issue, Jeff Wu
suggests that it is better to set several representative initial points in the parameter space to
launch the EM algorithm, because whether EM will trapped into stationary points but not
local maxima highly depends on the initializers.
• (4) In addition to item (2), if either (a) ψ(L∗ ) = {φ∗ }, or (b) lim ||φp+1 − φp || = 0 and
p→∞
ψ(L∗ ) is discrete satisfies, then φp converges to a stationary point φ∗ with L(φ∗ ) = L∗ , the
limit of L(φp ).
0
• (5) If L(φ) is unimodal in Ω with φ∗ being the only stationary point and that ∇Q(φ |φ) is
0
continuous in φ and φ , then for any EM sequence {φp }, φp converges to the unique maximizer
φ∗ of L(φ).

6
6 Appendix
6.1 Proof of Global Convergence Theorem
Firstly, we prove the second consequence that α(xk ) converges monotonically to α(x∗ ) for some
x∗ ∈ Γ. Assume that x∗ is a limit of {xk }∞ ∞
k=0 . Then there is a sub-sequence {xkj }j=0 such
that lim xkj = x . Since the ascent function α() is continuous, we have lim α(xkj ) = α(x∗ ).
∗
j→∞ j→∞
Additionally, we observe that α is monotonically increasing on {xk }∞ ∗
k=0 , so α(x ) ≥ α(xk ) ∀ k.
∗ ∗
Since lim xkj = x , there exits a j0 such that for j > j0 , α(x ) − α(xkj ) < ∀ > 0. Hence, for all
j→∞
k ≥ kj0 :
α(x∗ ) − α(xk ) = α(x∗ ) − α(xkj0 ) + α(xkj0 ) − α(xk ) <
which implies that α(xk ) converges monotonically to α(x∗ ).
Secondly, we prove x∗ ∈ Γ by contradiction. Suppose x∗ ∈ / Γ, and consider the sequence
{xkj +1 }∞
j=0 which satisfies xkj +1 ∈ M (xkj ). Since xkj +1 ∈ S, it has a convergent sequence
lim x(kj +1)l = x∗∗ . Moreover, M is closed on X\Γ, so x∗∗ ∈ M (x∗ ). By the previous proof,
l→∞
we have lim α(xk ) = α(x∗ ), so α(x∗∗ ) = α(x∗ ) which is contradictory to a) of 3) in the theorem.
k→∞

References
[1] HOUSEHOL. AS. Ostrowski, am-solution of equations and systems of equations, 1967.

[2] Arthur P Dempster, Nan M Laird, and Donald B Rubin. Maximum likelihood from incomplete
data via the em algorithm. Journal of the Royal Statistical Society: Series B (Methodological),
39(1):1–22, 1977.
[3] Gordon D Murray. Contribution to discussion of paper by ap dempster, nm laird and db rubin.
J. Roy. Statist. Soc. Ser. B, 39:27–28, 1977.
[4] CF Jeff Wu. On the convergence properties of the em algorithm. The Annals of statistics,
pages 95–103, 1983.
[5] Willard I Zangwill. Nonlinear programming: a unified approach, volume 52. Prentice-hall
Englewood Cliffs, NJ, 1969.

Essential Calculus Skills Practice Workbook With Full Solutions (Chris McMullen (McMullen, Chris) )
100% (5)
Essential Calculus Skills Practice Workbook With Full Solutions (Chris McMullen (McMullen, Chris) )
365 pages
An Introduction To Signal Detection and Estimation - Second Edition Chapter IV: Selected Solutions
100% (1)
An Introduction To Signal Detection and Estimation - Second Edition Chapter IV: Selected Solutions
7 pages
Tutorial On Generalized Expectation
No ratings yet
Tutorial On Generalized Expectation
6 pages
Tutorial On Generalized Expectation Maximization: Javier R. Movellan
No ratings yet
Tutorial On Generalized Expectation Maximization: Javier R. Movellan
6 pages
16 Aos1435
No ratings yet
16 Aos1435
44 pages
EM-algorithm: California Institute of Technology 136-93 Pasadena, CA 91125 Welling@vision - Caltech.edu
No ratings yet
EM-algorithm: California Institute of Technology 136-93 Pasadena, CA 91125 Welling@vision - Caltech.edu
7 pages
(Slides) The em Algorithm
No ratings yet
(Slides) The em Algorithm
14 pages
EM Algo
No ratings yet
EM Algo
8 pages
Methods of Estimation II: Dr. Kempthorne
No ratings yet
Methods of Estimation II: Dr. Kempthorne
44 pages
The EM Algorithm: Ajit Singh November 20, 2005
No ratings yet
The EM Algorithm: Ajit Singh November 20, 2005
4 pages
The Kullback-Liebler Distance and Entropy
No ratings yet
The Kullback-Liebler Distance and Entropy
5 pages
EM Presentation 2013
No ratings yet
EM Presentation 2013
18 pages
Figueiredo EM Algorithm
No ratings yet
Figueiredo EM Algorithm
35 pages
The Expectation-Maximisation Algorithm: 14.1 The EM Algorithm - A Method For Maximising The Likeli-Hood
No ratings yet
The Expectation-Maximisation Algorithm: 14.1 The EM Algorithm - A Method For Maximising The Likeli-Hood
21 pages
Chapter 9.4 Allele Frequency Estimation
No ratings yet
Chapter 9.4 Allele Frequency Estimation
24 pages
Emp Proc Lecture Notes
No ratings yet
Emp Proc Lecture Notes
172 pages
HW2
No ratings yet
HW2
4 pages
Expectation-Maximization Algorithm
No ratings yet
Expectation-Maximization Algorithm
13 pages
Mathematics: Study of Local Convergence and Dynamics of A King-Like Two-Step Method With Applications
No ratings yet
Mathematics: Study of Local Convergence and Dynamics of A King-Like Two-Step Method With Applications
12 pages
msqe_metrics_1_ps2
No ratings yet
msqe_metrics_1_ps2
11 pages
Lecture3 EM
No ratings yet
Lecture3 EM
36 pages
Notes
No ratings yet
Notes
10 pages
MLE Stuff
No ratings yet
MLE Stuff
8 pages
The Expectation Maximization Algorithm
No ratings yet
The Expectation Maximization Algorithm
7 pages
Class14 PDF
No ratings yet
Class14 PDF
29 pages
11 Hidden Markov Models (HMMS) Model and Problem Description
No ratings yet
11 Hidden Markov Models (HMMS) Model and Problem Description
15 pages
Nummax
No ratings yet
Nummax
3 pages
Annals of Mathematics
No ratings yet
Annals of Mathematics
19 pages
Control Optimo
No ratings yet
Control Optimo
132 pages
Summary SC Microeconometrics
No ratings yet
Summary SC Microeconometrics
20 pages
ემპირიული პროცესები
No ratings yet
ემპირიული პროცესები
131 pages
EM at RIT
No ratings yet
EM at RIT
17 pages
5
No ratings yet
5
29 pages
Mathematics: Advances in The Semilocal Convergence of Newton's Method With Real-World Applications
No ratings yet
Mathematics: Advances in The Semilocal Convergence of Newton's Method With Real-World Applications
12 pages
9e6cbf4ac9c3320e4e9d5402ab7ac5eb_MIT14_384F13_rec7
No ratings yet
9e6cbf4ac9c3320e4e9d5402ab7ac5eb_MIT14_384F13_rec7
6 pages
Maths SOP Report PDF
No ratings yet
Maths SOP Report PDF
18 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
14 pages
Functional Local Linear Relative Regression: Abdelkader Chahad Ali Laksaci Ait-Hennani Larbi
No ratings yet
Functional Local Linear Relative Regression: Abdelkader Chahad Ali Laksaci Ait-Hennani Larbi
7 pages
Maximum Likelihood Estimation: Guy Lebanon February 19, 2011
No ratings yet
Maximum Likelihood Estimation: Guy Lebanon February 19, 2011
6 pages
hw1 Solution 16 PDF
No ratings yet
hw1 Solution 16 PDF
17 pages
Instantly download the complete Numerical Algorithms Methods for Computer Vision Machine Learning and Graphics 1st Solomon Solution Manual book (PDF).
100% (7)
Instantly download the complete Numerical Algorithms Methods for Computer Vision Machine Learning and Graphics 1st Solomon Solution Manual book (PDF).
22 pages
Overview of As Convergence
No ratings yet
Overview of As Convergence
17 pages
A Modified Expectation Maximization Algorithm For Penalized Likelihood Estimation in Emission Tomorzradhv
No ratings yet
A Modified Expectation Maximization Algorithm For Penalized Likelihood Estimation in Emission Tomorzradhv
6 pages
Complete Download of Numerical Algorithms Methods for Computer Vision Machine Learning and Graphics 1st Solomon Solution Manual Full Chapters in PDF DOCX
100% (9)
Complete Download of Numerical Algorithms Methods for Computer Vision Machine Learning and Graphics 1st Solomon Solution Manual Full Chapters in PDF DOCX
48 pages
Numerical Methods For CSE Problem Sheet 4: Problem 1. Order of Convergence From Error Recursion (Core Prob-Lem)
No ratings yet
Numerical Methods For CSE Problem Sheet 4: Problem 1. Order of Convergence From Error Recursion (Core Prob-Lem)
14 pages
8th Lecture Note - 1039837803 230515 094639
No ratings yet
8th Lecture Note - 1039837803 230515 094639
10 pages
Equi-Statistical - Convergence of Positive Linear Operators: Sevda Karaku S and Kamil Demirci
No ratings yet
Equi-Statistical - Convergence of Positive Linear Operators: Sevda Karaku S and Kamil Demirci
12 pages
STAT2602 Tutorial 5
No ratings yet
STAT2602 Tutorial 5
7 pages
Maximum Likelihood Estimation (MLE)
No ratings yet
Maximum Likelihood Estimation (MLE)
4 pages
qrm_05
No ratings yet
qrm_05
52 pages
1 Inequalities: 1.1 Markov
No ratings yet
1 Inequalities: 1.1 Markov
15 pages
Industrial Mathematics Institute: Research Report
No ratings yet
Industrial Mathematics Institute: Research Report
25 pages
Maximum Likelihood An Introduction: L. Le Cam
No ratings yet
Maximum Likelihood An Introduction: L. Le Cam
31 pages
Statistical Inference III: Mohammad Samsul Alam
No ratings yet
Statistical Inference III: Mohammad Samsul Alam
32 pages
Expectation Maximization
No ratings yet
Expectation Maximization
21 pages
Conditional Least Squares Estimation in Nonlinear and Nonstationary Stochastic Regression Models
No ratings yet
Conditional Least Squares Estimation in Nonlinear and Nonstationary Stochastic Regression Models
27 pages
TS-Theme3
No ratings yet
TS-Theme3
18 pages
2024 - Math Data Sci RPT
No ratings yet
2024 - Math Data Sci RPT
48 pages
Introduction To Nonlinear Filtering
No ratings yet
Introduction To Nonlinear Filtering
126 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
QuTip Python Feature Summary
No ratings yet
QuTip Python Feature Summary
7 pages
Exponents and Powers Worksheet-1cl8
No ratings yet
Exponents and Powers Worksheet-1cl8
3 pages
Intro To Transformations
No ratings yet
Intro To Transformations
31 pages
Havelock Dawson Method For Free Surface
No ratings yet
Havelock Dawson Method For Free Surface
4 pages
PYQs Matrices & Determinants
No ratings yet
PYQs Matrices & Determinants
3 pages
Review of Discrete-Time Signals: Lecture - 2
No ratings yet
Review of Discrete-Time Signals: Lecture - 2
34 pages
LET EXAM REVIEW Math
No ratings yet
LET EXAM REVIEW Math
105 pages
22ma101 Assignment
No ratings yet
22ma101 Assignment
3 pages
18 Green's Function For The Poisson Equation
No ratings yet
18 Green's Function For The Poisson Equation
5 pages
Curvilinear Motion
No ratings yet
Curvilinear Motion
10 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
28 pages
Numerical Computation Technique
No ratings yet
Numerical Computation Technique
21 pages
Second Grading Grade 9
100% (1)
Second Grading Grade 9
2 pages
Reasoning and Numerical Ability For GATE, 1/e: Singh S.K
No ratings yet
Reasoning and Numerical Ability For GATE, 1/e: Singh S.K
4 pages
Interval of Convergence of Power Series
No ratings yet
Interval of Convergence of Power Series
3 pages
DLM 1 - Unit 6 Linear Equations and Inequalities in Two Variables
0% (1)
DLM 1 - Unit 6 Linear Equations and Inequalities in Two Variables
45 pages
Wavelets MATLAB Assignment 5 5.1: Linear Convolution in MATLAB
No ratings yet
Wavelets MATLAB Assignment 5 5.1: Linear Convolution in MATLAB
4 pages
Differential Calculus
No ratings yet
Differential Calculus
51 pages
Khalil Mikaela Compilation-Of-Algorithems PDF
No ratings yet
Khalil Mikaela Compilation-Of-Algorithems PDF
49 pages
Hungarian Method Calculator
No ratings yet
Hungarian Method Calculator
6 pages
Arithmetic Mean
No ratings yet
Arithmetic Mean
4 pages
Control Systems Lab Manual
100% (1)
Control Systems Lab Manual
107 pages
Iquanta
No ratings yet
Iquanta
4 pages
SSC New Syllabus Material
No ratings yet
SSC New Syllabus Material
68 pages
Chapter 6 - Section B - Non-Numerical Solutions: 6.1 by Eq. (6.8)
No ratings yet
Chapter 6 - Section B - Non-Numerical Solutions: 6.1 by Eq. (6.8)
7 pages
AD
0% (1)
AD
5 pages
RSHH Qam12 ch10
No ratings yet
RSHH Qam12 ch10
76 pages
Ps 1
No ratings yet
Ps 1
4 pages

EM Converge Property

Uploaded by

EM Converge Property

Uploaded by

The Convergence Property of EM Algorithm

A thesis presented for a seminar of EM

2 The Generalized EM Algorithm

where M is a point-to-set map consisting of two steps:

and for any sequence {φp } from a GEM algorithm:

L(φp+1 ) ≥ L(φp ) (4)

3 Global Maximum or Local Maximum or Stationary Val-

Figure 1: Global Convergence Theorem

4 Does A EM Sequence {φp } Converge?

You might also like