0% found this document useful (0 votes)

58 views9 pages

Combes - An Introduction To Stochastic Approximation - 2013

The document provides an introduction to stochastic approximation algorithms. It discusses how Robbins introduced the basic stochastic approximation scheme to determine the parameter x* of a system based on noisy observations of the system output g(x). The scheme updates x recursively based on the difference between the target output g* and observed output. The document outlines how this recursion relates to solving an ordinary differential equation and proves almost sure convergence under certain conditions. It provides examples of stochastic gradient descent and Q-learning algorithms that use this approach.

Uploaded by

johan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views9 pages

Combes - An Introduction To Stochastic Approximation - 2013

Uploaded by

johan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

An introduction to stochastic approximation

Richard Combes
October 11, 2013

1 The basic stochastic approximation scheme

1.1 A first example
We propose to start the exposition of the topic by an example. The arguments are given in a
crude manner. Formal proofs will be given in section 2. This example is taken from the very
article [6] which introduced stochastic approximation. Consider x ∈ R the parameter of a
system and g(x) ∈ R an output value from this system when parameter x is used. We assume
g to be a smooth, increasing function. An agent wants to determine sequentially x∗ ∈ R the
value such that the system output equals a target value g ∗ . If for all x, the value of g(x) can
be observed directly from the system, then determining g ∗ could be solved by a simple search
technique such as binary search or golden ratio search. Here we assume that only a noisy
version of g can be observed. Namely, at time n ∈ N, the decision maker sets the parameter
equal to xn , and observes Yn = g(xn ) + Mn with Mn a random variable denoting noise, with
E[Mn ] = 0.
In order to determine g(x), a crude approach would be to sample parameter x repeatedly
and average the result, so that the effect of noise would cancel out, and apply a deterministic
line search (such as binary search).
[6] proposed a much more elegant approach. If xn > x∗ , we have that g(xn ) > g ∗ , so that
diminishing xn by a small amount proportional to g ∗ −g(xn ) would guarantee xn+1 ∈ [x∗ , xn ].
Therefore, define n a sequence of small positive numbers, and consider the following update
scheme:
xn+1 = xn + n (g ∗ − Yn ) = xn + n (g ∗ − g(xn )) + n Mn .
The first intuition is that if the noise sequence is well behaved (say {Mn } is i.i.d Gaussian
with mean 0 and variance 1) and n = 1/n, then the law of large numbers would guarantee
that the noiseP“averages out” so that for large n noise can bePignored altogether. Namely,
define Sn = k≥n Mk /k, then var(Sn ) is upper bounded by k≥n 1/k 2 →n→+∞ 0, so that
Sn should be negligible. (Obviously this reasoning is heuristic and to make it precise we have
to use a law of large numbers-like result ... )
Now assume no noise (Mn ≡ 0 for all n), and n = 1/n, g smooth with a strictly positive
first derivative upper bounded by g0. Removing the noise term Mn :

g 0 (xn ) ∗
g(xn+1 ) = g(xn ) + (g − g(xn )).
n

1
By the fundamental theorem of calculus: (1/n)|g ∗ − g(xn )| ≤ (g 0 /n)|x∗ − xn |. So for n ≥ g 0 ,
we have either xn ≤ xn+1 ≤ x∗ or xn ≥ xn+1 ≥ x∗ . In both cases, n 7→ |g(xn ) − g ∗ | is
decreasing for large n.
It is also noted that:
xn+1 − xn
= (g ∗ − g(xn )),
n
so that xn appears as a discretization (with discretization steps {1/n} of the following
ordinary differential equation (o.d.e.):
ẋ = g ∗ − g(x).
This analogy will be made precise in the next subsection.

1.2 The associated o.d.e

We now introduce the so-called o.d.e. approach popularized by [5], which allow to analyze
stochastic recursive algorithms such as the one considered by Robbins in his original paper.
It is noted that [6] did not rely on the o.d.e. method and used direct probabilistic arguments.
The crude reasoning above suggests that the asymptotic behavior of the random sequence
{xn } can be obtained by determining the asymptotic behavior of a corresponding (determin-
istic) o.d.e. In this lecture we will consider a sequence xn ∈ Rd , d ≥ 1, and a general update
scheme of the following form:
xn+1 = xn + n (h(xn ) + Mn ),
with h : Rd → Rd . We define the associated o.d.e.:
ẋ = h(x).
We will prove that (with suitable assumptions on h, the noise and step sizes) if the o.d.e.
admits a continously differentiable Liapunov function V , then we have that V (xn ) →n→+∞ 0
almost surely. We recall that V is a Liapunov function if it is positive, radially unbounded,
and strictly diminishing along the solutions of the o.d.e.

1.3 Instances of stochastic approximation algorithms

Algorithms based on stochastic approximation schemes have become ubiquitous in various
fields, including signal processing, optimization, machine learning and economics/game the-
ory. There are several reasons for this:
Low memory requirements: the basic stochastic approximation is a Markovian update:
the value of xn+1 is a function of xn and the observation at time n. So its implementation
requires a small amount of memory.
Influence of noise: stochastic approximation algorithms are able to work with noise, so
that they are good candidates as “on-line” optimization algorithms which work with
the noisy output of a running system. Furthermore the convergence of a stochastic ap-
proximation scheme is determined by inspecting a deterministic o.d.e. which is simpler
to analyze and does not depend on the statistics of the noise.

2
Iterative updates: Once again since they are Markovian updates, stochastic approxima-
tion schemes are good models for collective learning phenomena where a set of agents
interact repeatedly and update their behavior depending on their most recent observa-
tion. This is the reason why results on learning schemes in game theory rely heavily
on stochastic approximation arguments.

We give a few examples of stochastic algorithms found in the literature.

1.4 Stochastic gradient algorithms

Stochastic gradient algorithms allow to find a local maximum of a cost function whose value
is only known through noisy measurements, and are commonplace in machine learning (on-
line regression, training of neural networks, on-line optimization of Markov decision processes
etc). We consider a function f : R → R which is strongly convex, twice differentiable with a
unique minimum x∗ . f cannot be observed directly, nor can its gradient ∇f . At time n we
can observe f (xn ) + Mn . Therefore it makes sense to approximate ∇f by finite differences,
with a suitable discretization step. Consider the scheme (due to Kiefer and Wolfowitz [4]):

f (xn + δn ) − f (xn − δn )
xn+1 = xn − n ,
2δn
The associated o.d.e is ẋ = −∇f (x) which admits the Liapunov function V (x) = f (x) −
f (x∗ ). With the proper step sizes (say n = n−1 , δn = n−1/3 ) it can be proven that the
method converges to the minimum xn →n→∞ f (x∗ ) almost surely.

1.5 Distributed updates

In many applications, the components of xn are not updated simultaneously. This is for
instance the case in distributed optimization when each component of xn is controlled by
a different agent. This is also the case for on-line learning algorithms for Markov Decision
Processes such as Q-learning.
For instance assume that at time n, a component k(n) uniformly distributed {1, . . . , d}
is chosen, and only the k(n)-th component of xn is updated:
(
xn,k + n (hk (xn ) + Mn,k ) , k = k(n)
xn+1,k = .
xn,k , k 6= k(n)

Then it can be proven that the behavior of {xn } can be described by the o.d.e. ẋ = h(x).
Namely, the asymptotic behavior of {xn } is the same as in the case where all its components
are updated simultaneously. This is described for instance in [1][Chap 7].

1.6 Fictitious play

Fictitious play is a learning dynamic for games introduced by [2], and studied extensively by
game theorists afterwards (see for instance [3]). Consider 2 agents playing a matrix game.
Namely, at time n ∈ N, agent k ∈ {1, 2} chooses action akn ∈ {1, . . . , A}, and receives a

3
reward Aka1 ,a2 , where A1 , A2 are two A by A matrices with real entries. Define the empirical
distribution of actions of player k at time n by :
n
k 1X
p (a, n) = 1{akt = a}.
n t=1

A natural learning scheme for agent k is to assume that at time n + 1, agent k 0 will choose an
0
action whose probability distribution is equal to pk (., n), and play the best action. Namely
0 0
agent k assumes that P[akn+1 = a] = pk (a, n), and chooses the action maximizing his expected
payoff given that assumption.
We define X X
g k (., p0 ) = max p(a)Aka,a0 p0 (a0 ),
p∈P
1≤a≤A 1≤a0 ≤A

with P the set of probability distributions on {1, . . . , A}. g k (., p0 ) is the probability distribu-
tion of the action of k maximizing the expected payoff, knowing that player k 0 will play an
action distributed as p0 .
The empirical probabilities can be written recursively as:

(n + 1)pk (a, n + 1) = npk (a, n) + 1{akn = a},

so that:
1
pk (a, n + 1) = pk (a, n) + (1{akn = a} − pk (a, n)).
n+1
0
Using the fact that E[1{akn = a}] = g k (., pk ), we recognize that the probabilities p are updated
according to a stochastic approximation scheme with n = 1/(n + 1), and the corresponding
o.d.e. is
ṗ = g(p) − p.
It is noted that such an o.d.e may have complicated dynamics and might not admit a Liapunov
function without further assumptions on the structure of the game (the matrices A1 and A2 ).

2 Convergence to the o.d.e limit

In this section we prove the basic stochastic approximation convergence result for diminishing
step sizes with martingale difference noise. This setup is sufficiently simple to grasp the proof
techniques without relying on sophisticated results. The only prerequisites are: the (discrete-
time) martingale convergence theorem and two basic results on o.d.e., namely Gonwall’s
inequality and the Picard - Lindelöf theorem. We largely follow the exposition given by
Borkar in [1][Chap 2].

2.1 Assumptions
We denote by Fn the σ-algebra generated by (x0 , M0 , . . . , xn , Mn ). Namely Fn contains all
the information about the history of the algorithm up to time n.
We introduce the following assumptions:

4
(A1) (Lipshitz continuity of h) There exists L ≥ 0 such that for all x, y ∈ Rd ||h(x)−h(y)|| ≤
L||x − y||.

(A2) (Diminishing step sizes) We have that n≥0 n = ∞ and n≥0 2n < ∞.
P P

(A3) (Martingale difference noise) There exists K ≥ 0 such that for all n we have that
E[Mn+1 |Fn ] = 0 and E[||Mn+1 ||2 |Fn ] ≤ K(1 + ||xn ||).

(A4) (Boundedness of the iterates) We have that supn≥0 ||xn || < ∞ almost surely.

(A5) (Liapunov function) There exists a positive, radially unbounded, continuously differ-
entiable function V : Rd → R such that for all x ∈ Rd , h∇V (x), h(x)i ≤ 0 with strict
inequality if V (x) 6= 0.

(A1) is necessary to ensure that the o.d.e. has a unique solution given an initial condition,
and that the value of the solution after a given amount of time depends continuously on the
initial condition. (A2) is necessary for almost sure convergence, and holds in particular for
n = 1/n. (A3) is required to control the random fluctuations of xn around the solution of
the o.d.e. (using the martingale convergence theorem), and holds in particular if {Mn }n∈N is
independent with bounded variance. (A4) is essential, and can (in some cases) be difficult
to prove. We will discuss how to ensure that (A4) holds in the latter sections. (A5) ensures
that all solutions of the o.d.e. converge to the set of zeros of V , and that this set is stable (in
the sense of Liapunov). Barely assuming that all solutions of the o.d.e. converge to a single
point does not guarantee convergence of the corresponding stochastic approximation.

2.2 The main theorem

We are now equipped to state the main theorem.

Theorem 1. Assume that (A1) - (A5) hold, then we have that:

V (xn ) →n→∞ 0, a.s.

The proof of theorem 1 is based on an intermediate result stating that the sequence {xn }
(suitably interpolated) remains arbitrarily close to the solution of the o.d.e. We define Φt (x)
the value at t of the unique solution to the o.d.e. starting at x at time 0. Φ Pis uniquely
n−1
defined because of (A1) and the Picard-Lindelöf theorem. We define t(n) = k=0 k , and
x(t) the interpolated version of {xn }n∈N . Namely for all n , x(t(n)) = xn , and x is linear by
parts. We define xn (t) = Φt−t(n) (xn ) the o.d.e. trajectory started at xn at time t(n).

Lemma 1. For all T > 0, we have that:

sup ||x(t) − xn (t)|| →n→∞ 0 a.s.

t∈[t(n),t(n)+T ]

Proof of lemma 1: Since the result holds almost surely we consider a fixed sample path
throughout the proof. Define m = inf{k : t(k) > t(n) + T } so that we can prove the result for
T = t(m)−t(n) and consider the time interval [t(n), t(m)]. Consider n ≤ k ≤ m, we are going

5
to start by bounding the difference between x and xn at time instants t ∈ {t(n), ..., t(m)},
that is supn≤k≤m |xk − xn (t(k))|.
We start by re-writing the definition of xk and xk (t(k)):
k−1
X k−1
X
xk = xn + u h(xu ) + k Mk
u=n u=1

and by the fundamental theorem of calculus:

Z t(k)
n
x (t(k)) = xn + h(xn (v))dv
t(n)
k−1 Z t(u+1)
X
= xn + h(xn (v))dv
u=n t(u)
k−1
X Z t(u+1)
n
h(xn (v)) − h(xn (t(u))) dv

= xn + u h(x (t(u))) +
u=n t(u)

R t(u+1)
we recall that t(u) dv = u .
Our goal is to upper bound the following difference, decomposed into 3 terms:
k−1
X k−1
X
n
Ck = ||x (t(k)) − xk || ≤ Ak + Bu + Lu Cu , (1)
u=n u=n

with:
k−1
X
Ak = || k Mk ||,
u=n
Z t(u+1)
Bk = ||h(xn (v)) − h(xn (t(u)))||dv.
t(u)

The stochastic term

We first upper bound Ak , the stochastic term in (1). Define Sn = nu=0 u Mu . It is noted
P
that Ak = Sk − Sn . Sn is a martingale since:

E[Sn+1 − Sn |Fn ] = E[n+1 Mn+1 |Fn ] = 0.

From (A3) , E[||Mn+1 ||2 |Fn ] ≤ K(1 + supk ||xk ||) < ∞. Therefore the sequence {Sn } is a
square integrable martingale:
X X
E[||Sn+1 − Sn ||2 |Fn ] ≤ K(1 + sup ||xn ||) 2n < ∞.
n
n≥0 n≥0

Using the martingale convergence theorem (lemma 2), we have that Sn converges almost
surely to a finite value S∞ . This implies that:

Ak ≤ ||Sk − Sn || ≤ ||Sk − S∞ || + ||Sn − S∞ || ≤ 2 sup ||Sn0 − S∞ || →n→∞ 0 , a.s.

n0 ≥n

6
Therefore, until the end of the proof we choose n large enough so that Ak ≤ δ/2 for all k ≥ n
with δ > 0 arbitrarily small.
The discretization term, maximal slope of xn
In order to upper bound Bu , we prove that for t ∈ [t(u), t(u + 1)], xn (t) can be approxi-
mated by xn (t(u)) (up to a term proportional to u ). To do so we have to bound the maximal
slope of t 7→ xn (t) on [t(n), t(m)]. We know that xn (t(n)) = xn ≤ supn∈N ||xn || which is finite
by (A4). Using the fact that h is Lipshitz and applying Gromwall’s inequality (lemma 2)
there exists a constant KT > 0 such that:

||h(xn (t))|| ≤ ||h(0)|| + L||xn (t)|| ≤ KT , t ∈ [t(n), t(m)].

We have used the fact that h is Lipschitz so it grows at most linearly: for all x, ||h(x)−h(0)|| ≤
L||x||, so that ||h(x)|| ≤ ||h(0)|| + L||x||. Therefore by the fundamental theorem of calculus,
for t ∈ [t(u), t(u + 1)]:
Z t(u+1)
n n
||x (t) − x (t(u))|| ≤ ||h(xn (v))||dv ≤ k KT .
t(u)

In turn, using the Lipschitz continuity of h we have that:

Z t(u+1)
Bu ≤ L||xn (v) − xn (t(u))||dv ≤ 2u LKT .
t(u)

By (A2), u≥n 2u →n→+∞ 0, and so k−1

P P P
PBu ≤ u≥n Bu →n→+∞ 0. Until the end of the
u=n
proof, we consider n large enough so that u≥n Bu ≤ δ/2.
The recursive term
Going back to (1), by the reasoning above, we have proven that:
k−1
X
Ck ≤ δ + L u Cu .
u=0

Using the fact that k−1

P
u=n u ≤ t(m) − t(n) = T , and applying the discrete time version of
Gronwall’s inequality (lemma 3):

sup Ck ≤ δeLT .
n≤k≤m

By letting δ arbitrary small we have proven that:

sup ||xk − xn (t(k))|| →n→∞ 0

n≤k≤m

Error due to linear interpolation

In order to finish the proof, we need to provide an upper bound for ||x(t) − xn (t)|| when
t∈/ {t(n), ..., t(m)}. Consider n ≤ k ≤ m, and t ∈ [t(k), t(k + 1)]. Since x is linear by parts
(by definition), there exists λ ∈ [0, 1] such that:

x(t) = λxk + (1 − λ)xk+1 .

7
Applying the fundamental theorem of calculus twice, xn (t) can be written:
Z t
n n
x (t) = x (t(k)) + h(xn (v))dv
t(k)
Z t(k+1)
n
= x (t(k + 1)) − h(xn (v))dv
t

Therefore the error due to linear interpolation can be upper bounded as follows:

||x(t) − xn (t)|| ≤ λ||xk − xn (t(k))|| + (1 − λ)||xk+1 − xn (t(k + 1))||

Z t Z t(k+1)
n
+λ ||h(x (v))||dv + (1 − λ) ||h(xn (v))||dv,
t(k) t

and we obtain the announced result:

sup ||x(t) − xn (t)|| ≤ sup ||xk − xn (t(k))|| + n CT →n→+∞ 0

t∈[t(n),t(m)] n≤k≤m

which concludes the proof.

We can proceed to prove the main theorem.

Proof. Once again we work with a fixed sample path. We consider ν > 0, and define the
level set H ν = {x : V (x) ≥ ν}. Choose > 0 such that if V (x) ≤ ν and ||x − y|| ≤ ,
then V (y) ≤ 2ν. Such an exists because (by radial unboundedness) the set {x : V (x) ≤
ν} is compact, and because of the uniform continuity of V on compact sets. Since V is
continuously differentiable, x 7→ h∇V, h(x)i is strictly negative on H ν , and H ν is closed, we
define ∆ = supx∈H ν h∇V, h(x)i < 0. Denote by V∞ = sup||x||≤supn ||xn || V (x) which is finite
since supn ||xn || is finite and V is continuous. Define T = (V∞ − ν)/∆. Then for all x such
that ||x|| ≤ supn ||xn || and all t > T , we must have V (Φt (x)) ≤ ν. Finally, choose n large
enough so that supt∈[t(n),t(n)+T ] ||x(t) − xn (t)|| ≤ and m such that t(m) = t(n) + T . Then we
have that V (xn (t + T )) ≤ ν and that |xn (t + T ) − xm | ≤ , which proves that V (xm ) ≤ 2ν.
The reasoning above holds for all sample paths, for all ν > 0, for all m arbitrarily large,
so V (xn ) →n→∞ 0 a.s. which is the announced result.

3 Appendix
3.1 Ordinary differential equations
We state here two basic results on o.d.e.s used in the proof of the main theorem.

Lemma 2 (Gronwall’s inequality). Consider T ≥ 0, L ≥ 0 and a function t 7→ x(t) such

that ẋ(t) ≤ L||x(t)||, t ∈ [0, T ]. Then we have that supt∈[0,T ] ||x(t)|| ≤ ||x(0)||eLT .

8
Lemma 3 (Gronwall’s inequality, discrete case). Consider K ≥ 0 and positive sequences
{xn } , {n } such that for all 0 ≤ n ≤ N :
n
X
xn+1 ≤ K + n xn .
u=0
PN
Then we have the upper bound: sup0≤n≤N xn ≤ Ke n=0 n .

3.2 Martingales
We state the martingale convergence which is required to control the random fluctuations of
the stochastic approximation in the proof of the main theorem.
Consider a sequence of σ-fields F = (Fn )n∈N , and {Mn }n∈N a sequence of random variables
in Rd . We say that {Mn }n∈N is a F - martingale if Mn is Fn - measurable and E[Mn+1 |Fn ] =
Mn . The following theorem (due to Doob) states that if the sum of squared increments of a
martingale is finite (in expectation), then this martingale has a finite limit a.s.

Theorem 2 (Martingale convergence theorem). Consider {Mn }n∈N a martingale in Rd with:

X
E[||Mn+1 − Mn ||2 |Fn ] < ∞,
n≥0

then there exists a random variable M∞ ∈ Rd such that ||M∞ || < ∞ a.s. and Mn →n→∞ M∞
a.s.

References
[1] Vivek S. Borkar. Stochastic approximation: a dynamical systems viewpoint. Cambridge
University Press, 2008.

[2] G.W Brown. Iterative solutions of games by fictitious play. Activity Analysis of Production
and Allocation, 1951.

[3] Drew Fudenberg and David K. Levine. The Theory of Learning in Games. MIT Press
Books. The MIT Press, July 1998.

[4] J. Kiefer and J. Wolfowitz. Stochastic estimation of the maximum of a regression function.
The Annals of Mathematical Statistics, 23(3):pp. 462–466, 1952.

[5] L. Ljung. Analysis of recursive stochastic algorithms. Automatic Control, IEEE Trans-
actions on, 22(4):551–575, 1977.

[6] Herbert Robbins and Sutton Monro. A stochastic approximation method. The Annals
of Mathematical Statistics, 22(3):400–407, September 1951.

Thompson Asset Management - Students
0% (1)
Thompson Asset Management - Students
99 pages
Numerical Methods For Stochastic Control Problems in Continuous Time (PDFDrive)
100% (1)
Numerical Methods For Stochastic Control Problems in Continuous Time (PDFDrive)
480 pages
TS 02025 - 1.00 - Testing of Cast-in-Place Concrete Piles
No ratings yet
TS 02025 - 1.00 - Testing of Cast-in-Place Concrete Piles
13 pages
Solutions To Stochastic Calculus For Finance I (Steven Shreve)
100% (1)
Solutions To Stochastic Calculus For Finance I (Steven Shreve)
19 pages
Stochastic Control Notes
No ratings yet
Stochastic Control Notes
173 pages
5 The Stochastic Approximation Algorithm: 5.1 Stochastic Processes - Some Basic Concepts
No ratings yet
5 The Stochastic Approximation Algorithm: 5.1 Stochastic Processes - Some Basic Concepts
14 pages
Introduction To Stochastic Approximation Algorithms
No ratings yet
Introduction To Stochastic Approximation Algorithms
14 pages
Dynamics of stochastic approximation algorithms
No ratings yet
Dynamics of stochastic approximation algorithms
69 pages
An Adaptive Simulated Annealing Algorithm PDF
No ratings yet
An Adaptive Simulated Annealing Algorithm PDF
9 pages
The O.D.E. Method For Convergence of Stochastic Approximation and Reinforcement Learning
No ratings yet
The O.D.E. Method For Convergence of Stochastic Approximation and Reinforcement Learning
23 pages
StochasticApproximation Borkar
100% (1)
StochasticApproximation Borkar
172 pages
Protter
No ratings yet
Protter
43 pages
SDE Book
No ratings yet
SDE Book
119 pages
SGOS Book
No ratings yet
SGOS Book
238 pages
RL Theory Tutorial
No ratings yet
RL Theory Tutorial
80 pages
Stochastic Calculus, Filtering, and Stochastic Control
100% (2)
Stochastic Calculus, Filtering, and Stochastic Control
265 pages
Stochastic Processes, Ito Calculus and Black-Scholes Formula
No ratings yet
Stochastic Processes, Ito Calculus and Black-Scholes Formula
36 pages
Arena Stanfordlecturenotes11
No ratings yet
Arena Stanfordlecturenotes11
9 pages
AStat
No ratings yet
AStat
287 pages
Particle Filter Tutorial
No ratings yet
Particle Filter Tutorial
8 pages
10 - Reinforcement Learning
No ratings yet
10 - Reinforcement Learning
24 pages
Proba Num GP
No ratings yet
Proba Num GP
116 pages
Detection and Estimation Theory Lecture Notes For Ecen 672
No ratings yet
Detection and Estimation Theory Lecture Notes For Ecen 672
216 pages
Lim 05429427
No ratings yet
Lim 05429427
10 pages
RL Class Notes (4)
No ratings yet
RL Class Notes (4)
68 pages
A Step by Step Mathematical Derivation A
No ratings yet
A Step by Step Mathematical Derivation A
32 pages
Pavliotis Book
No ratings yet
Pavliotis Book
155 pages
Part F
No ratings yet
Part F
25 pages
3 - Chapter 6 Stochastic Approximation
No ratings yet
3 - Chapter 6 Stochastic Approximation
24 pages
Introduction To Stochastic Optimization-2
No ratings yet
Introduction To Stochastic Optimization-2
15 pages
An Introductory Course On Stochastic Calculus
No ratings yet
An Introductory Course On Stochastic Calculus
210 pages
MA4K0 Notes
No ratings yet
MA4K0 Notes
189 pages
Hybrid Switching Diffusions Properties and Applications
No ratings yet
Hybrid Switching Diffusions Properties and Applications
394 pages
ChrisRackauckas-IntuitiveSDEs
No ratings yet
ChrisRackauckas-IntuitiveSDEs
96 pages
1 - Table of contents
No ratings yet
1 - Table of contents
6 pages
Book All in One
No ratings yet
Book All in One
288 pages
Lecture Note SGD
No ratings yet
Lecture Note SGD
4 pages
Approximation of The Invariant Measure of Stable SDEs
No ratings yet
Approximation of The Invariant Measure of Stable SDEs
32 pages
Continuous Time 1
No ratings yet
Continuous Time 1
86 pages
Lecture Notes MAI
No ratings yet
Lecture Notes MAI
111 pages
B-splines chaos and Kalman Filters for solving a stochastic differential equation - ScienceDirect
No ratings yet
B-splines chaos and Kalman Filters for solving a stochastic differential equation - ScienceDirect
4 pages
Full Notes
No ratings yet
Full Notes
197 pages
Lecture_Notes_MAI
No ratings yet
Lecture_Notes_MAI
114 pages
Notes It
No ratings yet
Notes It
46 pages
11 Hidden Markov Models (HMMS) Model and Problem Description
No ratings yet
11 Hidden Markov Models (HMMS) Model and Problem Description
15 pages
l1 Mdps Exact Methods
No ratings yet
l1 Mdps Exact Methods
69 pages
Recursive Estimation
No ratings yet
Recursive Estimation
12 pages
GDD Nonlinear NIPS 2009 Convergent Temporal Difference Learning With Arbitrary Smooth Function Approximation
No ratings yet
GDD Nonlinear NIPS 2009 Convergent Temporal Difference Learning With Arbitrary Smooth Function Approximation
9 pages
Probability Theory - Formula Sheet
No ratings yet
Probability Theory - Formula Sheet
13 pages
Stochastic Differential Equations. Introduction To Stochastic Models For Pollutants Dispersion, Epidemic and Finance
100% (1)
Stochastic Differential Equations. Introduction To Stochastic Models For Pollutants Dispersion, Epidemic and Finance
156 pages
Mathematical Methods in Robust Control of Discrete-Time Linear Stochastic Systems by Vasile Dragan, Toader Morozan, Adrian-Mihail Stoica (Auth.)
No ratings yet
Mathematical Methods in Robust Control of Discrete-Time Linear Stochastic Systems by Vasile Dragan, Toader Morozan, Adrian-Mihail Stoica (Auth.)
349 pages
Discrete Time
No ratings yet
Discrete Time
106 pages
Probability in High Dimensions 1693642387
No ratings yet
Probability in High Dimensions 1693642387
184 pages
Younes - 1999 - On The Convergence of Markovian Stochastic Algorithms With Rapidly Decreasing Ergodicity Rates
No ratings yet
Younes - 1999 - On The Convergence of Markovian Stochastic Algorithms With Rapidly Decreasing Ergodicity Rates
53 pages
Empirical Process (Sara Van de Geer)
No ratings yet
Empirical Process (Sara Van de Geer)
91 pages
Stoch Proc Notes PDF
No ratings yet
Stoch Proc Notes PDF
239 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Mathematical Functions
From Everand
Mathematical Functions
Oliver Linton
No ratings yet
Defarias - VanRoy - The Linear Programming Approach To Approximate Dynamic Programming - 2003
No ratings yet
Defarias - VanRoy - The Linear Programming Approach To Approximate Dynamic Programming - 2003
17 pages
Kwong - On The LQG Problem With Correlated Noise and Its Relation To Minimum Variance Control - 1991
No ratings yet
Kwong - On The LQG Problem With Correlated Noise and Its Relation To Minimum Variance Control - 1991
14 pages
When To Abandon Resesarch
No ratings yet
When To Abandon Resesarch
16 pages
Convex Optimization With Abstract Linear Operators
No ratings yet
Convex Optimization With Abstract Linear Operators
9 pages
On Complex Eigenvalues of M and P Matrices : - I, Ltan - 2
No ratings yet
On Complex Eigenvalues of M and P Matrices : - I, Ltan - 2
6 pages
Guo - Chen - The Astrom Wittenmark Self Tuning Regulator Revisited and ELS Based Adaptive Trackers - 1991
No ratings yet
Guo - Chen - The Astrom Wittenmark Self Tuning Regulator Revisited and ELS Based Adaptive Trackers - 1991
11 pages
Adjacency Matrix and Its Rank
No ratings yet
Adjacency Matrix and Its Rank
4 pages
Lower Bound For Streaming Algorithms: 1 Communication Complexity
No ratings yet
Lower Bound For Streaming Algorithms: 1 Communication Complexity
2 pages
YANG-THESIS-2015 Gap Stability Sparse
No ratings yet
YANG-THESIS-2015 Gap Stability Sparse
28 pages
Sampled-Data Reachability Analysis Using Sensitivity and Mixed-Monotonicity
No ratings yet
Sampled-Data Reachability Analysis Using Sensitivity and Mixed-Monotonicity
6 pages
A Convex Approximation of Optimal Distributed Controller in Frequency Domain
No ratings yet
A Convex Approximation of Optimal Distributed Controller in Frequency Domain
7 pages
Class03 PDF
No ratings yet
Class03 PDF
40 pages
Lowering The Upper Bounds On The Cost of Robust Distributed Controllers Beyond Quadratic Invariance
No ratings yet
Lowering The Upper Bounds On The Cost of Robust Distributed Controllers Beyond Quadratic Invariance
18 pages
CDC2017 Dynamiccontracts PDF
No ratings yet
CDC2017 Dynamiccontracts PDF
14 pages
Adapt: An Interactive Procedure For Multiple Testing With Side Information
No ratings yet
Adapt: An Interactive Procedure For Multiple Testing With Side Information
24 pages
7 PDF
No ratings yet
7 PDF
4 pages
Controllability Not Sufficient PDF
No ratings yet
Controllability Not Sufficient PDF
6 pages
Less Than A Single Pass: Stochastically Controlled Stochastic Gradient Method
No ratings yet
Less Than A Single Pass: Stochastically Controlled Stochastic Gradient Method
28 pages
Global Optimality of Local Search For Low Rank Matrix Recovery
No ratings yet
Global Optimality of Local Search For Low Rank Matrix Recovery
21 pages
Power of Ordered Hypothesis Testing
No ratings yet
Power of Ordered Hypothesis Testing
18 pages
Robust Sketching For Multiple Square-Root LASSO Problems
No ratings yet
Robust Sketching For Multiple Square-Root LASSO Problems
17 pages
When Are Nonconvex Problems Not Scary
No ratings yet
When Are Nonconvex Problems Not Scary
11 pages
Reproducing Kernel Hilbert Spaces
No ratings yet
Reproducing Kernel Hilbert Spaces
5 pages
Liu Et Al. - 2020 - High Dimensional Robust Sparse Regression
No ratings yet
Liu Et Al. - 2020 - High Dimensional Robust Sparse Regression
10 pages
Sun Et Al. - 2020 - Finite Sample System Identification Improved Rate
No ratings yet
Sun Et Al. - 2020 - Finite Sample System Identification Improved Rate
10 pages
Carè Et Al. - 2018 - Finite-Sample System Identification An Overview A
No ratings yet
Carè Et Al. - 2018 - Finite-Sample System Identification An Overview A
6 pages
Candès and Romberg - 2007 - Sparsity and Incoherence in Compressive Sampling
No ratings yet
Candès and Romberg - 2007 - Sparsity and Incoherence in Compressive Sampling
18 pages
16 Newton Cotes Quadrature
0% (1)
16 Newton Cotes Quadrature
3 pages
Adipic Acid - Wikipedia PDF
No ratings yet
Adipic Acid - Wikipedia PDF
24 pages
SQL Optimization
No ratings yet
SQL Optimization
20 pages
Khối 2 + Khối 3 - TIMO Vòng 2
No ratings yet
Khối 2 + Khối 3 - TIMO Vòng 2
31 pages
Analog Converter 0 - 5 A - RMCA61BD
No ratings yet
Analog Converter 0 - 5 A - RMCA61BD
5 pages
Kisprofile2023-24 2
No ratings yet
Kisprofile2023-24 2
4 pages
CH 10
No ratings yet
CH 10
9 pages
HTML Project
17% (6)
HTML Project
6 pages
Sampling and Testing Concrete Masonry Units and Related Units
No ratings yet
Sampling and Testing Concrete Masonry Units and Related Units
24 pages
MI2026 Problems
No ratings yet
MI2026 Problems
44 pages
Biografi Al-Khawarizmi
No ratings yet
Biografi Al-Khawarizmi
9 pages
Universidad de Las Fuerzas Armadas ESPE: Diseño Experimental
No ratings yet
Universidad de Las Fuerzas Armadas ESPE: Diseño Experimental
15 pages
What Is Asymptotic Notation
No ratings yet
What Is Asymptotic Notation
51 pages
Mobile Robot Navigation System.
No ratings yet
Mobile Robot Navigation System.
28 pages
Business Maths
No ratings yet
Business Maths
2 pages
Principles of Materials Characterization and Metrology by Kannan
No ratings yet
Principles of Materials Characterization and Metrology by Kannan
869 pages
5 Avogadro's Law
100% (1)
5 Avogadro's Law
3 pages
Volvo Penta Inboard Diesel: Technical Data
No ratings yet
Volvo Penta Inboard Diesel: Technical Data
2 pages
Mid Sem Paper-Cns-Feb2022
No ratings yet
Mid Sem Paper-Cns-Feb2022
1 page
Abrasive Machining Processes: Grinding Lapping Honing
No ratings yet
Abrasive Machining Processes: Grinding Lapping Honing
61 pages
Technical Note For Construction
No ratings yet
Technical Note For Construction
44 pages
Cross Border Valuation
No ratings yet
Cross Border Valuation
18 pages
Aluminum 2011 T6
No ratings yet
Aluminum 2011 T6
7 pages
Catalyst Handbook For Destillates Hydrotreatment (HR Series)
No ratings yet
Catalyst Handbook For Destillates Hydrotreatment (HR Series)
140 pages
Stature Estimation Himachal
No ratings yet
Stature Estimation Himachal
5 pages
Design and Analysis of A Centrifugal Pump
No ratings yet
Design and Analysis of A Centrifugal Pump
10 pages
The Vacuum Interrupter Contact
No ratings yet
The Vacuum Interrupter Contact
8 pages
Class 6 Savita
No ratings yet
Class 6 Savita
7 pages