【免费】IntroductiontoStochasticApproximationAlgorithms.pdf资源-CSDN文库

需积分: 0 127 浏览量 2022-11-17 09:33:34 上传评论收藏 762KB PDF 举报

资源推荐

资源详情

资源评论

Chapter 15

Introduction to Stochastic

Approximation Algorithms

Stochastic approximation algorithms are recursive update rules that can be

used, among other things, to solve optimization problems and ﬁxed point equa-

tions (including standard linear systems) when the collected data is subject to

noise. In engineering, optimization problems are often of this type, when you

do not have a mathematical model of the system (which can be too complex)

but still would like to optimize its behavior by adj usting certain parameters.

For this purpose, you can do experiments or run simulations to evaluate the

performance of the system at given values of the parameters. Stochastic ap-

proximation algorithms have also been used in the social sciences to describe

collective dynamics: ﬁctitious play in learning theory and consensus algorithms

can be studied using their theory. In short, it is hard to overemphasized their

usefulness. In addition, the theory of stochastic approximation algorithms, at

least when approached using the ODE method as done here, is a beautiful mix

of dynamical systems theory and probability theory. We only have time to give

you a ﬂavor of this theory but hopefully this will motivate you to explore fur-

ther on your own. For our purpose, essentially all approximate DP algorithms

encountered in the following chapters are stochastic approximation algorithms.

We will not have time to give formal convergence proofs for all of them, but this

chapter should give you a starting point to understand the basic mechanisms

involved. Most of the material discussed here is taken from [Bor08].

15.1 Example: The Robbins-Monro Algorithm

Suppose we wish to ﬁnd the root

θ of the function f : R → R. We can use

Newton’s procedure, which generates the sequence of iterates

n+1

= θ

−

f(θ

)

(θ

)

This version: Octob er 31 2009

129

Suppose we also know a neighborhood of

θ, where f(θ) < 0 for θ<

θ, f(θ) > 0

for θ>

θ, and f in nondecreasing in this neighborhood. Then if we start

at θ

close enough of

θ, the following simpler (but less eﬃcient) scheme also

converges to

θ, and does not require the derivative of f:

n+1

= θ

− αf(θ

), (15.1)

for some ﬁxed and suﬃciently small α> 0. Note that if f is itself the derivative

of a function F , these schemes correspond to Newton’s method and a ﬁxed-

step gradient descent procedure for minimizing F , respectively (more precisely,

ﬁnding a critical point of F or root of the gradient of F ).

Very often in applications, we do not have access to the mathematical model

f, but we can do experiments or simulations to sample the function at particular

values of θ. These samples are typically noisy however, so that we can assume

that we have a black-box at our disposal (the simulator, the lab where we do

the experiments, etc.), which on input xθ returns the value y = f(θ)+d, where

d is a noise, which will soon be assumed to be random. The point is that we

only have access to the value y, and we have no way of removing the noise from

it, i.e., of isolating the exact value of f (θ). Now suppose that we still want to

ﬁnd a root of f as in the problem above, with access only to this noisy black

box.

Assume for now that we know that the noise is i.i.d. and zero-mean. A ﬁrst

approach to the problem could be, for a given value of θ, to sample suﬃcient

many time at the same point θ and get values y

, . . . , y

, and then form an

estimate of f(θ ) using the empirical average

f(θ) ≈

i=1

. (15.2)

With suﬃciently many samples at every iterate θ

of (15.1), we can reasonably

hope to ﬁnd approximately the root of f. The problem is that we might spend

a lot of time taking samples at points θ that are far from

θ and are not really

relevant, except for telling us in which direction to move next. This can be a

real issue if obtaining each sample is time-consuming or costly.

An alternative procedure, studied by Robbins and Monro [RM51]

, is to

simply use directly the noisy version of f in a slightly modiﬁed version of

algorithm (15.1):

n+1

= θ

− γ

, (15.3)

where γ

is a sequence of positive numbers converging to 0 and such that

= ∞ (for example, γ

=1/(n + 1)), and y

= f (θ

)+d

is the noisy

version of f(θ

). Note that the iterates θ

are now random variables.

The intuition behing the decreasing step size γ

is that it provides a sort

of averaging of the observations. For an analogy in a simpler setting, suppose

In fact, recursive stochastic algorithms have been used in signal processing (e.g., for

smoothing radar returns) even before the work of Robbins and Monro. However, there was

apparently no general asymptotic theory.

130

we have i.i.d. observations ξ

, . . . , ξ

of a random variable and wish to form

their empirical average as in (15.2). A recursive alternative to (15.2), extremely

useful in settings where the samples become available progressively with time

(recall for example the Kalman ﬁlter), is to form

= ξ

,θ

n+1

= θ

− γ

[θ

− ξ

n+1

with γ

=1/(n + 1). One can immediately verif y that θ

i=1

)/n, for

all n.

This chapter is concerned with recurrences generalizing (15.3) of the form:

n+1

= θ

+ γ

[f(θ

)+b

+ D

n+1

] (15.4)

where θ

∈ R

is possibly random, f is a function R

→ R

, b

is a small sys-

tematic perturbation term, such as a bias in our estimator of f(θ

), and D

n+1

is a random noise with zero mean (conditioned on the past). The assumptions

and exact deﬁnitions of these terms will be made precise in section 15.3. In

applications, we are typically ﬁrst interested in the asymptotic behavior of the

sequence {θ

15.2 The ODE Approach and More Application

Examples

The ODE (Ordinary Diﬀerential Equation) method says roughly that if the

step sizes γ

are appropriately chosen, the bias terms b

decrease appropriately,

and the noise D

is zero-mean, then the iterates (15.4) asymptotically track

the trajectories of the dynamical system

θ = f (θ).

We will give a more formal proof of this fact in the basic case in section 15.3.

Typically for the simplest proofs γ

must be decreasing to 0 and satisfy

= ∞,

< ∞.

However other choices are possible, including constant small step sizes in some

cases, and in practice the choice of step sizes requires experimentation because

it controls the convergence rate. Some theoretical results regarding convergence

rates are also available but will not be covered here. The ODE is extremely

useful in any case, even if another technique is chosen for formal convergence

proofs, in order to get a quick idea of the behavior of an algorithm. Moreover,

another big advantage of this method is that it can be used to easily create new

stochastic approximation algorithms from convergent ODEs. We now describe

a few more classes of problems where these algorithms arise.

By deﬁnition, ˙x :=

x(t)

131

剩余13页未读，继续阅读

评论收藏

内容反馈

琉璃树下

粉丝: 20
资源: 1

Introduction to Stochastic Approximation Algorithms.pdf

最新资源

Introduction to Stochastic Approximation Algorithms.pdf

AN INTRODUCTION TO STOCHASTIC

Introduction to stochastic processes

An Introduction to Stochastic Modeling

Introduction to stochastic calculus with applications.pdf

Wiley.Introduction.to.Stochastic.Processes.with.R

Approximation Algorithms.pdf

Introduction+to+Algorithms.pdf

Introduction_to_Algorithms.pdf

The Design of Approximation Algorithms.pdf

[English] Approximation_Algorithms.pdf

Introduction to stochastic programming

Probability-Theory——Stochastic-Processes.pdf

Introduction to Algorithms (2nd Edition).pdf

Introduction_to_Algorithms_3rd_Edition.pdf

Introduction to Algorithms (3rd Edition).pdf

A Concise and Practical Introduction to Programming Algorithms in Java.pdf

An Introduction To Genetic Algorithms ( MIT ).pdf

Introduction to Genetic Algorithms.pdf

Introduction to the Design and Analysis of Algorithms 3rd edition.pdf

Introduction to the Design and Analysis of Algorithms 3rd Edition.pdf

Introduction to Algorithms 3rd - 副本.pdf

Stochastic Approximation and Recursive Algorithms and Applications

Algorithms for Approximation - 2007.pdf

An Introduction to the Analysis of Algorithms(2nd) 无水印pdf

IE598NH-lecture-17-Stochastic Approximation for MSP.pdf

Introduction To Algorithms (Mit Press 2Nd Edition).pdf

完整车牌号识别程序，可以识别车牌和颜色，可以集成到项目中 支持win7+

ChatGPT教程（终极版）最全整理

博客中Kmeans以及FCM算法数据（免积分）

Chatgpt 4omni 发布 GPT 4o / chatgpt-4 桌面版 chchatgpt 4 下载 / darkgpt

最新资源

完整车牌号识别程序，可以识别车牌和颜色，可以集成到项目中支持win7+