MarkovChainMonteCarloandGibbsSampling资源-CSDN文库

共1个文件

pdf：1个

Markov

Chain

Monte

Carlo

5星 · 超过95%的资源需积分: 10 98 浏览量 2011-04-11 10:52:20 上传评论 4 收藏 290KB RAR 举报

资源推荐

资源详情

资源评论

收起资源包目录

Markov Chain Monte Carlo and Gibbs Sampling.rar （1个子文件）

Markov Chain Monte Carlo and Gibbs Sampling.pdf 412KB

Markov Chain Monte Carlo

and Gibbs Sampling

Lecture Notes for EEB 581, version 26 April 2004

°B. Walsh 2004

A major limitation towards more widespread implementation of Bayesian ap-

proaches is that obtaining the posterior distribution often requires the integration

of high-dimensional functions. This can be computationally very difﬁcult, but

several approaches short of direct integration have been proposed (reviewed by

Smith 1991, Evans and Swartz 1995, Tanner 1996). We focus here on Markov

Chain Monte Carlo (MCMC) methods, which attempt to simulate direct draws

from some complex distribution of interest. MCMC approaches are so-named be-

cause one uses the previous sample values to randomly generate the next sample

value, generating a Markov chain (as the transition probabilities between sample

values are only a function of the most recent sample value).

The realization in the early 1990’s (Gelfand and Smith 1990) that one particu-

lar MCMC method, the Gibbs sampler, is very widely applicable to a broad class

of Bayesian problems has sparked a major increase in the application of Bayesian

analysis, and this interest is likely to continue expanding for sometime to come.

MCMC methods have their roots in the Metropolis algorithm (Metropolis and

Ulam 1949, Metropolis et al. 1953), an attempt by physicists to compute com-

plex integrals by expressing them as expectations for some distribution and then

estimate this expectation by drawing samples from that distribution. The Gibbs

sampler (Geman and Geman 1984) has its origins in image processing. It is thus

somewhat ironic that the powerful machinery of MCMC methods had essentially

no impact on the ﬁeld of statistics until rather recently. Excellent (and detailed)

treatments of MCMC methods are found in Tanner (1996) and Chapter two of

Draper (2000). Additional references are given in the particular sections below.

MONTE CARLO INTEGRATION

The original Monte Carlo approach was a method developed by physicists to use

random number generation to compute integrals. Suppose we wish to compute

a complex integral

h(x) dx (1a)

If we can decompose h(x) into the production of a function f(x) and a probability

2 MCMC AND GIBBS SAMPLING

density function p(x) deﬁned over the interval (a, b), then note that

h(x) dx =

f(x) p(x) dx = E

p(x)

[ f (x) ] (1b)

so that the integral can be expressed as an expectation of f(x) over the density

p(x). Thus, if we draw a large number x

, ···,x

of random variables from the

density p(x), then

h(x) dx = E

p(x)

[ f (x)]'

i=1

f(x

) (1c)

This is referred to as Monte Carlo integration.

Monte Carlo integration can be used to approximate posterior (or marginal

posterior) distributions required for a Bayesian analysis. Consider the integral

I(y)=

f(y|x)p(x)dx, which we approximate by

I(y)=

i=1

f(y |x

) (2a)

where x

are draws from the density p(x). The estimated Monte Carlo standard

error is given by

[

I(y)] =

n−1

i=1

f(y |x

) −

I(y)

(2b)

Importance Sampling

Suppose the density p(x) roughly approximates the density (of interest) q(x), then

f(x) q(x)dx =

f(x)

q(x)

p(x)

p(x)dx = E

p(x)

f(x)

q(x)

p(x)

¶¸

(3a)

This forms the basis for the method of importance sampling, with

f(x) q(x)dx '

i=1

f(x

)

q(x

)

p(x

)

(3b)

where the x

are drawn from the distribution given by p(x). For example, if we

are interested in a marginal density as a function of y, J(y)=

f(y|x)q(x)dx,

we approximate this by

J(y) '

i=1

f(y |x

)

q(x

)

p(x

)

(4)

MCMC AND GIBBS SAMPLING 3

where x

are drawn from the approximating density p.

An alternative formulation of importance sampling is to use

f(x) q(x)dx '

I =

i=1

f(x

)

i=1

, where w

g(x

)

p(x

)

(5a)

where x

are drawn from the density p(x). This has an associated Monte Carlo

variance of

Var

i=1

f(x

) −

i=1

(5b)

INTRODUCTION TO MARKOV CHAINS

Before introducing the Metropolis-Hastings algorithm and the Gibbs sampler, a

few introductory comments on Markov chains are in order. Let X

denote the value

of a random variable at time t, and let the state space refer to the range of possible

X values. The random variable is a Markov process if the transition probabilities

between different values in the state space depend only on the random variable’s

current state, i.e.,

Pr(X

t+1

= s

, ···,X

) = Pr(X

t+1

= s

)(6)

Thus for a Markov random variable the only information about the past needed

to predict the future is the current state of the random variable, knowledge of the

values of earlier states do not change the transition probability. A Markov chain

refers to a sequence of random variables (X

, ···,X

) generated by a Markov

process. A particular chain is deﬁned most critically by its transition probabilities

(or the transition kernel), P (i, j)=P(i→j), which is the probability that a

process at state space s

moves to state s

in a single step,

P (i, j)=P(i→j) = Pr(X

t+1

= s

) (7a)

We will often use the notation P (i → j) to imply a move from i to j, as many texts

deﬁne P (i, j)=P(j→i), so we will use the arrow notation to avoid confusion.

Let

(t) = Pr(X

= s

) (7b)

denote the probability that the chain is in state j at time t, and let π(t) denote the

row vector of the state space probabilities at step t. We start the chain by specifying

a starting vector π(0). Often all the elements of π(0) are zero except for a single

element of 1, corresponding to the process starting in that particular state. As

the chain progresses, the probability values get spread out over the possible state

space.

4 MCMC AND GIBBS SAMPLING

The probability that the chain has state value s

at time (or step) t +1is

given by the Chapman-Kolomogrov equation, which sums over the probability

of being in a particular state at the current step and the transition probability from

that state into state s

(t + 1) = Pr(X

t+1

= s

)

Pr(X

t+1

= s

) · Pr(X

= s

)

P (k → i) π

(t)=

P(k,i)π

(t)(7)

Successive iteration of the Chapman-Kolomogrov equation describes the evolu-

tion of the chain.

We can more compactly write the Chapman-Kolomogrov equations in matrix

form as follows. Deﬁne the probability transition matrix P as the matrix whose

i, jth element is P(i, j), the probability of moving from state i to state j, P(i → j).

(Note this implies that the rows sum to one, as

P (i, j)=

P(i→j)=1.)

The Chapman-Kolomogrov equation becomes

π(t +1)=π(t)P (8a)

Using the matrix form, we immediately see how to quickly interate the Chapman-

Kolomogrov equation, as

π(t)=π(t−1)P =(π(t−2)P)P = π(t − 2)P

(8b)

Continuing in this fashion shows that

π(t)=π(0)P

(8c)

Deﬁning the n-step transition probability p

(n)

as the probability that the process

is in state j given that it started in state insteps ago, i..e.,

(n)

= Pr(X

t+n

= s

) (8d)

it immediately follows that p

(n)

is just the ij-th element of P

Finally, a Markov chain is said to be irreducibile if there exists a positive

integer such that p

)

> 0 for all i, j. That is, all states communicate with each

other, as one can always go from any state to any other state (although it may take

more than one step). Likewise, a chain is said to be aperiodic when the number

of steps required to move between two states (say x and y) is not required to be

multiple of some integer. Put another way, the chain is not forced into some cycle

of ﬁxed length between certain states.

MCMC AND GIBBS SAMPLING 5

Example 1. Suppose the state space are (Rain, Sunny, Cloudy) and weather

follows a Markov process. Thus, the probability of tomorrow’s weather simply

depends on today’s weather, andnot any other previous days. If this is the case, the

observation that it has rained for three straight days does not alter the probability

of tomorrow weather compared to the situation where (say) it rained today but

was sunny for the last week. Suppose the probability transitions given today is

rainy are

P( Rain tomorrow

| Rain today ) = 0.5,

P( Sunny tomorrow

| Rain today ) = 0.25,

P( Cloudy tomorrow

| Rain today ) = 0.25,

The ﬁrst row of the transition probability matrix thus becomes

(0.5, 0.25, 0.25).

Suppose the rest of the transition matrix is given by

P =





0.50.25 0.25

0.500.5

0.25 0.25 0.5





Note that this Markov chain is irreducible, as all states communicate with each

other.

Suppose today is sunny. What is the expected weather two days from now? Seven

days? Here

π(0)=(0 1 0), giving

π(2) = π(0)P

=(0.375 0.25 0.375 )

and

π(7) = π(0)P

=(0.40.20.4)

Conversely, suppose today is rainy, so that π(0)=(1 0 0). The expected

weather becomes

π(2)=(0.4375 0.1875 0.375 ) and π(7)=(0.40.20.4)

Note that after a sufﬁcient amount of time, the expected weather in independent of

the starting value. In other words, the chain has reached a stationary distribution,

where the probability values are independent of the actual starting value.

As the above example illustrates, a Markov chain may reach a stationary

distribution π

∗

, where the vector of probabilities of being in any particular given

state is independent of the initial condition. The stationary distribution satisﬁes

∗

= π

∗

P (9)

评论收藏

内容反馈

zhang2322123

2012-11-04

文章的逻辑性还不错，适合初学者
x421251632

2016-08-14

学习采样知识不错的参考
litaizhi

2013-04-04

好东西，值得下载

xts616

粉丝: 6
资源: 62

Markov Chain Monte Carlo and Gibbs Sampling

Handbook of Markov Chain Monte Carlo - Steve影印版

Markov Chain Monte Carlo in Practice

Markov Chain Monte Carlo in practice

Markov Chain Monte Carlo and Gibbs Sampling.rar

Markov-Chain-Monte-Carlo

Markov Chain Monte Carlo Methods

Monte Carlo and Quasi-Monte Carlo Sampling by Christiane Lemieux

Markov Chain Monte Carlo Simulation Methods in Econometrics

Gibbs Sampling

Bayesian Computation With R

The EM Algorithm and Extensions (2nd Edition)

一本基于matlab的数理统计电子书-Crc Press - Computational Statistics Handbook With Matlab -.part4.rar

一本基于matlab的数理统计电子书-Crc Press - Computational Statistics Handbook With Matlab -.part1.rar

一本基于matlab的数理统计电子书-Crc Press - Computational Statistics Handbook With Matlab -.part2.rar

一本基于matlab的数理统计电子书-Crc Press - Computational Statistics Handbook With Matlab -.part3.rar

Ppattern Recognition and Machine Learning

网络风险压力测试：超越风险价值 (VaR) 的网络风险保险模型：网络时代的风险、不确定性和利润-研究论文

深度学习神经网络(英文版PDF教程）

BNPSeg:使用HDP-MRF的贝叶斯非参数图像分割

python大作业 含爬虫、数据可视化、地图、报告、及源码（整和为一个文件）（2014-2020全国各地区原油加工量）.rar

仿真电路以及操作方法

【纯干货啊】华为IPD流程管理(完整版).pptx

可编程语言标准IEC61131-3中文版.pdf

OFDM完整仿真过程与教程.zip

信号与系统——保研复习资料.pdf

Landsat_WRS2.zip

最新资源

python大作业含爬虫、数据可视化、地图、报告、及源码（整和为一个文件）（2014-2020全国各地区原油加工量）.rar