ProbabilisticApproachtoInverseProblems资源-CSDN文库

需积分: 10 122 浏览量 2010-01-16 12:38:35 上传评论收藏 1.8MB PDF 举报

### 概率论方法在反问题中的应用 #### 引言与背景反问题是指根据间接观测数据估计物理系统未知参数的问题。这类问题在地球物理学、工程地震学等多个领域都有广泛应用。Albert Tarantola 和 Klaus Mosegaard 在2002年合著的手册《概率论方法在反问题中的应用》提供了深入浅出的概率理论基础及其在解决反问题中的应用。该手册可作为 Tarantola 2005年出版的经典教材的补充材料，为从事反问题研究的学者提供重要的参考依据。 #### 基本概念 - **模型参数**：指待确定的物理系统参数。 - **可观测参数**：通过实验或观测获得的数据。 - **不确定性**：包括数据误差、模型参数的先验知识等。 - **物理定律**：连接模型参数与观测数据之间的数学关系。 #### 概率论元素 1. **体积**：在概率论中，体积通常用来表示事件空间的大小。 2. **概率**：指某一事件发生的可能性。 3. **均匀概率分布**：当所有可能的结果都具有相同的概率时，则称其为均匀概率分布。 4. **概率的结合**：表示两个或多个独立事件同时发生的概率。 5. **条件概率密度**：在已知某些条件下某事件发生的概率。 6. **边际概率密度**：只考虑单个变量时的概率分布。 7. **独立性和贝叶斯定理**：独立性是指两个事件的发生互不影响；贝叶斯定理则是关于后验概率的重要定理。 #### 蒙特卡洛方法蒙特卡洛方法是一种统计抽样技术，广泛应用于反问题求解中。具体包括： 1. **随机游走**：一种模拟过程，通过随机选择方向来移动。 2. **Metropolis 规则**：一种接受或拒绝新样本的准则，用于确保样本符合目标分布。 3. **级联 Metropolis 规则**：扩展了 Metropolis 规则，允许更高效的采样。 4. **初始化随机游走**：确定初始状态的方法。 5. **收敛问题**：探讨如何判断模拟是否已经收敛到稳定状态。 #### 反问题的概率论公式化 1. **模型参数与可观测参数**：明确两者之间的联系是解决反问题的基础。 2. **模型参数的先验信息**：利用已有的知识来约束模型参数的取值范围。 3. **测量与实验不确定性**：量化观测数据中的不确定度。 4. **联合“先验”概率分布**：在模型参数与可观测参数空间中的概率分布。 5. **物理定律作为数学函数**： - 物理定律提供了模型参数与观测数据之间关系的基础。 - 反问题的目标是基于观测数据和物理定律来推断模型参数。 6. **物理定律作为概率关联**： - 当物理定律以概率形式表示时，可以通过概率分布来理解模型参数与观测数据之间的关系。 - 这种方法允许处理非线性问题和复杂不确定性。 #### 解决反问题（I）：检查概率分布 - 本章节将探讨如何通过分析模型参数和观测数据之间的概率分布来解决反问题。这包括利用蒙特卡洛方法生成大量样本，从而估计模型参数的概率分布，并评估不同假设下的模型拟合程度。该手册为反问题的研究者提供了一套完整的框架，涵盖了从基本概率理论到高级蒙特卡洛方法的应用。通过阅读和实践这些理论和技术，研究者能够更好地理解和解决复杂的反问题。

资源推荐

资源详情

资源评论

Probabilistic Approach

to Inverse Problems

∗

Klaus Mosegaard

†

& Albert Tarantola

‡

November 16, 2002

Abstract

In ‘inverse problems’ data from indirect measurements are used to estimate unknown parameters of physical

systems. Uncertain data, (possibly vague) prior information on model parameters, and a physical theory relating

the model parameters to the observations are the fundamental elements of any inverse problem. Using concepts

from probability theory, a consistent formulation of inverse problems can be made, and, while the most general

solution of the inverse problem requires extensive use of Monte Carlo methods, special hypotheses (e.g., Gaussian

uncertainties) allow, in some cases, an analytical solution to part of the problem (e.g., using the method of least

squares).

∗

This text has been published as a chapter of the International Handbook of Earthquake & Engineering Seismology (Part A),

Academic Press, 2002, pages 237–265. It is here complete, with its appendixes.

†

Niels Bohr Institute; Juliane Maries Vej 30; 2100 Copenhagen OE; Denmark; mailto:klaus@gfy.ku.dk

‡

Institut de Physique du Globe; 4, place Jussieu; 75005 Paris; France; mailto:tarantola@ipgp.jussieu.fr

Contents

1Introduction 4

1.1 General Comments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

1.2 Brief Historical Review . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

2 Elements of Probability 6

2.1 Volume . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

2.2 Probability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

2.3 Homogeneous Probability Distributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

2.4 Conjunction of Probabilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

2.5 Conditional Probability Density . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

2.6 Marginal Probability Density . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

2.7 Independence and Bayes Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

3 Monte Carlo Methods 13

3.1 Random Walks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

3.2 The Metropolis Rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

3.3 The Cascaded Metropolis Rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

3.4 Initiating a Random Walk . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

3.5 Convergence Issues . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

4 Probabilistic Formulation of Inverse Problems 17

4.1 Model Parameters and Observable Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

4.2 Prior Information on Model Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

4.3 Measurements and Experimental Uncertainties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

4.4 Joint ‘Prior’ Probability Distribution in the (M, D) Space . . . . . . . . . . . . . . . . . . . . . . . 20

4.5 Physical Laws as Mathematical Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

4.5.1 Physical Laws . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

4.5.2 Inverse Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

4.6 Physical Laws as Probabilistic Correlations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

4.6.1 Physical Laws . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

4.6.2 Inverse Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

5 Solving Inverse Problems (I): Examining the Probability Density 26

6 Solving Inverse Problems (II): Monte Carlo Methods 26

6.1 Basic Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

6.2 Sampling the Homogeneous Probability Distribution . . . . . . . . . . . . . . . . . . . . . . . . . . . 27

6.3 Sampling the Prior Probability Distribution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27

6.4 Sampling the Posterior Probability Distribution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27

7 Solving Inverse Problems (III): Deterministic Methods 28

7.1 Maximum Likelihood Point . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

7.2 Misﬁt . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29

7.3 Gradient and Direction of Steepest Ascent . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30

7.4 The Steepest Descent Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30

7.5 Estimating Posterior Uncertainties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

7.6 Some Comments on the Use of Deterministic Methods . . . . . . . . . . . . . . . . . . . . . . . . . . 32

7.6.1 Linear, Weakly Nonlinear and Nonlinear Problems . . . . . . . . . . . . . . . . . . . . . . . . 32

7.6.2 The Maximum Likelihood Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

8 Conclusions 34

9Acknowledgements 35

10 Bibliography 35

AVolumetric Probability and Probability Density 38

B Conditional and Marginal Probability Densities 38

B.1 Conditional Probability Density . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

B.2 Marginal Probability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39

C Combining Data and Theories: a Conceptual Example 40

C.1 Contemplative Approach (Conjunction of Probabilities) . . . . . . . . . . . . . . . . . . . . . . . . . 41

C.2 “Ideal Theory” (Conditional Probability Density) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

C.3 Uncertain Analytical Theory (Conjunction of Probabilities) . . . . . . . . . . . . . . . . . . . . . . . 43

D Information Content 45

E Example: Prior Information for a 1D Mass Density Model 45

F Gaussian Linear Problems 46

G The Structure of an Inference Space 47

G.1 Kolmogorov’s Concept of Probability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47

G.2 Inference Space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48

G.3 The Interpretation of the or and the and Operation . . . . . . . . . . . . . . . . . . . . . . . . . . . 49

H Homogeneous Probability for Elastic Parameters 50

H.1 Uncompressibility Modulus and Shear Modulus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50

H.2 Young Modulus and Poisson Ratio . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51

H.3 Longitudinal and Transverse Wave Velocities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52

I Homogeneous Distribution of Second Rank Tensors 53

J Example of Ideal (Although Complex) Geophysical Inverse Problem 54

KAnExample of Partial Derivatives 61

L Probabilistic Estimation of Hypocenters 61

L.1 A Priori Information on Model Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61

L.2 Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62

L.3 Solution of the Forward Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62

L.4 Solution of the Inverse Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62

L.5 Numerical Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62

L.6 An Example of Bimodal Probability Density for an Arrival Time. . . . . . . . . . . . . . . . . . . . . 64

MFunctional Inverse Problems 65

M.1 The Functional Spaces Under Investigation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66

M.2 Duality Product . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66

M.3 Scalar Product in L

Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67

M.4 The Transposed Operator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69

M.5 The Adjoint Operator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72

M.6 The Green Operator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73

M.7 Born Approximation for the Acoustic Wave Equation . . . . . . . . . . . . . . . . . . . . . . . . . . 74

M.8 Tangent Application of Data With Respect to Parameters . . . . . . . . . . . . . . . . . . . . . . . . 76

M.9 The Transpose of the Fr´echet Derivative Just Computed . . . . . . . . . . . . . . . . . . . . . . . . . 76

M.10The Continuous Inverse Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77

N Random Walk Design 77

O The Metropolis Algorithm 77

P The Borel ‘Paradox’ 79

1Introduction

1.1 General Comments

Given a physical system, the ‘forward’ or ‘direct’ problem consists, by deﬁnition, in using a physical theory to

predict the outcome of possible experiments. In classical physics this problem has a unique solution. For instance,

given a seismic model of the whole Earth (elastic constants, attenuation, etc. at every point inside the Earth) and

given a model of a seismic source, we can use current seismological theories to predict which seismograms should

be observed at given locations at the Earth’s surface.

The ‘inverse problem’ arises when we do not have a good model of the Earth, or a good model of the seismic

source, but we have a set of seismograms, and we wish to use these observations to infer the internal Earth structure

or a model of the source (typically we try to infer both).

Many factors make the inverse problem underdetermined (non-unique). In the seismic example, two diﬀerent

Earth models may predict the same seismograms

, the ﬁnite bandwidth of our data will never allow us to resolve

very small features of the Earth model, and there are always experimental uncertainties that allow diﬀerent models

to be ‘acceptable’.

The term ‘inverse problem’ is widely used. The authors of this text only like this name moderately, as we

see the problem more as a problem of ‘conjunction of states of information’ (theoretical, experimental and prior

information). In fact, the equations used below have a range of applicability well beyond ‘inverse problems’:

they can be used, for instance, to predict the values of observations in a realistic situation where the parameters

describing the Earth model are not ‘given’, but only known approximately.

We take here a probabilistic point of view. The axioms of probability theory apply to diﬀerent situations. One is

the traditional statistical analysis of random phenomena, another one is the description of (more or less) subjective

states of information on a system. For instance, estimation of the uncertainties attached to any measurement

usually involves both uses of probability theory: some uncertainties contributing to the total uncertainty are

estimated using statistics, while some other uncertainties are estimated using informed scientiﬁc judgement about

the quality of an instrument, about eﬀects not explicitly taken into account, etc. The International Organization

for Standardization (ISO) in Guide to the Expression of Uncertainty in Measurement (1993), recommends that

the uncertainties evaluated by statistical methods are named ‘type A’ uncertainties, and those evaluated by other

means (for instance, using Bayesian arguments) are named ‘type B’ uncertainties. It also recommends that former

classiﬁcations, for instance into ‘random’ and ‘systematic uncertainties’, should be avoided. In the present text, we

accept ISO’s basic point of view, and extend it by downplaying the role assigned by ISO to the particular Gaussian

model for uncertainties (see section 4.3) and by not assuming that the uncertainties are ‘small’.

In fact, we like to think of an ‘inverse’ problem as merely a ‘measurement’. A measurement that can be quite

complex, but the basic principles and the basic equations to be used are the same for a relatively complex ‘inverse

problem’ as for a relatively simple ‘measurement’.

We do not normally use, in this text, the term ‘random variable’, as we assume that we have probability

distributions over ‘physical quantities’. This a small shift in terminology that we hope will not disorient the reader.

An important theme of this paper is invariant formulation of inverse problems, in the sense that solutions

obtained using diﬀerent, equivalent, sets of parameters should be consistent, i.e., probability densities obtained as

the solution of an inverse problem, using two diﬀerent set of parameters, should be related through the well known

rule of multiplication by the Jacobian of the transformation.

This paper is organized as follows. After a brief historical review of inverse problem theory, with special

emphasis on seismology, we give a small introduction to probability theory. In addition to being a tutorial, this

introduction also aims at ﬁxing a serious problem of classical probability, namely the non-invariant deﬁnition of

conditional probability. This problem, which materializes in the so-called Borel paradox, has profound consequences

for inverse problem theory.

A probabilistic formulation of inverse theory for general inverse problems (usually called ‘nonlinear inverse

problems’) is not complete without the use of Monte Carlo methods. Section 3 is an introduction to the most

versatile of these methods, the Metropolis sampler. Apart from being versatile, it also turns out to be the most

natural method for implementing our probabilistic approach.

In sections 4, 5 and 6 time has come for applying probability theory and Monte Carlo methods to inverse

problems. All the steps of a careful probabilistic formulations are described, including parameterization, prior

information over the parameters, and experimental uncertainties. The hitherto overlooked problem of uncertain

For instance, we could ﬁt our observations with a heterogeneous but isotropic Earth model or, alternatively, with an homogeneous

but anisotropic Earth.

physical laws (‘forward relations’) is given special attention in this text, and it is shown how this problem is

profoundly linked to the resolution of the Borel paradox.

Section 7 treats the special case of the mildly nonlinear inverse problems, where deterministic (non Monte

Carlo) methods can be employed. In this section, invariant forms of classical inversion formulæ are given.

1.2 Brief Historical Review

Foralong time scientists have estimated parameters using optimization techniques. Laplace explicitly stated the

least absolute values criterion. This, and the least squares criterion were later popularized by Gauss (1809). While

Laplace and Gauss were mainly interested in overdetermined problems, Hadamard (1902, 1932) introduced the

notion of an “ill-posed problem”, that can be viewed in many cases as an underdetermined problem.

The late sixties and early seventies was a golden age for the theory of inverse problems. In this period the

ﬁrst uses of Monte Carlo theory to obtain Earth models were made by Keilis-Borok and Yanovskaya (1967) and

by Press (1968). At about the same time, Backus and Gilbert, and Backus alone, in the years 1967–1970, made

original contributions to the theory of inverse problems, focusing on the problem of obtaining an unknown function

from discrete data. Although the resulting mathematical theory is elegant, its initial predominance over the more

‘brute force’ (but more powerful) Monte Carlo theory was only possibly due to the quite limited capacities of the

computers at that time. It is our feeling that Monte Carlo methods will play a more important role in the future

(and this is the reason why we put emphasis on these methods in this article). An investigation of the connection

between analogue models, discrete models and Monte Carlo models can be found in a paper by Kennett and Nolet

(1978).

Important developments of inverse theory in the fertile period around 1970 were also made by Wiggins (1969),

with his method of suppressing ‘small eigenvalues’, and by Franklin (1970), by introducing the right mathematical

setting for the Gaussian, functional (i.e., inﬁnite dimensional) inverse problem (see also Lehtinen et al., 1989).

Other important papers from the period are Gilbert (1971) and Wiggins (1972). A reference that may interest

some readers is Parzen et al. (1998), where the probabilistic approach of Akaike is described. To the ‘regularizing

techniques’ of Tikhonov (1963), Levenberg (1944) and Marquardt (1970), we prefer, in this paper, the approach

where the a priori information is used explicitly.

For seismologists, the ﬁrst bona ﬁde solution of an inverse problem was the estimation of the hypocenter

coordinates of an earthquake using the ‘Geiger method’ (Geiger, 1910), that present-day computers have made

practical. In fact, seismologists have been the originators of the theory of inverse problems (for data interpretation),

and this is because the problem of understanding the structure of the Earth’s interior using only surface data is a

diﬃcult problem.

Three-dimensional (3D) tomography of the Earth, using travel times of seismic waves, was developed by Keiiti

Aki and his coworkers in a couple of well known papers (Aki and Lee, 1976; Aki, Christoﬀerson and Husebye

1977). Minster and Jordan (1978) applied the theory of inverse problems to the reconstruction of the tectonic

plate motions, introducing the concept of ‘data importance’. Later, tomographic studies have provided spectacular

images of the Earth’s interior. Interesting papers on these inversions are van der Hilst et al. (1997) and Su et al.

(1992).

One of the major current challenges in seismic inversion is the nonlinearity of wave ﬁeld inversions. This is

accentuated by the fact that major experiments in the future most likely will allow us to sample the whole seismic

wave ﬁeld. For the low frequencies wave ﬁeld inversion is linear. Dahlen (1976) investigated the inﬂuence of lateral

heterogeneity on the free oscillations. He showed that the the inverse problem of estimating lateral heterogeneity

of even degree from multiplet variance and skewance is linear. At the time this was published, data accuracy and

unknown ellipticity splitting parameters hindered its application to real data, but later developments, including the

works of Woodhouse and Dahlen (1978) on discontinuous Earth models, led to present-days successful inversions of

low frequency seismograms. In this connection the works of Woodhouse, Dziewonski and others spring to mind

Later, the ﬁrst attempts to go to higher frequencies and nonlinear inversion were made by Nolet et al. (1986), and

Nolet (1990)

Purely probabilistic formulations of inverse theory saw the light around 1970 (see, for instance, Kimeldorf and

Wahba, 1970). In an interesting paper, Rietsch (1977) made nontrivial use of the notion of a ‘noninformative’

prior distribution for positive parameters. Jackson (1979) explicitly introduced prior information in the context of

linear inverse problems, an approach that was generalized by Tarantola and Valette (1982a, 1982b) to nonlinear

problems.

Preliminary Earth Reference Model (PREM), Dziewonski and Anderson, PEPI, 1981. Inversion for Centroid Moment Tensor

(CMT), Dziewonski, Chou and Woodhouse, JGR, 1982. First global tomographic model, Dziewonski, JGR, 1984.

剩余80页未读，继续阅读

评论收藏

内容反馈

苍月纹章

粉丝: 1
资源: 5

Probabilistic Approach to Inverse Problems

Approaches to Probabilistic Model Learning for Mobile Manipulation Robots

Probabilistic Reasoning in Multiagent Systems: A Graphical Models Approach

Restricted Gene Expression Programming: A New Approach for Parameter Identification Inverse Problems of Partial Differential Equation(CI, EI, 2区IF=2.784)

A Probabilistic Collaborative Representation based Approach for

Introduction to Probabilistic Graphical Models

An Introduction to Probabilistic Graphical Models

Skip Lists:A Probabilistic Alternative to Balanced Trees

An Analytical Approach to Calculating

An Introduction to Probabilistic Programing

An introduction to probabilistic graphical models

an introduction to probabilistic graphical models by Michael Jordan

Building Probabilistic Graphical Models with Python

Machine Learning: a probabilistic approach

Mastering Probabilistic Graphical Models using Python(PACKT,2015)

inverse problem

Practical Probabilistic Programming(Manning,2016)

Probabilistic Robotics

COMPUTATIONAL METHODS FOR INVERSE PROBLEMS

Daphne Koller_Probabilistic Graphical Models

Practical.Probabilistic.Programming.2016.3.pdf

概率编程Practical.Probabilistic.Programming

Natural Image Statistics

最新资源