Gaussianprocesses资源-CSDN文库

math

需积分: 9 200 浏览量 2018-08-11 10:34:31 上传评论收藏 160KB PDF 举报

资源详情

资源评论

Gaussian processes

Chuong B. Do (updated by Honglak Lee)

November 22, 2008

Many of the classical machine learning algorithms that we talked about during the ﬁrst

half of this course ﬁt the following pattern: given a training set of i.i.d. examples sampled

from some unknown distribution,

1. solve a convex optimization problem in order to identify the single “best ﬁt” mo del for

the data, a nd

2. use this estimated model to make “best guess” predictions for future test input points.

In these notes, we will talk about a diﬀerent ﬂavor of learning algorithms, known as

Bayesian methods. Unlike classical learning algorithm, Bayesian algorithms do not at-

tempt to identify “best-ﬁt” models of the data (or similarly, make “best guess” predictions

for new test inputs). Instead, they compute a p osterior distribution over models (or similarly,

compute posterior predictive distributions for new test inputs). These distributions provide

a useful way to quantify our uncertainty in model estimates, and to exploit our knowledge

of this uncertainty in order to make more robust predictions on new test points.

We focus on regression problems, where the goal is to learn a mapping from some input

space X = R

of n-dimensional vectors to an output space Y = R of real-valued targets.

In particular, we will talk about a kernel-based fully Bayesian regression algorithm, known

as Gaussian process regression. The material covered in these notes draws heavily on many

diﬀerent topics that we discussed previously in class (namely, the probabilistic interpretation

of linear regression

, Bayesian methods

, kernels

, and properties of multivariate Gaussians

The organization of these notes is as follows. In Section 1, we provide a brief review

of multivaria t e Gaussian distributions and their properties. In Section 2 , we brieﬂy review

Bayesian methods in the context of probabilistic linear regression. The centr al ideas under-

lying Gaussian processes are presented in Section 3, and we derive the full Gaussian process

regression model in Section 4 .

See course lecture notes on “Supervised Learning, Discriminative Algorithms.”

See course lecture notes on “Regularization and Model Selection.”

See course lecture notes on “Supp ort Vector Machines.”

See course lecture notes on “Factor Analysis.”

1 Multivariate Gaussians

A vector-valued random variable x ∈ R

is said to have a multivariate normal (or

Gaussian) distribution with mean µ ∈ R

and covariance matrix Σ ∈ S

p(x; µ, Σ) =

(2π)

n/2

|Σ|

1/2

exp



−

(x − µ)

−1

(x − µ)



. (1)

We write this as x ∼ N(µ, Σ). Here, recall from the section notes on linear algebra t hat S

refers to the space of symmetric positive deﬁnite n × n matrices.

Generally speaking, Gaussian random variables are extremely useful in machine learning

and statistics for two main reasons. First, they are extremely common when modeling “noise”

in statistical algorithms. Quite often, noise can be considered to be the accumulation of a

large number of small independent random perturbations aﬀecting the measurement process;

by the Central Limit Theorem, summations of independent random variables will tend to

“look Gaussian.” Second, Gaussian random variables are convenient for many analytical

manipulations, because many of the integrals involving Gaussian distributions that arise in

practice have simple closed form solutions. In the remainder of this section, we will review

a number of useful properties of multivariate Gaussians.

Consider a random vector x ∈ R

with x ∼ N(µ, Σ). Suppo se also that the variables in x

have been partitioned into two sets x

= [x

··· x

]

∈ R

and x

= [x

r+1

··· x

]

∈ R

n−r

(and similarly for µ and Σ), such that

x =





µ =





Σ =





Here, Σ

= Σ

since Σ = E[(x − µ)(x − µ)

] = Σ

. The f ollowing properties hold:

1. Normalization. The density function normalizes, i.e.,

p(x; µ, Σ)dx = 1.

This property, though seemingly trivial at ﬁrst glance, turns out to be immensely

useful for evaluating all sorts of integrals, even o nes which appear to have no relation

to probability distributions at all (see Appendix A.1)!

2. Marginalization. The marginal densities,

p(x

) =

p(x

, x

; µ, Σ)dx

p(x

) =

p(x

, x

; µ, Σ)dx

There ar e a c tua lly cases in which we would want to deal with multivariate Gauss ian distributions where

Σ is positive semideﬁnite but not p ositive deﬁnite (i.e., Σ is not full rank). In such cas e s, Σ

−1

does not exist,

so the deﬁnition of the Gaussian density given in (1) does not apply. For instance, see the c ourse lecture

notes on “Factor Analysis.”

剩余13页未读，继续阅读

评论收藏

内容反馈

Gaussian processes

评论0

最新资源

Gaussian processes

评论0

最新资源

相关推荐

高斯过程回归GPR代码

advances in gaussian processes

Gaussian Processes for Machine Learning

Paper: Gaussian Processes for Regression

Gaussian Processes for Regression: A Quick Introduction

Deep Gaussian Processes in matlab.zip

Matlab implementations of Gaussian processes and other

Gaussian Processes for Machine Learning机器学习的高斯过程

2016-10-19 gpml-matlab-v4.0.zip_Gaussian Processes _gpml高斯过程_分类回

Vector Davinci官方帮助配置使用手册（AutoSAR）.pdf

c++入门，核心，提高讲义笔记

数字图像处理 冈萨雷斯 课后习题

离散数学及其应用 第八版 奇数编号练习答案.pdf

科研伦理与学术规范 期末考试2 （40题）.pdf

最值得收藏的 考研线性代数 全部知识点思维导图整理(张宇, 汤家凤), 附带惯用思维/做题技巧/易错点整理.emmx

软件著作权设计说明书模板（含填写说明）.docx

AUTOSAR培训教材.rar

菜菜sklearn课程讲义.rar

“互联网+”大学生创新创业大赛项目计划书

notepad++-7.9下载

HALCON快速入门手册.pdf

最值得收藏的 考研高等数学 全部知识点思维导图整理(张宇, 汤家凤), 附带做题技巧/易错点/知识点整理.emmx

LabView 官方教程（全）

SMA_Connector.zip

AUTOSAR官方培训教材.zip

最优化理论与算法习题解答.pdf

2019年最新全国行政区划省市区县级别（矢量数据.shp格式）

工程伦理案例分享.docx

小型超市管理系统【软件工程大作业】

数字图像处理冈萨雷斯课后习题

离散数学及其应用第八版奇数编号练习答案.pdf

科研伦理与学术规范期末考试2 （40题）.pdf

最值得收藏的考研线性代数全部知识点思维导图整理(张宇, 汤家凤), 附带惯用思维/做题技巧/易错点整理.emmx

最值得收藏的考研高等数学全部知识点思维导图整理(张宇, 汤家凤), 附带做题技巧/易错点/知识点整理.emmx