没有合适的资源？快使用搜索试试~ 我知道了~

文库首页人工智能机器学习CSE176 Introduction to Machine Learning — Lecture notes.pdf

CSE176 Introduction to Machine Learning — Lecture notes.pdf

机器学习

需积分: 5 0 下载量 133 浏览量 2021-09-29 17:01:08 上传评论收藏 3.47MB PDF 举报

温馨提示

试读

80页

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning 从经验中学习的软件开发和分析技术综述。具体主题包括:监督学习(分类、回归);无监督学习(聚类、降维);强化学习;计算学习理论。具体的技术包括:贝叶斯方法、混合模型、决策树、基于实例的方法、神经网络、内核机器、集成等等。

资源推荐

资源详情

资源评论

CSE176 Introduction to Machine Learning — Lecture notes

Miguel

A. Carr eira-Perpi˜n´an

EECS, University of California, Merced

September 2, 2019

These are notes for a one-semester undergraduate course on machine learning given by Prof.

Miguel

A. Carreira-Perpi˜n´an at the University of California, Merced. The notes are largely based on

the book “Introduction to machine learning” by Ethem Alpaydın (MIT Press, 3rd ed., 2014), with

some additions.

These notes may be used for educational, no n-commercial purposes.

2015–2016 Miguel

A. Carreira-Perpi˜n´an

1 Introduction

1.1 What is machine learning (ML)?

• Data is being produced and stored continuously ( “big data”):

– science: genomics, astronomy, materials science, particle accelerators. . .

– sensor networ ks: weather measurements, traﬃc. . .

– people: social netwo r ks, blogs, mobile phones, purcha ses, bank tra nsactions. . .

– etc.

• Data is not r andom; it contains structure that can be used to predict outcomes, or gain knowl-

edge in some way.

Ex: patterns of Amazon purchases can be used to recommend items.

• It is more diﬃcult t o design algor it hms for such tasks (compared to, say, sort ing an array or

calculating a payro ll) . Such algorithms need data.

Ex: construct a spam ﬁlter, using a collection of email messages labelled as spam/not s pam.

• Data mining: the application of ML methods to large databases.

• Ex of ML applications: f raud detection, medical diagnosis, speech or fa ce recognition. . .

• ML is programming computers using data (past experience) to optimize a performance criterion.

• ML relies on:

– Stat istics: making inferences from sample data.

– Numerical algorithms (linear alg ebra, opt imizatio n) : optimize criteria, manipulate models.

– Computer science: data structures and programs that solve a ML problem eﬃciently.

• A model:

– is a compressed version of a database;

– extracts knowledge from it;

– does not have perfect performance but is a useful approximation to the data.

1.2 Examples of ML problems

• Supervi s ed learning: labels provided.

– Classiﬁcation (pattern recognition) :

∗ Face recognitio n. Diﬃcult because of the complex variability in the data : pose and

illumination in a face ima ge, occlusions, glasses/beard/make-up/etc.

Training examples:

Test images:

∗ Optical character recognition: diﬀerent styles, slant. . .

∗ Medical diagnosis: often, variables are missing (tests are costly).

∗ Speech recognition, machine translation, biometrics. . .

∗ Credit scoring: classify customers into high- and low-risk, based on their income and

savings, using data about past loans (whether they were pa id or not).

– Regres s i on: the labels to be predicted are continuous:

∗ Predict the price of a car from its mileage.

∗ Navigating a car: angle of the steering.

∗ Kinematics of a robot arm: predict wor kspace location from angles.

if income > θ

and savings > θ

then low-risk else high-risk

y = wx + w

Savings

Income

Low-Risk

High-Risk

θ

2

θ

1

x: mileage

y: price

• Unsupervised learning: no labels provided, only input data .

– Learning associations:

∗ Basket analysis: let p(Y |X) = “probability that a customer who buys product X

also buys product Y ”, estimated from past purchases. If p(Y |X) is large (say 0.7) ,

associate “X → Y ”. When someone buys X, recommend them Y .

– Clustering: group similar data points.

– D ensity estimation: where are data points likely to lie?

– D i mensionality reduction: data lies in a low-dimensional manifold.

– Feature selection: keep only useful features.

– Outlier/no velty detection.

• Semisupervised learning: labels provided for some points only.

• Reinf orcement learning: ﬁnd a sequence o f actions (policy) that reaches a goal. No supervised

output but delayed reward.

Ex: playing chess or a computer game, robot in a m aze.

2 Supervised learning

2.1 Learning a class from examples: two-class prob lems

• We are given a training set of labeled examples (positive and negative) and want to learn a

classiﬁer that we can use to predict unseen examples, or to understand the data.

• Input representation: we need to decide what attributes (features) to use to describe the input

patterns (examples, instances). This implies ignoring other attributes as irrelevant.

training set for a “family car”

Hypothesis class of rectangles

≤ price ≤ p

) AND (e

≤ engine power ≤ e

)

where p

, p

, e

∈ R

x

2

: Engine power

x

1

: Price

x

1

t

x

2

t

x

2

: Engine power

x

1

: Price

p

1

p

2

e

1

e

2

C

• Training set: X = {(x

, y

)}

n=1

where x

∈ R

is the nth input vector and y

∈ {0, 1} its

class label.

• Hypothesis (mod el) class

H: the set of classiﬁer functions we will use. Idea lly, the true class

distribution C can be represented by a function in H (exactly, or with a small error).

• Having selected H, learning the class reduces to ﬁnding an optimal h ∈ H. We don’t know the

true class regions C, but we can approximate them by the empirical error:

E(h; X) =

n=1

I(h(x

) 6= y

) = number of misclassiﬁed instances

There may be more than one optimal h ∈ H. In that case, we achieve better generalization by maximizing the margin (the distance

between the boundary of h and the instances closest to it).

the hypothesis with the largest margin noise and a more complex hypothesis





























x

2

x

1

h

1

h

2

2.4 Noise

• Noise is any unwanted anomaly in the data. It can be due to:

– Imprecision in recording the input attributes: x

– Errors in labeling the input vectors: y

– Attributes not considered that aﬀect the label (h i dden or latent attributes, may be unob-

servable).

• Noise makes learning harder.

• Should we keep t he hypothesis class simple rather than complex?

– Easier to use and to train (fewer parameters, faster).

– Easier to explain or interpret.

– Less variance in the learned model than for a complex model (less aﬀected by single

instances), but also higher bias.

Given compar able empirical error , a simple model will generalize better than a complex one.

(Occam’ s razor: simpler explanations are more plausible; eliminate unnecessary complexity.)

2.5 Learning multiple classes

• With K classes, we can code the label as a n integer y = k ∈ {1, . . . , K}, or as a one-of-K

binary vector y = (y

, . . . , y

)

∈ {0, 1}

(containing a single 1 in position k).

• One approach for K-class classiﬁcation: consider it as K two-class classiﬁcation problems, and

minimize the total empirical error:

E({h

}

k=1

; X) =

n=1

k=1

I(h

) 6= y

)

where y

is coded as o ne- of-K and h

is the two-class classiﬁer for problem k, i.e., h

(x) ∈ {0, 1}.

• Ideally, for a given pattern x only one h

(x) is one. When no, or more than one, h

(x) is one

then the classiﬁer is in doubt and may reject the pa t tern.

3-class example

regression: polynomials of order 1, 2, 6

(✐ solve for order 1: optimal w

, w

)

Engine power

Price

Family car

Sports car

Luxury sedan

?

x: milage

y: price

剩余79页未读，继续阅读

评论收藏

内容反馈

资源评论

资源反馈

评论星级较低，若资源使用遇到问题可联系上传者，3个工作日内问题未解决可申请退款~

努力+努力=幸运

粉丝: 4
资源: 136

上传资源快速赚钱

我的内容管理展开

我的资源快来上传第一个资源

我的收益

登录查看自己的收益

我的积分登录查看自己的积分

我的C币登录后查看C币余额

我的收藏

我的下载

下载帮助

前往需求广场，查看用户热搜

CSE176 Introduction to Machine Learning — Lecture notes.pdf

Machine Learning Lecture notes

An-Introduction-to-Machine-Learning-.pdf

NUS Machine Learning Lecture notes

Introduction to Machine Learning for the Science.pdf

A Brief Introduction to Machine Learning for Engineers.pdf

CSE446 Machine Learning.pdf

H3CSE GB381答案含解析.pdf

System Security Lecture Notes (StonyBrook CSE509)

Lecture Notes On Software Engineering.pdf

新版H3CSE GB0-371 GB0-381 GB0-391 V2.1题库 100%包通过

Lecture-notes-for-Machine-Learning:Lecture notes for Machine Learning (机器学习讲义)

An Introduction to Machine Learning pdf Springer

Introduction to Machine Learning_3rd Edition.pdf

A_Brief_Introduction_to_Machine_Learning_for_Engineers.pdf

Introduction to Machine Learning with Applications in Information 无水印pdf

Software Design and Implementation Lecture Notes (Washington CSE331)

360cse_13.0.2256.0.exe

A Brief Introduction to SystemVerilog Instructor.pdf

H3CSE-IPV6（GB0-330).pdf

h3c安全认证-H3CSE-Security(GB0-530)2021年1月20日更新.pdf

Introduction To Machine Learning 2nd Edition Ethem_Alpaydin - pdf

A Hands-On Introduction to Math, Stats, and Machine Learning.pdf

An Introduction to Machine Learning(2nd) 无水印pdf

Artificial Intelligence With an Introduction to Machine Learning(2nd) 无水印原版pdf

1a_Advances in Financial Machine Learning Lecture-1-9.pdf

2021年H3CSE考试题库371.381.391包含PDF文件，vce文件.rar

DS_CSE7761_V1.3.pdf

H3CSE路由实验文档手册.pdf

360浏览器极速版360cse_11.0.2116.0.exe

360cse_12.0.1478.0.exe

最新资源