knn.rar_TheMethodMethod资源-CSDN文库

共6个文件

doc：6个

版权申诉

108 浏览量 2022-09-22 20:54:30 上传评论收藏 668KB RAR 举报

资源详情

资源评论

资源推荐

收起资源包目录

knn.rar （6个子文件）

knn

~$ear_algreport.doc 162B

SVMNw.doc 568KB

REGRESSION MODELS.doc 217KB

PredictiveModeling_Report.doc 114KB

knear_algreport.doc 295KB

CONTENTDescTree.doc 30KB

SUPPORT VECTOR MACHINES

Introduction

Support vector machines (SVMs) are a set of related supervised learning methods

which analyze data and recognize patterns, used for statistical classification and regression

analysis. Since an SVM is a classifier, then given a set of training examples, each marked as

belonging to one of two categories, an SVM training algorithm builds a model that predicts

whether a new example falls into one category or the other. Intuitively, an SVM model is a

representation of the examples as points in space, mapped so that the examples of the separate

categories are divided by a clear gap that is as wide as possible. New examples are then mapped

into that same space and predicted to belong to a category based on which side of the gap they

fall on.

More formally, a support vector machine constructs a hyperplane or set of hyperplanes in

a high or infinite dimensional space, which can be used for classification, regression or other

tasks. Intuitively, a good separation is achieved by the hyperplane that has the largest distance to

the nearest training datapoints of any class (so-called functional margin), since in general the

larger the margin the lower the generalization error of the classifier.

Whereas the original problem may be stated in a finite dimensional space, it often

happens that in that space the sets to be discriminated are not linearly separable. For this reason it

was proposed that the original finite dimensional space be mapped into a much higher

dimensional space presumably making the separation easier in that space. SVM schemes use a

mapping into a larger space so that cross products may be computed easily in terms of the

variables in the original space making the computational load reasonable. The cross products in

the larger space are defined in terms of a kernel function K(x,y) which can be selected to suit the

problem. The hyperplanes in the large space are defined as the set of points whose cross product

with a vector in that space is constant.

The vectors defining the hyperplanes can be chosen to be linear combinations with

parameters α

of images of feature vectors which occur in the data base. With this choice of a

hyperplane the points x in the feature space which are mapped into the hyperplane are

defined by the relation:

∑

K(x

,x) = constant

if K(x,y) becomes small as y grows further from x, each element in the sum measures the

degree of closeness of the test point x to the corresponding data base point x

. In this way the sum

of kernels above can be used to measure the relative nearness of each test point to the data points

originating in one or the other of the sets to be discriminated. Note the fact that the set of points x

mapped into any hyperplane can be quite convoluted as a result allowing much more complex

discrimination between sets which are far from convex in the original space.

Motivation

Classifying data is a common task in machine learning. Suppose some given data points

each belong to one of two classes, and the goal is to decide which class a new data point will be

in. In the case of support vector machines, a data point is viewed as a p-dimensional vector (a list

of p numbers), and we want to know whether we can separate such points with a p − 1-

dimensional hyperplane. This is called a linear classifier. There are many hyperplanes that might

classify the data. One reasonable choice as the best hyperplane is the one that represents the

largest separation, or margin, between the two classes. So we choose the hyperplane so that the

distance from it to the nearest data point on each side is maximized. If such a hyperplane exists,

it is known as the maximum-margin hyperplane and the linear classifier it defines is known as a

maximum margin classifier.

Introduction to Support Vector Machine (SVM) Models

A Support Vector Machine (SVM) performs classification by constructing an N-

dimensional hyperplane that optimally separates the data into two categories. SVM models are

closely related to neural networks. In fact, a SVM model using a sigmoid kernel function is

equivalent to a two-layer, perceptron neural network.

Support Vector Machine (SVM) models are a close cousin to classical multilayer

perceptron neural networks. Using a kernel function, SVM’s are an alternative training method

for polynomial, radial basis function and multi-layer perceptron classifiers in which the weights

of the network are found by solving a quadratic programming problem with linear constraints,

rather than by solving a non-convex, unconstrained minimization problem as in standard neural

network training.

In the parlance of SVM literature, a predictor variable is called an attribute, and a

transformed attribute that is used to define the hyperplane is called a feature. The task of

choosing the most suitable representation is known as feature selection. A set of features that

describes one case (i.e., a row of predictor values) is called a vector. So the goal of SVM

modeling is to find the optimal hyperplane that separates clusters of vector in such a way that

cases with one category of the target variable are on one side of the plane and cases with the

other category are on the other size of the plane. The vectors near the hyperplane are the support

vectors. The figure below presents an overview of the SVM process.

A Two-Dimensional Example

Before considering N-dimensional hyperplanes, let’s look at a simple 2-dimensional

example. Assume we wish to perform a classification, and our data has a categorical target

variable with two categories. Also assume that there are two predictor variables with continuous

values. If we plot the data points using the value of one predictor on the X axis and the other on

the Y axis we might end up with an image such as shown below. One category of the target

variable is represented by rectangles while the other category is represented by ovals.

In this idealized example, the cases with one category are in the lower left corner and the

cases with the other category are in the upper right corner; the cases are completely separated.

The SVM analysis attempts to find a 1-dimensional hyperplane (i.e. a line) that separates the

cases based on their target categories. There are an infinite number of possible lines; two

candidate lines are shown above. The question is which line is better, and how do we define the

optimal line.

The dashed lines drawn parallel to the separating line mark the distance between the

dividing line and the closest vectors to the line. The distance between the dashed lines is called

the margin. The vectors (points) that constrain the width of the margin are the support vectors.

The following figure illustrates this.

An SVM analysis finds the line (or, in general, hyperplane) that is oriented so that the

margin between the support vectors is maximized. In the figure above, the line in the right panel

is superior to the line in the left panel.

If all analyses consisted of two-category target variables with two predictor variables, and

the cluster of points could be divided by a straight line, life would be easy. Unfortunately, this is

not generally the case, so SVM must deal with (a) more than two predictor variables, (b)

评论收藏

内容反馈

版权申诉

御道御小黑

粉丝: 61
资源: 1万+

knn.rar_The Method Method

评论0

最新资源

knn.rar_The Method Method

评论0

knn.rar_About Method

2.rar_The Method Method_fdtd

knn.m.zip_The Method Method_knn.m

namednodemapremovenameditemreturnnodevalue.rar_The Method Method

GS.rar_The Method Method

knn.ipynb_deeplearning_knn.ipynb_

iris_KNN.rar_iris_iris KNN_k fold_knn_测试集预测集

KNN.rar_KDD_The Process_k-nearest neighbor _kdd knn_knn kdd

knn.rar_K._KNN K_knn_knn matlab

gooz.rar_The Method Method

zuixiaoerchengfa.rar_The Method Method_zuixiaoerchengfa

Regula-Falsi-Method.rar_The Method Method

Imsl.rar_The Method Method

trussx.rar_The Method Method_truss

knn.rar_KPPV_The Ensemble_ensemble iris_iris KNN_knn iris

kNN.rar_K._KNN算法 python_knn_knn python_knn算法 python

knn.rar_K._knn_knn matlab_matlab knn

KNN.rar_K-means KNN_K._knn matlab_knn算法

Citation KNN.rar_Citation kNN_bagging algorithm_knn_matlab_mechi

yiqunsuanfa.rar_The Method Method_ant

KM1998KM.rar_KM1998KM_The Method Method

qdbuspendingreply.rar_QDBusPendingReply_The Method Method

theEhxstingvast.rar_The Method Method

eqxatbons_MATLAB.rar_The Method Method

KNN.rar_knn_knn vc维多少

KNN-classifier.rar_knn_knn MATLAB_knn算法_knn算法 matlab_matlab kn

KNN_Matlab.rar_KNN Classification_knn_knn matlab_matlab knn

knn.rar_K._KNN 分类_knn 鍒嗙被

knn_recognition.rar_K._knn_knn识别_matlab knn

最新资源