libsvm函数工具箱资源-CSDN文库

共62个文件

java：10个

c：10个

py：6个

需积分: 10 127 浏览量 2018-07-23 11:27:34 上传评论收藏 843KB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

svm工具箱.zip （62个子文件）

guide.pdf 290KB

libsvm-3.17

Makefile.win 1KB

svm-train.c 9KB

svm-toy

gtk

callbacks.h 2KB

main.c 398B

callbacks.cpp 10KB

svm-toy.glade 6KB

interface.h 203B

Makefile 573B

interface.c 6KB

windows

svm-toy.cpp 11KB

svm-toy.cpp 10KB

Makefile 392B

svm-scale.c 8KB

tools

grid.py 15KB

README 7KB

checkdata.py 2KB

subset.py 3KB

easy.py 3KB

svm-predict.c 5KB

python

svm.py 9KB

svmutil.py 8KB

README 12KB

Makefile 32B

matlab

make.m 798B

svmtrain.c 11KB

svmpredict.c 10KB

svm_model_matlab.h 201B

libsvmwrite.c 2KB

README 10KB

svm_model_matlab.c 8KB

Makefile 1KB

libsvmread.c 4KB

windows

svmpredict.mexw64 25KB

libsvm.dll 157KB

svm-scale.exe 79KB

libsvmwrite.mexw64 10KB

libsvmread.mexw64 11KB

svm-toy.exe 138KB

svm-predict.exe 123KB

svmtrain.mexw64 63KB

svm-train.exe 152KB

README 28KB

svm.h 3KB

heart_scale 27KB

svm.def 477B

java

svm_predict.java 5KB

svm_train.java 8KB

libsvm.jar 51KB

libsvm

svm_node.java 115B

svm.m4 61KB

svm_model.java 868B

svm_parameter.java 1KB

svm.java 62KB

svm_problem.java 136B

svm_print_interface.java 87B

svm_scale.java 9KB

Makefile 624B

svm_toy.java 12KB

Makefile 732B

svm.cpp 63KB

A Practical Guide to Support Vector Classiﬁcation

Chih-Wei Hsu, Chih-Chung Chang, and Chih-Jen Lin

Department of Computer Science

National Taiwan University, Taipei 106, Taiwan

http://www.csie.ntu.edu.tw/

cjlin

Initial version: 2003 Last updated: April 15, 2010

Abstract

The support vector machine (SVM) is a popular classiﬁcation technique.

However, beginners who are not familiar with SVM often get unsatisfactory

results since they miss some easy but signiﬁcant steps. In this guide, we propose

a simple procedure which usually gives reasonable results.

1 Introduction

SVMs (Support Vector Machines) are a useful technique for data classiﬁcation. Al-

though SVM is considered easier to use than Neural Networks, users not familiar with

it often get unsatisfactory results at ﬁrst. Here we outline a “cookbook” approach

which usually gives reasonable results.

Note that this guide is not for SVM researchers nor do we guarantee you will

achieve the highest accuracy. Also, we do not intend to solve challenging or diﬃ-

cult problems. Our purpose is to give SVM novices a recipe for rapidly obtaining

acceptable results.

Although users do not need to understand the underlying theory behind SVM, we

brieﬂy introduce the basics necessary for explaining our procedure. A classiﬁcation

task usually involves separating data into training and testing sets. Each instance

in the training set contains one “target value” (i.e. the class labels) and several

“attributes” (i.e. the features or observed variables). The goal of SVM is to produce

a model (based on the training data) which predicts the target values of the test data

given only the test data attributes.

Given a training set of instance-label pairs (x

, y

), i = 1, . . . , l where x

∈ R

and

y ∈ {1, −1}

, the support vector machines (SVM) (Boser et al., 1992; Cortes and

Vapnik, 1995) require the solution of the following optimization problem:

min

w,b,ξ

w + C

i=1

subject to y

φ(x

) + b) ≥ 1 − ξ

, (1)

≥ 0.

Table 1: Problem characteristics and performance comparisons.

Applications #training #testing #features #classes Accuracy Accuracy

data data by users by our

procedure

Astroparticle

3,089 4,000 4 2 75.2% 96.9%

Bioinformatics

391 0

20 3 36% 85.2%

Vehicle

1,243 41 21 2 4.88% 87.8%

Here training vectors x

are mapped into a higher (maybe inﬁnite) dimensional space

by the function φ. SVM ﬁnds a linear separating hyperplane with the maximal margin

in this higher dimensional space. C > 0 is the penalty parameter of the error term.

Furthermore, K(x

, x

) ≡ φ(x

)

φ(x

) is called the kernel function. Though new

kernels are being proposed by researchers, beginners may ﬁnd in SVM books the

following four basic kernels:

• linear: K(x

, x

) = x

• polynomial: K(x

, x

) = (γx

+ r)

, γ > 0.

• radial basis function (RBF): K(x

, x

) = exp(−γkx

− x

), γ > 0.

• sigmoid: K(x

, x

) = tanh(γx

+ r).

Here, γ, r, and d are kernel parameters.

1.1 Real-World Examples

Table 1 presents some real-world examples. These data sets are supplied by our users

who could not obtain reasonable accuracy in the beginning. Using the procedure

illustrated in this guide, we help them to achieve better performance. Details are in

Appendix A.

These data sets are at http://www.csie.ntu.edu.tw/

cjlin/papers/guide/

data/

Courtesy of Jan Conrad from Uppsala University, Sweden.

Courtesy of Cory Spencer from Simon Fraser University, Canada (Gardy et al., 2003).

Courtesy of a user from Germany.

As there are no testing data, cross-validation instead of testing accuracy is presented here.

Details of cross-validation are in Section 3.2.

1.2 Proposed Procedure

Many beginners use the following procedure now:

• Transform data to the format of an SVM package

• Randomly try a few kernels and parameters

• Test

We propose that beginners try the following procedure ﬁrst:

• Transform data to the format of an SVM package

• Conduct simple scaling on the data

• Consider the RBF kernel K(x, y) = e

−γkx−yk

• Use cross-validation to ﬁnd the best parameter C and γ

• Use the best parameter C and γ to train the whole training set

• Test

We discuss this procedure in detail in the following sections.

2 Data Preprocessing

2.1 Categorical Feature

SVM requires that each data instance is represented as a vector of real numbers.

Hence, if there are categorical attributes, we ﬁrst have to convert them into numeric

data. We recommend using m numbers to represent an m-category attribute. Only

one of the m numbers is one, and others are zero. For example, a three-category

attribute such as {red, green, blue} can be represented as (0,0,1), (0,1,0), and (1,0,0).

Our experience indicates that if the number of values in an attribute is not too large,

this coding might be more stable than using a single number.

The best parameter might be aﬀected by the size of data set but in practice the one obtained

from cross-validation is already suitable for the whole training set.

2.2 Scaling

Scaling before applying SVM is very important. Part 2 of Sarle’s Neural Networks

FAQ Sarle (1997) explains the importance of this and most of considerations also ap-

ply to SVM. The main advantage of scaling is to avoid attributes in greater numeric

ranges dominating those in smaller numeric ranges. Another advantage is to avoid

numerical diﬃculties during the calculation. Because kernel values usually depend on

the inner products of feature vectors, e.g. the linear kernel and the polynomial ker-

nel, large attribute values might cause numerical problems. We recommend linearly

scaling each attribute to the range [−1, +1] or [0, 1].

Of course we have to use the same method to scale both training and testing

data. For example, suppose that we scaled the ﬁrst attribute of training data from

[−10, +10] to [−1, +1]. If the ﬁrst attribute of testing data lies in the range [−11, +8],

we must scale the testing data to [−1.1, +0.8]. See Appendix B for some real examples.

3 Model Selection

Though there are only four common kernels mentioned in Section 1, we must decide

which one to try ﬁrst. Then the penalty parameter C and kernel parameters are

chosen.

3.1 RBF Kernel

In general, the RBF kernel is a reasonable ﬁrst choice. This kernel nonlinearly maps

samples into a higher dimensional space so it, unlike the linear kernel, can handle the

case when the relation between class labels and attributes is nonlinear. Furthermore,

the linear kernel is a special case of RBF Keerthi and Lin (2003) since the linear

kernel with a penalty parameter

C has the same performance as the RBF kernel with

some parameters (C, γ). In addition, the sigmoid kernel behaves like RBF for certain

parameters (Lin and Lin, 2003).

The second reason is the number of hyperparameters which inﬂuences the com-

plexity of model selection. The polynomial kernel has more hyperparameters than

the RBF kernel.

Finally, the RBF kernel has fewer numerical diﬃculties. One key point is 0 <

≤ 1 in contrast to polynomial kernels of which kernel values may go to inﬁnity

(γx

+ r > 1) or zero (γx

+ r < 1) while the degree is large. Moreover, we

must note that the sigmoid kernel is not valid (i.e. not the inner product of two

vectors) under some parameters (Vapnik, 1995).

There are some situations where the RBF kernel is not suitable. In particular,

when the number of features is very large, one may just use the linear kernel. We

discuss details in Appendix C.

3.2 Cross-validation and Grid-search

There are two parameters for an RBF kernel: C and γ. It is not known beforehand

which C and γ are best for a given problem; consequently some kind of model selection

(parameter search) must be done. The goal is to identify good (C, γ) so that the

classiﬁer can accurately predict unknown data (i.e. testing data). Note that it may

not be useful to achieve high training accuracy (i.e. a classiﬁer which accurately

predicts training data whose class labels are indeed known). As discussed above, a

common strategy is to separate the data set into two parts, of which one is considered

unknown. The prediction accuracy obtained from the “unknown” set more precisely

reﬂects the performance on classifying an independent data set. An improved version

of this procedure is known as cross-validation.

In v-fold cross-validation, we ﬁrst divide the training set into v subsets of equal

size. Sequentially one subset is tested using the classiﬁer trained on the remaining

v − 1 subsets. Thus, each instance of the whole training set is predicted once so the

cross-validation accuracy is the percentage of data which are correctly classiﬁed.

The cross-validation procedure can prevent the overﬁtting problem. Figure 1

represents a binary classiﬁcation problem to illustrate this issue. Filled circles and

triangles are the training data while hollow circles and triangles are the testing data.

The testing accuracy of the classiﬁer in Figures 1a and 1b is not good since it overﬁts

the training data. If we think of the training and testing data in Figure 1a and 1b

as the training and validation sets in cross-validation, the accuracy is not good. On

the other hand, the classiﬁer in 1c and 1d does not overﬁt the training data and gives

better cross-validation as well as testing accuracy.

We recommend a “grid-search” on C and γ using cross-validation. Various pairs

of (C, γ) values are tried and the one with the best cross-validation accuracy is

picked. We found that trying exponentially growing sequences of C and γ is a

practical method to identify good parameters (for example, C = 2

−5

, 2

−3

, . . . , 2

γ = 2

−15

, 2

−13

, . . . , 2

The grid-search is straightforward but seems naive. In fact, there are several

advanced methods which can save computational cost by, for example, approximating

the cross-validation rate. However, there are two motivations why we prefer the simple

评论收藏

内容反馈

拼凑的摩天轮

粉丝: 0
资源: 6

libsvm函数工具箱

libsvm工具包（含网格法查找最优解函数 SVMcgForClass.m 内用libsvmtrain函数）

libsvm工具包（含网格法查找最优解函数 SVMcgForClass.m）

Matlab函数工具箱（复杂网络工具箱、图论工具箱、机器人函数工具箱、深度学习matlab工具箱等）

MATLAB工具箱-LibSvm函数工具箱.rar

libsvm加强工具箱

LIBSVM工具箱下载地址，LIBSVM-FarutoUitimate工具箱下载地址

支持向量机算法以及Libsvm工具箱介绍

LIBSVM工具箱

libsvm-mat-加强工具箱

libsvm工具包

libsvm-3.23工具包_LibSVM_libsvm3.23下载_SVM_

支持向量机 matlab工具箱 libsvm mat

多分类SVM（使用libsvm)

libsvm-mat2.8.1支持向量机的matlab工具箱

libsvm工具箱(附手写的demo)

libsvm-mat增加了pca,pso,ga工具包

libsvm.rar_libsvm 核函数

利用WOA算法优化libsvm中SVDD算法的参数

libsvm

libsvm-3.1-[FarutoUltimate3.1mcode]

libsvm-FarutuUltimate版本各种函数使用方式

Navicat Premium 16

DataGrip软件包

navicat17安全补丁

数据库课程设计-点餐系统sql文件

main-menu.xml

mysql省市区表完整版文件

MySQL学生成绩管理系统设计实验报告

最新资源