MATLAB分类与判别模型代码adaboost经典的分类算法代码.zip资源-CSDN文库

共11个文件

m：9个

txt：1个

pdf：1个

需积分: 5 102 浏览量 2023-08-08 20:25:14 上传评论收藏 756KB ZIP 举报

在机器学习领域，分类与判别模型是常用的数据分析工具，用于预测未知数据的类别。MATLAB作为一种强大的数值计算和编程环境，提供了丰富的工具箱来实现各种机器学习算法，包括Adaboost这样的集成学习方法。Adaboost，全称为"Adaptive Boosting"，是一种迭代的弱学习器组合技术，通过不断调整各个弱分类器的权重，构建出一个强分类器。 Adaboost算法的核心思想是针对训练数据集中的难例进行加权，即在每次迭代时，对错误分类的样本赋予更高的权重，使得后续的弱分类器更关注这些难例。这一过程会重复多次，每次迭代都会训练一个新的弱分类器，并根据其在当前数据集上的分类误差率来确定其在最终模型中的权重。所有弱分类器按照其权重线性组合，形成一个强分类器。在MATLAB中实现Adaboost，可以使用`fitensemble`函数，结合决策树（如`TreeBagger`或`GBC`）作为弱分类器的基础模型。以下是一个基本的Adaboost分类模型的代码框架： ```matlab % 加载数据 load iris % 使用鸢尾花数据集为例 X = iris.data; Y = iris.target; % 创建决策树弱分类器 tree = fitctree(X,Y); % 设置Adaboost参数 nEstimators = 100; % 弱分类器的数量 algorithm = 'AdaBoostM1'; % 选择Adaboost算法 % 训练Adaboost模型 ensemble = fitensemble(X,Y,'Method',algorithm,'NumLearningCycles',nEstimators,'BaseLearners',tree); % 预测 Y_pred = predict(ensemble,X_test); ``` 在这个例子中，我们首先加载了鸢尾花数据集，然后创建了一个决策树作为弱分类器。接着，设置Adaboost参数，包括弱分类器的数量（`nEstimators`）和算法类型（`algorithm`）。使用`fitensemble`函数训练Adaboost模型，将训练数据和标签传入，指定算法和其他参数。我们用训练好的模型进行预测。在实际应用中，Adaboost模型不仅可以用于二分类问题，也可以扩展到多分类任务。此外，Adaboost对异常值敏感，因此在数据预处理阶段需要考虑数据的质量，如缺失值的处理和异常值的检测。通过理解Adaboost的工作原理以及如何在MATLAB中实现，我们可以利用这个强大的工具解决各种分类问题，提高模型的预测性能。同时，Adaboost还可以与其他机器学习算法结合，例如SVM、神经网络等，构建更加复杂的集成模型。在分析数据和设计模型时，根据问题的具体情况，灵活选择和调整模型参数，是提升模型效果的关键。

资源推荐

资源详情

资源评论

收起资源包目录

MATLAB分类与判别模型代码 adaboost经典的分类算法代码.zip （11个子文件）

adaboost

array.m 810B

F_Random.m 540B

F_AssignLabelM2V2a.m 2KB

01593701.pdf 874KB

F_RandPartV4.m 1KB

F_GetWgtBTW.m 1KB

R_JD_LDA_BstTrnM2V1aR.m 7KB

F_JD_LDA_PLossVa.m 3KB

README.dat.txt 2KB

F_wJD_LDAV2.m 5KB

F_PartTrainValid.m 1KB

166 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 17, NO. 1, JANUARY 2006

Ensemble-Based Discriminant Learning

With Boosting for Face Recognition

Juwei Lu, Member, IEEE, K. N. Plataniotis, Senior Member, IEEE, A. N. Venetsanopoulos, Fellow, IEEE, and

Stan Z. Li

Abstract—In this paper, we propose a novel ensemble-based

approach to boost performance of traditional Linear Discriminant

Analysis (LDA)-based methods used in face recognition. The

ensemble-based approach is based on the recently emerged tech-

nique known as “boosting.” However, it is generally believed that

boosting-like learning rules are not suited to a strong and stable

learner such as LDA. To break the limitation, a novel weakness

analysis theory is developed here. The theory attempts to boost a

strong learner by increasing the diversity between the classiﬁers

created by the learner, at the expense of decreasing their margins,

so as to achieve a tradeoff suggested by recent boosting studies

for a low generalization error. In addition, a novel distribution

accounting for the pairwise class discriminant information is

introduced for effective interaction between the booster and the

LDA-based learner. The integration of all these methodologies

proposed here leads to the novel ensemble-based discriminant

learning approach, capable of taking advantage of both the

boosting and LDA techniques. Promising experimental results ob-

tained on various difﬁcult face recognition scenarios demonstrate

the effectiveness of the proposed approach. We believe that this

work is especially beneﬁcial in extending the boosting framework

to accommodate general (strong/weak) learners.

Index Terms—Boosting, face recognition (FR), linear discrimi-

nant analysis, machine learning, mixture of linear models, small-

sample-size (SSS) problem, strong learner.

I. INTRODUCTION

A. Face Recognition

ACE RECOGNITION (FR) has a wide range of appli-

cations, such as face-based video indexing and browsing

engines, biometric identity authentication, human-computer

interaction, and multimedia monitoring/surveillance. Within

the past two decades, numerous FR algorithms have been

proposed, and detailed surveys of the developments in the

area have appeared in the literature [1]–[6]. Among various

FR methodologies used, the most popular are the so-called

appearance-based approaches, which include the three most

well-known FR methods, namely Eigenfaces [7], Fisherfaces

[8], and Bayes Matching [9]. With focus on low-dimensional

statistical feature extraction, the appearance-based approaches

Manuscript received March 23, 2004; revised December 24, 2004.

This work was supported in part by the Bell University Laboratories at the

University of Toronto.

J. Lu, K. N. Plataniotis, and A. N. Venetsanopoulos are with The Edward S.

Rogers Sr. Department of Electrical and Computer Engineering, University of

Toronto, ON M5S 3G4 Canada (e-mail: kostas@dsp.toronto.edu).

Stan Z. Li is with the Center for Biometrics and Security Research, Institute

of Automation, Chinese Academy of Sciences, Beijing 100080, P.R. China.

Digital Object Identiﬁer 10.1109/TNN.2005.860853

generally operate directly on appearance images of face object

and process them as two-dimensional (2-D) holistic patterns

to avoid difﬁculties associated with three-dimensional (3-D)

modeling, and shape or landmark detection [5]. Of the appear-

ance-based FR methods, those based on linear discriminant

analysis (LDA) have shown promising results as it is demon-

strated in [8], [10]–[15]. However, statistical learning methods

such as the LDA-based ones often suffer from the so-called

“small-sample-size” (SSS) problem [16], encountered in

high-dimensional pattern recognition tasks where the number

of training samples available for each subject is smaller than the

dimensionality of the samples. For example, in the experiments

reported here only

training samples per subject

are available while the dimensionality of the sample space

is up to

. In addition, the performance of linear

appearance-based methods including LDA often deteriorates

rapidly when face patterns are subject to large variations in

viewpoints, illumination or facial expression. These variations

result in a highly nonconvex and complex distribution of face

images [17]. Thus, the limited success of these methods should

be attributed to their linear nature.

In general, a nonconvex distribution can be handled either

by globally nonlinear models or by a mixture of locally linear

models (or ensemble-based methods as they are known in the

machine learning literature [18]). Globally nonlinear methods

are not without problems. Approaches such as those based on

kernel machines [19]–[26] require the optimization of many de-

sign parameters, tend to overﬁt easily due to the increased al-

gorithmic complexity, and they are computationally expensive

compared to their linear counterparts. The last point is particu-

larly important for tasks such as face recognition, which are per-

formed in a high-dimensional input space. On the other hand,

ensemble-based approaches embody the principle of “divide

and conquer,” by which a complex recognition task is decom-

posed into a set of simpler ones, in each of which a locally

linear pattern distribution can be generalized and dealt with by

a relatively simple linear solution. As such, the ensemble-based

methods are simpler, easier to implement, and more cost effec-

tive compared to the nonlinear ones. However, most existing en-

semble-based FR methods are developed based on traditional

cluster analysis [27]–[30]. As a consequence, a disadvantage to

classiﬁcation tasks is that the submodels’ division/combination

criteria used in these clustering techniques are not directly re-

lated to the classiﬁcation error rate (CER) of the resulting clas-

siﬁer, especially the true CER (often referred to as the general-

ization error rate).

LU et al.: ENSEMBLE-BASED DISCRIMINANT LEARNING WITH BOOSTING FOR FACE RECOGNITION 167

B. Ensemble-Based Learning With Boosting

Recently, a machine-learning technique known as “boosting”

has received considerable attention in the pattern recognition

community, due to its usefulness in designing ensemble-based

classiﬁers [31], [32]. The idea behind boosting is to sequentially

employ a base classiﬁer on a weighted version of the training

sample set to generalize a set of classiﬁers of its kind. Often

the base classiﬁer is also called “learner.” These weights are

updated at each iteration through a classiﬁcation-error-driven

mechanism. Although any individual classiﬁer produced by the

learner may perform slightly better than random guessing, the

formed ensemble can provide a very accurate (strong) classiﬁer.

It has been shown, both theoretically and experimentally, that

boosting is particularly robust in preventing overﬁtting and re-

ducing the generalization error by increasing the so-called mar-

gins of the training examples [32]–[35]. The margin is deﬁned

as the minimal distance of an example to the decision surface of

classiﬁcation [36]. For a classiﬁer, a larger expected margin of

training data generally leads to a lower generalization error.

Since its introduction, AdaBoost became known as the most

accurate general purpose classiﬁcation algorithm available [37].

However, the machine-learning community generally regards

ensemble-based learning rules, including boosting and bagging

[38], not suited to a strong and stable learner, such as LDA [35],

[39]. The reason behind this belief is that the effectiveness

of these rules depends, to a great extent, on the learner’s

“instability,” which means that small changes in the training

set could cause large changes in the resulting classiﬁer [35].

On the other hand, it has been found in practical applications

that boosting may fail given a too weak learner [32]. In recent

boosting studies, Murua [40] introduced a useful notion of

weak dependence between classiﬁers constructed with the same

training data, and proposed an interesting upper bound on the

generalization error with respect to the margins of the classiﬁers

and their dependence. Murua’s bound reveals that to achieve

a low generalization error, the boosting procedure should not

only create the classiﬁers with large expected margins, but also

keep their dependence low or weak. This suggests in theory

that there exists a tradeoff between the large margins and the

weak dependence.

The requirement for an appropriately weak learner signiﬁ-

cantly restricts the applicability of the boosting algorithms in

practical applications, given the fact that most of state-of-the-art

recognition methods involve the utilization of a strong learner.

Therefore, it is highly desirable to improve the traditional

boosting frameworks, so that they are capable of accommo-

dating more general learners in both the pattern recognition and

machine learning communities.

C. Overview of the Contributions

In this paper, a novel weakness analysis theory is developed to

overcome the limitation of the weak learners, which are neces-

sary in existing boosting algorithms. To this end, a new variable

called “learning difﬁculty degree” (LDD) is introduced along

with a cross-validation method. They are used to analyze and

appropriately regulate the weakness of the classiﬁers general-

ized by a strong learner via the training data. In addition, a new

loss function with respect to the LDD is proposed to quantita-

tively estimate the generalization power of these produced clas-

siﬁers. This is achieved in the loss function by balancing the

averaged empirical error of the classiﬁers and their mutual de-

pendence. They are two key factors to the generalization error

of the formed ensemble classiﬁer as shown in Murua’s theory

[40].

The proposed weakness analysis theory is applied to boost

the performance of the traditional LDA-based approaches

in complex FR tasks. Thus, the learners in this work are the

LDA-based ones, which differ from the traditional learners used

in boosting at two aspects: 1) They are rather strong and stable

and 2) they are feature extractors rather than pure classiﬁers.

The latter makes this work similar in spirit to those of Viola,

Tieu and Jones [41]–[43], where the boosting process is viewed

as a feature selection process. Particularly, to boost the speciﬁc

LDA-based learners, a new variable called “pairwise class

discriminant distribution” (PCDD) is also introduced to build

an effective interaction mechanism between the booster and

the learner. As a result, a novel ensemble-based discriminant

learning method is developed here under the boosting frame-

work through the utilization of the PCDD and the weakness

analysis theory. In the proposed method, each round of boosting

generalizes a new LDA subspace particularly targeting those

examples from the hard-to-separate pairs of classes indicated

by its preceding PCDD, so that the separability between these

classes is enhanced in the new LDA subspace. The ﬁnal result

obtained by the process is an ensemble of multiple relatively

weak but very speciﬁc LDA solutions. The ensemble-based

solution is able to take advantage of both boosting and LDA.

It is shown by the FR experiments to outperform the single

solutions created by the LDA-based learners in various difﬁcult

learning scenarios, which include the cases with different SSS

settings and the case with increased nonlinear variations.

The rest of the paper is organized as follows. In Section II, we

brieﬂy review the AdaBoost approach and its multiclass exten-

sions. Then, in Section III, the theory and algorithm of how to

boost a LDA-based strong learner are introduced and described

in detail. Section IV reports on a set of experiments conducted

on the FERET face database to demonstrate the effectiveness

of the proposed methodologies. Finally, conclusions are sum-

marized in Section V. In addition, a brief introduction to the

adopted LDA-based learners is given in Appendix I.

II. R

ELATED WORK

Since the boosting method proposed here is developed from

AdaBoost [31], we begin with a brief review of the algorithm

and its multiclass extensions.

In the case of pattern classiﬁcation, the task of learning from

examples can be formulated in the following way: Given a

training set,

, containing classes with each

class

consisting of a number of exam-

ples

and their corresponding class labels , a total of

examples are available in the set. Let be the

sample space:

, and be the label set:

. Taking as input such a set , the objective of

168 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 17, NO. 1, JANUARY 2006

Fig. 1. Algorithm of boosting a LDA-style learner (simply replacing the learner with JD-LDA or EFM in step 3 to obtain B-JD-LDA or B-EFM). Either

(

p; q

)

(

p; q

)

can be used to replace each other during the boosting process.

learning is to estimate a function or classiﬁer ,

such that

will correctly classify unseen examples .

To this end, AdaBoost works by repeatedly applying a given

weak learner to a weighted version of the training set in a se-

ries of rounds

, and then linearly combining these

weak classiﬁers

constructed in each round into a single

strong classiﬁer

. The most interesting feature of AdaBoost is

its surprising ability to reduce the amount of overﬁtting and the

generalization error of classiﬁcation, even as

becomes large

[31], [34]. To explain the property, quite a number of perspec-

tives on AdaBoost have emerged since its introduction [44]. The

dominant amongst them is the margin theory, which regards Ad-

aBoost to be an efﬁcient method for maximizing the margin

[34]. However, many researchers have shown that the margin

theory provides only partial answers to the puzzle [45], [46].

As a result, AdaBoost still remains as a mysterious algorithm,

which is considered one of the most important unsolved prob-

lems in machine learning [37]. On the other hand, the limita-

tion in the theoretical explanation does not seem to hamper the

success of AdaBoost-style approaches in practical applications.

For example, Viola and Jones [43] build the ﬁrst real-time face

detection system by using AdaBoost, which is considered a dra-

matic breakthrough in the face detection research.

AdaBoost is originally developed to support binary classiﬁ-

cation tasks. Its multiclass extensions include two variants, Ad-

aBoost.M1 and AdaBoost.M2 [31]. AdaBoost.M1 is the most

straightforward generalization. However, the algorithm halts if

the classiﬁcation error rate (CER) of the weak classiﬁer

pro-

duced in any iterative step is

%. Research indicates that this

limitation often terminates the procedure too early, resulting in

insufﬁcient classiﬁcation capabilities [32], [34]. To avoid the

problem, rather than the ordinary CER, AdaBoost.M2 attempts

to minimize a more sophisticated error measure called “pseu-

doloss,”

, which is expressed as

(1)

where

(see steps 7,8 of Fig. 1 for deﬁnition) is the

so-called “mislabel distribution” deﬁned over the set of all mis-

labels:

. With the pseudoloss, the boosting process can continue

as long as the weak classiﬁer produced has pseudoloss slightly

better than random guessing. In addition, the introduction of the

mislabel distribution enhances the communication between the

LU et al.: ENSEMBLE-BASED DISCRIMINANT LEARNING WITH BOOSTING FOR FACE RECOGNITION 169

learner and the booster. In this way, AdaBoost.M2 can focus the

learner not only on hard-to-classify examples, but more specif-

ically, on the incorrect labels [31]. For all these reasons, we de-

velop the ensemble-based discriminant algorithm proposed in

the next section following the AdaBoost.M2 paradigm.

There are two LDA-based FR approaches (or learners) that

are boosted in this work. One is the so-called “Enhanced Fisher

LDA Model” (hereafter EFM) [13], and the other is called “Re-

vised Direct LDA” (hereafter JD-LDA) [47] proposed by the

authors recently. The EFM method is an improvement of the

Fisherfaces method [8], while the JD-LDA method is a LDA

variant introduced speciﬁcally for face recognition in high-di-

mensional, small-sample-size scenarios. For completeness, the

details of the two learners are described in Appendix I. Com-

pared to traditional learners used in the boosting algorithms,

the two LDA-based learners should be emphasized again at the

following two points. 1) They are strong and stable learners,

which can be successfully used as stand-alone procedures in

FR tasks [13], [47], [48]. That obviously contradicts the gen-

eral belief that boosting solutions should operate only on top of

weak learners. 2) The EFM or JD-LDA learner is composed of

a LDA-based feature extractor and a nearest center classiﬁer. As

it can be seen in Appendix I, the learning focus of such a learner

is on the feature extractor rather than the classiﬁer. It is rather

different at this point from the original boosting design where

the weak learners are used only as

pure classiﬁers without con-

cerning feature extraction. This makes the AdaBoost learning

tend to be an adaptively feature selection process, some of the

ideas seen in [43]. Therefore, accommodating a learner such as

JD-LDA or EFM requires a generalized boosting framework,

which is not restricted by the assumption of the weak learner

availability. To highlight these difference, we call “gClassiﬁer”

the more general classiﬁer produced by the LDA-based learners

in the rest of the paper.

III. B

OOSTING A

LDA-STYLE LEARNER

A. Interaction Between the LDA Learner and the Booster

To boost a learner, we ﬁrst have to build a strong connection

between the learner and the boosting framework. In AdaBoost,

this is implemented by manipulating the so-called “sample dis-

tribution,” which is a measure of how hard to classify an ex-

ample. However, we need a more speciﬁc connecting variable

in this work, given the fact that the nature of LDA is a feature

extractor, which goal is to ﬁnd a linear mapping to enhance the

between-class separability of the samples under learning. For

this purpose, a new distribution called “pairwise class discrimi-

nant distribution” (PCDD),

, is introduced here. The PCDD

is developed from the mislabel distribution

of AdaBoost.M2.

Deﬁned on any one pair of classes

, the

PCDD can be computed at the

th iteration as (2), shown at the

bottom of the page, where

and are the number of elements

in classes

and , respectively. As it is known from the Ad-

aBoost.M2 developments, the mislabel distribution

indicates the extent of difﬁculty in distinguishing the example

from the incorrect label based on the feedback information

from the preceding

gClassiﬁers. Thus, can be

intuitively considered as a measure of how important it is to dis-

criminate between the classes

and when designing the cur-

rent gClassiﬁer

. Obviously, a larger value of implies

worse separability between the two classes. It is, therefore, suit-

able to drive a LDA-based learner through

, so that it is

focused speciﬁcally on the hard-to-separate pairs of classes. To

this end, rather than the ordinary deﬁnition of the between-class

scatter matrix

where

is the mean of the class and

is the average of the ensemble ), we

introduce a variant of

, which can be expressed as

with

(3)

It should be noted at this point that the variant

weighted

embodies the design principle behind the so-called “frac-

tional-step” LDA presented in [49]. According to this principle,

object classes that are difﬁcult to be separated in the low-dimen-

sional output spaces

generalized in previous

rounds can potentially result in misclassiﬁcation. Thus, they

should be paid more attention by being more heavily weighted

in the high-dimensional input space of the current (

th) round, so

that their separability is enhanced in the resulting feature space

. It can be easily seen that the variant reduces to when

is equal to a constant.

Similarly, the weighted version of the within-class scatter ma-

trix

can be given as follows:

(4)

where

is deﬁned over as the

sample distribution, similar to the one given in AdaBoost. Since

is derived indirectly from the pseudoloss , we call

a “pseudo sample distribution” for the distinguishing

purpose. It can be seen that a larger value of

implies a

harder-to-classify example for those preceding gClassiﬁers.

Recently, it is shown that to achieve a low generalization

error, the boosting procedure should not only create classiﬁers

with large expected margins, but also keep their dependence

low or weak [40]. Obviously, classiﬁers trained with more

overlapping examples will result in stronger dependence

among them. A way to avoid building similar gClassiﬁers

otherwise

(2)

评论收藏

内容反馈

Java徐师兄

粉丝: 1546
资源: 2309

MATLAB分类与判别模型代码 adaboost经典的分类算法代码.zip

代码 adaboost经典的分类算法代码

MATLAB源码集锦-adaboost经典的分类算法代码

adaboost.zip_adaboost 决策树_决策树_分类器 matlab_集成学习 分类_集成学习matlab

adaboost经典的分类算法代码.zip

adaboost经典的分类算法代码.zip MATLAB

4.MATLAB预测与预报模型代码 基于BP_Adaboost算法的公司财务预警建模代码.zip

MATLAB预测与预报模型代码 基于BP-Adaboost算法的公司财务预警建模代码.zip

算法源码-分类与判别：adaboost经典的分类算法代码.rar

代码 adaboost经典的分类算法代码.rar

adaboost经典的分类算法代码.rar

adaboost.zip_adaboost_adaboost 分类_adaboost 算法_matlab adaboost_分类

【BP分类】基于ADABOOST-BP算法实现数据分类附matlab代码.zip.zip

adaboost-matlab.zip_adaboost_adaboost matlab_adaboost matlab_ma

基于haar特征+AdaBoost，CascadeBoost算法的人脸检测原理+matlab代码.zip

4.MATLAB预测与预报模型代码 基于BP_Adaboost算法的公司财务预警建模代码.rar

传统机器学习分类算法python实现源码集合(KNN决策树贝叶斯随机森林SVM等).zip

Matlab 基于BP神经网络的数据分类预测 BP分类

Matlab 基于支持向量机(SVM)的数据回归预测 SVM回归

LSTM时间序列神经网络预测MATLAB代码

ADRC控制器仿真 simulink 2017a版本

matlab2020b ubuntu.txt

基于蚁群算法的三维路径规划(matlab实现)

基于智能优化算法的双层优化求解(matlab代码)

调频连续波（FMCW）雷达二维FFT代码matlab

基于蚁群算法的二维路径规划(matlab实现)

美赛各题常用算法程序与参考代码.rar

MATLAB 模型预测控制（MPC）工具箱的使用

MATLAB深度学习入门实例（果树病虫害识别VGG19版）

锂电池BMS的Matlab仿真模型

最新资源

adaboost.zip_adaboost 决策树_决策树_分类器 matlab_集成学习分类_集成学习matlab

4.MATLAB预测与预报模型代码基于BP_Adaboost算法的公司财务预警建模代码.zip

MATLAB预测与预报模型代码基于BP-Adaboost算法的公司财务预警建模代码.zip

4.MATLAB预测与预报模型代码基于BP_Adaboost算法的公司财务预警建模代码.rar