美赛E题常见参考代码.zip资源-CSDN文库

共107个文件

m：60个

mat：21个

png：14个

版权申诉

147 浏览量 2023-08-23 16:55:49 上传评论收藏 5.52MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

美赛E题常见参考代码.zip （107个子文件）

wavenn.asv 3KB

GARCH-Matlab.doc 34KB

bp.doc 25KB

改进灰色马尔科夫模型湖北省用水量预测模型.docx 256KB

模糊综合评价原理+案例讲解与Matlab实现.docx 166KB

figure.fig 9KB

centre.fig 8KB

chapter12.html 15KB

chapter40.html 13KB

train_C4_5.m 13KB

FuzzyNet.m 8KB

R_JD_LDA_BstTrnM2V1aR.m 7KB

crossvalidation_lvq.m 6KB

wavenn.m 5KB

FastNN.m 5KB

F_wJD_LDAV2.m 5KB

chapter12.m 4KB

main.m 4KB

chapter21_lvq.m 4KB

chapter9.m 3KB

BP_Hidden.m 3KB

chapter2_1.m 3KB

wavenn.m 3KB

chapter21_bp.m 3KB

Condensing.m 3KB

waiji.m 3KB

F_JD_LDA_PLossVa.m 3KB

chapter40.m 2KB

vote_C4_5.m 2KB

main.m 2KB

F_AssignLabelM2V2a.m 2KB

Fisher.m 2KB

bestselect.m 2KB

Fassess.m 2KB

F_PartTrainValid.m 1KB

F_RandPartV4.m 1KB

Widrow_Hoff.m 1KB

huise.m 1KB

KNN.m 1KB

Cross.m 1KB

SinglePerceptron.m 1KB

灰色.m 1KB

F_GetWgtBTW.m 1KB

incorporate.m 1KB

draw.m 1KB

Mutation.m 1001B

decisionTree.m 928B

Select.m 912B

fitness.m 901B

rfmain.m 890B

array.m 810B

zh.m 786B

statistics.m 673B

NNforCondense.m 594B

test.m 580B

F_Random.m 540B

concentration.m 479B

excellence.m 455B

dea.m 408B

similar.m 377B

hundun.m 331B

popinit.m 319B

zy.m 225B

plotljz.m 176B

minf.m 150B

d_mymorlet.m 97B

mymorlet.m 90B

aaa.mat 4.02MB

data.mat 84KB

data.mat 45KB

chapter12_wine.mat 20KB

data1.mat 10KB

IAdata.mat 5KB

data2.mat 2KB

traffic_flux.mat 2KB

data2_noisy.mat 224B

data1_noisy.mat 216B

data2.mat 205B

data5.mat 195B

data8.mat 195B

data4.mat 195B

data1.mat 195B

data6.mat 195B

data9.mat 194B

data3.mat 192B

data0.mat 191B

data7.mat 191B

01593701.pdf 874KB

chapter40_08.png 28KB

chapter40_07.png 27KB

chapter40_04.png 16KB

chapter12_02.png 11KB

chapter12_03.png 9KB

chapter40_01.png 8KB

chapter40_03.png 7KB

chapter40_02.png 7KB

chapter12_01.png 7KB

共 107 条

166 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 17, NO. 1, JANUARY 2006

Ensemble-Based Discriminant Learning

With Boosting for Face Recognition

Juwei Lu, Member, IEEE, K. N. Plataniotis, Senior Member, IEEE, A. N. Venetsanopoulos, Fellow, IEEE, and

Stan Z. Li

Abstract—In this paper, we propose a novel ensemble-based

approach to boost performance of traditional Linear Discriminant

Analysis (LDA)-based methods used in face recognition. The

ensemble-based approach is based on the recently emerged tech-

nique known as “boosting.” However, it is generally believed that

boosting-like learning rules are not suited to a strong and stable

learner such as LDA. To break the limitation, a novel weakness

analysis theory is developed here. The theory attempts to boost a

strong learner by increasing the diversity between the classiﬁers

created by the learner, at the expense of decreasing their margins,

so as to achieve a tradeoff suggested by recent boosting studies

for a low generalization error. In addition, a novel distribution

accounting for the pairwise class discriminant information is

introduced for effective interaction between the booster and the

LDA-based learner. The integration of all these methodologies

proposed here leads to the novel ensemble-based discriminant

learning approach, capable of taking advantage of both the

boosting and LDA techniques. Promising experimental results ob-

tained on various difﬁcult face recognition scenarios demonstrate

the effectiveness of the proposed approach. We believe that this

work is especially beneﬁcial in extending the boosting framework

to accommodate general (strong/weak) learners.

Index Terms—Boosting, face recognition (FR), linear discrimi-

nant analysis, machine learning, mixture of linear models, small-

sample-size (SSS) problem, strong learner.

I. INTRODUCTION

A. Face Recognition

ACE RECOGNITION (FR) has a wide range of appli-

cations, such as face-based video indexing and browsing

engines, biometric identity authentication, human-computer

interaction, and multimedia monitoring/surveillance. Within

the past two decades, numerous FR algorithms have been

proposed, and detailed surveys of the developments in the

area have appeared in the literature [1]–[6]. Among various

FR methodologies used, the most popular are the so-called

appearance-based approaches, which include the three most

well-known FR methods, namely Eigenfaces [7], Fisherfaces

[8], and Bayes Matching [9]. With focus on low-dimensional

statistical feature extraction, the appearance-based approaches

Manuscript received March 23, 2004; revised December 24, 2004.

This work was supported in part by the Bell University Laboratories at the

University of Toronto.

J. Lu, K. N. Plataniotis, and A. N. Venetsanopoulos are with The Edward S.

Rogers Sr. Department of Electrical and Computer Engineering, University of

Toronto, ON M5S 3G4 Canada (e-mail: kostas@dsp.toronto.edu).

Stan Z. Li is with the Center for Biometrics and Security Research, Institute

of Automation, Chinese Academy of Sciences, Beijing 100080, P.R. China.

Digital Object Identiﬁer 10.1109/TNN.2005.860853

generally operate directly on appearance images of face object

and process them as two-dimensional (2-D) holistic patterns

to avoid difﬁculties associated with three-dimensional (3-D)

modeling, and shape or landmark detection [5]. Of the appear-

ance-based FR methods, those based on linear discriminant

analysis (LDA) have shown promising results as it is demon-

strated in [8], [10]–[15]. However, statistical learning methods

such as the LDA-based ones often suffer from the so-called

“small-sample-size” (SSS) problem [16], encountered in

high-dimensional pattern recognition tasks where the number

of training samples available for each subject is smaller than the

dimensionality of the samples. For example, in the experiments

reported here only

training samples per subject

are available while the dimensionality of the sample space

is up to

. In addition, the performance of linear

appearance-based methods including LDA often deteriorates

rapidly when face patterns are subject to large variations in

viewpoints, illumination or facial expression. These variations

result in a highly nonconvex and complex distribution of face

images [17]. Thus, the limited success of these methods should

be attributed to their linear nature.

In general, a nonconvex distribution can be handled either

by globally nonlinear models or by a mixture of locally linear

models (or ensemble-based methods as they are known in the

machine learning literature [18]). Globally nonlinear methods

are not without problems. Approaches such as those based on

kernel machines [19]–[26] require the optimization of many de-

sign parameters, tend to overﬁt easily due to the increased al-

gorithmic complexity, and they are computationally expensive

compared to their linear counterparts. The last point is particu-

larly important for tasks such as face recognition, which are per-

formed in a high-dimensional input space. On the other hand,

ensemble-based approaches embody the principle of “divide

and conquer,” by which a complex recognition task is decom-

posed into a set of simpler ones, in each of which a locally

linear pattern distribution can be generalized and dealt with by

a relatively simple linear solution. As such, the ensemble-based

methods are simpler, easier to implement, and more cost effec-

tive compared to the nonlinear ones. However, most existing en-

semble-based FR methods are developed based on traditional

cluster analysis [27]–[30]. As a consequence, a disadvantage to

classiﬁcation tasks is that the submodels’ division/combination

criteria used in these clustering techniques are not directly re-

lated to the classiﬁcation error rate (CER) of the resulting clas-

siﬁer, especially the true CER (often referred to as the general-

ization error rate).

LU et al.: ENSEMBLE-BASED DISCRIMINANT LEARNING WITH BOOSTING FOR FACE RECOGNITION 167

B. Ensemble-Based Learning With Boosting

Recently, a machine-learning technique known as “boosting”

has received considerable attention in the pattern recognition

community, due to its usefulness in designing ensemble-based

classiﬁers [31], [32]. The idea behind boosting is to sequentially

employ a base classiﬁer on a weighted version of the training

sample set to generalize a set of classiﬁers of its kind. Often

the base classiﬁer is also called “learner.” These weights are

updated at each iteration through a classiﬁcation-error-driven

mechanism. Although any individual classiﬁer produced by the

learner may perform slightly better than random guessing, the

formed ensemble can provide a very accurate (strong) classiﬁer.

It has been shown, both theoretically and experimentally, that

boosting is particularly robust in preventing overﬁtting and re-

ducing the generalization error by increasing the so-called mar-

gins of the training examples [32]–[35]. The margin is deﬁned

as the minimal distance of an example to the decision surface of

classiﬁcation [36]. For a classiﬁer, a larger expected margin of

training data generally leads to a lower generalization error.

Since its introduction, AdaBoost became known as the most

accurate general purpose classiﬁcation algorithm available [37].

However, the machine-learning community generally regards

ensemble-based learning rules, including boosting and bagging

[38], not suited to a strong and stable learner, such as LDA [35],

[39]. The reason behind this belief is that the effectiveness

of these rules depends, to a great extent, on the learner’s

“instability,” which means that small changes in the training

set could cause large changes in the resulting classiﬁer [35].

On the other hand, it has been found in practical applications

that boosting may fail given a too weak learner [32]. In recent

boosting studies, Murua [40] introduced a useful notion of

weak dependence between classiﬁers constructed with the same

training data, and proposed an interesting upper bound on the

generalization error with respect to the margins of the classiﬁers

and their dependence. Murua’s bound reveals that to achieve

a low generalization error, the boosting procedure should not

only create the classiﬁers with large expected margins, but also

keep their dependence low or weak. This suggests in theory

that there exists a tradeoff between the large margins and the

weak dependence.

The requirement for an appropriately weak learner signiﬁ-

cantly restricts the applicability of the boosting algorithms in

practical applications, given the fact that most of state-of-the-art

recognition methods involve the utilization of a strong learner.

Therefore, it is highly desirable to improve the traditional

boosting frameworks, so that they are capable of accommo-

dating more general learners in both the pattern recognition and

machine learning communities.

C. Overview of the Contributions

In this paper, a novel weakness analysis theory is developed to

overcome the limitation of the weak learners, which are neces-

sary in existing boosting algorithms. To this end, a new variable

called “learning difﬁculty degree” (LDD) is introduced along

with a cross-validation method. They are used to analyze and

appropriately regulate the weakness of the classiﬁers general-

ized by a strong learner via the training data. In addition, a new

loss function with respect to the LDD is proposed to quantita-

tively estimate the generalization power of these produced clas-

siﬁers. This is achieved in the loss function by balancing the

averaged empirical error of the classiﬁers and their mutual de-

pendence. They are two key factors to the generalization error

of the formed ensemble classiﬁer as shown in Murua’s theory

[40].

The proposed weakness analysis theory is applied to boost

the performance of the traditional LDA-based approaches

in complex FR tasks. Thus, the learners in this work are the

LDA-based ones, which differ from the traditional learners used

in boosting at two aspects: 1) They are rather strong and stable

and 2) they are feature extractors rather than pure classiﬁers.

The latter makes this work similar in spirit to those of Viola,

Tieu and Jones [41]–[43], where the boosting process is viewed

as a feature selection process. Particularly, to boost the speciﬁc

LDA-based learners, a new variable called “pairwise class

discriminant distribution” (PCDD) is also introduced to build

an effective interaction mechanism between the booster and

the learner. As a result, a novel ensemble-based discriminant

learning method is developed here under the boosting frame-

work through the utilization of the PCDD and the weakness

analysis theory. In the proposed method, each round of boosting

generalizes a new LDA subspace particularly targeting those

examples from the hard-to-separate pairs of classes indicated

by its preceding PCDD, so that the separability between these

classes is enhanced in the new LDA subspace. The ﬁnal result

obtained by the process is an ensemble of multiple relatively

weak but very speciﬁc LDA solutions. The ensemble-based

solution is able to take advantage of both boosting and LDA.

It is shown by the FR experiments to outperform the single

solutions created by the LDA-based learners in various difﬁcult

learning scenarios, which include the cases with different SSS

settings and the case with increased nonlinear variations.

The rest of the paper is organized as follows. In Section II, we

brieﬂy review the AdaBoost approach and its multiclass exten-

sions. Then, in Section III, the theory and algorithm of how to

boost a LDA-based strong learner are introduced and described

in detail. Section IV reports on a set of experiments conducted

on the FERET face database to demonstrate the effectiveness

of the proposed methodologies. Finally, conclusions are sum-

marized in Section V. In addition, a brief introduction to the

adopted LDA-based learners is given in Appendix I.

II. R

ELATED WORK

Since the boosting method proposed here is developed from

AdaBoost [31], we begin with a brief review of the algorithm

and its multiclass extensions.

In the case of pattern classiﬁcation, the task of learning from

examples can be formulated in the following way: Given a

training set,

, containing classes with each

class

consisting of a number of exam-

ples

and their corresponding class labels , a total of

examples are available in the set. Let be the

sample space:

, and be the label set:

. Taking as input such a set , the objective of

168 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 17, NO. 1, JANUARY 2006

Fig. 1. Algorithm of boosting a LDA-style learner (simply replacing the learner with JD-LDA or EFM in step 3 to obtain B-JD-LDA or B-EFM). Either

(

p; q

)

(

p; q

)

can be used to replace each other during the boosting process.

learning is to estimate a function or classiﬁer ,

such that

will correctly classify unseen examples .

To this end, AdaBoost works by repeatedly applying a given

weak learner to a weighted version of the training set in a se-

ries of rounds

, and then linearly combining these

weak classiﬁers

constructed in each round into a single

strong classiﬁer

. The most interesting feature of AdaBoost is

its surprising ability to reduce the amount of overﬁtting and the

generalization error of classiﬁcation, even as

becomes large

[31], [34]. To explain the property, quite a number of perspec-

tives on AdaBoost have emerged since its introduction [44]. The

dominant amongst them is the margin theory, which regards Ad-

aBoost to be an efﬁcient method for maximizing the margin

[34]. However, many researchers have shown that the margin

theory provides only partial answers to the puzzle [45], [46].

As a result, AdaBoost still remains as a mysterious algorithm,

which is considered one of the most important unsolved prob-

lems in machine learning [37]. On the other hand, the limita-

tion in the theoretical explanation does not seem to hamper the

success of AdaBoost-style approaches in practical applications.

For example, Viola and Jones [43] build the ﬁrst real-time face

detection system by using AdaBoost, which is considered a dra-

matic breakthrough in the face detection research.

AdaBoost is originally developed to support binary classiﬁ-

cation tasks. Its multiclass extensions include two variants, Ad-

aBoost.M1 and AdaBoost.M2 [31]. AdaBoost.M1 is the most

straightforward generalization. However, the algorithm halts if

the classiﬁcation error rate (CER) of the weak classiﬁer

pro-

duced in any iterative step is

%. Research indicates that this

limitation often terminates the procedure too early, resulting in

insufﬁcient classiﬁcation capabilities [32], [34]. To avoid the

problem, rather than the ordinary CER, AdaBoost.M2 attempts

to minimize a more sophisticated error measure called “pseu-

doloss,”

, which is expressed as

(1)

where

(see steps 7,8 of Fig. 1 for deﬁnition) is the

so-called “mislabel distribution” deﬁned over the set of all mis-

labels:

. With the pseudoloss, the boosting process can continue

as long as the weak classiﬁer produced has pseudoloss slightly

better than random guessing. In addition, the introduction of the

mislabel distribution enhances the communication between the

LU et al.: ENSEMBLE-BASED DISCRIMINANT LEARNING WITH BOOSTING FOR FACE RECOGNITION 169

learner and the booster. In this way, AdaBoost.M2 can focus the

learner not only on hard-to-classify examples, but more specif-

ically, on the incorrect labels [31]. For all these reasons, we de-

velop the ensemble-based discriminant algorithm proposed in

the next section following the AdaBoost.M2 paradigm.

There are two LDA-based FR approaches (or learners) that

are boosted in this work. One is the so-called “Enhanced Fisher

LDA Model” (hereafter EFM) [13], and the other is called “Re-

vised Direct LDA” (hereafter JD-LDA) [47] proposed by the

authors recently. The EFM method is an improvement of the

Fisherfaces method [8], while the JD-LDA method is a LDA

variant introduced speciﬁcally for face recognition in high-di-

mensional, small-sample-size scenarios. For completeness, the

details of the two learners are described in Appendix I. Com-

pared to traditional learners used in the boosting algorithms,

the two LDA-based learners should be emphasized again at the

following two points. 1) They are strong and stable learners,

which can be successfully used as stand-alone procedures in

FR tasks [13], [47], [48]. That obviously contradicts the gen-

eral belief that boosting solutions should operate only on top of

weak learners. 2) The EFM or JD-LDA learner is composed of

a LDA-based feature extractor and a nearest center classiﬁer. As

it can be seen in Appendix I, the learning focus of such a learner

is on the feature extractor rather than the classiﬁer. It is rather

different at this point from the original boosting design where

the weak learners are used only as

pure classiﬁers without con-

cerning feature extraction. This makes the AdaBoost learning

tend to be an adaptively feature selection process, some of the

ideas seen in [43]. Therefore, accommodating a learner such as

JD-LDA or EFM requires a generalized boosting framework,

which is not restricted by the assumption of the weak learner

availability. To highlight these difference, we call “gClassiﬁer”

the more general classiﬁer produced by the LDA-based learners

in the rest of the paper.

III. B

OOSTING A

LDA-STYLE LEARNER

A. Interaction Between the LDA Learner and the Booster

To boost a learner, we ﬁrst have to build a strong connection

between the learner and the boosting framework. In AdaBoost,

this is implemented by manipulating the so-called “sample dis-

tribution,” which is a measure of how hard to classify an ex-

ample. However, we need a more speciﬁc connecting variable

in this work, given the fact that the nature of LDA is a feature

extractor, which goal is to ﬁnd a linear mapping to enhance the

between-class separability of the samples under learning. For

this purpose, a new distribution called “pairwise class discrimi-

nant distribution” (PCDD),

, is introduced here. The PCDD

is developed from the mislabel distribution

of AdaBoost.M2.

Deﬁned on any one pair of classes

, the

PCDD can be computed at the

th iteration as (2), shown at the

bottom of the page, where

and are the number of elements

in classes

and , respectively. As it is known from the Ad-

aBoost.M2 developments, the mislabel distribution

indicates the extent of difﬁculty in distinguishing the example

from the incorrect label based on the feedback information

from the preceding

gClassiﬁers. Thus, can be

intuitively considered as a measure of how important it is to dis-

criminate between the classes

and when designing the cur-

rent gClassiﬁer

. Obviously, a larger value of implies

worse separability between the two classes. It is, therefore, suit-

able to drive a LDA-based learner through

, so that it is

focused speciﬁcally on the hard-to-separate pairs of classes. To

this end, rather than the ordinary deﬁnition of the between-class

scatter matrix

where

is the mean of the class and

is the average of the ensemble ), we

introduce a variant of

, which can be expressed as

with

(3)

It should be noted at this point that the variant

weighted

embodies the design principle behind the so-called “frac-

tional-step” LDA presented in [49]. According to this principle,

object classes that are difﬁcult to be separated in the low-dimen-

sional output spaces

generalized in previous

rounds can potentially result in misclassiﬁcation. Thus, they

should be paid more attention by being more heavily weighted

in the high-dimensional input space of the current (

th) round, so

that their separability is enhanced in the resulting feature space

. It can be easily seen that the variant reduces to when

is equal to a constant.

Similarly, the weighted version of the within-class scatter ma-

trix

can be given as follows:

(4)

where

is deﬁned over as the

sample distribution, similar to the one given in AdaBoost. Since

is derived indirectly from the pseudoloss , we call

a “pseudo sample distribution” for the distinguishing

purpose. It can be seen that a larger value of

implies a

harder-to-classify example for those preceding gClassiﬁers.

Recently, it is shown that to achieve a low generalization

error, the boosting procedure should not only create classiﬁers

with large expected margins, but also keep their dependence

low or weak [40]. Obviously, classiﬁers trained with more

overlapping examples will result in stronger dependence

among them. A way to avoid building similar gClassiﬁers

otherwise

(2)

评论收藏

内容反馈

版权申诉

skyJ

粉丝: 2554
资源: 2038

美赛E题常见参考代码.zip

美赛A题常见参考代码.zip

美赛A题常见参考代码下载

美赛各题型常见参考代码汇总.zip

美赛C题常见参考代码.zip

美赛F题常见参考代码.zip

美赛B题常见参考代码.zip

美赛D题常见参考代码.zip

2020美赛ABCDE题思路与资料.zip

美赛A题的常见完整代码

美赛F题常见参考代码.rar

美赛备战学习资料matlab源码.zip

21年美赛C题训练的代码.zip

2022美赛C题思路资料代码.zip

2022美赛A题O奖论文.zip

基于python和MATLAB的美赛模型与代码实现.zip

2020美赛c题代码参考

2019美赛D题-参考代码

南京邮电大学数学实验MATLAB2023综合练习1参考

Matlab2022b 下载

基于蚁群算法的二维路径规划(matlab实现)

南京邮电大学数学实验MATLAB2023

电-气-热综合能源系统优化调度matlab代码

最新资源