IterativeRe-constrainedGroupSparseFaceRecognition资源-CSDN文库

论文

需积分: 9 66 浏览量 2018-12-05 13:56:12 上传评论收藏 4.24MB PDF 举报

资源推荐

资源详情

资源评论

Iterative Re-constrained Group Sparse Face

Recognition with Adaptive Weights Learning

Jianwei Zheng, Ping Yang, Shengyong Chen, Senior Member, IEEE,Guojiang Shen and Wanliang Wang

Abstract—In this paper, we consider the robust face recognition

problem via iterative re-constrained group sparse classiﬁer with

adaptive weights learning (IRGSC). Speciﬁcally, we propose a

group sparse representation classiﬁcation (GSRC) approach in

which weighted features and groups are collaboratively adopted

to encode more structure information and discriminative infor-

mation than other regression based methods. In addition, we

derive an efﬁcient algorithm to optimize the proposed objective

function, and theoretically prove the convergence. There are sev-

eral appealing aspects associated with IRGSC. First, adaptively

learned weights can be seamlessly incorporated into the GSRC

framework. This integrates the locality structure of the data and

validity information of the features into l

2,p

-norm regularization

to form a uniﬁed formulation. Second, IRGSC is very ﬂexible to

different size of training set as well as feature dimension thanks to

the l

2,p

-norm regularization. Third, the derived solution is proved

to be a stationary point (globally optimal if p≥1). Comprehensive

experiments on representative datasets demonstrate that IRGSC

is a robust discriminative classiﬁer which signiﬁcantly improves

the performance and efﬁciency compared with the state-of-the-

art methods in dealing with face occlusion, corruption, and

illumination changes, etc.

Index Terms—Sparse representation, classiﬁcation, weights

learning, Group constraints, face recognition

I. INTRODUCTION

IGH sample dimensionality and short insufﬁciency of

prior knowledge about the valid features for classiﬁca-

tion are two challenging problems in machine learning and

pattern recognition. Face recognition (FR) remains an active

topic after comprehensive research over the last two decades

not just owe to its great application potential [1], [2]. It

offers a good testbench to reveal how these two key machine

learning problems are solvable as large and unambiguous trial

databases are available for FR problem. Appearance based

method [3], also known as holistic method, which exploits

machine learning approaches on the complete image for both

feature selection and classiﬁcation, provides a credible way

to cope with these two difﬁcult machine learning problems.

However, robust FR w.r.t. occlusion/corruption is still an open

problem due to the variations of noises, such as real disguise,

This work was supported in part by the National Natural Science Foundation

of China under Grant Nos. 61602413, 61325019, U1509207, 61502424 and

61379123 and in part by the Zhejiang Provincial Natural Science Foundation

under Grant LY15F030014. (Corresponding author: Shengyong Chen.)

J. Zheng, P. Yang, G. Shen and W. Wang are with the college

of Computer Science and Technology, Zhejiang University of Technol-

ogy, Hangzhou 310023, China (e-mail: zjw@zjut.edu.cn; yizhongyang-

ping@126.com; gjshen1975@zjut.edu.cn; wwl@zjut.edu.cn).

S. Chen is with the college of Computer and Communication Engineering,

Tianjin University of Technology, Tianjin 300384, China, and also with

the college of Computer Science and Technology, Zhejiang University of

Technology, Hangzhou 310023, China (e-mail: csy@zjut.edu.cn).

continuous or pixel-wise occlusion, randomness of occlusion

position, and the percentage of occluded pixels.

Recently, regression analysis based approaches arouse broad

interests in FR community. Naseem et al. presented a linear

regression classiﬁer (LRC) for FR [4], which represents the

query image by a linear combination of the class-speciﬁc dic-

tionary atoms. Wright et al. proposed a sparse representation

based classiﬁcation (SRC) algorithm to identify face images

with different corruption and real disguise [5]. In SRC, a

query image is regressed as a sparse linear combination of

the dictionary atoms from all classes collaboratively, and then

the decision is made by identifying which subject yields the

minimal reconstruction residual. Both LRC and SRC cannot

achieve desirable performance especially when the dictionary

is not overcomplete enough, which is common in practical

FR system. Yang et al. [6] gave an insight into SRC and

provided some theoretical supports for its effectiveness. They

asserted that it is l

regularizer rather than l

(which counts

the number of nonzero entries in a vector) that renders SRC

resultful. Zhang et al. [7] analyzed the core theory of SRC

and argued that the collaborative mechanism plays a more

essential role than the l

-norm based sparsity constraint. They

proposed a collaborative representation classiﬁer (CRC) based

on l

-norm constraint. By using of the structural information

of errors, Yang et al. [8] further proposed nuclear norm based

matrix regression (NMR) classiﬁcation framework under l

norm regularization for better FR performance in the presence

of occlusion and illumination variations. Zhong et al. believed

that there should be a certain balance between SRC and CRC.

They introduced l

1/2

-norm regularization for the practical FR

which claimed to embrace both advantages of SRC and CRC

for better performance [9]. The label of the dictionary atoms

is not utilized in the aforementioned algorithms, hence their

regression is based solely on the structure of the individual

samples. Nie et al. introduced the l

2,1

-norm regularity term

to incorporate the class labels. Their method, named as group

sparse classiﬁer (GSC), explores the sparse structure in the

group form with more discriminative information [10].

It is worth mentioning that, these classiﬁers and their

variants, however, still have not overcome the two elementary

constraints of regression type algorithms. One is the carefully

navigated importance of different samples and the other is

the removal of some invalid features. A violation of these

two conditions results in poor performance of the regression-

based classiﬁcation [11], [12]. The ﬁrst constrain makes all

the training samples equally contribute to discriminant, which

is unrealistic. The second constrain renders regression based

approaches perform poorly for the heavy corrupted query

samples. Thus it is not a surprise that, despite of the impressive

results of regression classiﬁers and various extensions, many

works [11], [13]–[15] show doubts about their validity for

image classiﬁcation.

To overcome the ﬁrst constraint of regression classiﬁers,

query-adapted technique, also called distance weights learning

[16], provides an idea to separate outlier samples from the

dictionary atoms. A weights vector is obtained by computing

the distances from query sample to the whole training sam-

ples [17]. Accordingly, many developed weighted regression

algorithms beneﬁt from this simple idea, including weighted

SRC (WSRC) [18], weighted CRC (WCRC) [19], and locality

group sensitive sparse representation (LGSR) [20]. Tang et

al. argued that locality weighted regularized term of LGSR

disrupts its group structure of sparse solution. Considering

that each class plays a different role in regressing the query

samples, they presented a weighted GSC (WGSC) [21] al-

gorithm with consideration of the inﬂuence of the similarity

between query samples and classes. However, the distribution

structure of the training samples may not coincide with the

natural class structure for a FR problem due to the inﬂuence

of corruption. As a result, in seeking a weights vector, these

classiﬁers may undesirably remove some samples which are

in fact needed to represent the query image. Timofte et al.

[12] adopted ﬁxed point theorem to fully exploit the weights

information from dictionary atoms, without employing query-

adapted technique. They claimed that their proposed method

has higher computational efﬁciency than WCRC and WSRC,

while it keeps the same recognition performance. However,

their method requires carefully controlled training images with

both quality and quantity, which is difﬁcult to achieve in

practice.

There are some attempts to alleviate the second constraint

of regression classiﬁers. RSRC [5] introduces an identity

matrix as a dictionary to code the outlier features (e.g.,

pixels with corruption or occlusion). Naseem et al. [22] and

Zhang et al. [23] respectively extended their LRC and CRC

to the robust version, robust linear regression classiﬁcation

(RLRC) and robust collaborative representation classiﬁcation

(RCRC), using the Huber and Laplacian estimator to deal

with severe random pixel noise and illumination changes.

To unify the existing robust sparse regression models: the

additive model for error correction and multiplicative model

for error detection, He et al. [24], [25] created a half-quadratic

framework by deﬁning different half-quadratic functions based

on the maximum correntropy criterion. Borrowing the idea of

matrix based representation from NMR, Luo et al. [26] and

Chen et al. [27] respectively introduced matrix variate slash

and elliptically contoured distribution to image representation

for better noise resistant. In addition, Yang et al. sought for

a maximum a posterior solution and proposed a regularized

robust coding (RRC) model for FR [28], which is robust to

various types of feature outliers (e.g. corruption and facial

expression). Qian et al. [29] further extended RRC to robust

general regression and representation model (RGRR) by using

of the prior information of the training set. RGRR works well

when the query samples share the same probability distribution

with the training samples. However, it involves an independent

training stage that is unnecessary for the traditional regression

based classiﬁcation approaches. To sum up, although much

progress has been made, robust FR is still an open issue due

to the complex variation of corruption.

In this work, we manage to solve the two elementary

limitations of regression based classiﬁcation methods. We

propose an iterative re-constrained group sparse classiﬁcation

(IRGSC) to increase the robustness of FR in dealing with

severe occlusion, complex corruption, real disguises and large

expression variation. The main contributions of this paper are

outlined as follows:

1) A general framework is presented for regression-based

classiﬁcation. It uniﬁes previous l

, l

or l

2,1

regularized norm

into a general formulation and learns the feature weights

and distance weights simultaneously to achieve the optimal

representation coefﬁcients. We derive a new and efﬁcient

algorithm to iteratively and adaptively update the weights

vector. Especially for the feature weights, we present a closed

solution seamlessly connected to the model with only one

univocal parameter, while the existing approaches all rely on

different distribution functions for corresponding noises.

2) We extend the convex group norm to a concave surrogate

function for a tighter approximation of the l

2,0

-norm. Then the

weighted l

2,p

-norm penalty is enforced on the coefﬁcients for

purpose of imposing both distance locality and group sparsity,

where p is released from a ﬁxed value and is ﬂexible to various

training size and feature dimension. By introducing the feature

weights vector to compute the reconstruction residuals and

distance measurement, the proposed approach uses selected

features to reﬂect the true distribution structure. Compared

with WGSC, the sparse solution of IRGSC maintains locality

at a feature level and contains more discriminative information.

3) In our implementation, the IRGSC minimization problem

is transformed into an iteratively re-constrained group sparse

coding problem with a reasonably designed weight learning

strategy for robust FR. In theory, we prove that IRGSC mono-

tonically decreases the objective value and any coefﬁcients

sequence is a stationary point. Our extensive experiments in

benchmark face databases show that IRGSC achieves much

better performance than existing regression based FR classi-

ﬁers, especially when there are complicated variations, such

as severe occlusions and corruptions, etc.

The rest of this paper is organized as follows: we introduce

a general formulation of regression based classiﬁers in Section

2. Section 3 introduces IRGSC classiﬁer for face recognition.

In Section 4, we present the optimization algorithm of IRGSC.

Section 5 analyses the complexity and convergence of the

proposed method. In Section 6, we conduct experiments on 3

public face databases and compare our results with the state-

of-the-art methods. Finally, Section 7 concludes the paper.

II. A GENERAL FRAMEWORK FOR REGRESSION BASED

CLASSIFIER

For various classiﬁcation task, different regression based

approaches have been proposed due to the variation of the

motivations, however, their purposes are often similar in

the sense that they aim to derive a series of representation

coefﬁcients and facilitate the succeeding classiﬁcation task.

A natural question that arises is whether these methods can

be reformulated into a unifying framework and whether this

framework assists in deriving new classiﬁers. In this section,

we give positive answers to this question. We present a

uniﬁed formulation of regression representation to provide a

common perspective in comprehending the relationship among

the existed algorithms and to devise new classiﬁers.

For a general classiﬁcation problem, the training samples

are represented as a dictionary matrix X=[X

, X

,..., X

]∈

m×n

supposing there exist c classes of subjects, where

=[x

, x

,..., x

]∈R

m×n

is the sample subset from subject

i=1,2,...,c. x

∈R

is the jth sample from the ith class with

feature dimension m. Here n

is the number of training samples

of class i, and n=

i=1

is the total sample number. The aim of

representation based classiﬁcation is, given the training set X,

to correctly determine the class to which a query test sample

y∈R

belongs.

For this purpose, regression methods use the dictionary X

to represent the query y linearly as

y = X

+ X

+ ... + X

= x

+ x

+ ... + x

= Xθ,

(1)

where θ=[θ

,θ

,...,θ

]

∈R

is the coefﬁcient vector to be

determined. Suppose the optimal representation vector θ

∗

achieved, let δ

(θ

∗

) be the vector whose only nonzero entries

are the entries of θ

∗

associated with class i. The query y can be

reconstructed by the training samples of class i as y

=Xδ

(θ

∗

i=1,...,c. The label of y is decided as the class which gives the

minimum reconstruction error [28]

identity(y) = arg min

ky − Xδ

(θ

∗

)k.

(2)

By surveying the existing algorithms, we introduce the fol-

lowing criteria to uniformly compute the representation coef-

ﬁcients

min

ks  (y − Xθ)k

+ λkη  θk

(3)

where  denotes element-wise multiplication, kuk

is the

-norm of u. The objective function (3) consists of two

parts: the ﬁrst part measures the reconstruction error while

the second one is a regularization term, with λ representing

the regularization parameter that balances the contribution

of the reconstruction error and locality of the solution. In

the following, we give an overview of these regression type

algorithms with various implementation details of p, q, s and

η.

A. Linear Regression type Classiﬁer

The traditional linear representation-type algorithms take

no considerations of locality or feature weight factor. That

is to say, they let both of s and η to be 1 (a vector with

all entries be 1) in (3) and only employ different forms of

norm constraints to characterize the representation coefﬁcients.

SRC [5] deems that sparsity plays the most important role

in capturing discriminative information. Hence the authors

employ l

-norm (i.e.,kθk

i=1

|θ

|) to recover the solution

of Eq.(1) as follows

min

ky − Xθk

+ λkθk

(4)

However, some researchers are skeptical about whether

the sparsity constraint is necessary in classiﬁcation. They

argue that the success of SRC is attributed to collaborative

mechanism rather than sparsity. Consequently, CRC [7] is

proposed, which replaces the l

-norm in (4) with l

-norm

(i.e.,kθk

i=1

). On the other hand, some researchers

doubt that in practical cases, the l

-norm cannot achieve solu-

tions as sparse as l

-norm since the dictionary is not overcom-

plete enough. Since the combinatorial l

-norm minimization is

an NP-hard problem, the l

1/2

-norm (i.e.,kθk

1/2

i=1

|θ

1/2

)

minimization, as a closer constraint to l

-norm than l

-norm,

is employed in LHC [9] for sparse coding.

Another limitation of SRC is that the label information of

each training sample is not considered when solving the l

norm minimization problem. Therefore, SRC might represent

a test sample by training samples from unrelated classes, and

thus is not preferable for classiﬁcation. GSC [10] tries to ﬁnd

a coefﬁcient θ which is sparse at group level. This means the

non-zero coefﬁcients of θ just occur at few speciﬁc groups, and

meanwhile the coefﬁcients within the selected groups are non-

sparse. GSC seeks for a representation that uses the minimum

number of groups instead of atoms. Hence, the l

2,1

mixed

norm (i.e.,

i=1

kθ

) regularizer is used in the optimization

process, where the query y is reconstructed by samples from

few classes.

B. Weighted Regression type Classiﬁer

The aforementioned classiﬁers emphasize that sparsity or

collaborative mechanism is important in representing the query

sample, however, they neglect the locality constraint, which is

more important in revealing the true geometry of feature space

[30], [31]. In other words, the query y might be represented

by training samples that are far away from it. Many locality-

constrained linear classiﬁers have been proposed recently.

Speciﬁcally, they integrate η into norm constraints in order

to avoid selecting the training samples that are far from y to

represent the test sample

min

ky − Xθk

+ λkη  θk

(5)

where s=1 and q=2 are ﬁxed, η=[η

,η

,...,η

]

measures

the similarity between the query sample and all the reference

samples. Speciﬁcally,

= exp(kx

− yk

/σ

), i = 1, ...c, j = 1, ..., n

(6)

where σ is the bandwidth parameter and η

denotes the

distance between y and x

. Clearly, the larger the η

is, the

smaller the weight coefﬁcient is. In (5), when p is assigned as

1 or 2, the corresponding weighted classiﬁer is WSRC [18]

and WCRC [19], respectively.

Similar to WSRC and WCRC, LGSR [20] combines the

reconstruction error with data locality constrained group spar-

sity regularizer. However, the locality constraint regularized

剩余15页未读，继续阅读

评论收藏

内容反馈

牙盘不锈

粉丝: 0
资源: 2

Iterative Re-constrained Group Sparse Face Recognition

Face Recognition

FaceRecognition

Face recognition

Burrus C S. Iterative re-weighted least squares.pdf

Low Complexity Iterative LMMSE-PIC Equalizer for OTFS代码

Iterative Case-Driven Method and Practice of Java Language Teach

Agile and Iterative Development - A Manager's Guide

Iterative Case-Driven Method and Practice of Java Language Teaching.pdf

Group sparse representation based classification

face_recognition

Multi-layer Locality-constrained Iterative Neighbor Embedding for Face Hallucination:Multi-layer Locality-constrained Iterative Neighbor Embedding for Face Hallucination的Matlab代码-matlab开发

Iterative Methods for sparse linear space.pdf

Iterative Methods for Sparse Linear Systems

hard_l0_Mterm.rar_NORM_hard_l0_Mterm.m_l0 norm_l0-norm_sparse

Implementation of an iterative headway-based bus

An iterative SURE-LET approach to sparse reconstruction

iterative-shrinkage-thresholding-algorithm:一类迭代收缩阈值算法（ISTA），用于解决信号图像处理中出现的线性逆问题

An iterative SVD-Krylov based method for model reduction of large-scale dynamical systems.pdf

Kulkarni-ReconNet-Non-Iterative-Reconstruction-CVPR-2016-paper.docx

fast-iterative-shrinkage-thresholding-algorithm:快速迭代收缩阈值算法（FISTA）保留了ISTA的计算简单性，但是在理论上和实践上都证明了全局收敛速度明显更好

[1] iterative convex refinement for sparse recovery.pdf

Iterative methods for sparse linear systems (1st edition)

Iterative-Threshold-Selection.rar_iterative threshold_threshold_

doa_traditional_algorithm_by_vl-master_DOA波束_延时求和方法_延时求和波束_延时累加_

最新资源