联合正则化最近点用于基于图像集的人脸识别资源-CSDN文库

83 浏览量 2021-03-07 05:59:12 上传评论收藏 487KB PDF 举报

人脸识别是计算机视觉和模式识别领域中最重要和基础的问题之一。传统的人脸识别通常假设只有一个查询人脸图像用于识别一个人的身份。尽管在库中每个主题有多个图像，但这仍然是一个挑战，因为需要解决各种变化和变化。基于图像集的人脸识别已经引起了大量的关注，因为它有希望能够克服各种变化。最近，(协同)正则化最近点(C)RNP通过测量在每个图像集中生成的最近点之间的集合间距离，实现了最先进的性能。然而，在计算其与不同图库图像集的最近点之间的距离时，查询集中的最近点会改变，这可能导致错误的图库图像集也可能具有较小的集合间距离；CRNP使用协同表示来克服这个问题，但它并没有显式地最小化集合间距离。为了克服这些问题并充分利用基于最近点的方法的优势，在本文中提出了一个新颖的联合正则化最近点(JRNP)用于基于图像集的人脸识别。在JRNP中，当计算查询集与不同类别图像集之间的距离时，查询集中的最近点保持不变；同时，它显式地最小化了面部图像的集合间距离。提出了一种有效的算法来解决这个问题，然后基于图像集中规则化最近点之间的联合距离进行分类。在基准数据库（例如，Honda/UCSD、CMU Mobo和YouTube数据库）上进行了广泛实验。实验结果清楚地表明，我们的JRNP在基于图像集的人脸识别中领先于性能。本研究论文介绍了联合正则化最近点（Joint Regularized Nearest Points，简称JRNP）用于基于图像集的人脸识别方法。该方法旨在解决传统人脸识别方法中的一些局限性，尤其是在面对图像集合中存在不同表情、姿态和光照变化的人脸识别问题时。在人脸识别的研究中，近年来基于图像集的方法由于其能够更好地应对变化而受到关注。相对于传统的单一图像人脸识别，基于图像集的人脸识别方法通过使用同一人不同图像的集合来提高识别的准确性和鲁棒性。文中提到的RNP方法通过测量每个图像集中生成的最近点之间的集合间距离来实现优秀性能，但是它存在一个缺陷：在计算查询集到不同图库图像集的距离时，查询集中的最近点会发生变化，这可能导致即使错误的图像集也会有较小的集合间距离。针对这个问题，CRNP（协同正则化最近点）方法采用了协同表示技术来解决，然而它没有明确地最小化集合间距离。为了解决这些问题并充分利用基于最近点的方法的优势，本研究提出了一种新的JRNP方法。在JRNP中，查询集的最近点在计算其到不同类别图像集的距离时保持不变。同时，它显式地最小化了面部图像的集合间距离。为了有效解决这一问题，研究者提出了一种高效的算法，分类基于图像集中规则化最近点之间的联合距离。通过对基准数据库进行广泛实验，实验结果表明，提出的JRNP在基于图像集的人脸识别领域取得了领先性能。人脸识别技术的发展不仅仅是识别单张图片上的单一表情，而是朝着能够处理在时间上连续变化的人脸数据集的目标前进。因此，研究者们开发了一系列基于图像集的方法，这些方法通过在库中对同一人的多个图像进行分析，从而实现更为准确和鲁棒的人脸识别。通过对不同图像之间的关系进行建模，这些方法可以捕捉到人脸在变化条件下的稳定性特征，同时抑制那些因变化而产生的非稳定性特征。研究者们通常采用一些数学模型来描述图像间的相似性和差异性，例如通过度量学习方法来计算不同图像集合之间的距离。通过这种方式，人脸识别技术能够在包含有挑战性的条件（如表情、姿态变化和光照变化等）的图像集上得到更好的识别结果。而JRNP正是这样一种旨在改善集合间距离度量，从而提高分类准确率的方法。研究论文还强调了在人脸识别技术中使用基准数据库进行测试的重要性。这些数据库提供了标准化的数据集合，使得不同研究团队之间可以进行公平的性能比较。通过在多个广泛使用的基准数据库上进行实验，研究者可以验证其方法的有效性，并对比其他现有技术的性能。联合正则化最近点（JRNP）方法在基于图像集的人脸识别研究中提供了一种新的技术途径。通过优化最近点的计算方式并最小化集合间距离，该方法显著提升了人脸识别系统的性能。随着技术的不断进步，未来的人脸识别系统有望进一步提高准确性和鲁棒性，以满足日益增长的安全和应用需求。

资源推荐

资源详情

资源评论

Joint Regularized Nearest Points for Image Set based Face Recognition

Meng Yang

, Weiyang Liu

, and Linlin Shen

College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China

School of Electronic & Computer Engineering, Peking University

Abstract—Face recognition based on image set has attracted

much attention due to its promising per-formance to overcome

various variations. Recently, (collaborative) regularized nearest

points (C)RNP has achieved the state-of-art performance by

measuring the between-set distance as the distance between

nearest points generated in each image set. However, the nearest

point of the query set in RNP changes in computing its distance

to nearest points of different gallery image sets, which may result

in that a wrong gallery image set can also has a small between-set

distance; CRNP used collaborative representation to overcome

this issue but it doesn’t explicitly minimize the between-set

distance. In order to solve these issues and fully exploit the

advantages of nearest point based approaches, in this paper a

novel joint regularized nearest points (JRNP) is proposed for face

recognition based on image sets. In JRNP, the nearest point in

the query set keeps the same when computing its distance to the

image sets of different classes; at the same time, it explicitly

minimize the between-set distance of facial images. An efficient

algorithm was proposed to solve this problem, and the

classification is then based on the joint distance between the

regularized nearest points in image sets. Extensive experiments

on benchmark databases were conducted on benchmark

databases (e.g., Honda/UCSD, CMU Mobo, and YouTube

databases). The experimental results clearly show that our JRNP

leads the performance in face recognition based on image sets.

I. INTRODUCTION

Recognizing the objects of interest (e.g., human faces) is

one of the most important and fundamental problems in the

communities of computer vision and pattern recognition.

Although face recognition (FR) has been extensively studied

in the past several decades, the traditional face recognition

usually assumes there is only a single query face image, from

which a human identity is recognized. Although there are

multiple images in the gallery set per subject, it is still a big

challenge to correctly recognize a person’s identity based on

only a single query face image captured in less-

controlled/uncontrolled environments due to different

variations (e.g., lighting, expression, pose, disguise changes)

existing in facial appearance images.

With the wide installation of video cameras and the

developments of large-capacity-storage media, it becomes

very convenient to collect multiple images from video

sequences or photo albums for a known subject and store these

images as the gallery and query image sets. Multiple face

images in the query and gallery set for each subject

incorporates more within-class appearance variations and

provides richer information for face recognition. Compared to

the traditional face recognition with a single query face image,

face recognition based on image sets could achieve more

satisfactory performance in practical face recognition

applications and is more promising framework of face

recognition.

Face recognition based on image sets has been attracting

much attention from researchers over the past decades. The

image sets could either be the consecutive video sequences

with temporal information, or unordered photo album images

collected from web at different times. Compared to video-

based face recognition [1][20-23][38-39], face recognition

based on general image sets, in which the temporal

information is not available, has wider applications. In this

paper we mainly focus on the face recognition problem based

on general image sets. Numerous approaches have been

proposed for this kind of image-set based recognition problem.

One major category of face recognition based on image set

is the parameter model based approaches [39][24-25]. These

parametric model based approaches [39][24-25] firstly

represent each image set by some parametric distribution with

the parameters estimated from the data itself, and then

calculate the between-set distance by measuring the similarity

between these two distributions (e.g., in terms of Kullback-

Leibler divergence [37]). However, the parametric methods

need to solve the difficult parameter estimation problem and

require strong statistical correlations between the gallery and

query sets, which may not exist in practice. To overcome the

shortcomings of parameter model based approaches, recently

Lu et al. [36] directly extracted the multiple order statistics

features from the image set and developed a multi-kernel

metric learning method to combine different order information.

In order to avoid the drawbacks of model-based methods,

non-parametric model-free based approaches were proposed

based on representing an image set as a convex/affine

subspace [3][19][26-28], mixture of subspaces [29-31], or

nonlinear manifolds [4][17][32-33]. In nonlinear-manifold

methods, the manifold of an image set is usually represented

as a combination of local linear subspaces [4][33]. In this

model-free face recognition based on image sets, how to

measure between-set distance is the key problem. A popular

way is to define the between-set distance as the distance

between two “exemplars” (e.g., the mean of samples) chosen

from these two image sets. For instance, Cevikalp et al. [3]

characterized each image set by an affine/convex hull spanned

by its samples, and selected two points (one point in one hull)

with the closest approach as the “exemplars”. Another way of

measuring the between-set distance for non-parametric

approach is to compare the structure of the non-parametric

model. For instance, Canonical correlation analysis (CCA) [9],

which analyzes the principal angles and canonical correlations

between linear subspaces, is widely used in the works of

[4][19][26][27][28][30][31]. Besides, the natural second-order

statistic-covariance matrix was used to represent each image

set in [6], and the image-set based classification was

formulated as classifying points lying on a Riemannian

manifold.

Recently inspired the success of sparse representation on

face recognition [5], Hu et al. [2] proposed a sparse

approximated nearest points (SANP) approach for image-set

based face recognition. By modeling each image set as an

affine hull, Hu et al. selected two points (one point in each

hull) with the closest distance as the sparse approximated

nearest points (SANP), where SANPs were required to be

sparsely represented by the original samples. The final

between-set distance of SANPs is the result of multiplication

of the distance between the found SANPs and the dimension

of the affine hull. Although SANP has achieved a good

performance, its model is a little complex. In order to improve

it, Yang et al. [34] proposed a regularized nearest points (RNP)

method, which modeled each image set as a regularized affine

hull and use the regularized nearest points to measure the

similarity of these two image sets. Following RNP and

collaborative representation based classification [7], Wu et al.

[35] find all the regularized nearest points simultaneously in

the framework of collaborative representation. Although RNP

and CRNP have shown promising performance, RNP finds

different regularized nearest points in the query set when

computing the distance between the query set and different

gallery sets, easily resulting in over-fitting. CRNP didn’t

explicitly minimize the between-set distance, which reduces

its discrimination, because the objective function of CRNP

aims to minimize the distance between the query set and the

entire gallery set.

This paper presents an efficient and effective joint

regularized nearest points (JRNP) method for image-set-based

face recognition. JRNP minimizes the joint representation

model by finding a unique nearest point in the query set and

explicitly minimizing the image set based between-class

distance. An efficient algorithm was proposed to solve this

problem, and the classification is then based on the joint

distance between the regularized nearest points in image sets.

Compared to RNP, the nearest point in the query set keeps the

same for different gallery image sets to avoid over-fitting.

Different from CRNP, JRNP explicitly minimize the between-

set distance to enhance the discrimination of the model. Our

experiments on benchmark image set databases clearly show

that JRNP has achieved better recognition accuracy than the

previous methods, including SANP, RNP and CRNP.

Meanwhile, the proposed RNP also has a very fast speed; e.g.,

it is over 14 times faster than SANP in the CMU Mobo

database [15].

The rest of this paper is organized as follows. Section II

briefly reviews the RNP method in [34] and CRNP method in

[35]. Section III presents the proposed JRNP. Section IV

conducts experiments and Section V concludes the paper.

II. R

ELATED WORK

In this section, two related work, regularized nearest points

(RNP) [34] and collaboratively regularized nearest points

(CRNP) [35], are reviewed. Both of them use the nearest

points generated by each face images set to compute the

distance between different subjects.

Denote

,1 , 2 ,

,,,

iii in

⎡

⎤

⎣

⎦

"X= x x x

as the data matrix of i-th

class, n

is the image number of i-th class, and x

i,k

is the k-th

feature vector of i-th class. Let

,,,

⎡⎤

⎣⎦

"Y=

yy y

be the

data matrix of the query class, with y

as its k-th feature vector.

RNP computed the nearest points class by class. For instance,

the coding phase for

i-th class and the query set is

,12

222

min s.t. 1, 1, ,

ii ik k i

σσ

−==≤≤

∑∑

αβ

αβ α β

(1)

where

i,k

is the k-th entry of

, X

and Y

are the generated

nearest points based on X

and Y, respectively. With the solved

coding coefficients

and

, then the distance between these

two sets is computed as

()

ii ii

∗∗

=+⋅−XYXY

(2)

where ||X||

is the nuclear norm of X, i.e., the sum of the

singular values of X.

Opposite to RNP, CRNP computed the virtual samples for

all classes at the same time

22 2

12 ,

22 2

min s.t. 1, 1

kik

λλ β α

−+ + = =

∑∑∑

αβ

βα α β

(3)

where X=[X

,…,X

] and

;

;…;

], where

is the

sub-coefficient vector associated to X

. Compared to RNP, the

nearest point for each class (e.g., X

for i-th class) and the

nearest point in the query set (i.e., Y

), are computed

simultaneously. After solving the coding coefficients,

(

i=1,…,c) and

, CRNP computed the between-class distance

()

ii ii i

∗∗

=+⋅−XYXY

(4)

Although RNP and CRNP have achieved promising results

on face recognition based on image sets, there are several

issues to be further considered.

The nearest point of the query set in RNP changes when

computing the between-set distance of query set to the gallery

sets of different classes. Since face images have small

between-class variance and big within-class variance, the

distance measured by RNP is easily over-fitting, e.g., the

between-set distance of wrong classes could also be small.

CRNP inherits some merits of collaborative representation

based classification [7], e.g., the across-class collaboration in

representation and competing of different classes in

classification. However, compared to RNP, CRNP didn’t

explicitly minimize the distance between the query set and the

gallery set of different classes. This would reduce the

discrimination ability of CRNP.

III.

JOINT REGULARIZED NEAREST POINT MODEL

In order to overcome the shortcomings of RNP and CRNP,

we proposed a joint regularized nearest points (JRNP) to

preserve the advantages of RNP and CRNP, and overcome

their shortcomings at the same time. In this section, we first

剩余6页未读，继续阅读

评论收藏

内容反馈

weixin_38737630

粉丝: 1
资源: 929

联合正则化最近点用于基于图像集的人脸识别

基于图像集匹配(ISM)的正则化最近点法在视频人脸识别中的应用.pdf

电信设备-基于稀疏正则化的实现多波段人脸图像信息融合的人脸识别方法.zip

hierNetGxE:开发该软件包以适应正则化回归模型，该模型称为hierNet GxE，用于基于层次化套索的基因-环境（GxE）交互作用的联合选择

基于联合正则化的稀疏磁共振图像重构

多光谱低阶结构化字典学习用于人脸识别

强边缘提取网络用于非均匀运动模糊图像盲复原.docx

基于Schatten p-范数的结构受限判别词典学习用于人脸识别

基于双组分分解的混合正则化方法用于部分纹理CS-MR图像重建

基于子空间类标传播和正则判别分析的单标记图像人脸识别.pdf

JointBayesian code

JRC:Joint- collective Representation Classification（JRC）;face recognition; 联合稀疏表示分类方法人脸识别

asm_aam.rar_AAM_Asm_hotunn_matlab 图像处理_yesterdayriz

用于图像分类的多核协作表示

鲁棒的联合稀疏回归与广义正交学习的图像特征选择

基于CORDIC的反正弦和反余弦计算的FPGA实现

使用3DCNN和卷积LSTM进行手势识别学习时空特征

BA无标度网络中的SIR模型

基于三次贝塞尔曲线的类汽车曲率连续路径平滑

基于机器学习的设备剩余寿命预测方法综述

基于FPGA的奇异值和特征值分解的快速实现。

磁悬浮系统自适应模糊PID控制器的设计

基于BP神经网络的人口预测

两轮平衡车的建模与控制研究

最新资源