鲁棒的人脸对准的自适应级联回归模型资源-CSDN文库

40 浏览量 2021-03-03 01:51:26 上传评论收藏 10.79MB PDF 举报

在了解“鲁棒的人脸对准的自适应级联回归模型”这一研究论文的知识点之前，需要我们首先明确几个概念，包括鲁棒的人脸对准、自适应级联回归模型、局部特征与遮挡问题、以及形状索引外观和自适应形状先验等。这些内容共同构成了本篇论文的研究核心。人脸识别技术中，人脸对准（Face Alignment）是指定位和对准人脸图像中关键点的过程，其目的是找到人脸的主要特征点（landmarks）的位置。在实际应用中，由于环境、光照变化以及人脸表情差异等因素，人脸图像往往存在遮挡等问题，这会影响对人脸特征点的准确检测。鲁棒的人脸对准是指在人脸图像存在遮挡或其他复杂因素干扰时，仍能够准确对准人脸特征点的技术。为了实现鲁棒的人脸对准，研究者提出了多种算法和技术，其中级联回归模型是一种被广泛应用的方法。级联回归模型（Cascade Regression Model）是一种迭代过程，通过逐步优化来提高对特征点定位的准确性。在级联回归模型中，通常会使用局部特征来估计人脸特征点的位置。然而，当面对遮挡等情况时，依赖局部特征的方法会受到影响，导致对准结果不准确。针对这一问题，本文提出了自适应级联回归模型。其主要创新点在于引入了形状索引外观（shape-indexed appearance）来估计每个特征点的遮挡程度。在每个迭代步骤中，通过这种方式，为每个特征点赋予一个遮挡级别的权重，从而减少遮挡对特征点定位的影响。这种权重可以视为形状索引特征的自适应权重，用于降低特征噪声。同时，论文设计了基于样例的形状先验（exemplar-based shape prior），以抑制局部图像破坏的影响。论文中还提到，人脸对准技术的重要性在于它为多种面向人脸的应用提供了基础，例如人脸识别、表情分析、人脸动画、面部合成和三维面部建模等。在过去的二十年中，已经提出了许多种人脸特征点定位方法，其中最流行的方法是将面部特征点作为一个整体形状来学习，从而得到一个通用的面部形状模型。研究实验部分，作者在具有挑战性的基准测试上进行了广泛的实验，结果表明提出的方法在面部特征点定位和遮挡检测方面取得了比现有的领先方法更好的结果。为了使知识点内容更为清晰和全面，我们可以进一步展开以下几个方面的详细说明： 1. 级联回归模型的原理与特点：级联回归模型是一种逐级改进的回归算法。在人脸识别领域，它通过不断地构建回归器，来逐步细化人脸特征点的位置预测。每一个回归器都会尝试对上一个回归器的输出进行微调，直至达到一个较为理想的对准效果。这种模型具有较强的自适应性和迭代学习能力，适用于复杂的图像特征学习任务。 2. 自适应级联回归模型的创新之处：本研究中的自适应级联回归模型对传统的级联回归模型进行了改进，具体表现在： - 引入了形状索引外观，用于评估每个特征点的遮挡情况，并据此对特征点进行加权； - 利用遮挡级别的权重，对形状索引特征进行自适应调节，以此减少噪声和遮挡带来的影响； - 设计了基于样例的形状先验，帮助模型更好地处理局部破坏的图像。 3. 形状索引外观和自适应形状先验的作用： - 形状索引外观能够捕捉到不同形状特征点在图像中出现的模式，并对遮挡进行有效估计； - 自适应形状先验则通过学习大量样例数据，来构建一种能够应对图像局部损坏的先验知识，为模型提供了一种在面对不完美数据时的稳健决策基础。 4. 面对挑战的实验与验证：论文在多个具有挑战性的测试基准上进行了实验，验证了模型在复杂场景下的鲁棒性和准确性。实验结果不仅证明了方法的有效性，也展示了其在实际应用中的潜力和优势。整体而言，本论文提出的方法是人脸对准领域中的一大进步，它针对传统方法在遮挡图像处理上的局限性，提出了具有自适应性的新算法，并在实验证明了其优越性。这些知识点的深入理解，对于从事人脸识别、计算机视觉等相关领域的科研人员和技术开发者而言，都具有重要的参考价值。

资源推荐

资源详情

资源评论

JOURNAL OF L

X CLASS FILES, VOL. 4, NO. 5, APRIL 2015 1

Adaptive Cascade Regression Model for Robust Face

Alignment

Qingshan Liu, Senior Member, IEEE, Jiankang Deng, Jing Yang, Guangcan Liu, Member, IEEE,

and Dacheng Tao, Fellow, IEEE

Abstract—Cascade regression is a popular face alignment

approach, and it has achieved good performances on the wild

databases. However, it depends heavily on local features in

estimating reliable landmark locations and therefore suffers from

corrupted images, such as images with occlusion, which often

exists in real-world face images. In this paper, we present a new

adaptive cascade regression model for robust face alignment.

In each iteration, the shape-indexed appearance is introduced

to estimate the occlusion level of each landmark, and each

landmark is then weighted according to its estimated occlusion

level. Also, the occlusion levels of the landmarks act as adaptive

weights on the shape-indexed features to decrease the noise on

the shape-indexed features. At the same time, an exemplar-

based shape prior is designed to suppress the inﬂuence of

local image corruption. Extensive experiments are conducted

on the challenging benchmarks, and the experimental results

demonstrate that the proposed method achieves better results

than state-of-the-art methods for facial landmark localization and

occlusion detection.

Keywords—Robust Face Alignment, Cascade Regression Model,

Shape-Indexed Appearance, Adaptive Shape Prior

I. INTRODUCTION

Face alignment has been an active research topic over the

last two decades [1], because it potentially has signiﬁcance for

many face-oriented applications, such as face recognition [2],

[3], [4], [5], expression analysis [6], [7], face animation [8],

face synthesis [9], and 3D face modeling [10], [11]. A large

number of facial landmark localization methods have been

proposed in the past two decades [12], and the most popular

solution is to take the ensemble of facial landmarks as a whole

shape and learn a general face shape model from labeled

training images [13]. In respect of this shape model, the

previous works can be categorized as explicit shape model-

based methods and implicit shape model-based methods.

Manuscript received May 6, 2016; revised August 22, 2016 and November

1, 2016; accepted November 26, 2016. This work was supported in part by

the National Natural Science Foundation of China under Grant 61532009

and Grant 61272223, in part by the Natural Science Foundation of Jiangsu

province under Grant BK2012045, in part by the Australian Research Council

under Project DP-140102164 and Project FT-130101457. The associate editor

coordinating the review of this manuscript and approving it for publication

was Prof. Yonggang Shi.

Q. Liu, J. Deng, J. Yang and G. Liu are with the B-DAT Laboratory, the

Department of Information and Control, Nanjing University of Information

and Technology, Nanjing 210014 China.

D. Tao is with the Center for Quantum Computation and Intelligent Systems,

Faculty of Engineering and Information Technology, University of Technology,

Sydney, N.S.W. 2007, Australia.

Most early works on this topic address the face alignment

problem by employing explicit shape constraints, and they

learn a parametric shape model from the labeled training data.

Representative works are Active Shape Model (ASM) [14] and

Active Appearance Model (AAM) [15], [16], [17], in which

the variation in face shape is modeled by Principal Component

Analysis (PCA) [14], [15]. Other methods include Markov

Random Field (MRF)-based modeling [18], [19], Graph-based

model [20], [21], and exemplar-based modeling [22], [21]. In

the context of medical image analysis, Zhang [23] developed

an Adaptive Shape Composition method (ASC) to model

shapes and implicitly incorporate the shape prior constraint

effectively by utilizing sparse representation on the shape

dictionary. ASC is able to handle non-Gaussian errors, model

multi-modal distribution of shapes and recover local details.

The problem is efﬁciently solved by an EM type of framework

and an efﬁcient convex optimization algorithm. Inspired by

ASC, Liu [24] proposed a dual sparse constrained cascade

regression model (DSC-CR) for robust facial landmark lo-

calization. During the regressor training, a sparse constraint

is incorporated by Lasso [25], which can select the robust

features and compress the size of the model. Another sparse

shape constraint is incorporated between the regressors to

suppress the ambiguity in the local features. Due to the

limited capacity of explicit shape models, they tend to under-

perform on faces that have extreme variations in pose and

expression [26].

In recent years, implicit shape constraints have attracted

much attention. Their main objective is to learn shape re-

gression functions that directly map the face image to the

landmark coordinates without a parametric shape model, and

good performances have been achieved on some standard

benchmarks [27], [28], as a result of their ability to integrate

contextual information and their ﬂexibility in building the

relationship between landmark points. There are two popular

ways to learn such a regression function. One is based on deep

network learning [28], [29], and the cascade regression model

is another popular implicit shape model. Our work focuses on

the cascade regression model, which aims to learn a series of

face shape regressors and combine them in an additive manner

to approximate the complex nonlinear mapping between the

initial shape and the true shape [27]. However, the cascade

regression model is sensitive to large occlusion, because oc-

clusion not only affects the location updates around occluded

regions but also has an effect on the location updates in non-

occluded regions during shape regressor iterations [30].

In this paper, we present a new adaptive cascade regression

JOURNAL OF L

X CLASS FILES, VOL. 4, NO. 5, APRIL 2015 2

model for robust face alignment. In contrast to previous works,

we ﬁrst use shape-indexed appearance, the normalised face

appearance [15] at the current face shape, to estimate the

occlusion level of landmarks in each iteration. Based on the

estimated occlusion levels, an adaptive weighting scheme is

applied to the shape-indexed features [31] to decrease the

inﬂuence of corrupted landmarks. An exemplar-based shape

prior model is also incorporated to smooth the updated shape

adaptively. In contrast to the sparse shape constraint method

proposed in [24], the proposed adaptive exemplar-based shape

prior utilizes the occlusion level information, and the sparse

reconstruction is only performed on the visible landmarks. The

proposed method is evaluated on four challenging benchmarks,

and the experimental results demonstrate its advantages over

state-of-the-art methods for facial landmark localization and

occlusion detection.

Our contributions are as follows: 1) Shape-indexed appear-

ance is innovatively utilized to estimate the occlusion level for

each landmark. 2) Based on the estimated occlusion levels, an

adaptive weighting scheme is designed to suppress the inﬂu-

ence of noise corruption efﬁciently. 3) We propose an efﬁcient

exemplar-based shape constraint to suppress the inﬂuence of

local image corruption. 4) We conduct the experiments from

face detection to face alignment, and the experimental results

on challenging benchmarks show the power of our model for

facial landmark localization and occlusion detection.

II. RELATED WORK

Cascade regression is a popular implicit shape model, which

relies on shape-indexed local features and stacked regressors.

The idea of regression was ﬁrst proposed to estimate pose

in [31]. In [26], an explicit shape regression (ESR) method was

designed for facial landmark localization. In [27], a supervised

Descent Method (SDM) was proposed to learn cascade regres-

sors with fast SIFT features, and the cascade regression proce-

dure was interpreted from the perspective of gradient descent.

To reduce the inﬂuence of inaccurate initializations, Yan [32]

utilized the strategies of learn to rank and learn to combine

from multiple hypotheses with a structural SVM framework.

The Incremental Parallel Cascade Linear Regression (iPar-

CLR) method was proposed in [33], which incrementally

updates all the linear regressors in a parallel way instead

of the traditional sequential manner. Each level is trained

independently, using only the statistics of the previous level,

and the generative model is gradually turned into a person-

speciﬁc model by a recursive linear least-squares method. An

-induced Stagewise Relational Dictionary (SRD) model was

proposed in [34] to learn consistent and coherent relationships

between face appearance and shape for face images with large

variations in view. In recent years, cascade regression has

attracted much attention because its effectiveness has been

demonstrated by extensive comparison [27], [35], [1].

In cascade regression, shape-indexed features have an im-

portant role, and many local feature descriptors have been

successfully applied including Haar wavelets [36], random

ferns [37], Local Binary features (LBF) [35], [38], SIFT [27],

and HoG [32]. Since almost all the local feature descriptors

are sensitive to occlusion, cascade regression cannot handle

face images with large occlusion well. To overcome this

shortcoming, Roh et al [39] utilized over-sufﬁcient facial

feature detectors and the RANSAC-based method to infer

occlusion. In [40], occlusion was modeled as a sparse outlier;

however, the sparse error could occur from either the inﬂuence

of the occluded landmarks or the perturbation of the visible

landmarks. Robust Cascade Pose Regression (RCPR) [41]

explicitly predicts the likelihood of landmark occlusion using

a ﬁxed occlusion prior on the divided blocks. The occlusion

dictionary was deployed in [34] to deal with different kinds of

partial face occlusion. In [42], a hierarchical deformable part

model was proposed to model the occlusion of facial parts

explicitly. Yu [30] proposed an occlusion-robust regression

method by forming a consensus from a set of occlusion-

speciﬁc regressors. In this paper, we propose a new method to

estimate the occlusion levels around the landmarks by shape-

indexed appearances, which in turn act as adaptive weights on

both shape-indexed features and nearest exemplar-based shape

constraint.

III. ADAPTIVE CASCADE REGRESSION MODEL

Cascade regression combines a sequence of regressors in an

additive manner, and each regression function f is learnt by

minimizing the mean square error [27]:

f = arg min

i=1





∗

− S



− f(I

, S

)



, (1)

where M is the number of training samples, I

is the face

image, S

∗

is the ground truth shape, S

is the initialization of

face shape, and f is a single step regression function. Due to

the complex variation of the human face, one step of regression

f is insufﬁcient. Cascade regression combines a series of

simple regression function f

, t = 1, . . . , T to approximate a

complex nonlinear mapping between the initial shape S

and

the ground truth shape S

∗

arg min

i=1





∗

− S

t−1



− R

Φ(I

, S

t−1

)



, (2)

where Φ is the nonlinear feature descriptor and

, t = 1, . . . , T is the linear transform matrix which

iteratively maps the current feature vectors Φ(I

, S

t−1

) to

the updated landmark location (S

∗

− S

t−1

). The equation

indicates that the displacement of each facial landmark

is related to all other ﬁducial points, so in this way the

shape constraint is incorporated implicitly. Since this is a

linear least squares problem, R

has a close-form solution



∗

− S

t−1



Φ(I

, S

t−1

)





Φ(I

, S

t−1

)



Φ(I

, S

t−1

)





−1

Although cascade regression achieves good performance

in face alignment, it is sensitive to occlusion because it

depends heavily on the local features around each landmark.

To successfully overcome this issue, we propose an adaptive

cascade regression model which utilizes the occlusion levels

, j = 1, · · · , N of each landmark to adjust the shape-

剩余11页未读，继续阅读

评论收藏

内容反馈

weixin_38725119

粉丝: 4
资源: 951

鲁棒的人脸对准的自适应级联回归模型

基于中心子带回归的模型自适应语音鲁棒识别算法

论文研究 - 基于拉普拉斯分布的鲁棒混合回归模型的自适应稀疏群变量选择

基于自适应LBP人脸识别的身份验证.pdf

双层级联神经网络的人脸超分辨率重建.pdf

human-face-tracking.rar_face tracking_human-face-tracking_人脸追踪ma

AdaBoost人脸检测程序

face detection

人脸识别c++实现

非刚性人脸跟踪

图像处理人脸检测系统.rar

基于arm架构的嵌入式人脸识别技术研究.pdf

无线电物理专业基于ARM架构的嵌入式人脸识别技术研究.pdf

基于卷积神经网络的人脸检测综述.pdf

renlianshibie.zip_人脸特征定位_人脸特征检测_图像定位_眼镜

opencv的官方分类器（人脸识别）.zip

cpp-FaceBoxes具有高精度的CPU实时人脸检测器

基于Adaboost算法的人脸检测利用OpenCv实现

人脸检测与识别的趋势和分析

jviolajones(人脸检测算法).7z

不同姿势下的人脸识别算法研究

一种改进的快速人脸检测算法

人脸识别的光线补偿matalb

基于VC++实现的人脸检测

人脸的检测与定位

光照影响2_人脸识别_OPENCV_C++

最新资源