通过Garbor子空间亲和力和自调整谱聚类进行无监督的面部姿势分组资源-CSDN文库

163 浏览量 2021-03-04 06:49:24 上传评论收藏 686KB PDF 举报

面部表情分组在视频人脸识别中扮演着重要的角色。本文提出了一种基于Garbor子空间亲和力和自调整谱聚类的无监督面部姿势分组方法。本方法首先采用局部归一化方法降低光照不均匀的影响，然后通过Gabor小波表示法提取具有判别力的外观特征。接下来，提出了一种Garbor子空间亲和力方法来根据成对相似度计算一个亲和力矩阵，在该矩阵中，相同姿势的面部帧总是共享较小的成对相似度。通过自调整谱聚类算法对亲和力矩阵进行标记，从而自动获得姿势分组的数量和相应的分组结果。该方法在没有任何先验标签的情况下，能够有效地在不均匀的照明条件下区分不同的面部姿势，实验结果已经表明了该方法的性能令人满意。面部识别是计算机视觉领域的一个广泛研究领域。作为在现实生活中最成功的应用之一，面部识别有着从监视、安全、电信、人机交互等众多的商业和执法应用。过去，许多关于面部识别的工作主要集中在应用单个探测面部图像在相对受控的环境中，而最成功的识别方法通常需要对比较的样本图像之间对应特征向量进行准确的对齐。尽管如此，这些以单一图像为基础的方法可能无法应对环境变化和面部表情的多样性。为了处理这些问题，本研究提出的方法利用了Gabor特征提取的强区分能力，并且结合了自调整谱聚类算法的动态分组能力。Gabor特征提取法通过模拟人类视觉系统对纹理识别的敏感度，能够在不同的尺度和方向上提取图像的局部特征。这些特征能够对图像中的纹理模式进行有效的编码，对于识别在复杂背景或不同光照条件下的面部表情具有重要作用。在使用Gabor特征之后，通过Garbor子空间亲和力方法得到成对相似度，然后用自调整谱聚类算法自动地对这些相似度进行聚类，从而实现面部姿势的分组。自调整谱聚类算法可以动态地根据数据集的特征调整聚类的数量，这样可以在未知数据分布的情况下，依然获得有效的聚类结果。这种方法不仅减少了人工标注的工作量，也提高了聚类过程的灵活性和准确性。值得注意的是，该论文强调了在无监督环境下完成面部姿势分组的可能性和有效性。与传统的监督学习方法相比，无监督学习方法由于不需要大量的标注数据，因此在处理大规模数据集时具有显著的优势。此外，该方法的自调整特性让其可以适应各种不同的应用场景，无需事先设定固定的聚类数，使得方法更加通用和强大。这项技术在监视、安全、人机交互等多个领域具有广泛的应用前景。例如，在视频监控系统中，自动的面部姿势分组可以用于行为分析和异常检测；在安全领域，可以根据不同的面部姿势来提高人脸识别的准确率；在人机交互中，可以依据用户的面部表情自动调整交互方式，使机器更加智能化和人性化。本文提出的方法不仅创新地将Gabor子空间亲和力和自调整谱聚类应用于面部姿势分组的无监督学习场景，还通过实验验证了其在面部表情识别中的有效性和实用性。随着相关研究的进一步深入和技术的不断发展，可以预见该技术在未来将会为面部识别和智能分析领域带来更多的进步和突破。

资源推荐

资源详情

资源评论

Unsupervised Facial Pose Grouping via Garbor

Subspace Afﬁnity and Self-Tuning Spectral

Clustering

Xin Liu

Department of Computer Science and Technology

Huaqiao University, Xiamen, P.R. China

Email: xliu@hqu.edu.cn

Yiu-ming Cheung

a,b,∗

Department of Computer Science

Hong Kong Baptist University, Hong Kong SAR, China

United International College, BNU – HKBU, Zhuhai, China

∗

Corresponding Author; Email: ymc@comp.hkbu.edu.hk

Abstract—Facial pose grouping plays an important role in

the video face recognition. In this paper, we present an un-

supervised facial pose grouping approach via Garbor subspace

afﬁnity and self-tuning spectral clustering. First, we utilize the

local normalization method to reduce the impact of uneven

illuminations, and then extract the discriminative appearance

features via Gabor wavelet representation. Next, the Garbor

subspace afﬁnity method is presented to compute an afﬁnity

matrix in terms of the pairwise similarity, in which the facial

frames of the same pose always share the smaller pairwise

similarities. Finally, we employ the self-tuning spectral clustering

algorithm to label the afﬁnity matrix, through which the number

of pose groups and the corresponding grouping results can be

obtained automatically. Without any label priors, the proposed

approach is able to well differentiate the distinct facial poses

under uneven illuminations, and the experimental results have

shown the satisfactory performances.

Index Terms—Facial pose grouping; Garbor subspace afﬁnity;

self-tuning spectral clustering

I. INTRODUCTION

Face recognition has been an extensive research area in

computer vision community. As one of the most successful ap-

plications in real life, it has a large number of commercial and

law-enforcement applications ranging from the surveillance,

security, telecommunications, human computer interaction and

so forth. In the past, much work on face recognition was

mainly focused on applying a single probe face image in a

relatively controlled environment and most successful recog-

nition approaches often required an accurate alignment about

the corresponding feature vector between the sample images

to be compared. Nevertheless, these single-image based face

recognition methods often limited their application domain due

to the sensitivity to the facial pose variations.

Evidently, as a common problem in the face recognition

community, the instinctive face pose, e.g., moderate out-

of-plane head motion, provides important information about

emotional response and subtle facial actions, which should be

generally considered in designing a robust face recognition

algorithm. Under the circumstances, an intuitive way is to

obtain the multiple views of the same face for recognition

purpose. Along this way, Li et al. [1] ﬁrst utilized indepen-

dent component analysis (ICA) to learn a group of view-

speciﬁc subspace representation and subsequent performed a

face recognition task from the pre-learned multi-view face

examples, in which the span of the basis components in each

view-group deﬁned a subspace of faces in that view. Latter,

Huang et al. [2] ﬁrst detected the coordinates of the eyes and

then utilized the k-means algorithm to determine the facial

pose. In addition, Mangai et al. [3] utilized Linear Discriminant

Analysis (LDA) to build a low-dimensional subspace for face

images having samples at wide range of viewpoints, in which

the Mahalanobis distance was utilized to group the patterns for

subsequent recognition.

Recently, there has been an increasing interest on video-

based face recognition because the commonly available video

sources are able to provide more signiﬁcant facial informa-

tion [4]. Intrinsically, recognition in video offers a great op-

portunity to integrate information temporally across the video

sequence, which may help to increase the recognition rates

signiﬁcantly. Along this line, Hadid et al. [5] ﬁrst performed an

unsupervised learning to extract the most representative sam-

ples (called “pose exemplars”) from the raw gallery sequences

and subsequently conducted a probabilistic voting to recognize

the individuals in the videos. Volker et al. [6] ﬁrst learned

a set of exemplars (i.e., cluster centers) that incorporating

different poses to summarize the gallery video information,

and then utilized these exemplars as centers to model the

probabilistic mixture distributions for the recognition purpose.

Similar, Lee et al. [7] ﬁrst modeled each registered person with

facial pose variations by a group of low-dimensional appear-

ance manifolds (named pose manifolds) in the ambient image

space and then identiﬁed the person via these pose manifolds

efﬁciently. Latter, they further represented the face model by a

complex nonlinear appearance manifold that approximated by

a collection of linear subspaces [8], in which each subspace

incorporating the nearby poses was obtained by principle

component analysis (PCA) of frames from the training video

sequence. In addition, Chen et al. [9] ﬁrst partitioned the

video sequence by a k-means clustering algorithm so that the

frames with the similar pose and illumination were in one

partition. Then, they learned a sub-dictionary with sparseness

2015 IEEE International Conference on Systems, Man, and Cybernetics

DOI 10.1109/SMC.2015.475

2720

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余4页未读，立即下载

评论收藏

内容反馈

weixin_38534683

粉丝: 3
资源: 1020

通过Garbor子空间亲和力和自调整谱聚类进行无监督的面部姿势分组

STSC:自调谱聚类Python

无监督聚类算法

半监督聚类

无监督感知分组PPT

利用聚类技术实现纹理图像分割

指纹GarBor滤波器源码以及加速方法

基于garbor滤波器的人眼跟踪

MATLAB.rar_garbor_matlab 毕业设计

基于Garbor滤波和PCA的双核LSSVM人脸识别.pdf

基于Gabor滤波器和深度学习的图像检索方法.pdf

基于聚类调整相似度的半监督分类

基于图的聚类和半监督分类的自加权多核学习

基于Gabor小波变换和两次DCT的人脸表情识别

基于Gabor滤波器的图像纹理特征提取

one_dimensional_gab​or_filter(in,sigma,​freq):该函数提供了一个具有高斯核的一维 garbor 滤波器-matlab开发

log-Gabor的应用

图像gabor纹理提取

openCV Gabor变换

unsupervised-clustering:未知来源图像的无监督聚类分类

无监督感知分组原文

gabor filter

图像理解与智能处理高光谱图像分类代码+课程报告

小波分析理论简介.docx

基于Gabor滤波器的掌纹识别方法研究

Python图像分割程序.rar

最新资源

one_dimensional_gabor_filter(in,sigma,freq):该函数提供了一个具有高斯核的一维 garbor 滤波器-matlab开发