旋转不变维降维算法资源-CSDN文库

需积分: 9 41 浏览量 2021-04-30 03:36:03 上传评论收藏 1.47MB PDF 举报

旋转不变维降维算法是一种新型的线性维降维方法，旨在解决传统子空间学习方法在面对数据集中的异常值和图像变化时敏感度较高的问题。该算法使用L2,1范数代替传统方法中的L2范数，目的是提高算法对于图像变化的鲁棒性，从而能够在分类任务中进行更为稳健的图像特征提取。 1. 传统子空间学习方法的局限性在介绍旋转不变维降维算法之前，需要了解传统子空间学习方法存在的一些内在局限性。传统方法通常使用L2范数作为度量，这导致它们对数据中的异常点以及目标对象的图像变化极为敏感。这种敏感性是由于L2范数在面对偏离主要数据流形的异常值时，往往会产生较大的测量误差。因此，这些方法可能无法准确地识别出数据中的主要模式，尤其是当数据受到噪声和不规则变化影响时。 2. L2,1范数与鲁棒性文章中提出的一系列基于L2,1范数的方法，旨在解决上述问题。L2,1范数在算法设计中扮演了关键角色。L2,1范数是L2范数的一种变体，可以对数据矩阵的每一行（即每一个数据样本）应用L2范数，并对结果求和。这种范数计算方式有助于算法在优化过程中忽略掉数据中的噪声和异常值，使算法对数据的异常值和图像变化保持一定的鲁棒性。 3. 算法设计与旋转不变维度减少框架为了进一步提高算法性能，研究者们设计了不同的算法，并提出了一种统一的旋转不变(RI)维度减少框架。该框架在某种程度上扩展了著名的图嵌入算法框架，使其更为通用。在此框架下，算法可以提取出对旋转变化具有不变性的图像特征，这在图像分类等任务中是非常有价值的。 4. 全局最优解文章对提出的算法框架进行了深入分析，证明了在计算并使用数据空间的所有正交投影时，优化问题存在全局最优解。这意味着算法能够在理论上确保达到最佳的降维效果。 5. 实验结果为了验证提出的旋转不变维降维算法的有效性，研究者们在一些流行图像数据集上进行了实验。实验结果表明，该算法在分类性能方面可以与以往基于L2范数的子空间学习算法相媲美，有时甚至能够获得更优异的性能。 6. 关键词本文涉及的关键技术术语包括维降维、图像分类、旋转不变子空间不变性图像特征提取、旋转学习等。这些术语概括了文章研究的主要领域和目标，说明了旋转不变维降维算法的研究背景和意义。旋转不变维降维算法通过引入L2,1范数，并设计出旋转不变的维度减少框架，有效地提高了传统子空间学习方法在面对数据集中的异常值和图像变化时的鲁棒性和分类性能。这一研究为维降维领域提供了新的理论基础和实践工具，对于图像处理、模式识别等领域的技术进步具有重要的推动作用。

资源推荐

资源详情

资源评论

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

IEEE TRANSACTIONS ON CYBERNETICS 1

Rotational Invariant Dimensionality

Reduction Algorithms

Zhihui Lai, Yong Xu, Member, IEEE, Jian Yang, Linlin Shen, and David Zhang, Fellow, IEEE

Abstract—A common intrinsic limitation of the traditional sub-

space learning methods is the sensitivity to the outliers and the

image variations of the object since they use the L

norm as

the metric. In this paper, a series of methods based on the L

2,1

norm are proposed for linear dimensionality reduction. Since

the L

2,1

-norm based objective function is robust to the image

variations, the proposed algorithms can perform robust image

feature extraction for classiﬁcation. We use different ideas to

design different algorithms and obtain a uniﬁed rotational invari-

ant (RI) dimensionality reduction framework, which extends the

well-known graph embedding algorithm framework to a more

generalized form. We provide the comprehensive analyses to

show the essential properties of the proposed algorithm frame-

work. This paper indicates that the optimization problems have

global optimal solutions when all the orthogonal projections of

the data space are computed and used. Experimental results on

popular image datasets indicate that the proposed RI dimension-

ality reduction algorithms can obtain competitive performance

compared with the previous L

norm based subspace learning

algorithms.

Index Terms—Dimensionality reduction, image classiﬁcation,

image feature extraction, rotational invariant (RI) subspace

learning.

I. INTRODUCTION

EATURE extraction and dimensionality reduction meth-

ods have been paid much attention in past several decades.

Manuscript received May 23, 2015, revised November 11, 2015; accepted

May 24, 2016. This work was supported in part by the Natural Science

Foundation of China under Grant 61573248, Grant 61203376, Grant

61375012, Grant 61272050, Grant 61362031, Grant 61332011, and Grant

61370163, in part by the General Research Fund of Research Grants Council

of Hong Kong under Project 531708, in part by the Science Foundation

of Guangdong Province under Grant 2014A030313556, and in part by

the Shenzhen Municipal Science and Technology Innovation Council under

Grant JCYJ20150324141711637. This paper was recommended by Associate

Editor P. Tino.

Z. Lai is with the College of Computer Science and Software Engineering,

Shenzhen University, Shenzhen 518060, China, and also with the Hong Kong

Polytechnic University, Hong Kong (e-mail: lai_zhi_hui@163.com).

Y. Xu is with the Bio-Computing Research Center and Key Laboratory of

Network Oriented Intelligent Computation, Shenzhen Graduate School,

Harbin Institute of Technology, Shenzhen 518055, China (e-mail:

yongxu@ymail.com).

J. Yang is with the School of Computer Science, Nanjing

University of Science and Technology, Nanjing 210094, China (e-mail:

csjyang@njust.edu.cn).

L. Shen is with the College of Computer Science and Software Engineering,

Shenzhen University, Shenzhen 518060, China (e-mail: llshen@szu.edu.cn).

D. Zhang is with the Biometrics Research Centre, Department of

Computing, Hong Kong Polytechnic University, Hong Kong (e-mail:

csdzhang@comp.polyu.edu.hk).

Color versions of one or more of the ﬁgures in this paper are available

online at http://ieeexplore.ieee.org.

Digital Object Identiﬁer 10.1109/TCYB.2016.2578642

The classical linear dimensionality reduction methods such as

principle component analysis (PCA) [1]–[3] and linear dis-

criminant analysis (LDA) [4] and its variations [5], [6]are

widely used in the ﬁelds of pattern recognition, computer

vision, and data mining. It is known that these classical meth-

ods (i.e., PCA and LDA) only focus on the global structure

of a dataset in dimensionality reduction. With the fast devel-

opment of the manifold learning based techniques [7]–[10],

the local geometry structure has been taken into account in

designing different linear dimensionality reduction methods.

For example, locality preserving projection (LPP, also called

Laplacianfaces) [11] and orthogonal LPP [12] were proposed

for face recognition. Yan et al. [13] proposed a uniﬁed graph

embedding framework for linear and nonlinear dimension-

ality reduction, and marginal ﬁsher analysis (MFA) and its

extension [14] were proposed the for face and gait feature

extraction.

All the above methods, however, use the L

or Frobenius

norm based metric to characterize the scatter of the dataset,

thus these methods are sensitive to the outliers. Recently, other

measurement such as L

norm was widely explored due to its

robustness in different applications. For example, the L

norm

was used in sparse regression [15]–[17], sparse representation

classiﬁer designation [18], [19], subspace learning [20]–[25],

sparse subspace learning [26], [27], and sparse coding for

image representation [28]. In addition, the sparse L

graph

was also used in subspace learning, spectral clustering [29],

and label propagation [30]. But one drawback of these L

norm based methods is that the L

norm terms are just used

as the regularization and the L

or Frobenius norm terms are

still dominant in the optimization problems. Thus, these meth-

ods are still sensitive to the outliers in a certain sense in

dimensionality reduction.

Although various L

norm based subspace learning meth-

ods, such as those in [25] and [31]–[34], have shown promis-

ing performance, these methods still have some unsolved

problems. For example, some of them have very high com-

putational costs in computing the (local) optimal solutions,

and the theoretical relation between the optimal solutions of

norm based methods and the traditional/classical ones was

still unclear. Recently, a new measurement called rotational

invariance (RI) L

norm or L

2,1

norm has attracted much

attention in the ﬁelds of patter recognition and computer

vision [35], multitask learning and tensor factorization [36].

Previous studies show that the pure L

2,1

norm based regres-

sion is more robust than the L

norm regression in pat-

tern recognition [37]–[39], and thus was widely used in

2168-2267

 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.

See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

2 IEEE TRANSACTIONS ON CYBERNETICS

Fig. 1. Development route of the dimensionality reduction methods

mentioned in this paper.

joint feature selection and subspace learning [40]–[42], image

recognition [43], Web image annotation [44], and multime-

dia data understanding [45]. The brief development route of

the dimensionality reduction methods mentioned in this paper

is shown in Fig. 1.

Robustness is an important issue in feature extraction. One

tractable method is to introduce the robust measurement, i.e.,

replace the Frobenius norm with other norms which are robust

to the outliers. A relative ideal norm for feature extraction

and recognition should contain the following aspects: 1) the

measurement should be robust to the outliers; 2) the derived

model using this measurement is easy to solve; and 3) it is

better to bridge the strong theoretical connections between the

previous methods and the new ones using the introduced norm.

Based on these three aspects, the RI L

norm or L

2,1

norm is

a very suitable candidate owing to its robustness to outliers and

different variations in the dataset [36]–[39], the simplicity for

solving the derived models as indicated in [38], and the close

theoretical connections to previous methods (which will be

shown on Section III of this paper).

This paper focuses on designing the robust linear dimen-

sionality reduction methods using the RI L

norm or L

2,1

norm. The difference between this paper and the previous

works is that this paper not only focuses on a set of concrete

robust subspace learning methods, but also builds a uniﬁed

framework to conclude the proposed methods using the RI L

norm or L

2,1

norm. From the algorithm aspect, the signiﬁcant

difference between the proposed algorithms and the previous

ones is the algorithms presented in this paper can achieve its

robustness automatically or intrinsically (without introducing

any other parameters).

The main contributions of this paper are as follows.

1) We propose four representative RI subspace learning

methods, i.e., RI PCA (RIPCA), RI LDA (RILDA), RI

LPP (RILPP), and RI MFA (RIMFA) for image feature

extraction. Besides these newly proposed algorithms, we

also propose a uniﬁed robust and RI subspace learning

framework. It is shown that the framework proposed in

this paper indeed extends the well-known graph embed-

ding framework proposed in [13] to a more general form

for linear dimensionality reduction.

2) The comprehensive analyses, including the convergence,

computational complexity, and the theoretical connec-

tions between this framework and the previous graph

embedding algorithm framework, are presented to show

the essential properties of the proposed algorithm frame-

work. And more importantly, the optimization problems

derived by the new metric are easy to solve and the

codes are also very easy to implement (the codes can be

downloaded from http://www.scholat.com/laizhihui).

3) Extensive experiments show that the proposed RI sub-

space learning algorithms perform better than the previ-

ous ones for image feature extraction in most cases.

The rest of this paper is organized as follows. In Section II,

four RI subspace learning algorithms are proposed. The the-

oretical analyses of the proposed framework are presented in

Section III. Experiments are carried out in Section IV to test

these RI subspace learning algorithms where the objects in

the databases have different variations, and the conclusions

are given in Section V.

II. P

ROPOSED ALGORITHMS

In this section, some notations are given at ﬁrst and then

the algorithms are presented. Let matrix X = [x

, x

,...,x

]

be the data matrix including all the training samples {x

}

i=1

∈

in its columns. In practice, the feature dimension m is

often very high. The goal of feature extraction is to transform

the data from the originally high-dimensional space to a low-

dimensional one. In other words, sample x ∈ R

should be

transformed into y ∈ R

(d  m)byusing

y = U

x ∈ R

(1)

where U = (u

, u

,...,u

) and u

(i = 1,...,d) is an

m-dimension column vector.

A. Deﬁnitions of Different Norms

For a given matrix A = [a

] ∈ R

n×m

, we denote the ith row

of A by A

. The Frobenius norm of matrix A is deﬁned as











i=1



j=1









i=1



. (2)

It can be seen that the sensitivity of the Frobenius norm comes

from the squared operation, which makes the larger values of

A



signiﬁcantly dominate the ﬁnal result. Differing from

the Frobenius norm, L

-norm of a matrix A is deﬁned as



i=1



j=1



. (3)

The L

2,1

-norm of a matrix is deﬁned as



2,1



i=1









j=1



i=1



. (4)

Since for any rotational matrix R,



2,1



2,1

-norm is RI (this is the reason why the proposed algo-

rithms are called as RI in this paper). As indicated in [38], the

robustness of the L

2,1

-norm or RI L

-norm is originated from

its special deﬁnition, where there is no squared operation. Note

that, if A degrades to be a high-dimensional row vector, its

2,1

-norm or RI L

-norm will degrade to the Frobenius norm.

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

LAI et al.: RI DIMENSIONALITY REDUCTION ALGORITHMS 3

TABLE I

UMMARY OF THE ALGORITHMS

B. Discussions and the Motivations of This Paper

Yan et al. [13] proposed a general framework to unify the

subspace learning methods, including PCA, LDA, LPP, and

MFA, by using the model

min





−





(5)

s.t. tr



−



= cons (6)

where cons denotes the constant,

and

are the graphs

deﬁned on the dataset, and



.The

optimal solutions of the above problem can be given by the

following generalized eigenequation:



−



U = X



−



U. (7)

Many linear dimensionality reduction methods can be

included in this graph embedding algorithm framework [13].

However, as it is mentioned in Section I, the crucial draw-

back of this framework is its sensitiveness to the outliers

or the variations of the images since it uses the L

Frobenius norm as the metric [35], [36], [38], [39]. The draw-

back of the previous framework (or previous subspace learn-

ing algorithms) inspires us to develop new framework (or

algorithms) which is robust in linear dimensionality reduc-

tion. Motivated by the previous robust subspace learning

algorithms [35], [36], [38], [39], [41], [43], we introduce the

RI L

norm or L

2,1

norm to develop a set of algorithms for

robust linear dimensionality reduction. Four simple but effec-

tive and efﬁcient algorithms (i.e., RIPCA, RILDA, RILPP, and

RIMFA) are ﬁrst proposed and then a uniﬁed framework is

obtained for robust linear dimensionality reduction. For ease

of reading and comparison, all the detailed information of the

four algorithms presented in this paper will be summarized in

Table I and the algorithm steps are shown in Table II.

C. RIPCA

PCA aims to ﬁnd a set of projections that can characterize

the most of the variances of the data points by using the square

norm. But for RIPCA, the RI L

-norm (i.e., L

2,1

-norm) is used

as the measurement among the data points. According to the

deﬁnition of L

2,1

-norm and the formulations presented in [38],

the RI L

-norm total scatter value is deﬁned as follows:



i=1





−¯x







−¯x



···



−¯x





2,1

= tr





= tr





(8)

剩余13页未读，继续阅读

评论收藏

内容反馈

weixin_38736652

粉丝: 1
资源: 938

旋转不变维降维算法

旋转不变维数约简算法

旋转不变算法ESPRIT

旋转不变子空间类算法是空间谱估计中的典型算法

基于LBPV的旋转不变纹理分类算法

旋转不变子空间法，matlab源程序通过反复训练模板能有较高的识别率，用于时频分析算法

LPP.rar_LDA LPP_LPP如何降维_LPP源码_LPP算法_hungcis

基于降维波束空间的实值ESPRIT单基地MIMO雷达测角算法.docx

计算机视觉学习初识LBP算法.ppt

快速四阶累积量旋转不变子空间算法 (2009年)

基于2DGabor的人脸识别改进算法.pdf

LLE_流形学习_LLE数据降维_数据开发_线性局部嵌入算法_

sift特征在三维物体识别中的应用

机器学习与深度学习自测题（选择题）.docx

基于二维图像表示的人脸识别算法研究.pdf

SURF算法.docx

基于不变特征的运动视频序列自动配准算法

结合PCA的尺度不变特征变换(SIFT)算法，wolf_方法计算李雅普诺夫指数.zip

sift算法matlab源码

sift,lbp特征与PCA降维 k-means.pdf

SIFT以及扩展算法总结

计算机视觉学习初识LBP算法 .ppt

计算机视觉学习初识LBP算法PPT学习教案.pptx

matlab实现sift算法匹配

PCASIFT算法C++实现

最新资源