没有合适的资源?快使用搜索试试~ 我知道了~
具有复杂噪声的Tensor鲁棒主成分分析
需积分: 50 1 下载量 2 浏览量
2021-05-08
18:38:53
上传
评论
收藏 3.52MB PDF 举报
温馨提示
RPCA模型在各种应用程序中均取得了良好的性能。 但是,有两个缺陷限制了其有效性。 首先,它是为处理矩阵形式的数据而设计的,在某些实际情况下,它无法利用高阶张量数据的结构信息。 其次,它采用L1范数来处理噪声部分,这使其仅对稀疏噪声有效。 本文提出了一种基于CP分解和高斯混合(MoG)模型数据噪声的张量RPCA模型。 对原始数据使用张量结构使我们能够充分利用固有的先验结构,而MoG是连续分布的任何混合的通用近似值,这使我们的方法能够从很宽的范围内重新获得低维线性子空间。噪音或它们的混合物。 该模型是通过在变分贝叶斯框架下推断出的一种新提出的算法来解决的。 通过对合成数据和真实数据的大量实验,证明了我们的方法相对于现有的最新方法的优越性。
资源推荐
资源详情
资源评论
Tensor RPCA by Bayesian CP Factorization with Complex Noise
Qiong Luo
1,2
, Zhi Han
1∗
, Xi’ai Chen
1,2
, Yao Wang
3
, Deyu Meng
3
, Dong Liang
3
, Yandong Tang
1
1
State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences;
2
University of Chinese Academy of Sciences;
3
Xi’an Jiaotong University
{luoqiong, hanzhi, chenxiai, ytang}@sia.cn, {dymeng, liangdong}@mail.xjtu.edu.cn, yao.s.wang@gmail.com
Abstract
The RPCA model has achieved good performances in
various applications. However, two defects limit its effec-
tiveness. Firstly, it is designed for dealing with data in ma-
trix form, which fails to exploit the structure information of
higher order tensor data in some pratical situations. Sec-
ondly, it adopts L
1
-norm to tackle noise part which makes
it only valid for sparse noise. In this paper, we propose a
tensor RPCA model based on CP decomposition and model
data noise by Mixture of Gaussians (MoG). The use of ten-
sor structure to raw data allows us to make full use of the
inherent structure priors, and MoG is a general approxima-
tor to any blends of consecutive distributions, which makes
our approach capable of regaining the low dimensional lin-
ear subspace from a wide range of noises or their mixture.
The model is solved by a new proposed algorithm inferred
under a variational Bayesian framework. The superiority of
our approach over the existing state-of-the-art approaches
is demonstrated by extensive experiments on both of syn-
thetic and real data.
1. Introduction
In the fields of data analysis, principal component anal-
ysis (PCA) has been a classical and prevalent tool and has
extensive applications [16]. Originally, PCA aims to find
the best L
2
-norm low-rank approximation of a specified
matrix due to its smoothness and has many fast numerical
solvers [9, 24, 25, 26, 35, 41]. But L
2
-norm is only suitable
for Gaussian noise and too susceptible to outliers and gross
noise. To increase the robustness of PCA, a series of works
have been conducted in recent years [12, 17, 13, 19].
Inspired by the improvement of low-rank matrix analy-
sis [4, 5, 30], the robust principal component analysis (RP-
CA) [40] has been proposed for remedying the deficiency of
traditional PCA, in which, a high dimensional observation
matrix is assumed to consist of a low-rank component and
∗
Corresponding author.
a sparse component. Specifically, let Y ∈ R
m×n
be the ob-
servation data matrix, X ∈ R
m×n
be the low-rank matrix,
E ∈ R
m×n
be the sparse noise matrix, and then we can
describe the RPCA as the following optimization problem:
min
X,E
X
∗
+ λE
1
s.t. Y = X + E, (1)
where X
∗
=
r
σ
r
(X) denotes the nuclear norm of
X, σ
r
(X) (r =1, 2, ..., min (m, n)) is the r
th
singular val-
ue of X, E
1
=
ij
|e
ij
| denotes the L
1
-norm of E, and
e
ij
is the element in the i
th
row and j
th
column of E. It has
been proved that if L and S satisfy a certain incoherence
condition, the RPCA can uniquely extract X and E from Y
[6]. RPCA has played an important role in handling vari-
ous problems, including robust matrix recovery [40], face
alignment [27], subspace segmentation [21] and so forth.
Recently, it has been noticed that more and more modern
applications contain data with a higher order tensor struc-
ture, such as background extraction [7], face recognition
and representation [40, 34, 38, 2], structure from motion
[36], object recognition [37] and motion segmentation [39].
Matrices can be viewed as second order tensors, howev-
er, moving from matrices to higher order tensors presents
significant new challenges. A direct way to address these
challenges is to unfold tensors to matrices and then directly
apply the matrix RPCA model. Unfortunately, as recently
pointed out by [7], the multilinear structure is lost in such
matricization and as a result, methods constructed based on
these techniques often lead to suboptimal results. As such,
it is helpful to handle such raw data by using a direct ten-
sor representation, and several researches have been made
in the literatures [11, 20].
Moreover, L
1
-norm and L
2
-norm can characterize spe-
cific Laplace and Gaussian distributions, respectively, but
the real noise is generally not of a particular kind of noise
configurations, as already shown in [42]. Mixture of Gaus-
sians (MoG) is capable to commonly approximate wider
range of distributions due to its universal approximation ca-
pability, and Laplacian and Gaussian are regarded as a spe-
cial case of MoG [3]. It has been demonstrated that MoG
2017 IEEE International Conference on Computer Vision
2380-7504/17 $31.00 © 2017 IEEE
DOI 10.1109/ICCV.2017.537
5029
=
+
+
+
(a) Observation High-
order Tensor
(b) Low-rank Tensor
(c) MoG
Figure 1. TenRPCA-MoG sketch map. (a) is the observation ten-
sor, (b) is the low-rank tensor, (c) represents the complex noise.
can deal with complex noise in multiple computer vision
tasks, like image denoising and recovery [23, 42, 10].
In this paper, we propose a new tensor based RPCA
model with noise modeling by MoG, which is named as
TenRPCA-MoG. As shown in Fig. 1, the new model di-
vides the observation (noisy data) into a low-rank tensor
component (clean data) and the residue (noise), and models
them separately. It has the following contributions: firstly,
it treats raw high-order data as a tensor to reserve the com-
plete structure information, and uses CP tensor factorization
method to replace existing matrix factorization method to
extract low-rank structure in tensor data; secondly, it adopt-
s MoG to model noise which makes it have the ability to
fit a wide range of noises rather than Gaussian or Lapla-
cian noise; thirdly, we formulate the problem as a genera-
tive model under the Bayesian framework and propose an
algorithm based on the variational inference theory to infer
the posterior and effectively solve the problem.
2. Related work
In order to make full use of high-order data structure in-
formation, Cao et al [7] extended the RPCA to the tensor
form based on Tucker decomposition and successfully ap-
plied it to the background extraction. Compared with other
methods, this model can achieve good extraction results at a
very low sampling rate. However, it is designed to deal with
only two types of noises, i.e., Gaussian noise and impulse
noise, such that it is inadequate for more complex noise in
real scenarios.
Meng and De la Torre [23] firstly applied the MoG
to low-rank matrix factorization (LRMF) for adapting to
unknown noise. Consequently, Zhao et al [42] proposed
a RPCA-MoG model which used MoG to model RPCA
noise. Benefiting from powerful approximation capability
of MoG, they successfully applied it to the face modeling
and background subtraction. However, these methods are
designed based on matrix techniques and fail to take the ad-
vantage of structure prior of original data. In order to over-
come such defect, Chen et al [10] developed a low-rank ten-
sor factorization (MoG-WLRTF) model with MoG and got
a good result on image denoising.
The differences and improvements of our model against
Chen [10] and Cao [7] are specified as follows: Chen’s work
is a LRTF model, while our work is a RPCA model under
the Bayesian framework, which has better adaptivity to var-
ious problems and the rank can be automatically confirmed
by the algorithm itself; compared with our model, Cao’s
model lacks a universal noise modeling ability and is only
suitable for background extraction application.
3. Notations
Throughout the paper, lowercase letters (a, b, ···) de-
note scalars and bold lowercase letters denote vectors
(a, b, ···) with elements (a
i
,b
j
, ···). Uppercase letter-
s (A, B, ···) denote the matrices with column vectors
(a
:
i
,b
:
j
, ···) and elements (a
i
j
,b
i
j
, ···). High-order ten-
sors are represented by calligraphic letters (A, B, ···).A
K-mode tensor X∈R
I
1
×I
2
×···×I
K
is rank-one tensor
if it can be written as the out product of K vectors, i.e.,
X = a
1
◦ a
2
◦···◦a
K
.
4. TenRPCA-MoG model
The traditional RPCA model is formulated as Eq. 2. Ten-
sor RPCA has the similiar form as:
min
X ,E
X
∗
+ λE
1
s.t. Y = X + E, (2)
where Y∈R
f×g×m
is the observation data tensor, X∈
R
f×g×m
is the low-rank tensor and E∈R
f×g×m
is the
noise tensor. Here, the L
1
norm is specifically set for deal-
ing with noise under sparse assumption, Laplacian noise,
for example. However, as we introduced before, the real
noise is generally much more complex rather than a simple
Laplacian noise. In order to improve the robustness to com-
plex noise of tensor RPCA, we introduce MoG for noise
modeling and obtain TenRPCA-MoG as:
min
X ,E
X
∗
+ λE
M
s.t. Y = X + E, (3)
where symbol E
M
means that E is modeled with MoG.
As shown in Fig. 1 as a simulation, the first and the second
Gaussians are for describing dense noise, while the third
Gaussian is for approximating Laplacian noise.
In our model, the low- rank factorization of tensor X is
obtained by CP decomposition. Hence, the detailed intro-
duction of our model starts from CP decomposition with its
helpful properties.
4.1. CP decomposition
There are two general tensor decomposition frameworks,
Tucker and CANDECOMP/PARAFAC (CP). CP decompo-
sition can be considered as a higher-order generalization of
the matrix singular value decomposition [8]. It decompos-
es a tensor into a sum of rank-one component tensors [18].
5030
剩余9页未读,继续阅读
资源评论
weixin_38552536
- 粉丝: 6
- 资源: 918
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- 微信小程序点餐系统微信小程序开发实战项目源码+数据库+详细文档说明(高分项目)
- Flutter jar包
- 基于微信平台的点餐系统小程序完整源码+文档说明+数据库(高分毕业设计项目)
- 基于微信小程序的点餐系统源码 (源码+详细文档说明)高分毕业设计项目
- 墙壁墙体发霉检测数据集VOC+YOLO格式2359张1类别.zip
- (反编译微信小程序)一键获取微信小程序源码(傻瓜式操作),使用了C#加nodejs制作.zip
- 计算机语言学中单调队列算法的C++实现
- 印度未来水资源需求与干预策略的建模与评估 - 超出人均用水比率模型的应用
- 鲜亮的高景观色彩,专注景观的小程序组件库.zip
- 历史新闻传播模型及其优化研究 - 社交网络与图模型的应用
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功