SparseSubspaceClustering的论文和源码资源-CSDN文库

共38个文件

m：30个

pdf：3个

mat：2个

1星需积分: 39 13 浏览量 2019-07-26 14:47:45 上传评论 4 收藏 6.58MB RAR 举报

资源推荐

资源详情

资源评论

收起资源包目录

SSC.rar （38个子文件）

SSC_ADMM_v1.1

DataProjection.m 733B

SSC.m 1KB

run_SSC_MS.m 3KB

admmOutlier_mat_func.m 4KB

run_SSC_Faces.m 1KB

BuildAdjacency.m 969B

computeLambda_mat.m 678B

.DS_Store 6KB

thrC.m 680B

YaleBCrop025.mat 7.65MB

readme.pdf 17KB

missclassGroups.m 1KB

matrixNormalize.m 580B

errorLinSys.m 825B

SpectralClustering.m 1KB

readme.rtf 2KB

errorCoef.m 548B

admmLasso_mat_func.m 3KB

Misclassification.m 747B

__MACOSX

SSC_ADMM_v1.1

._run_SSC_MS.m 713B

._DataProjection.m 369B

._SSC.m 706B

._matrixNormalize.m 718B

._run_SSC_Faces.m 716B

._SpectralClustering.m 721B

._admmOutlier_mat_func.m 82B

._Misclassification.m 720B

._readme.pdf 178B

._.DS_Store 82B

._thrC.m 711B

._errorLinSys.m 714B

._errorCoef.m 712B

._admmLasso_mat_func.m 82B

._YaleBCrop025.mat 713B

._missclassGroups.m 718B

._computeLambda_mat.m 720B

._BuildAdjacency.m 717B

sparse subspace clustering algorithm theory and applications.pdf 4.59MB

Sparse Subspace Clustering:

Algorithm, Theory, and Applications

Ehsan Elhamifar, Student Member, IEEE, and Ren

e Vidal, Senior Member, IEEE

Abstract—Many real-world problems deal with collections of high-dimensional data, such as images, videos, text and web documents, DNA

microarray data, and more. Often, such high-dimensional data lie close to low-dimensional structures corresponding to several classes or

categories to which the data belong. In this paper, we propose and study an algorithm, called Sparse Subspace Clustering (SSC), to cluster

data points that lie in a union of low-dimensional subspaces. The key idea is that, among the inﬁnitely many possible representations of a data

point in terms of other points, a sparse representation corresponds to selecting a few points from the same subspace. This motivates solving a

sparse optimization program whose solution is used in a spectral clustering framework to infer the clustering of the data into subspaces. Since

solving the sparse optimization program is in general NP-hard, we consider a convex relaxation and show that, under appropriate conditions

on the arrangement of the subspaces and the distribution of the data, the proposed minimization program succeeds in recovering the desired

sparse representations. The proposed algorithm is efﬁcient and can handle data points near the intersections of subspaces. Another key

advantage of the proposed algorithm with respect to the state of the art is that it can deal directly with data nuisances, such as noise,

sparse outlying entries, and missing entries, by incorporating the model of the data into the sparse optimization program. We demonstrate the

effectiveness of the proposed algorithm through experiments on synthetic data as well as the two real-world problems of motion segmentation

and face clustering.

Index Terms—High-dimensional data, intrinsic low-dimensionality, subspaces, clustering, sparse representation, `

-minimization, convex

programming, spectral clustering, principal angles, motion segmentation, face clustering.

1 INTRODUCTION

IGH-DIMENSIONAL data are ubiquitous in many areas of

machine learning, signal and image processing, computer

vision, pattern recognition, bioinformatics, etc. For instance,

images consist of billions of pixels, videos can have millions of

frames, text and web documents are associated with hundreds of

thousands of features, etc. The high-dimensionality of the data not

only increases the computational time and memory requirements

of algorithms, but also adversely affects their performance due to

the noise effect and insufﬁcient number of samples with respect

to the ambient space dimension, commonly referred to as the

“curse of dimensionality” [1]. However, high-dimensional data

often lie in low-dimensional structures instead of being uniformly

distributed across the ambient space. Recovering low-dimensional

structures in the data helps to not only reduce the computational

cost and memory requirements of algorithms, but also reduce

the effect of high-dimensional noise in the data and improve the

performance of inference, learning, and recognition tasks.

In fact, in many problems, data in a class or category can

be well represented by a low-dimensional subspace of the high-

dimensional ambient space. For example, feature trajectories of

a rigidly moving object in a video [2], face images of a subject

under varying illumination [3], and multiple instances of a hand-

written digit with different rotations, translations, and thicknesses

[4] lie in a low-dimensional subspace of the ambient space. As a

result, the collection of data from multiple classes or categories

lie in a union of low-dimensional subspaces. Subspace clustering

• E. Elhamifar is with the Department of Electrical Engineering and

Computer Science, University of California, Berkeley, USA. E-mail:

ehsan@eecs.berkeley.edu.

• R. Vidal is with the Center for Imaging Science and the Department of

Biomedical Engineering, The Johns Hopkins University, USA. E-mail: rvi-

dal@cis.jhu.edu.

(see [5] and references therein) refers to the problem of separating

data according to their underlying subspaces and ﬁnds numerous

applications in image processing (e.g., image representation and

compression [6]) and computer vision (e.g., image segmentation

[7], motion segmentation [8], [9], and temporal video segmen-

tation [10]), as illustrated in Figures 1 and 2. Since data in

a subspace are often distributed arbitrarily and not around a

centroid, standard clustering methods [11] that take advantage

of the spatial proximity of the data in each cluster are not in

general applicable to subspace clustering. Therefore, there is a

need for having clustering algorithms that take into account the

multi-subspace structure of the data.

1.1 Prior Work on Subspace Clustering

Existing algorithms can be divided into four main categories: iter-

ative, algebraic, statistical, and spectral clustering-based methods.

Iterative methods. Iterative approaches, such as K-subspaces

[12], [13] and median K-ﬂats [14] alternate between assigning

points to subspaces and ﬁtting a subspace to each cluster. The

main drawbacks of such approaches are that they generally require

to know the number and dimensions of the subspaces, and that

they are sensitive to initialization.

Algebraic approaches. Factorization-based algebraic approaches

such as [8], [9], [15] ﬁnd an initial segmentation by thresholding

the entries of a similarity matrix built from the factorization

of the data matrix. These methods are provably correct when

the subspaces are independent, but fail when this assumption

is violated. In addition, they are sensitive to noise and outliers

in the data. Algebraic-geometric approaches such as Generalized

Principal Component Analysis (GPCA) [10], [16], ﬁt the data

with a polynomial whose gradient at a point gives the normal

vector to the subspace containing that point. While GPCA can

arXiv:1203.1005v3 [cs.CV] 5 Feb 2013

Fig. 1. Motion segmentation: given feature points on multiple rigidly moving objects tracked in multiple frames of a video (top), the goal

is to separate the feature trajectories according to the moving objects (bottom).

Fig. 2. Face clustering: given face images of multiple subjects (top), the goal is to ﬁnd images that belong to the same subject (bottom).

deal with subspaces of different dimensions, it is sensitive to noise

and outliers, and its complexity increases exponentially in terms

of the number and dimensions of subspaces.

Statistical methods. Iterative statistical approaches, such as Mix-

tures of Probabilistic PCA (MPPCA) [17], Multi-Stage Learning

(MSL) [18], or [19], assume that the distribution of the data inside

each subspace is Gaussian and alternate between data clustering

and subspace estimation by applying Expectation Maximization

(EM). The main drawbacks of these methods are that they gener-

ally need to know the number and dimensions of the subspaces,

and that they are sensitive to initialization. Robust statistical

approaches, such as Random Sample Consensus (RANSAC) [20],

ﬁt a subspace of dimension d to randomly chosen subsets of d

points until the number of inliers is large enough. The inliers

are then removed, and the process is repeated to ﬁnd a second

subspace, and so on. RANSAC can deal with noise and outliers,

and does not need to know the number of subspaces. However,

the dimensions of the subspaces must be known and equal. In

addition, the complexity of the algorithm increases exponentially

in the dimension of the subspaces. Information-theoretic statistical

approaches, such as Agglomerative Lossy Compression (ALC)

[21], look for the segmentation of the data that minimizes the

coding length needed to ﬁt the points with a mixture of degenerate

Gaussians up to a given distortion. As this minimization problem

is NP-hard, a suboptimal solution is found by ﬁrst assuming that

each point forms its own group, and then iteratively merging pairs

of groups to reduce the coding length. ALC can handle noise

and outliers in the data. While, in principle, it does not need to

know the number and dimensions of the subspaces, the number

of subspaces found by the algorithms is dependent on the choice

of a distortion parameter. In addition, there is no theoretical proof

for the optimality of the agglomerative algorithm.

Spectral clustering-based methods. Local spectral clustering-

based approaches such as Local Subspace Afﬁnity (LSA) [22],

Locally Linear Manifold Clustering (LLMC) [23], Spectral Local

Best-ﬁt Flats (SLBF) [24], and [25] use local information around

each point to build a similarity between pairs of points. The

segmentation of the data is then obtained by applying spectral

clustering [26], [27] to the similarity matrix. These methods

have difﬁculties in dealing with points near the intersection of

two subspaces, because the neighborhood of a point can contain

points from different subspaces. In addition, they are sensitive to

the right choice of the neighborhood size to compute the local

information at each point.

Global spectral clustering-based approaches try to resolve these

issues by building better similarities between data points using

global information. Spectral Curvature Clustering (SCC) [28] uses

multi-way similarities that capture the curvature of a collection of

points within an afﬁne subspace. SCC can deal with noisy data

but requires to know the number and dimensions of subspaces and

assumes that subspaces have the same dimension. In addition, the

complexity of building the multi-way similarity grows exponen-

tially with the dimensions of the subspaces, hence, in practice,

a sampling strategy is employed to reduce the computational

cost. Using advances in sparse [29], [30], [31] and low-rank

[32], [33], [34] recovery algorithms, Sparse Subspace Clustering

(SSC) [35], [36], [37], Low-Rank Recovery (LRR) [38], [39],

[40], and Low-Rank Subspace Clustering (LRSC) [41] algorithms

pose the clustering problem as one of ﬁnding a sparse or low-rank

representation of the data in the dictionary of the data itself. The

solution of the corresponding global optimization algorithm is

then used to build a similarity graph from which the segmentation

of the data is obtained. The advantages of these methods with

respect to most state-of-the-art algorithms are that they can handle

noise and outliers in data, and that they do not need to know the

dimensions and, in principle, the number of subspaces a priori.

1.2 Paper Contributions

In this paper, we propose and study an algorithm based on

sparse representation techniques, called Sparse Subspace Cluster-

ing (SSC), to cluster a collection of data points lying in a union

of low-dimensional subspaces. The underlying idea behind the

algorithm is what we call the self-expressiveness property of the

data, which states that each data point in a union of subspaces can

be efﬁciently represented as a linear or afﬁne combination of other

points. Such a representation is not unique in general because

there are inﬁnitely many ways in which a data point can be

expressed as a combination of other points. The key observation

is that a sparse representation of a data point ideally corresponds

to a combination of a few points from its own subspace. This

motivates solving a global sparse optimization program whose

solution is used in a spectral clustering framework to infer the

clustering of data. As a result, we can overcome the problems

of local spectral clustering-based algorithms, such as choosing

the right neighborhood size and dealing with points near the

intersection of subspaces, since, for a given data point, the sparse

optimization program automatically picks a few other points that

are not necessarily close to it but belong to the same subspace.

Since solving the sparse optimization program is in general

NP-hard, we consider its `

relaxation. We show that, under mild

conditions on the arrangement of subspaces and data distribution,

the proposed `

-minimization program recovers the desired solu-

tion, guaranteeing the success of the algorithm. Our theoretical

analysis extends the sparse representation theory to the multi-

subspace setting where the number of points in a subspace is

arbitrary, possibly much larger than its dimension. Unlike block-

sparse recovery problems [42], [43], [44], [45], [46], [47] where

the bases for the subspaces are known and given, we do not have

the bases for subspaces nor do we know which data points belong

to which subspace, making our case more challenging. We only

have the sparsifying dictionary for the union of subspaces given

by the matrix of data points.

The proposed `

-minimization program can be solved efﬁ-

ciently using convex programming tools [48], [49], [50] and does

not require initialization. Our algorithm can directly deal with

noise, sparse outlying entries, and missing entries in the data as

well as the more general class of afﬁne subspaces by incorporating

the data corruption or subspace model into the sparse optimization

program. Finally, through experimental results, we show that

our algorithm outperforms state-of-the-art subspace clustering

methods on the two real-world problems of motion segmentation

(Fig. 1) and face clustering (Fig. 2).

Paper Organization. In Section 2, we motivate and introduce

the SSC algorithm for clustering data points in a union of linear

subspaces. In Section 3, we generalize the algorithm to deal with

noise, sparse outlying entries, and missing entries in the data as

well as the more general class of afﬁne subspaces. In Section

4, we investigate theoretical conditions under which the `

minimization program recovers the desired sparse representations

of data points. In Section 5, we discuss the connectivity of the

similarity graph and propose a regularization term to increase the

connectivity of points in each subspace. In Section 6, we verify

our theoretical analysis through experiments on synthetic data. In

Section 7, we compare the performance of SSC with the state of

the art on the two real-world problems of motion segmentation

and face clustering. Finally, Section 8 concludes the paper.

2 SPARSE SUBSPACE CLUSTERING

In this section, we introduce the sparse subspace clustering (SSC)

algorithm for clustering a collection of multi-subspace data using

sparse representation techniques. We motivate and formulate the

algorithm for data points that perfectly lie in a union of linear

subspaces. In the next section, we will generalize the algorithm

to deal with data nuisances such as noise, sparse outlying entries,

and missing entries as well as the more general class of afﬁne

subspaces.

Let {S

}

`=1

be an arrangement of n linear subspaces of R

of dimensions {d

}

`=1

. Consider a given collection of N noise-

free data points {y

}

i=1

that lie in the union of the n subspaces.

Denote the matrix containing all the data points as

Y ,



. . . y





. . . Y



Γ, (1)

where Y

∈ R

D×N

is a rank-d

matrix of the N

> d

points

that lie in S

and Γ ∈ R

N×N

is an unknown permutation matrix.

We assume that we do not know a priori the bases of the subspaces

nor do we know which data points belong to which subspace. The

subspace clustering problem refers to the problem of ﬁnding the

number of subspaces, their dimensions, a basis for each subspace,

and the segmentation of the data from Y .

To address the subspace clustering problem, we propose an

algorithm that consists of two steps. In the ﬁrst step, for each data

point, we ﬁnd a few other points that belong to the same subspace.

To do so, we propose a global sparse optimization program

whose solution encodes information about the memberships of

data points to the underlying subspace of each point. In the second

step, we use these information in a spectral clustering framework

to infer the clustering of the data.

2.1 Sparse Optimization Program

Our proposed algorithm takes advantage of what we refer to as

the self-expressiveness property of the data, i.e.,

each data point in a union of subspaces can be efﬁciently re-

constructed by a combination of other points in the dataset.

More precisely, each data point for data point y

∈ ∪

`=1

can

be written as

= Y c

, c

= 0, (2)

where c



. . . c



and the constraint c

= 0

eliminates the trivial solution of writing a point as a linear

combination of itself. In other words, the matrix of data points

Y is a self-expressive dictionary in which each point can be

written as a linear combination of other points. However, the

representation of y

in the dictionary Y is not unique in general.

This comes from the fact that the number of data points in a

subspace is often greater than its dimension, i.e., N

> d

. As a

result, each Y

, and consequently Y , has a non-trivial nullspace

giving rise to inﬁnitely many representations of each data point.

The key observation in our proposed algorithm is that among

all solutions of (2),

there exists a sparse solution, c

, whose nonzero entries

correspond to data points from the same subspace as y

. We

refer to such a solution as a subspace-sparse representation.

More speciﬁcally, a data point y

that lies in the d

-dimensional

subspace S

can be written as a linear combination of d

other

points in general directions from S

. As a result, ideally, a sparse

representation of a data point ﬁnds points from the same subspace

where the number of the nonzero elements corresponds to the

dimension of the underlying subspace.

For a system of equations such as (2) with inﬁnitely many

solutions, one can restrict the set of solutions by minimizing an

objective function such as the `

-norm of the solution

min kc

s. t. y

= Y c

, c

= 0. (3)

1. The `

-norm of c

∈ R

is deﬁned as kc

, (

j=1

)

0 5 10 15 20 25 30

−0.05

0.05

0.1

q = ∞

0 5 10 15 20 25 30

−0.1

−0.05

0.05

0.1

0.15

q = 2

0 5 10 15 20 25 30

−0.2

0.2

0.4

q = 1

Fig. 3. Three subspaces in R

with 10 data points in each subspace, ordered such that the ﬁst and the last 10 points belong to S

and

, respectively. The solution of the `

-minimization program in (3) for y

lying in S

for q = 1, 2, ∞ is shown. Note that as the value of q

decreases, the sparsity of the solution increases. For q = 1, the solution corresponds to choosing two other points lying in S

Different choices of q have different effects in the obtained

solution. Typically, by decreasing the value of q from inﬁnity

toward zero, the sparsity of the solution increases, as shown in

Figure 3. The extreme case of q = 0 corresponds to the general

NP-hard problem [51] of ﬁnding the sparsest representation of

the given point, as the `

-norm counts the number of nonzero

elements of the solution. Since we are interested in efﬁciently

ﬁnding a non-trivial sparse representation of y

in the dictionary

Y , we consider minimizing the tightest convex relaxation of the

-norm, i.e.,

min kc

s. t. y

= Y c

, c

= 0, (4)

which can be solved efﬁciently using convex programming tools

[48], [49], [50] and is known to prefer sparse solutions [29], [30],

[31].

We can also rewrite the sparse optimization program (4) for all

data points i = 1, . . . , N in matrix form as

min kCk

s. t. Y = Y C, diag(C) = 0, (5)

where C ,



. . . c



∈ R

N×N

is the matrix whose

i-th column corresponds to the sparse representation of y

, c

and diag(C) ∈ R

is the vector of the diagonal elements of C.

Ideally, the solution of (5) corresponds to subspace-sparse

representations of the data points, which we use next to infer the

clustering of the data. In Section 4, we study conditions under

which the convex optimization program in (5) is guaranteed to

recover a subspace-sparse representation of each data point.

2.2 Clustering using Sparse Coefﬁcients

After solving the proposed optimization program in (5), we

obtain a sparse representation for each data point whose nonzero

elements ideally correspond to points from the same subspace.

The next step of the algorithm is to infer the segmentation of the

data into different subspaces using the sparse coefﬁcients.

To address this problem, we build a weighted graph G =

(V, E, W ), where V denotes the set of N nodes of the graph

corresponding to N data points and E ⊆ V×V denotes the set of

edges between nodes. W ∈ R

N×N

is a symmetric non-negative

similarity matrix representing the weights of the edges, i.e., node

i is connected to node j by an edge whose weight is equal to w

An ideal similarity matrix W , hence an ideal similarity graph G,

is one in which nodes that correspond to points from the same

subspace are connected to each other and there are no edges

between nodes that correspond to points in different subspaces.

Note that the sparse optimization program ideally recovers to a

subspace-sparse representation of each point, i.e., a representation

whose nonzero elements correspond to points from the same

subspace of the given data point. This provides an immediate

choice of the similarity matrix as W = |C| + |C|

. In other

words, each node i connects itself to a node j by an edge whose

weight is equal to |c

|+ |c

|. The reason for the symmetrization

is that, in general, a data point y

∈ S

can write itself as a

linear combination of some points including y

∈ S

. However,

may not necessarily choose y

in its sparse representation. By

this particular choice of the weight, we make sure that nodes i

and j get connected to each other if either y

or y

is in the

sparse representation of the other.

The similarity graph built this way has ideally n connected

components corresponding to the n subspaces, i.e.,

W =







··· 0

0 ··· W







Γ, (6)

where W

is the similarity matrix of data points in S

. Clustering

of data into subspaces follows then by applying spectral clustering

[26] to the graph G. More speciﬁcally, we obtain the clustering

of data by applying the Kmeans algorithm [11] to the normalized

rows of a matrix whose columns are the n bottom eigenvectors

of the symmetric normalized Laplacian matrix of the graph.

Remark 1: An optional step prior to building the similarity

graph is to normalize the sparse coefﬁcients as c

← c

/kc

∞

This helps to better deal with different norms of data points.

More speciﬁcally, if a data point with a large Euclidean norm

selects a few points with small Euclidean norms, then the values

of the nonzero coefﬁcients will generally be large. On the other

hand, if a data point with a small Euclidean norm selects a few

points with large Euclidean norms, then the values of the nonzero

coefﬁcients will generally be small. Since spectral clustering puts

more emphasis on keeping the stronger connections in the graph,

by the normalization step we make sure that the largest edge

weights for all the nodes are of the same scale.

Algorithm 1 summarizes the SSC algorithm. Note that an

advantage of spectral clustering, which will be shown in the

experimental results, is that it provides robustness with respect

to a few errors in the sparse representations of the data points.

In other words, as long as edges between points in different

subspaces are weak, spectral clustering can ﬁnd the correct

segmentation.

2. To obtain a symmetric similarity matrix, one can directly impose the

constraint of C = C

in the optimization program. However, this results in

increasing the complexity of the optimization program and, in practice, does not

perform better than the post-symmetrization of C, as described above. See also

[52] for other processing approaches of the similarity matrix.

Algorithm 1 : Sparse Subspace Clustering (SSC)

Input: A set of points {y

}

i=1

lying in a union of n linear

subspaces {S

}

i=1

1: Solve the sparse optimization program (5) in the case of

uncorrupted data or (13) in the case of corrupted data.

2: Normalize the columns of C as c

←

∞

3: Form a similarity graph with N nodes representing the data

points. Set the weights on the edges between the nodes by

W = |C| + |C|

4: Apply spectral clustering [26] to the similarity graph.

Output: Segmentation of the data: Y

, Y

, . . . , Y

Remark 2: In principle, SSC does not need to know the

number of subspaces. More speciﬁcally, under the conditions of

the theoretical results in Section 4, in the similarity graph there

will be no connections between points in different subspaces.

Thus, one can determine the number of subspaces by ﬁnding the

number of graph components, which can be obtained by analyzing

the eigenspectrum of the Laplacian matrix of G [27]. However,

when there are connections between points in different subspaces,

other model selection techniques should be employed [53].

3 PRACTICAL EXTENSIONS

In real-world problems, data are often corrupted by noise and

sparse outlying entries due to measurement/process noise and ad-

hoc data collection techniques. In such cases, the data do not lie

perfectly in a union of subspaces. For instance, in the motion

segmentation problem, because of the malfunctioning of the

tracker, feature trajectories can be corrupted by noise or can have

entries with large errors [21]. Similarly, in clustering of human

faces, images can be corrupted by errors due to specularities, cast

shadows, and occlusions [54]. On the other hand, data points may

have missing entries, e.g., when the tracker loses track of some

feature points in a video due to occlusions [55]. Finally, data may

lie in a union of afﬁne subspaces, a more general model which

includes linear subspaces as a particular case.

In this section, we generalize the SSC algorithm for clustering

data lying perfectly in a union of linear subspaces, to deal with

the aforementioned challenges. Unlike state-of-the-art methods,

which require to run a separate algorithm ﬁrst to correct the errors

in the data [21], [55], we deal with these problems in a uniﬁed

framework by incorporating a model for the corruption into the

sparse optimization program. Thus, the sparse coefﬁcients again

encode information about memberships of data to subspaces,

which are used in a spectral clustering framework, as before.

3.1 Noise and Sparse Outlying Entries

In this section, we consider clustering of data points that are

contaminated with sparse outlying entries and noise. Let

= y

+ e

+ z

(7)

be the i-th data point that is obtained by corrupting an error-

free point y

, which perfectly lies in a subspace, with a vector of

sparse outlying entries e

∈ R

that has only a few large nonzero

elements, i.e., ke

≤ k for some integer k, and with a noise

∈ R

whose norm is bounded as kz

≤ ζ for some ζ > 0.

Since error-free data points perfectly lie in a union of subspaces,

using the self-expressiveness property, we can reconstruct y

∈

in terms of other error-free points as

j6=i

. (8)

Note that the above equation has a sparse solution since y

can

be expressed as a linear combination of at most d

other points

from S

. Rewriting y

using (7) in terms of the corrupted point

, the sparse outlying entries vector e

, and the noise vector z

and substituting it into (8), we obtain

j6=i

+ e

+ z

, (9)

where the vectors e

∈ R

and z

∈ R

are deﬁned as

, e

−

j6=i

, (10)

, z

−

j6=i

. (11)

Since (8) has a sparse solution c

, e

and z

also correspond to

vectors of sparse outlying entries and noise, respectively. More

precisely, when a few c

are nonzero, e

is a vector of sparse

outlying entries since it is a linear combination of a few vectors

of outlying entries in (10). Similarly, when a few c

are nonzero

and do not have signiﬁcantly large magnitudes

, z

is a vector of

noise since it is linear combination of a few noise vectors in (11).

Collecting e

and z

as columns of the matrices E and Z,

respectively, we can rewrite (9) in matrix form as

Y = Y C + E + Z, diag(C) = 0. (12)

Our objective is then to ﬁnd a solution (C, E, Z) for (12), where

C corresponds to a sparse coefﬁcient matrix, E corresponds to

a matrix of sparse outlying entries, and Z is a noise matrix. To

do so, we propose to solve the following optimization program

min kCk

+ λ

kEk

kZk

s. t. Y = Y C + E + Z, diag(C) = 0,

(13)

where the `

-norm promotes sparsity of the columns of C and

E, while the Frobenius norm promotes having small entries in

the columns of Z. The two parameters λ

> 0 and λ

> 0

balance the three terms in the objective function. Note that

the optimization program in (13) is convex with respect to the

optimization variables (C, E, Z), hence, can be solved efﬁciently

using convex programming tools.

When data are corrupted only by noise, we can eliminate E

from the optimization program in (13). On the other hand, when

the data are corrupted only by sparse outlying entries, we can

eliminate Z in (13). In practice, however, E can also deal with

small errors due to noise. The following proposition suggests

setting λ

= α

/µ

and λ

= α

/µ

, where α

, α

> 1 and

, min

max

j6=i

|, µ

, min

max

j6=i

. (14)

The proofs of all theoretical results in the paper are provided in

the supplementary material.

Proposition 1: Consider the optimization program (13). With-

out the term Z, if λ

≤ 1/µ

, then there exists at least

3. One can show that, under broad conditions, sum of |c

| is bounded above

by the square root of the dimension of the underlying subspace of y

. Theoretical

guarantees of the proposed optimization program in the case of corrupted data is

the subject of the current research.

评论收藏

内容反馈

想要进步的码农

2022-04-06

这个程序，缺少必要东西。怀疑你都没弄出来直接发出来的。掐烂钱

weixin_44366933

粉丝: 0
资源: 2

Sparse Subspace Clustering的论文和源码

最新资源

Sparse Subspace Clustering的论文和源码

Sparse subspace clustering算法代码

Sparse Subspace Clustering基于人脸分割的子空间聚类的原始代码

稀疏子空间聚类代码包

Sparse_subspace_clustering算法代码

稀疏子空间聚类

sparse-subspace-clustering-python:稀疏子空间聚类算法的Python实现

Sparse Subspace Clustering: Algorithm, Theory, and Applications

Sparse Subspace Clustering

subspace clustering

稀疏子空间代码.rar

Multi-view Low-rank Sparse Subspace Clustering Algorithm代码及各种数据集

CVPR-2009-Sparse Subspace Clustering.pdf

An l 12 and Graph Regularized Subspace Clustering Method（论文）

Fast Subspace Clustering via Sparse Representations

论文研究-Density Clustering Pruning Method Based on Reconstructed Support Vectors for Sparse LS-SVM.pdf

sparse-subspace-clustering-admm-master_SCC_admm_

Matlab implementation of multi-view low-rank sparse subspace

DSSC.zip_Deep Clustering_Deep sparse_clustering_matlab_subspace

论文研究-基于重建系数的子空间聚类融合算法.pdf

matlab描绘三维函数代码-compressive_spectral_subspace_clustering:MatLab编码用于压缩光谱子

matlab中存档算法代码-L0-motivated-LRSSC:L0激励的低秩稀疏子空间聚类的Matlab实现

matlab kmeans聚类 代码和例子（带图）

matlab子空间聚类

Clustering Analysis

Sparse discriminant manifold projections for bearing fault diagnosis

Fast Subspace Clustering via RepresentationSparses Matlab code

sparse manifold clustering and embedding

SSC_ADMM_v1.1

论文研究-三种谱聚类算法及其应用研究.pdf

最新资源

matlab kmeans聚类代码和例子（带图）