量子聚类算法（matlab实现）资源-CSDN文库

共8个文件

m：6个

pdf：1个

mat：1个

量子聚类算法（matlab实现）

5星 · 超过95%的资源需积分: 41 111 浏览量 2012-03-09 22:34:21 上传评论 11 收藏 176KB RAR 举报

资源推荐

资源详情

资源评论

收起资源包目录

QC.rar （8个子文件）

fineCluster.m 588B

QCscript.m 1KB

plotClust.m 271B

qc.m 2KB

Algorithm for data clustering in pattern recognition problems based on quantum mechanics.pdf 111KB

spellman-demo.mat 449KB

clustMeasure.m 1KB

graddesc.m 644B

VOLUME 88, N

UMBER 1 PHYSICAL REVIEW LETTERS 7J

ANUARY 2002

Algorithm for Data Clustering in Pattern Recognition Problems Based on Quantum Mechanics

David Horn and Assaf Gottlieb

School of Physics and Astronomy, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University,

Tel Aviv 69978, Israel

(Received 16 July 2001; published 20 December 2001)

We propose a novel clustering method that is based on physical intuition derived from quantum me-

chanics. Starting with given data points, we construct a scale-space probability function. Viewing the

latter as the lowest eigenstate of a Schrödinger equation, we use simple analytic operations to derive a

potential function whose minima determine cluster centers. The method has one parameter, determin-

ing the scale over which cluster structures are searched. We demonstrate it on data analyzed in two

dimensions (chosen from the eigenvectors of the correlation matrix). The method is applicable in higher

dimensions by limiting the evaluation of the Schrödinger potential to the locations of data points.

DOI: 10.1103/PhysRevLett.88.018702 PACS numbers: 89.75.Kd, 02.70. –c, 03.65.Ge, 03.67.Lx

Clustering of data is a well-known problem of pattern

recognition, covered in textbooks such as [1–3]. The prob-

lem we are looking at is deﬁning clusters of data solely by

the proximity of data points to one another. This problem is

one of unsupervised learning, and is in general ill deﬁned.

Solutions to such problems can be based on intuition de-

rived from physics. A good example of the latter is the

algorithm by [4] that is based on associating points with

Potts spins and formulating an appropriate model of sta-

tistical mechanics. We propose an alternative that is also

based on physical intuition, this one being derived from

quantum mechanics.

As an introduction to our approach we start with the

scale-space algorithm by [5] who uses a Parzen-window

estimator [3] of the probability distribution leading to the

data at hand. The estimator is constructed by associating

a Gaussian with each of the

N data points in a Euclidean

space of dimension d and summing over all of them. This

can be represented, up to an overall normalization, by

c共x兲苷

2共x2x

兲

兾2s

, (1)

where x

are the data points. Roberts [5] views the maxima

of this function as determining the locations of cluster

centers.

An alternative, and somewhat related, method is support

vector clustering (SVC) [6] that is based on a Hilbert-space

analysis. In SVC, one deﬁnes a transformation from data

space to vectors in an abstract Hilbert space. SVC pro-

ceeds to search for the minimal sphere surrounding these

states in Hilbert space. We will also associate data points

with states in Hilbert space. Such states may be repre-

sented by Gaussian wave functions, whose sum is c共x兲.

This is the starting point of our quantum clustering (QC)

method. We will search for the Schrödinger potential for

which c共x兲 is a ground state. The minima of the potential

deﬁne our cluster centers.

The Schrödinger potential.—We wish to view c as an

eigenstate of the Schrödinger equation

Hc ⬅

1 V共x兲

∂

c 苷 Ec . (2)

Here we rescaled H and V of the conventional quantum

mechanical equation to leave only one free parameter, s.

For comparison, the case of a single point at x

corre-

sponds to Eq. (2) with V 苷

共x 2 x

兲

and E 苷 d兾2,

thus coinciding with the ground state of the harmonic os-

cillator in quantum mechanics.

Given c for any set of data points we can solve Eq. (2)

for V :

V共x兲苷 E 1

苷 E 2

共x 2 x

兲

2共x2x

兲

兾2s

(3)

Let us furthermore require that minV 苷 0. This sets the

value of

E 苷 2 min

(4)

and determines V 共x兲 uniquely. E has to be positive since

V is a non-negative function. Moreover, since the last term

in Eq. (3) is positive deﬁnite, it follows that

0 , E #

. (5)

We note that c is positive deﬁnite. Hence, being an eigen-

function of the operator H in Eq. (2), its eigenvalue E is the

lowest eigenvalue of H, i.e., it describes the ground state.

All higher eigenfunctions have nodes whose numbers in-

crease as their energy eigenvalues increase. (In quantum

mechanics, where one interprets jcj

as the probability dis-

tribution, all eigenfunctions of H have physical meaning.

Although this approach could be adopted, we have chosen

c as the probability distribution because of the simplicity

of algebraic manipulations.)

Given a set of points deﬁned within some region of

space, we expect V 共x兲 to grow quadratically outside this

代入

VOLUME 88, N

UMBER 1 PHYSICAL REVIEW LETTERS 7J

ANUARY 2002

region, and to exhibit one or several local minima within

the region. We identify these minima with cluster centers,

which seems natural in view of the opposite roles of the

two terms in Eq. (2): Given a potential function, it attracts

the data distribution function

c to its minima, while the

Laplacian drives it away. The diffused character of the

distribution is the balance of the two effects.

As an example we display results for the crab data set

taken from Ripley’s book [7]. These data, given in a

ﬁve-dimensional parameter space, show nice separation

of the four classes contained in them when displayed in

two dimensions spanned by the second and third principal

components [8] (eigenvectors) of the correlation matrix of

the data. The information supplied to the clustering algo-

rithm contains only the coordinates of the data points. We

display the correct classiﬁcation to allow for visual com-

parison of the clustering method with the data. Starting

with s

苷 1兾

2 we see in Fig. 1 that the Parzen proba-

bility distribution, or the wave-function c, has only a

single maximum. Nonetheless, the potential, displayed in

Fig. 2, already shows four minima at the relevant locations.

The overlap of the topographic map of the potential with

the true classiﬁcation is quite amazing. The minima are the

centers of attraction of the potential, and they are clearly

evident although the wave function does not display local

maxima at these points. The fact that

V共x兲苷 E lies above

the range where all valleys merge explains why c共x兲 is

smoothly distributed over the whole domain.

As s is being decreased more minima will appear in

V 共x兲. For the crab data, we ﬁnd two new minima as s

is decreased to one-half. Nonetheless, the previous

minima become deeper and still dominate the scene. The

new minima are insigniﬁcant, in the sense that they lie

at high values (of order E). Classifying data points to

clusters according to their topographic location on the

−3 −2 −1 0 1 2 3

−3

−2

−1

PC3

PC2

FIG. 1. Ripley’s crab data [7] displayed on a plot of their sec-

ond and third principal components with a superimposed topo-

graphic map of Roberts’ probability distribution for s 苷 1兾

surface of V 共x兲, roughly the same clustering assignment

is expected for a range of s values. One important

advantage of quantum clustering is that E sets the scale

on which minima are observed. Thus, we learn from

Fig. 2 that the cores of all 4 clusters can be found at V

values below 0.4E. In comparison, the additional maxima

of c, which start to appear at lower values of s, may lie

much lower than the leading maximum and may be hard

to locate numerically.

Principal component analysis (PCA).— In our example,

data were given in some high-dimensional space and we

analyzed them after deﬁning a projection and a metric,

using the PCA approach. The latter deﬁnes a metric that is

intrinsic to the data, determined by second order statistics.

But, even then, several possibilities exist, leading to non-

equivalent results.

Principal component decomposition can be applied both

to the correlation matrix C

苷具x

典 and to the covari-

ance matrix

苷具具具共x

2 具x典

兲共x

2 具x典

兲典典典苷 C

2 具x典

具x典

(6)

In both cases averaging is performed over all data points,

and the indices indicate spatial coordinates from 1 to d.

The principal components are the eigenvectors of these

matrices. Thus we have two natural bases in which to

represent the data. Moreover, one often renormalizes the

eigenvector projections, dividing them by the square roots

of their eigenvalues. This procedure is known as “whiten-

ing,” leading to a renormalized correlation or covariance

matrix of unity. This is a scale-free representation that

would naturally lead one to start with s 苷 1 in the search

for (higher order) structure of the data.

−2 −1.5 −1 −0.5 0 0.5 1 1.5 2 2.5

−3

−2

−1

PC3

PC2

FIG. 2. A topographic map of the potential for the crab data

with s 苷 1 兾

2, displaying four minima (denoted by crossed

circles) that are interpreted as cluster centers. The contours of

the topographic map are set at values of V 共x兲兾E 苷 0.2, 0.4, 0.6,

0.8, 1.

018702-2 018702-2

评论收藏

内容反馈

zmdxiangpi

2014-10-13

怎么都是错误啊
小新ss23

2014-04-14

可以运行，希望对学习有帮助
liqiang4113

2014-07-21

聚类效果还可以
哆啦C梦GO

2015-05-26

还好哎，对我毕设有用啊，打算在这基础上改改成为自己的东西
白绝

2014-01-12

可以的，聚类的感觉也不错。

前往

页

gao675597253

粉丝: 23
资源: 41

量子聚类算法（matlab实现）

量子计算算法的matlab实现

量子聚类（matlab实现）

matlab 量子粒子群算法的函数寻优，很详细注解，可运行

编程代码大全中文(超经典)

iFunny阿拉伯语「iFunny Arabic」-crx插件

量子聚类--matlab

量子聚类Matlab工具箱

模式识别算法MATLAB实现

【MATLAB工具箱集锦】- 聚类分析工具箱FuzzyClusteringToolbox.zip

一次性分享一些.NET板网友高频经常索要的源代码

matlab代码

空白文件生成器(EmptyFilesCreator)v1.0英文绿色免费版

代码大全中文版

源代码计算行数计数器

30个数学建模智能算法及MATLAB程序代码.zip

【MATLAB工具箱集锦】- 蚁群算法工具箱.rar

【MATLAB工具箱集锦】- 量子波函数演示工具箱.rar

【MATLAB工具箱集锦】- 脑MRI肿瘤的检测与分类.zip

【MATLAB工具箱集锦】- 海洋要素计算工具箱seawater.rar

01字符串源代码

01126426140 صيانة زانوسي-crx插件

非常小的恶搞电脑代码

代码大全—完整TXT文档

好玩的游戏代码 好玩的游戏代码

【MATLAB工具箱集锦】-鱼群算法工具箱OptimizedAFSAr.zip

【MATLAB工具箱集锦】-遗传算法工具箱.rar

【MATLAB工具箱集锦】-PSORT粒子群优化工具箱.zip

【MATLAB工具箱集锦】- othercolor配色工具包.rar

【MATLAB工具箱集锦】- 元胞自动机.rar

最新资源

好玩的游戏代码好玩的游戏代码