VOLUME 88, N
UMBER 1 PHYSICAL REVIEW LETTERS 7J
ANUARY 2002
Algorithm for Data Clustering in Pattern Recognition Problems Based on Quantum Mechanics
David Horn and Assaf Gottlieb
School of Physics and Astronomy, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University,
Tel Aviv 69978, Israel
(Received 16 July 2001; published 20 December 2001)
We propose a novel clustering method that is based on physical intuition derived from quantum me-
chanics. Starting with given data points, we construct a scale-space probability function. Viewing the
latter as the lowest eigenstate of a Schrödinger equation, we use simple analytic operations to derive a
potential function whose minima determine cluster centers. The method has one parameter, determin-
ing the scale over which cluster structures are searched. We demonstrate it on data analyzed in two
dimensions (chosen from the eigenvectors of the correlation matrix). The method is applicable in higher
dimensions by limiting the evaluation of the Schrödinger potential to the locations of data points.
DOI: 10.1103/PhysRevLett.88.018702 PACS numbers: 89.75.Kd, 02.70. –c, 03.65.Ge, 03.67.Lx
Clustering of data is a well-known problem of pattern
recognition, covered in textbooks such as [1–3]. The prob-
lem we are looking at is defining clusters of data solely by
the proximity of data points to one another. This problem is
one of unsupervised learning, and is in general ill defined.
Solutions to such problems can be based on intuition de-
rived from physics. A good example of the latter is the
algorithm by [4] that is based on associating points with
Potts spins and formulating an appropriate model of sta-
tistical mechanics. We propose an alternative that is also
based on physical intuition, this one being derived from
quantum mechanics.
As an introduction to our approach we start with the
scale-space algorithm by [5] who uses a Parzen-window
estimator [3] of the probability distribution leading to the
data at hand. The estimator is constructed by associating
a Gaussian with each of the
N data points in a Euclidean
space of dimension d and summing over all of them. This
can be represented, up to an overall normalization, by
c共x兲 苷
X
i
e
2共x2x
i
兲
2
兾2s
2
, (1)
where x
i
are the data points. Roberts [5] views the maxima
of this function as determining the locations of cluster
centers.
An alternative, and somewhat related, method is support
vector clustering (SVC) [6] that is based on a Hilbert-space
analysis. In SVC, one defines a transformation from data
space to vectors in an abstract Hilbert space. SVC pro-
ceeds to search for the minimal sphere surrounding these
states in Hilbert space. We will also associate data points
with states in Hilbert space. Such states may be repre-
sented by Gaussian wave functions, whose sum is c共x兲.
This is the starting point of our quantum clustering (QC)
method. We will search for the Schrödinger potential for
which c共x兲 is a ground state. The minima of the potential
define our cluster centers.
The Schrödinger potential.—We wish to view c as an
eigenstate of the Schrödinger equation
Hc ⬅
μ
2
s
2
2
=
2
1 V共x兲
∂
c 苷 Ec . (2)
Here we rescaled H and V of the conventional quantum
mechanical equation to leave only one free parameter, s.
For comparison, the case of a single point at x
1
corre-
sponds to Eq. (2) with V 苷
1
2s
2
共x 2 x
1
兲
2
and E 苷 d兾2,
thus coinciding with the ground state of the harmonic os-
cillator in quantum mechanics.
Given c for any set of data points we can solve Eq. (2)
for V :
V共x兲 苷 E 1
s
2
2
=
2
c
c
苷 E 2
d
2
1
1
2s
2
c
X
i
共x 2 x
i
兲
2
e
2共x2x
i
兲
2
兾2s
2
.
(3)
Let us furthermore require that minV 苷 0. This sets the
value of
E 苷 2 min
s
2
2
=
2
c
c
(4)
and determines V 共x兲 uniquely. E has to be positive since
V is a non-negative function. Moreover, since the last term
in Eq. (3) is positive definite, it follows that
0 , E #
d
2
. (5)
We note that c is positive definite. Hence, being an eigen-
function of the operator H in Eq. (2), its eigenvalue E is the
lowest eigenvalue of H, i.e., it describes the ground state.
All higher eigenfunctions have nodes whose numbers in-
crease as their energy eigenvalues increase. (In quantum
mechanics, where one interprets jcj
2
as the probability dis-
tribution, all eigenfunctions of H have physical meaning.
Although this approach could be adopted, we have chosen
c as the probability distribution because of the simplicity
of algebraic manipulations.)
Given a set of points defined within some region of
space, we expect V 共x兲 to grow quadratically outside this
018702-1 0031-9007兾02兾88(1)兾018702(4)$15.00 © 2001 The American Physical Society 018702-1
代入
- 1
- 2
- 3
前往页