k-聚类（Matlab实现）资源-CSDN文库

共2个文件

m：2个

k-聚类

需积分: 19 126 浏览量 2017-11-30 09:38:29 上传评论收藏 2KB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

k-means.zip （2个子文件）

k-means

kmeans_test.m 1KB

myKmeans.m 2KB

% This is my implementation on the k-means algorithm straight from the % pseudocode description of the very same algorithm on the book % 'Introduction to Information Retrieval' by Manning, Schutze % and Raghavan. function [C, I, iter] = myKmeans(X, K, maxIter, TOL) % number of vectors in X [vectors_num, dim] = size(X); % compute a random permutation of all input vectors R = randperm(vectors_num); % construct indicator matrix (each entry corresponds to the cluster % of each point in X) I = zeros(vectors_num, 1); % construct centers matrix C = zeros(K, dim); % take the first K points in the random permutation as the center sead for k=1:K C(k,:) = X(R(k),:); end % iteration count iter = 0; % compute new clustering while the cumulative intracluster error in kept % below the maximum allowed error, or the iterative process has not % exceeded the maximum number of iterations permitted while 1 % find closest point for n=1:vectors_num % find closest center to current input point minIdx = 1; minVal = norm(X(n,:) - C(minIdx,:), 1); for j=1:K dist = norm(C(j,:) - X(n,:), 1); if dist < minVal minIdx = j; minVal = dist; end end % assign point to the closter center I(n) = minIdx; end % compute centers for k=1:K C(k, :) = sum(X(find(I == k), :)); C(k, :) = C(k, :) / length(find(I == k)); end % compute RSS error RSS_error = 0; for idx=1:vectors_num RSS_error = RSS_error + norm(X(idx, :) - C(I(idx),:), 2); end RSS_error = RSS_error / vectors_num; % increment iteration iter = iter + 1; % check stopping criteria if 1/RSS_error < TOL break; end if iter > maxIter iter = iter - 1; break; end end disp(['k-means took ' int2str(iter) ' steps to converge']);

评论收藏

内容反馈