FEC.rar_FEC聚类_启发式算法_启发聚类_复杂网络算法资源-CSDN文库

共1个文件

m：1个

版权申诉

142 浏览量 2022-09-20 16:42:10 上传评论收藏 2KB RAR 举报

在IT领域，聚类是一种无监督学习方法，用于将数据集中的对象或样本根据它们的相似性进行分组。FEC（Fast and Effective Clustering）算法是启发式聚类算法的一种，尤其适用于处理复杂网络问题。启发式算法是基于经验或简化模型来找到接近最优解的策略，而不是保证找到全局最优解。FEC算法就是这样一种旨在快速有效地解决聚类问题的策略。 FEC算法的核心思想是结合了层次聚类和中心点聚类的特点，通过迭代的方式逐步构建聚类结构。它首先选择一个初始的中心点，然后将数据点分配到最近的中心点所在的簇。随着算法的进行，中心点会不断更新，直到满足预设的停止条件，如达到预设的簇数或者簇内点的差异度阈值。在处理复杂网络问题时，FEC算法的优势在于其对大规模数据和高维度数据的高效处理能力。复杂网络通常包含大量节点和边，这些节点可能具有复杂的相互关系。FEC算法能够快速识别出网络中的紧密连接群体，帮助我们理解网络的结构和特性，例如社区结构、核心节点等。 FEC算法的步骤大致如下： 1. **初始化**：选择一部分数据点作为初始聚类中心。 2. **分配阶段**：将每个数据点分配到与其最接近的聚类中心所属的簇。 3. **更新阶段**：重新计算每个簇的中心，通常选择簇内所有点的几何中心或加权中心。 4. **合并阶段**：根据预设的合并策略，如距离阈值，合并相近的簇。 5. **重复以上步骤**，直到满足停止条件。在实际应用中，FEC算法可能会与其他技术结合，比如谱聚类、密度聚类等，以提高聚类效果。例如，可以利用复杂网络的特性，如节点度、介数中心性等作为权重，使聚类更加符合网络的内在结构。在压缩包中的"FEC.m"文件，很可能是MATLAB实现的FEC算法代码。MATLAB是一种广泛用于数值计算、数据分析和算法开发的编程环境，非常适合处理这样的算法实现。通过阅读和理解这段代码，我们可以深入学习FEC算法的细节，并可能对其进行优化或扩展以适应特定的复杂网络分析任务。总结来说，FEC算法是一种针对复杂网络问题设计的启发式聚类算法，它的高效性和对大规模数据的处理能力使其在数据挖掘、社交网络分析、生物信息学等领域有广泛应用。通过学习和理解FEC算法，我们可以更好地理解和处理复杂网络中的结构模式，为后续的数据分析和决策提供有力支持。

资源推荐

资源详情

资源评论

收起资源包目录

FEC.rar （1个子文件）

FEC.m 7KB

function main() %%% experimental networks used in the paper %%%%%%%%%%%%%%%%%%%%%%%% % W = load('football.gra'); % W = load('wkarate.gra'); % W = load('dolphin.gra'); % W = load('4,32,16,0.5.gra'); % this is the random network generated by computer % which inluding 4 communities and each contains 32 vertices, %and each vertices has 16 edges among which 7.5 inter-community edges %W = load('30,30,20,0.6.gra'); % this is the random network generated by computer % which inluding 30 communities and each contains 30 vertices, %and each vertices has 20 edges among which 8 %inter-community edges W = load('0.4.gra'); %W = load('15,40,20,0.7.gra'); %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% W = sparse(W); % W = sym(W); % for the FEC, no need symmetry close all; % show the original adjacency matrix show_matrix(W,'The original adjacency matrix'); disp('press Enter to continue...'); pause; disp('mining the communities...'); % control error, the unique parameter required by this algorithm err = 0.1^3; %% initializing the hierarchy matrix H = sparse(length(W),length(W)); id = 1;% id denotes the community number %% Initializing the name array for c = 1:length(W) W_names(c) = c; end %% main body to mining communities tic; [new_W,new_W_names,H,id] = NCMA(W,W_names,H,id,err); disp(['The computation took ' num2str(toc) ' seconds']); % output results for c = 1:length(H) s = sum(abs(H(:,c))); if s==0 break; end end if c ==length(H) H = H(:,1:c); else if c>1 H = H(:,1:c-1); else H = zeros(length(H),1); end end for r = 1:length(new_W_names) new_H(r,:) = H(new_W_names(r),:); end n = length(new_W_names); for r = 1:n flection(new_W_names(r)) = r; end A = sparse(n,n); [i,j,V] = find(W); for r = 1:length(i) x = flection(i(r)); y = flection(j(r)); A(x,y) = V(r); end show_matrix(new_H,'The community hierarchy structure'); WH = [new_W new_H]; show_matrix(WH,'The cleaned transfered matrix with community hierarchy'); WH = [A new_H]; show_matrix(WH,'The transfered matrix with community hierarchy'); %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% function [new_W,new_W_names,H,id] = NCMA(W,W_names,H,id,err) [pos, EP1,EP2,new_W, new_W_names]= PL(W,W_names,err); if (EP1 < 0.5) && (EP2 < 0.5) % stop criterion W_A = new_W(1:pos,1:pos); W_A_names = new_W_names(1:pos); H = makeH(H, id, W_A_names, 1, pos); id = id +1; if length(W_A) > 1 [W_A,W_A_names,H,id]= NCMA(W_A,W_A_names,H,id,err); end W_B = new_W(pos+1:length(new_W),pos+1:length(new_W)); W_B_names = new_W_names(pos+1:length(new_W)); H = makeH(H, id, W_B_names, pos+1, length(new_W)); id = id +1; if length(W_B) > 1 [W_B,W_B_names,H,id] = NCMA(W_B,W_B_names,H,id,err); end new_W = zeros(length(new_W),length(new_W)); new_W(1:pos,1:pos) = W_A; new_W(pos+1:length(new_W),pos+1:length(new_W)) = W_B; new_W_names = [W_A_names W_B_names]; end %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% function H = makeH(H, id, names, first, last) for c = first:last v = names(c-first+1); col = find_first_col(H, v); H(v,col) = id; end function col = find_first_col(H,v) for col = 1:length(H) if H(v,col) == 0 break; end end function [pos, EP1, EP2, new_W, new_W_names]= PL(W,W_names,error) n = length(W); % compute the degree vector [i,j,V] = find(W); NNZ = length(i); D = zeros(1,n); for r = 1:NNZ D(i(r)) = D(i(r)) + V(r); end % computer one-step transfer probability matrix P = sparse(W); for r = 1:NNZ x = i(r); y = j(r); w = V(r); if D(x)~= 0 P(x,y) = w/D(x); end end % select the vertex with maximum degree as the sink [s_D, IX]=sort(D); sink = IX(n); %computing the limit probability pi_sink pi_sink = D(sink) / sum(D); R0 = rand(n,1); R1 = R0; err = 1; steps = 0; while err> error && steps <300 S = zeros(n,1); for r = 1:NNZ % check r-th element W(i,j) = v S(i(r)) = S(i(r)) + R0(j(r))*V(r); end for r = 1:n if D(r) ~= 0 R1(r) = S(r) / D(r); else R1(r) = 0; end end err = sum(abs(R1 - R0)); steps = steps +1; R0 = R1; end % sort the L-transfer probabilities [SR, IX] = sort(R1); % transfer original adjacency matrix % compute the flection pos for r = 1:n flection(IX(r)) = r; new_W_names(r) = W_names(IX(r)); end new_W = sparse(n,n); new_P = sparse(n,n); [i1,j1,VP] = find(P); for r = 1:NNZ x = flection(i(r)); y = flection(j(r)); new_W(x,y) = V(r); new_P(x,y) = VP(r); end % finding the pos satisfying with the pcut criterion [pos, EP1, EP2] = FC(new_P); function [pos, EP1, EP2] = FC(B) n = length(B); [i,j,V] = find(B); NNZ = length(i); if n<=1 pos = 1; EP1 = 1; EP2 = 1; return; end % computing the trapping probability for each position by top-down S1 = zeros(n,1); T1 = zeros(n,1); for r = 1:NNZ x = i(r); y = j(r); w = V(r); if y<=x S1(x) = S1(x) + w; else S1(y) = S1(y) + w; end end for r = 2:n S1(r) = S1(r-1)+S1(r); T1(r) = S1(r)/r; end T1 = T1(1:n-1); % computing the trapping probability for each position by bottom-up S2 = zeros(n,1); T2 = zeros(n,1); for r = NNZ:-1:1 x = i(r); y = j(r); w = V(r); if y<=x S2(y) = S2(y) + w; else S2(x) = S2(x) + w; end end for r = n-1:-1:1 S2(r) = S2(r+1)+S2(r); T2(r) = S2(r)/(n-r+1); end T2 = T2(2:n); % computing the pcut for each positions for pos = 1:n-1 E1(pos) = 1 - T1(pos); E2(pos) = 1 - T2(pos); end Pcut = zeros(n-1,1); for pos =1:n-1 Pcut(pos) = 2-T1(pos)-T2(pos); end % finding the minimum pcut [pcut, pos] = min(Pcut); EP1 = 1 - T1(pos); EP2 = 1 - T2(pos); function show_matrix(A,str) figure(... 'Color',[1 1 1],... 'Name',str); clf; imagesc(A); colorbar; function W = sym(W) for i=1:length(W) W(i,i) = 0; end [i,j,V] = find(W); for r = 1:length(i) W(j(r),i(r)) = V(r); end

评论收藏

内容反馈

版权申诉