算法一类支持向量机OC-SVM（2）_matlab中使用ocsvm资源-CSDN文库

共7个文件

py：3个

png：3个

xlsx：1个

22 浏览量 2024-03-13 10:19:38 上传评论收藏 64KB ZIP 举报

支持向量机（SVM，Support Vector Machine）是一种在机器学习领域广泛应用的监督学习模型，尤其在分类和回归任务中表现出色。它通过构建最大边距超平面来划分数据，使得不同类别的样本尽可能地被分隔开来，同时保证边界与最近样本点的距离最大化。而一类支持向量机（One-Class SVM，简称OC-SVM）是SVM的一个变种，它主要用于异常检测或无监督学习场景，即只需要训练一个类别的数据。标题“算法一类支持向量机OC-SVM（2）”暗示我们将探讨的是OC-SVM在实际应用中的优化方法。OC-SVM通常用于检测数据集中的异常点或者模式识别，因为它可以学习一个能够最好地拟合一类数据的决策边界，对于那些远离这个边界的点，我们可以认为它们可能是异常的。蜂群算法是一种受到自然界中昆虫群体行为启发的全局优化技术，例如蚂蚁算法、粒子群优化算法等。这些算法通常由大量的简单个体组成，每个个体在搜索空间中随机移动，寻找最优解。在OC-SVM的优化过程中，蜂群算法可以用来调整模型的参数，如核函数的参数、惩罚系数C等，以达到更好的分类效果。在压缩包文件中提到的"py"可能是指包含Python源代码的文件，这些代码可能实现了使用蜂群算法优化OC-SVM的过程。Python是一种广泛用于科学计算和数据分析的语言，其丰富的库如Scikit-Learn提供了便捷的支持向量机实现，而研究人员或开发者可以在此基础上利用各种优化算法进行改进。在OC-SVM的实现中，通常包括以下步骤： 1. 数据预处理：对数据进行清洗、归一化或标准化，以便于模型学习。 2. 初始化：设置蜂群算法的参数，如种群规模、迭代次数、个体的初始位置等。 3. 建立模型：使用Scikit-Learn等库构建OC-SVM模型，并选择合适的核函数，如线性、多项式或高斯核（RBF）。 4. 优化过程：运行蜂群算法，更新模型参数，寻找最佳的C和核函数参数组合。 5. 训练模型：使用优化后的参数训练OC-SVM模型。 6. 预测与评估：用训练好的模型对新数据进行预测，评估模型性能，如计算误报率（False Positive Rate, FPR）和真正率（True Positive Rate, TPR）等指标。通过这种优化方法，OC-SVM可以更好地适应复杂的数据分布，提高异常检测的准确性和鲁棒性。同时，Python源代码的分享也方便了其他研究者和开发人员复现实验结果或在自己的项目中应用这些技术。一类支持向量机OC-SVM是一种强大的异常检测工具，结合蜂群算法的优化可以进一步提升其性能。通过Python代码，我们可以深入理解OC-SVM的工作原理，学习如何利用优化算法调整模型参数，从而在实际问题中得到更优的解决方案。

资源推荐

资源详情

资源评论

收起资源包目录

蜂群算法优化一类支持向量机.zip （7个子文件）

abc-ocsvm.py 6KB

ocsvm.py 2KB

draw.py 725B

Figure_2.png 18KB

data.xlsx 112KB

Figure_3.png 18KB

Figure_1.png 15KB

#abc-svm Demo1 import numpy as np from sklearn.datasets import make_blobs from sklearn.svm import OneClassSVM from sklearn.metrics import roc_auc_score from sklearn.model_selection import train_test_split # 蜂群算法类 class ArtificialBeeColony: def __init__(self, fitness_func, param_ranges,kernel_options, population_size=30, max_generations=1): self.fitness_func = fitness_func self.param_ranges = param_ranges self.kernel_options = kernel_options # 添加kernel选项 self.population_size = population_size self.max_generations = max_generations self.population = self.initialize_population() self.best_solution = None self.best_fitness = -np.inf def initialize_population(self): population = [] for _ in range(self.population_size): solution = {} for param_name, (low, high, step) in self.param_ranges.items(): if isinstance(low, list): # Handle categorical variables solution[param_name] = np.random.choice(low) else: solution[param_name] = low + step * np.random.rand() * (high - low) # 随机选择一个kernel solution['kernel'] = np.random.choice(self.kernel_options) population.append(solution) return population def update_population(self, new_solutions): self.population = new_solutions current_best = max(new_solutions, key=lambda x: self.fitness_func(x)) current_fitness = self.fitness_func(current_best) if current_fitness > self.best_fitness: self.best_fitness = current_fitness self.best_solution = current_best def optimize(self): for generation in range(self.max_generations): new_population = [] for solution in self.population: new_solution = self.search_for_new_solution(solution) new_population.append(new_solution) self.update_population(new_population) print(f"Generation {generation + 1}: Best Fitness = {self.best_fitness}") return self.best_solution, self.best_fitness def search_for_new_solution(self, solution): new_solution = {} for param_name, value in solution.items(): if param_name == 'kernel': # 对于kernel参数，从kernel_options列表中随机选择一个值 new_solution[param_name] = np.random.choice(self.kernel_options) else: low, high, step = self.param_ranges[param_name] if isinstance(low, list): # Handle categorical variables new_value = np.random.choice(low) else: new_value = value + step * (np.random.rand() - 0.5) * (high - low) new_value = max(low, min(new_value, high)) # Ensure value is within bounds new_solution[param_name] = new_value return new_solution import pandas as pd # 创建数据集，路径需要修改成你自己的路径 # data = make_blobs(n_samples=500, centers=1, n_features=2, random_state=42) # 数据excel 读取 orignal = pd.read_excel('C:/Users/11003189/Desktop/py/data.xlsx', sheet_name='点') # 读取数据 print(orignal) col =['X','Y'] col1 =['Unnamed: 3'] data =orignal[col] target =orignal[col1] array =target.to_numpy() print(array) from sklearn.metrics import r2_score # 适应度函数 def fitness_function(params): # 选取X_train 为正类 # X_train, X_test, _, y_test = train_test_split(X, y, test_size=0.3, random_state=42) # y_test = np.where(y_test == 0, -1, 1) # 将标签转换为-1和1 nu = params['nu'] kernel = params['kernel'] gamma = params['gamma'] # 创建和训练One-Class SVM模型 ocsvm = OneClassSVM(nu=nu, kernel=kernel, gamma=gamma) # ocsvm.fit(X_train) ocsvm.fit(data,target) # 预测测试集并计算ROC AUC分数作为适应度值 # y_pred = ocsvm.predict(X_test) predictions = ocsvm.predict(data) # fitness =r2_score(y_test, y_pred) count =0 for index, element in enumerate(predictions): if(element ==array[index][0]): count=count+1 return count # return fitness # 参数搜索范围 param_ranges = { 'nu': (0.1, 0.9, 0.1), # nu 的搜索范围，步长为 0.1 # 'kernel': ['rbf', 'linear'], # 核函数的选择列表 'gamma': (0.01, 1, 0.1) # gamma 的搜索范围，步长为 0.1 } kernel_options = ['rbf', 'linear'] # 可能的kernel值列表 # 创建并运行人工蜂群算法 abc = ArtificialBeeColony(fitness_function, param_ranges, kernel_options) best_solution, best_fitness = abc.optimize() # Best Solution: {'nu': 0.10587043848153328, 'gamma': 0.2575143995541765, 'kernel': 'rbf'} # Best Fitness: 1.0 # 输出最佳解和最佳适应度 print(f"Best Solution: {best_solution}") print(f"Best Fitness: {best_fitness}") print(best_solution['nu']) print(best_solution['gamma']) print(best_solution['kernel']) # 最优模型创建 ocsvm = OneClassSVM(nu=best_solution['nu'], kernel=best_solution['kernel'], gamma=best_solution['gamma']) ocsvm.fit(data,target) predictions = ocsvm.predict(data) # 绘图 import matplotlib.pyplot as plt import numpy as np # 预测数据生成图 colors =np.where(predictions == -1, 0, 1)# 0黑色 1亮色 # print(colors) col2 =['X'] col3 =['Y'] x= orignal[col2] y= orignal[col3] fig, ax = plt.subplots() scatter = ax.scatter(x, y, c=colors) plt.colorbar(scatter) # 显示颜色条 plt.show()# 如果不预览就不用打开 # plt.savefig('predictions.png') # 真实数据生成图 colors =np.where(target == -1, 0, 1)# 0黑色 1亮色 # print(colors) col2 =['X'] col3 =['Y'] x= orignal[col2] y= orignal[col3] fig, ax = plt.subplots() scatter = ax.scatter(x, y, c=colors) plt.colorbar(scatter) # 显示颜色条 plt.show()# 如果不预览就不用打开 # plt.savefig('predictions.png')

评论收藏

内容反馈