CapsNet.zip_fileget安卓版中文版资源-CSDN文库

共7个文件

gz：4个

py：2个

pyc：1个

CapsNet

胶囊网络

动态路由层

keras

MNIST

5星 · 超过95%的资源需积分: 50 62 浏览量 2020-04-19 23:45:26 上传评论 4 收藏 11.06MB ZIP 举报

胶囊网络（CapsNet）是一种深度学习模型，由Geoffrey Hinton及其团队在2017年提出，旨在解决传统卷积神经网络（CNN）在识别局部特征和保持物体不变性方面的不足。CapsNet的核心思想是引入“胶囊”（Capsule），这是一种能够编码物体属性的神经元组，如位置、方向、大小等。与传统的神经元不同，胶囊网络中的胶囊不仅输出一个标量值，而是输出一个向量，这个向量包含了更多的语义信息。在 CapsNet.zip 文件中，我们可以看到一个名为 `CapsNet.py` 的脚本，这很可能是实现胶囊网络的Python代码。该脚本基于Keras框架，Keras是一个高度模块化、用户友好的深度学习库，支持TensorFlow、Theano和CNTK等后端。使用Keras实现CapsNet的优势在于其简洁的API，使得模型构建和训练过程更为直观。在MNIST数据集上训练CapsNet模型，MNIST是一个包含手写数字的广泛使用的基准数据集。它有60,000个训练样本和10,000个测试样本，每个样本都是28x28像素的灰度图像。MNIST被广泛用于评估各种图像分类算法的性能。胶囊网络的实现通常包括以下几个关键组件： 1. **初级胶囊层（Primary Capsules）**：这是胶囊网络的第一层，通常由卷积层转换而来，负责检测输入图像中的基本特征。 2. **动态路由层（Dynamic Routing）**：这是胶囊网络的核心机制，通过迭代的“路由”过程，让高层胶囊根据低层胶囊的输出调整自己的激活状态。动态路由允许网络学习到特征间的依赖关系，而不仅仅是独立特征。 3. **胶囊层（_capsules）**：这些是更高层次的胶囊，它们接收来自初级胶囊的信号，并通过动态路由进行聚合和投票，形成对更复杂对象的表示。 4. **解码器（Decoder）**：在CapsNet中，通常会加入一个解码器网络，它接收胶囊层的输出，并尝试重建输入图像。解码器的损失函数可以与分类损失一起优化，提供一种监督信号，帮助网络更好地学习。 5. **损失函数**：胶囊网络通常使用两种损失函数，一个是分类损失，另一个是重构损失。分类损失衡量预测的类概率与真实标签之间的差异，而重构损失则衡量解码器重建的图像与原始输入图像的相似度。在运行`CapsNet.py`时，模型将进行训练，训练过程中会记录损失和准确率，并在验证集上评估模型性能。模型将在测试集上进行测试，输出模型在预测集上的正确率，即分类准确率。通过理解和实现这个CapsNet模型，你可以深入理解胶囊网络的原理以及如何在实际问题中应用，这对于提升你在深度学习领域的技能和知识是非常有价值的。同时，这个例子也可以作为进一步研究和改进胶囊网络结构的起点。

资源推荐

资源详情

资源评论

收起资源包目录

CapsNet.zip （7个子文件）

CapsNet

CapsNet.py 2KB

__pycache__

CapsuleLayer.cpython-37.pyc 3KB

CapsuleLayer.py 4KB

MNIST_data

t10k-labels-idx1-ubyte.gz 4KB

t10k-images-idx3-ubyte.gz 1.57MB

train-images-idx3-ubyte.gz 9.45MB

train-labels-idx1-ubyte.gz 28KB

from keras import backend as K from keras.layers import Layer """ 压缩函数,使用0.5替代hinton论文中的1,如果是1，所有的向量的范数都将被缩小。如果是0.5，小于0.5的范数将缩小，大于0.5的将被放大 """ def squash(x, axis=-1): s_quared_norm = K.sum(K.square(x), axis, keepdims=True) + K.epsilon() #||x||^2 scale = K.sqrt(s_quared_norm) / (0.5 + s_quared_norm) #||x||/(0.5+||x||^2) result = scale * x return result # 定义我们自己的softmax函数，而不是K.softmax.因为K.softmax不能指定轴 def softmax(x, axis=-1): ex = K.exp(x - K.max(x, axis=axis, keepdims=True)) result = ex / K.sum(ex, axis=axis, keepdims=True) return result # 定义边缘损失，输入y_true, p_pred，返回分数，传入fit即可 def margin_loss(y_true, y_pred): lamb, margin = 0.5, 0.1 result = K.sum(y_true * K.square(K.relu(1 - margin -y_pred)) + lamb * (1-y_true) * K.square(K.relu(y_pred - margin)), axis=-1) return result class Capsule(Layer): def __init__(self, num_capsule, dim_capsule, routings=3, share_weights=True, activation='squash', **kwargs): super(Capsule, self).__init__(**kwargs) # Capsule继承**kwargs参数 self.num_capsule = num_capsule self.dim_capsule = dim_capsule self.routings = routings self.share_weights = share_weights if activation == 'squash': self.activation = squash else: self.activation = activation.get(activation) # 得到激活函数 # 定义权重 def build(self, input_shape): input_dim_capsule = input_shape[-1] if self.share_weights: # 自定义权重 self.kernel = self.add_weight( #[row,col,channel]->[1,input_dim_capsule,num_capsule*dim_capsule] name='capsule_kernel', shape=(1, input_dim_capsule, self.num_capsule * self.dim_capsule), initializer='glorot_uniform', trainable=True) else: input_num_capsule = input_shape[-2] self.kernel = self.add_weight( name='capsule_kernel', shape=(input_num_capsule, input_dim_capsule, self.num_capsule * self.dim_capsule), initializer='glorot_uniform', trainable=True) super(Capsule, self).build(input_shape) # 必须继承Layer的build方法 # 层的功能逻辑(核心) def call(self, inputs): if self.share_weights: #inputs: [batch, input_num_capsule, input_dim_capsule] #kernel: [1, input_dim_capsule, num_capsule*dim_capsule] #hat_inputs: [batch, input_num_capsule, num_capsule*dim_capsule] hat_inputs = K.conv1d(inputs, self.kernel) else: hat_inputs = K.local_conv1d(inputs, self.kernel, [1], [1]) batch_size = K.shape(inputs)[0] input_num_capsule = K.shape(inputs)[1] hat_inputs = K.reshape(hat_inputs, (batch_size, input_num_capsule, self.num_capsule, self.dim_capsule)) #hat_inputs: [batch, input_num_capsule, num_capsule, dim_capsule] hat_inputs = K.permute_dimensions(hat_inputs, (0, 2, 1, 3)) #hat_inputs: [batch, num_capsule, input_num_capsule, dim_capsule] b = K.zeros_like(hat_inputs[:, :, :, 0]) #b: [batch, num_capsule, input_num_capsule] for i in range(self.routings): c = softmax(b, 1) o = self.activation(K.batch_dot(c, hat_inputs, [2, 2])) if K.backend() == 'theano': o = K.sum(o, axis=1) if i < self.routings-1: b += K.batch_dot(o, hat_inputs, [2, 3]) if K.backend() == 'theano': o = K.sum(o, axis=1) return o def compute_output_shape(self, input_shape): # 自动推断shape return (None, self.num_capsule, self.dim_capsule)

评论收藏

内容反馈