Tensorflow实现简单CNN资源-CSDN文库

共7个文件

gz：5个

py：2个

需积分: 2 61 浏览量 2018-07-26 17:18:26 上传评论收藏 26.44MB RAR 举报

在本文中，我们将深入探讨如何使用TensorFlow框架构建一个简单的卷积神经网络（CNN），用于对MNIST手写数字识别数据集进行分类。MNIST数据集是机器学习领域的一个经典基准，包含60,000个训练样本和10,000个测试样本，每个样本都是28x28像素的灰度图像。我们需要导入必要的库，包括TensorFlow、numpy以及MNIST数据集。在`CNNModel.py`中，你可能会看到类似以下的代码段： ```python import tensorflow as tf from tensorflow.keras.datasets import mnist from tensorflow.keras.utils import to_categorical # 加载MNIST数据 (x_train, y_train), (x_test, y_test) = mnist.load_data() ``` 接下来，数据预处理是至关重要的。由于CNN模型期望输入是三维张量（通道数、高度、宽度），所以需要将28x28的图像转换为4D张量，同时归一化像素值到0-1之间。这可以在`CNNTrainandTest.py`中完成： ```python x_train = x_train.reshape(-1, 28, 28, 1).astype('float32') / 255 x_test = x_test.reshape(-1, 28, 28, 1).astype('float32') / 255 # 将类别标签转换为one-hot编码 y_train = to_categorical(y_train, 10) y_test = to_categorical(y_test, 10) ``` 然后，我们构建CNN模型。在这个例子中，模型包含4个卷积-池化层，一个全连接层，ReLU作为激活函数，以及dropout防止过拟合。模型定义可能如下所示： ```python model = tf.keras.models.Sequential([ tf.keras.layers.Conv2D(32, (3, 3), activation='relu', input_shape=(28, 28, 1)), tf.keras.layers.MaxPooling2D((2, 2)), tf.keras.layers.Conv2D(64, (3, 3), activation='relu'), tf.keras.layers.MaxPooling2D((2, 2)), tf.keras.layers.Conv2D(64, (3, 3), activation='relu'), tf.keras.layers.MaxPooling2D((2, 2)), tf.keras.layers.Conv2D(64, (3, 3), activation='relu'), tf.keras.layers.Flatten(), tf.keras.layers.Dense(128, activation='relu'), tf.keras.layers.Dropout(0.5), tf.keras.layers.Dense(10, activation='softmax') ]) ``` 模型编译后，我们可以设置优化器（如Adam）、损失函数（如交叉熵）和评估指标（如准确率），然后开始训练： ```python model.compile(optimizer='adam', loss='categorical_crossentropy', metrics=['accuracy']) history = model.fit(x_train, y_train, epochs=10, validation_data=(x_test, y_test)) ``` 我们可以使用训练好的模型对测试集进行预测，并查看模型的性能。在`CNNTrainandTest.py`中，你可能会找到类似以下的代码： ```python test_loss, test_acc = model.evaluate(x_test, y_test, verbose=2) print('\nTest accuracy:', test_acc) ``` 通过这个简单的CNN模型，我们可以达到相当高的准确率，通常超过98%，这展示了CNN在图像分类任务上的强大能力。同时，这个例子也展示了TensorFlow在构建深度学习模型时的灵活性和易用性。

资源推荐

资源详情

资源评论

收起资源包目录

DeepLearning.rar （7个子文件）

CNNModel.py 6KB

CNNTrainandTest.py 3KB

MNIST_data

t10k-images-idx3-ubyte.gz 1.57MB

mnist.pkl.gz 15.42MB

train-labels-idx1-ubyte.gz 28KB

train-images-idx3-ubyte.gz 9.45MB

t10k-labels-idx1-ubyte.gz 4KB

''' CNNModel.py 含4层卷积池化层，一层全连接层，激活函数ReLU，采用dropout和softmax函数做分类器 2018/7/18完成搭建，2018/7/19实现文件调用 -------copyright@GCN------- ''' import tensorflow as tf from tensorflow.examples.tutorials.mnist import input_data sess = tf.InteractiveSession() # 函数申明 def weight_variable(shape): # 正态分布，标准差为0.1，默认最大为1，最小为-1，均值为0 initial = tf.truncated_normal(shape, stddev=0.1) return tf.Variable(initial) def bias_variable(shape): # 创建一个结构为shape矩阵也可以说是数组shape声明其行列，初始化所有值为0.1 initial = tf.constant(0.1, shape == shape) return tf.Variable(initial) def conv2d(x, W): # 卷积遍历各方向步数为1，SAME：边缘外自动补0，遍历相乘 # padding 一般只有两个值 return tf.nn.conv2d(x, W, strides=[1, 1, 1, 1], padding='SAME') def max_pool_2x2(x): # 池化卷积结果（conv2d）池化层采用kernel大小为2*2，步数也为2，SAME：周围补0，取最大值。数据量缩小了4倍 # x 是 CNN 第一步卷积的输出量，其shape必须为[batch, height, weight, channels]; # ksize 是池化窗口的大小， shape为[batch, height, weight, channels] # stride 步长，一般是[1，stride， stride，1] return tf.nn.max_pool(x, ksize=[1, 2, 2, 1], strides=[1, 2, 2, 1], padding='SAME') def deep_CNN(xs): # x_image又把xs reshape成了28*28*1的形状，灰色图片的通道是1.作为训练时的input，-1代表图片数量不定 x_image = tf.reshape(xs, [-1, 28, 28, 1]) # 搭建网络 # 第一层卷积池化 # 第一二参数值得卷积核尺寸大小，即patch w_conv1 = weight_variable([5, 5, 1, 32]) b_conv1 = bias_variable([32]) # 32个偏置值 h_conv1 = tf.nn.relu(conv2d(x_image, w_conv1)+b_conv1) # 得到28*28*32 #h_conv1 = tf.nn.sigmoid(conv2d(x_image, w_conv1)+b_conv1) h_pool1 = max_pool_2x2(h_conv1) # 得到14*14*32 # 第二层卷积池化 # 第三个参数是图像通道数，第四个参数是卷积核的数目，代表会出现多少个卷积特征图像; w_conv2 = weight_variable([5, 5, 32, 64]) b_conv2 = bias_variable([64]) h_conv2 = tf.nn.relu(conv2d(h_pool1, w_conv2)+b_conv2) # 得到14*14*64 #h_conv2 = tf.nn.sigmoid(conv2d(h_pool1, w_conv2)+b_conv2) h_pool2 = max_pool_2x2(h_conv2) # 得到7*7*64 # 添加一层卷积池化层 w_conv3 = weight_variable([5, 5, 64, 128]) b_conv3 = bias_variable([128]) h_conv3 = tf.nn.relu(conv2d(h_pool2, w_conv3)+b_conv3) # 得到7*7*128 #h_conv3 = tf.nn.sigmoid(conv2d(h_pool1, w_conv2)+b_conv2) h_pool3 = max_pool_2x2(h_conv3) # 得到4*4*128 # 第四层全连接层 w_fc1 = weight_variable([4*4*128, 1024]) b_fc1 = bias_variable([1024]) # 将第三层卷积池化结果reshape成只有一行7*7*128个数据 # [n_samples, 4, 4, 128] == [n_samples, 4 * 4 * 128] h_pool3_flat = tf.reshape(h_pool3, [-1, 4*4*128]) # -1 表示不知道该填什么数字合适的情况下，可以选择 # 卷积操作，结果是1*1*1024，单行乘以单列等于1*1矩阵，matmul实现最基本的矩阵相乘 # 不同于tf.nn.conv2d的遍历相乘，自动认为是前行向量后列向量 h_fc1 = tf.nn.relu(tf.matmul(h_pool3_flat, w_fc1)+ b_fc1) #h_fc1 = tf.nn.sigmoid(tf.matmul(h_pool2_flat, w_fc1)+ b_fc1) # 对卷积结果执行dropout操作 keep_prob = tf.placeholder(tf.float32) h_fc1_dropout = tf.nn.dropout(h_fc1, keep_prob) # tf.nn.dropout(x, keep_prob, noise_shape=None, seed=None, name=None) # 第二个参数keep_prob: 设置神经元被选中的概率,在初始化时keep_prob是一个占位符 # 第五层输出操作 # 二维张量，1*1024矩阵卷积，共10个卷积，对应ys长度为10 w_fc2 = weight_variable([1024, 10]) b_fc2 = bias_variable([10]) y_conv = tf.nn.softmax(tf.matmul(h_fc1_dropout, w_fc2)+b_fc2) return y_conv, keep_prob # 最后返回softmax层的输出和神经元被选中的概率（dropout） ''' def main(): # 读取数据集 mnist = input_data.read_data_sets("MNIST_data/", one_hot=True) # 定义输入输出结构 # tf.placeholder可以理解为形参，用于定义过程，执行时再赋值 # dtype 是数据类型，常用的是tf.float32,tf.float64等数值类型 # shape是数据形状，默认None表示输入图片的数量不定，28*28图片分辨率 xs = tf.placeholder(tf.float32, [None, 28 * 28]) # 类别是0-9总共10个类别，对应输出分类结果 ys = tf.placeholder(tf.float32, [None, 10]) y_conv, keep_prob = deep_CNN(xs) # 定义损失函数 # cross_entropy = tf.reduce_mean(-tf.reduce_sum(ys*tf.log(y_conv), reduction_indices=[1])) cross_entropy = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(labels=ys, logits=y_conv)) # AdamOptimizer通过使用动量（参数的移动平均数）来改善传统梯度下降，促进超参数动态调整 train_step = tf.train.AdamOptimizer(1e-4).minimize(cross_entropy) # 训练验证 # tf.equal(A, B)是对比这两个矩阵或者向量的相等的元素，如果相等返回True，否则返回False，返回的值的矩阵维度和A是一样的 correct_prediction = tf.equal(tf.argmax(y_conv, 1), tf.argmax(ys, 1)) # print(correct_prediction) # tf.arg_max(input, axis=None, name=None, dimension=None) 是对矩阵按行或列计算最大值（axis：0表示按列，1表示按行） accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32)) # 数据类型转换 print("start train") saver = tf.train.Saver() sess.run(tf.global_variables_initializer()) '''''' for i in range(1000): batch_x, batch_y = mnist.train.next_batch(50) if i % 100 == 0: train_accuracy = accuracy.eval(feed_dict={xs: batch_x, ys: batch_y, keep_prob: 1.0}) print('step:%d, training accuracy %g' % (i, train_accuracy)) train_step.run(feed_dict={xs: batch_x, ys: batch_y, keep_prob: 0.5}) saver.save(sess, "input_data/model") saver.restore(sess, "input_data/model") # 测试 print(accuracy.eval({xs: mnist.test.images, ys: mnist.test.labels, keep_prob: 1.0})) if __name__ == '__main__': main() '''

评论收藏

内容反馈