mnist.rar_MNIST_Python深度学习_python深度学习_tensorflow实现首先数字识别_手写数字识别

共1个文件

py：1个

版权申诉

mnist

python深度学习

手写数字识别

5星 · 超过95%的资源 48 浏览量 2022-09-20 23:40:25 上传评论收藏 2KB RAR 举报

标题中的“mnist.rar”指的是一个压缩包文件，其中包含了MNIST数据集的Python实现以及相关的深度学习模型。MNIST是Machine Learning领域一个经典的数据集，主要用于训练和测试手写数字识别模型。这个数据集包含60,000个训练样本和10,000个测试样本，每个样本都是28x28像素的手写数字图像。描述提到的“深度学习时间手写数字识别”，意味着我们将使用深度学习方法来解决这个问题。深度学习是一种模仿人脑神经网络结构的机器学习技术，特别擅长处理图像、语音等复杂数据。在这个项目中，Python将作为编程语言，TensorFlow则是一个强大的深度学习框架，它允许我们构建、训练和部署大规模的神经网络模型。 “python深度学习”标签表明整个项目是基于Python进行的，Python因为其丰富的科学计算库和易读性而成为数据科学和机器学习的首选语言。TensorFlow库在Python环境中可以轻松导入和使用，它提供了构建神经网络所需的各类工具和功能。 “tensorflow实现首先数字识别”表示我们将利用TensorFlow来创建模型，实现对MNIST数据集中手写数字的识别。这通常包括以下步骤： 1. 数据预处理：加载MNIST数据集，对图像进行归一化处理，将其转换为适合输入神经网络的格式。 2. 构建模型：设计一个深度学习模型，如卷积神经网络（CNN）或全连接神经网络（FCN），这些网络能有效地提取图像特征。 3. 训练模型：使用训练集对模型进行迭代优化，调整权重以最小化损失函数。 4. 验证与评估：在验证集上检查模型性能，确保不过拟合，并最终在测试集上评估模型的泛化能力。 5. 模型预测：将训练好的模型用于识别新的手写数字图像。文件名“mnist.py”很可能是项目的主脚本，里面包含了上述所有步骤的实现。通过运行这个文件，我们可以加载数据、构建模型、训练并测试模型，从而实现对MNIST数据集中手写数字的识别。总结来说，这个项目旨在通过Python和TensorFlow，利用深度学习技术对手写数字进行识别，具体实施将涉及数据处理、模型构建、训练和评估等多个环节，核心代码都封装在“mnist.py”文件中。对于初学者，这是一个很好的实践项目，可以帮助理解深度学习的基本原理和流程，同时也能加深对Python和TensorFlow这两个工具的掌握。

资源详情

资源评论

资源推荐

收起资源包目录

mnist.rar （1个子文件）

mnist.py 5KB

# Copyright 2015 The TensorFlow Authors. All Rights Reserved. # # Licensed under the Apache License, Version 2.0 (the "License"); # you may not use this file except in compliance with the License. # You may obtain a copy of the License at # # http://www.apache.org/licenses/LICENSE-2.0 # # Unless required by applicable law or agreed to in writing, software # distributed under the License is distributed on an "AS IS" BASIS, # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. # See the License for the specific language governing permissions and # limitations under the License. # ============================================================================== """Builds the MNIST network. Implements the inference/loss/training pattern for model building. 1. inference() - Builds the model as far as is required for running the network forward to make predictions. 2. loss() - Adds to the inference model the layers required to generate loss. 3. training() - Adds to the loss model the Ops required to generate and apply gradients. This file is used by the various "fully_connected_*.py" files and not meant to be run. """ from __future__ import absolute_import from __future__ import division from __future__ import print_function import math import tensorflow as tf # The MNIST dataset has 10 classes, representing the digits 0 through 9. NUM_CLASSES = 10 # The MNIST images are always 28x28 pixels. IMAGE_SIZE = 28 IMAGE_PIXELS = IMAGE_SIZE * IMAGE_SIZE def inference(images, hidden1_units, hidden2_units): """Build the MNIST model up to where it may be used for inference. Args: images: Images placeholder, from inputs(). hidden1_units: Size of the first hidden layer. hidden2_units: Size of the second hidden layer. Returns: softmax_linear: Output tensor with the computed logits. """ # Hidden 1 with tf.name_scope('hidden1'): weights = tf.Variable( tf.truncated_normal([IMAGE_PIXELS, hidden1_units], stddev=1.0 / math.sqrt(float(IMAGE_PIXELS))), name='weights') biases = tf.Variable(tf.zeros([hidden1_units]), name='biases') hidden1 = tf.nn.relu(tf.matmul(images, weights) + biases) # Hidden 2 with tf.name_scope('hidden2'): weights = tf.Variable( tf.truncated_normal([hidden1_units, hidden2_units], stddev=1.0 / math.sqrt(float(hidden1_units))), name='weights') biases = tf.Variable(tf.zeros([hidden2_units]), name='biases') hidden2 = tf.nn.relu(tf.matmul(hidden1, weights) + biases) # Linear with tf.name_scope('softmax_linear'): weights = tf.Variable( tf.truncated_normal([hidden2_units, NUM_CLASSES], stddev=1.0 / math.sqrt(float(hidden2_units))), name='weights') biases = tf.Variable(tf.zeros([NUM_CLASSES]), name='biases') logits = tf.matmul(hidden2, weights) + biases return logits def loss(logits, labels): """Calculates the loss from the logits and the labels. Args: logits: Logits tensor, float - [batch_size, NUM_CLASSES]. labels: Labels tensor, int32 - [batch_size]. Returns: loss: Loss tensor of type float. """ labels = tf.to_int64(labels) cross_entropy = tf.nn.sparse_softmax_cross_entropy_with_logits( logits, labels, name='xentropy') loss = tf.reduce_mean(cross_entropy, name='xentropy_mean') return loss def training(loss, learning_rate): """Sets up the training Ops. Creates a summarizer to track the loss over time in TensorBoard. Creates an optimizer and applies the gradients to all trainable variables. The Op returned by this function is what must be passed to the `sess.run()` call to cause the model to train. Args: loss: Loss tensor, from loss(). learning_rate: The learning rate to use for gradient descent. Returns: train_op: The Op for training. """ # Add a scalar summary for the snapshot loss. tf.scalar_summary(loss.op.name, loss) # Create the gradient descent optimizer with the given learning rate. optimizer = tf.train.GradientDescentOptimizer(learning_rate) # Create a variable to track the global step. global_step = tf.Variable(0, name='global_step', trainable=False) # Use the optimizer to apply the gradients that minimize the loss # (and also increment the global step counter) as a single training step. train_op = optimizer.minimize(loss, global_step=global_step) return train_op def evaluation(logits, labels): """Evaluate the quality of the logits at predicting the label. Args: logits: Logits tensor, float - [batch_size, NUM_CLASSES]. labels: Labels tensor, int32 - [batch_size], with values in the range [0, NUM_CLASSES). Returns: A scalar int32 tensor with the number of examples (out of batch_size) that were predicted correctly. """ # For a classifier model, we can use the in_top_k Op. # It returns a bool tensor with shape [batch_size] that is true for # the examples where the label is in the top k (here k=1) # of all logits for that example. correct = tf.nn.in_top_k(logits, labels, 1) # Return the number of true entries. return tf.reduce_sum(tf.cast(correct, tf.int32))+