深度学习课程设计报告.zip/图像分类_pycharm深度学习资源-CSDN文库

共58个文件

py：15个

pyc：12个

xml：7个

深度学习

5星 · 超过95%的资源需积分: 41 125 浏览量 2022-12-01 17:38:56 上传评论 4 收藏 619.38MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

深度学习课程设计报告.zip （58个子文件）

深度学习课程设计报告

代码1

10567262_3.rar 140.08MB

NeuralNetwork-cifar

vis_utils.py 2KB

two_layer_fc_net_start.py 2KB

optim.py 4KB

__pycache__

solver.cpython-36.pyc 8KB

optim.cpython-36.pyc 3KB

fc_net.cpython-36.pyc 3KB

layer_utils.cpython-36.pyc 3KB

data_utils.cpython-36.pyc 3KB

layers.cpython-36.pyc 4KB

cifar-10-batches-py

data_batch_4 29.6MB

data_batch_2 29.6MB

data_batch_1 29.6MB

batches.meta 158B

data_batch_5 29.6MB

test_batch 29.6MB

readme.html 88B

data_batch_3 29.6MB

layer_utils.py 2KB

.idea

misc.xml 224B

modules.xml 298B

workspace.xml 34KB

神经网络cifar代码.iml 459B

solver.py 9KB

fc_net.py 3KB

data_utils.py 7KB

layers.py 4KB

~WRD0000.tmp 1.03MB

代码

CNN_Cifar10_Study

optim.py 4KB

__pycache__

solver.cpython-36.pyc 9KB

optim.cpython-36.pyc 3KB

cnn.cpython-36.pyc 3KB

layer_utils.cpython-36.pyc 3KB

data_utils.cpython-36.pyc 3KB

layers.cpython-36.pyc 20KB

cnn.py 8KB

cifar-10-batches-py

data_batch_4 29.6MB

data_batch_2 29.6MB

data_batch_1 29.6MB

batches.meta 158B

data_batch_5 29.6MB

test_batch 29.6MB

readme.html 88B

data_batch_3 29.6MB

start.py 2KB

layer_utils.py 3KB

.idea

misc.xml 192B

modules.xml 293B

workspace.xml 15KB

CNN_Cifar10_Study.iml 662B

libraries

R_User_Library.xml 128B

cnn_tutorial.pdf 140KB

solver.py 12KB

data_utils.py 9KB

layers.py 25KB

卷积神经网络(CNN)详解与代码实现.docx 6.62MB

11723346_PYT.rar 146.51MB

课设报告.doc 1.04MB

Notes on Convolutional Neural Networks

Jake Bouvrie

Center for Biological and Computational Learning

Department of Brain and Cognitive Sciences

Massachusetts Institute of Technology

Cambridge, MA 02139

[email protected]

November 22, 2006

1 Introduction

This document discusses the derivation and implementation of convolutional neural networks

(CNNs) [3, 4], followed by a few straightforward extensions. Convolutional neural networks in-

volve many more connections than weights; the architecture itself realizes a form of regularization.

In addition, a convolutional network automatically provides some degree of translation invariance.

This particular kind of neural network assumes that we wish to learn ﬁlters, in a data-driven fash-

ion, as a means to extract features describing the inputs. The derivation we present is speciﬁc to

two-dimensional data and convolutions, but can be extended without much additional effort to an

arbitrary number of dimensions.

We begin with a description of classical backpropagation in fully connected networks, followed by a

derivation of the backpropagation updates for the ﬁltering and subsampling layers in a 2D convolu-

tional neural network. Throughout the discussion, we emphasize efﬁciency of the implementation,

and give small snippets of MATLAB code to accompany the equations. The importance of writing

efﬁcient code when it comes to CNNs cannot be overstated. We then turn to the topic of learning

how to combine feature maps from previous layers automatically, and consider in particular, learning

sparse combinations of feature maps.

Disclaimer: This rough note could contain errors, exaggerations, and false claims.

2 Vanilla Back-propagation Through Fully Connected Networks

In typical convolutional neural networks you might ﬁnd in the literature, the early analysis consists of

alternating convolution and sub-sampling operations, while the last stage of the architecture consists

of a generic multi-layer network: the last few layers (closest to the outputs) will be fully connected

1-dimensional layers. When you’re ready to pass the ﬁnal 2D feature maps as inputs to the fully

connected 1-D network, it is often convenient to just concatenate all the features present in all the

output maps into one long input vector, and we’re back to vanilla backpropagation. The standard

backprop algorithm will be described before going onto specializing the algorithm to the case of

convolutional networks (see e.g. [1] for more details).

2.1 Feedforward Pass

In the derivation that follows, we will consider the squared-error loss function. For a multiclass

problem with c classes and N training examples, this error is given by

n=1

k=1

− y

)

Here t

is the k-th dimension of the n-th pattern’s corresponding target (label), and y

is similarly

the value of the k-th output layer unit in response to the n-th input pattern. For multiclass classiﬁ-

cation problems, the targets will typically be organized as a “one-of-c” code where the k-th element

of t

is positive if the pattern x

belongs to class k. The rest of the entries of t

will be either zero

or negative depending on the choice of your output activation function (to be discussed below).

Because the error over the whole dataset is just a sum over the individual errors on each pattern, we

will consider backpropagation with respect to a single pattern, say the n-th one:

k=1

− y

)

− y

. (1)

With ordinary fully connected layers, we can compute the derivatives of E with respect to the net-

work weights using backpropagation rules of the following form. Let ` denote the current layer,

with the output layer designated to be layer L and the input “layer” designated to be layer 1. Deﬁne

the output of this layer to be

= f(u

), with u

= W

`−1

+ b

(2)

where the output activation function f (·) is commonly chosen to be the logistic (sigmoid) function

f(x) = (1 + e

−βx

)

−1

or the hyperbolic tangent function f(x) = a tanh(bx). The logistic function

maps [−∞, +∞] → [0, 1], while the hyperbolic tangent maps [−∞, +∞] → [−a, +a]. Therefore

while the outputs of the hyperbolic tangent function will typically be near zero, the outputs of a

sigmoid will be non-zero on average. However, normalizing your training data to have mean 0 and

variance 1 along the features can often improve convergence during gradient descent [5]. With a

normalized dataset, the hyperbolic tangent function is thus preferrable. LeCun recommends a =

1.7159 and b = 2/3, so that the point of maximum nonlinearity occurs at f(±1) = ±1 and will thus

avoid saturation during training if the desired training targets are normalized to take on the values

±1 [5].

2.2 Backpropagation Pass

The “errors” which we propagate backwards through the network can be thought of as “sensitivities”

of each unit with respect to perturbations of the bias

. That is to say,

∂E

∂b

∂E

∂u

∂b

= δ (3)

since in this case

∂u

∂b

= 1. So the bias sensitivity and the derivative of the error with respect to a

unit’s total input is equivalent. It is this derivative that is backpropagated from higher layers to lower

layers, using the following recurrence relation:

= (W

`+1

)

`+1

◦ f

) (4)

where “◦” denotes element-wise multiplication. For the error function (1), the sensitivities for the

output layer neurons will take a slightly different form:

= f

) ◦ (y

− t

Finally, the delta rule for updating a weight assigned to a given neuron is just a copy of the inputs

to that neuron, scaled by the neuron’s delta. In vector form, this is computed as an outer product

between the vector of inputs (which are the outputs from the previous layer) and the vector of

sensitivities:

∂E

∂W

= x

`−1

(δ

)

(5)

∆W

= −η

∂E

∂W

(6)

with analogous expressions for the bias update given by (3). In practice there is often a learning rate

parameter η

speciﬁc to each weight (W)

This nifty interpretation is due to Sebastian Seung

3 Convolutional Neural Networks

Typically convolutional layers are interspersed with sub-sampling layers to reduce computation time

and to gradually build up further spatial and conﬁgural invariance. A small sub-sampling factor is

desirable however in order to maintain speciﬁcity at the same time. Of course, this idea is not new,

but the concept is both simple and powerful. The mammalian visual cortex and models thereof [12,

8, 7] draw heavily on these themes, and auditory neuroscience has revealed in the past ten years

or so that these same design paradigms can be found in the primary and belt auditory areas of the

cortex in a number of different animals [6, 11, 9]. Hierarchical analysis and learning architectures

may yet be the key to success in the auditory domain.

3.1 Convolution Layers

Let’s move forward with deriving the backpropagation updates for convolutional layers in a network.

At a convolution layer, the previous layer’s feature maps are convolved with learnable kernels and

put through the activation function to form the output feature map. Each output map may combine

convolutions with multiple input maps. In general, we have that

= f

i∈M

`−1

∗ k

+ b

where M

represents a selection of input maps, and the convolution is of the “valid” border handling

type when implemented in MATLAB. Some common choices of input maps include all-pairs or all-

triplets, but we will discuss how one might learn combinations below. Each output map is given an

additive bias b, however for a particular output map, the input maps will be convolved with distinct

kernels. That is to say, if output map j and map k both sum over input map i, then the kernels

applied to map i are different for output maps j and k.

3.1.1 Computing the Gradients

We assume that each convolution layer ` is followed by a downsampling layer `+1. The backpropa-

gation algorithm says that in order to compute the sensitivity for a unit at layer `, we should ﬁrst sum

over the next layer’s sensitivies corresponding to units that are connected to the node of interest in

the current layer `, and multiply each of those connections by the associated weights deﬁned at layer

` + 1. We then multiply this quantity by the derivative of the activation function evaluated at the

current layer’s pre-activation inputs, u . In the case of a convolutional layer followed by a downsam-

pling layer, one pixel in the next layer’s associated sensitivity map δ corresponds to a block of pixels

in the convolutional layer’s output map. Thus each unit in a map at layer ` connects to only one unit

in the corresponding map at layer ` + 1. To compute the sensitivities at layer ` efﬁciently, we can

upsample the downsampling layer’s sensitivity map to make it the same size as the convolutional

layer’s map and then just multiply the upsampled sensitivity map from layer `+1 with the activation

derivative map at layer ` element-wise. The “weights” deﬁned at a downsampling layer map are all

equal to β (a constant, see section 3.2), so we just scale the previous step’s result by β to ﬁnish the

computation of δ

. We can repeat the same computation for each map j in the convolutional layer,

pairing it with the corresponding map in the subsampling layer:

= β

`+1



) ◦ up(δ

`+1

)



where up(·) denotes an upsampling operation that simply tiles each pixel in the input horizontally

and vertically n times in the output if the subsampling layer subsamples by a factor of n. As we

will discuss below, one possible way to implement this function efﬁciently is to use the Kronecker

product:

up(x) ≡ x ⊗ 1

n×n

Now that we have the sensitivities for a given map, we can immediately compute the bias gradient

by simply summing over all the entries in δ

∂E

∂b

u,v

(δ

)

评论收藏

内容反馈

蟹蛛

2023-06-07

加载速度很快，十分方便。
马虫医生

2023-06-07

图像分类展示思路清晰，可以帮助初学者。
无能为力就要努力

2023-06-07

图像分类的结果表现出色，值得学习和借鉴。
今年也要加油呀

2023-06-07

图像分类部分包含了一些有用的案例和技术。
卡哥Carlos

2023-06-07

课程设计报告的整体品质值得一级。

前往

页

风，风，风

粉丝: 4
资源: 99

深度学习课程设计报告.zip/图像分类

深度学习报告

深度学习研究报告

深度学习报告.docx

深度学习报告 (2).pdf

深度学习报告---综述.docx

基于Matlab图像处理人脸识别完整源码+代码详细注释+课程设计报告.zip

基于深度学习的红外与可见光图像的融合python源码（课程设计）.zip

基于传统图像处理算法实现的车牌识别系统Matlab源码(图像处理课程设计).zip

深度学习课程设计报告+源代码(CIFAR-10数据集).zip

python实现的基于深度学习的积灰识别-图像分类识别项目源码+全部数据（毕业设计）.zip

基于noise2noise修改的深度学习去水印项目源码+说明.zip

毕业新项目-基于python深度学习实现高分辨率城市遥感图像的水体提取系统源码.zip

基于Python实现图像分类.zip

机器学习课程大作业 - 基于深度神经网络的图像分类任务.zip

基于Python+OpenCV+Django+人脸识别库实现的人脸识别系统源码+项目说明(课程设计).zip

深度学习基于卷积神经网络的猫狗图像分类项目源码+数据集+文档PPT（高分项目）.zip

Matlab实现的跨摄像头的车辆检测与追踪Matlab源码+课程设计报告(数字图像处理课设).zip

基于深度学习给特斯拉汽车学习端对端的自动驾驶，训练模型根据车辆的前置相机所拍摄的路况图像，实现对车辆转向角度的预测.zip

基于深度学习的网络训练图像分类模型源码+报告（人工智能基础课程大作业）.zip

基于深度学习实现的高分辨率城市遥感图像的水体提取python源码.zip

基于OpenCV和tinker的指纹识别系统python源码+代码注释+项目说明及设计报告(数字图像处理课程设计).zip

基于matlab深度学习的智能图像去雾系统源码+全部数据（高分课设）.zip

基于深度学习的计算机视觉垃圾分类系统.zip

基于深度学习的图像分类源码.zip

相关实用应用程序（Windows可用）

免费可用的ChatGPT网页版.zip

ChatGPT使用总结：150个ChatGPT提示词模板（完整版）

chromedriver-win64.zip

全国计算机二级WPSoffice精选350道选择题题库（含答案）.pdf

最新资源