插值池在卷积神经网络中的应用资源-CSDN文库

183 浏览量 2021-03-27 20:08:54 上传评论收藏 886KB PDF 举报

插值池在卷积神经网络中的应用是深度学习领域的一项研究，该研究针对的是卷积神经网络（CNNs）中的池化操作。池化层是CNNs的重要组成部分，它通过减少特征图的空间尺寸来降低模型的复杂度，并使得网络更加容易训练。传统上，大多数卷积神经网络使用最大池化（max pooling）或平均池化（mean pooling）操作，但这些方法在处理特征图时可能会丢失一些重要的特征信息。为了解决这个问题，研究人员提出了插值池（interpolation pooling）方法，目的是保留更多的有效信息。在插值池方法中，考虑到已知像素点中离插值点最近的4x4像素，并在计算中给予距离插值点较近的像素较大的权重。这种做法可以更准确地近似插值点的值，以此来保留更多有用的特征信息。通过在不同的卷积神经网络结构中应用插值池，如经典的LeNet-5和金字塔型卷积神经网络，研究者发现插值池方法比传统池化方法具有更快的收敛速度和更高的准确度。深度学习自2006年Hinton提出深度信念网络（deep belief nets）以来，吸引了众多学者的关注。深度学习相较于传统的浅层学习网络，在视觉识别、语音识别和自然语言处理等领域展现了更深层次和更广泛的应用潜力。深度信念网络突破了BP神经网络发展的瓶颈。在各种深度学习模型中，卷积神经网络因其能够利用共享的卷积核来减少模型的复杂度而被广泛研究。卷积层使得网络在保持空间层次结构的同时，能够捕获更多空间特征，而池化层则是降低特征图的空间维度，从而降低计算量和参数数量，使得网络的训练更为高效。插值池方法的提出对于池化操作的改进具有重要意义，它通过保留更多的特征信息，不仅提高了卷积神经网络的识别准确性，同时也加快了网络的收敛速度。这对于提升图像处理和模式识别等任务中的网络性能有着非常实际的价值。插值池方法作为卷积神经网络中池化层的一种改进技术，其研究与应用反映了深度学习领域在提高神经网络性能方面的不断探索和创新。随着相关研究的深入，我们可以期待插值池及其类似技术将在未来的人工智能应用中发挥越来越重要的作用。

资源推荐

资源详情

资源评论

Helix Vol. 8(4): 3465- 3469

Application of Interpolation Pooling in Convolutional Neural Networks

Gaihua Wang,

Guoliang Yuan,

Meng Lv,

WenZhou Liu

Hubei Collaborative Innovation Centre for High-efficiency Utilization of Solar Energy, Hubei University of

Technology, Wuhan 430068, China

1,2,3,4

School of Electrical and Electronic Engineering, Hubei University of Technology, Wuhan 430068, China

Email: guoliang_yuan@hotmail.com

Received: 22

March 2018, Accepted: 6

April 2018, Published:30

June 2018

Abstract In the existing convolutional neural

networks, the majority of the used pooling operations

are max pooling or mean pooling, but it would lose

some important feature information when processing

the feature maps. Here we report interpolation pooling

to overcome the problem for retaining more effective

information of feature maps. The interpolation pooling

takes the known pixel points of 4x4 with the nearest to

the interpolation point into account. Due to the

distance from the pixels to be inserted, the weight of

the pixels near the distance in the calculation is larger.

We apply it to different convolutional neural networks,

such as lenet-5 and pyramid convolutional neural

networks. We found that the method has the

advantages of faster convergence and higher accuracy

than the traditional method of pooling.

Keywords: Interpolation Pooling, Image

Classification, Convolutional Networks

1. Introduction

In recent years, deep learning has attracted the

attention of many scholars. The fact has proved that

deep learning has deeper and wider application than

traditional shallow learning networks, including visual

recognition, speech recognition and natural language

processing. In 2006, Hinton[1] improved the

method(deep belief nets) of deep learning breaks the

bottleneck of the development of BP neural network.

In all kinds of deep learning, convolutional neural

networks (CNNs) have been the most extensively

studied. CNNs consist of three types of layers:

convolution, pooling and fully connected layer[2]. For

convolutional layers, the convolution kernel is shared

by all the spatial positions. which reduce the

complexity of the model and make the network easier

to train[3]. Pooling is an important concept of CNNs,

including max pooling, mean pooling or mixed

pooling. A pooling layer reduces computational load

by reducing the number of convolutional layers. In

2012, Krizhevsky et al. proposed an AlexNet[4] model

that shows significant improvements. AlexNet is

similar to LeNet-5[3], but with a deeper structure.

Simonyan et[5] proposed the VGG network based on

AlexNet. And he proved that the enhancement of net

work depth helps to improve the accuracy of image

classification. By increasing the depth, the network

can better approximate the objective function, increase

the non-linearity, and get a better representation of the

features. However, this also increases the complexity

of the network and makes it more difficult to optimize.

To solve degradation problem with increasing the

depth of CNNs, He et[6] proposed a ResNet that won

the 2015 ILSVRC championship. ResNet maps low-

level features directly to high-level Network. And it is

eight times as deep as VGG and 20 times faster than

AlexNet. Szegedy et[7]. proposed an inception

module by observing and optimizing the network

structure, which reduces the network complexity and

replaces the previous convolution kernel by using a 1

× 1 convolution kernel in the inception module. The

number of training parameters for GoogLeNet[7] built

using the Inception module is only 1 / 12th of AlexNet,

but the accuracy of image classification on ImageNet

is improved. In 2017, Saining Xie[8] proposed the

ResNeXt network structure based on ResNet.

ResNeXt improves the accuracy without increasing

the complexity of the parameters and reducing the

number of hyper-parameters. At the same time, a

variety of methods [9-11]have been proposed to

overcome the difficulties encountered in deep CNNs

training.

All the methods mentioned above are improvements

for the depth, activation function and convolution

kernel of CNNs. In these models, max pooling or mean

pooling is used. Max pooling simply selects the

maximum value from the pooling area as the final

response value, which is sensitive to noise information.

Mean pooling takes the average values in the pooling

area, which effectively reduces the impact of noise

information, but it smooths the image and leads to the

loss of high frequency information[12].In this paper,

for the existing problems of the max pooling and the

mean pooling in CNNs, interpolation pooling is

proposed to optimize the network efficiency.

Interpolation pooling mainly uses the method of image

interpolation to select the nearest 16 pixels as the pixel

value corresponding to the final image, so as to

achieve the purpose of scaling the feature map.

DOI 10.29042/2018-3465-3469

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余4页未读，立即下载

评论收藏

内容反馈

weixin_38638688

粉丝: 2
资源: 925

插值池在卷积神经网络中的应用

网络游戏-用于视频编码分数像素插值的卷积神经网络的构建方法.zip

基于残余插值的卷积神经网络去马赛克算法.pdf

基于快速残差插值和卷积神经网络的去马赛克算法.pdf

双输入流深度反卷积的插值神经网络.pdf

图像裁剪，图像缩小(最近插值，双线性差值，三次卷积插值)，图像滤镜（灰度，模糊，锐化，卡通，骗用），应用（印）等_Jav.zip

Matlab神经网络工具箱在煤层界面插值中的应用.pdf

基于数值积分的离散过程神经网络算法及应用_李盼池1

Sparse-Depth-Completion-maste2r_depthcompletion_furthervwe_compl

matlab插值代码解释-FSRCNN:由Pytorch和Matlab复制论文《加速超分辨率卷积神经网络》（CVPR2016）

仿生算法优化BP神经网络在降雨空间插值中的应用.pdf

基于CORDIC的反正弦和反余弦计算的FPGA实现

使用3DCNN和卷积LSTM进行手势识别学习时空特征

BA无标度网络中的SIR模型

基于三次贝塞尔曲线的类汽车曲率连续路径平滑

基于机器学习的设备剩余寿命预测方法综述

基于维纳过程的退化模型，具有递归过滤算法，可用于估计剩余使用寿命

基于FPGA的奇异值和特征值分解的快速实现。

基于BP神经网络的人口预测

磁悬浮系统自适应模糊PID控制器的设计

无人机协同目标的多无人机协同搜索方法

两轮平衡车的建模与控制研究

基于改进遗传算法的六自由度机器人时间最优轨迹规划

一种基于深度学习的机械臂抓取方法

基于深度神经网络的交通流量预测

一种去除ECG中基线漂移和工频干扰的高效滤波方法

最新资源