卷积神经网络损失函数ICIoU_损失函数在神经网络中的作用资源-CSDN文库

需积分: 41 60 浏览量 2022-05-20 22:36:40 上传评论 2 收藏 1.54MB PDF 举报

资源推荐

资源详情

资源评论

Received July 13, 2021, accepted July 22, 2021, date of publication July 26, 2021, date of current version August 3, 2021.

Digital Object Identifier 10.1109/ACCESS.2021.3100414

ICIoU: Improved Loss Based on

Complete Intersection Over Union

for Bounding Box Regression

XUFEI WANG

1,2

AND JEONGYOUNG SONG

, (Member, IEEE)

Key Laboratory of Industrial Automation, School of Mechanical Engineering, Shaanxi University of Technology, Hanzhong 723000, China

Department of Computer Engineering, Pai Chai University, Daejeon 35345, South Korea

Corresponding author: Jeongyoung Song (jysong@pcu.ac.kr)

This work was supported in part by the Shaanxi Provincial Key Laboratory of Industrial Automation Research Program under Grant

18JS020.

ABSTRACT An object detector based on convolutional neural network (CNN) has been widely used in

the ﬁeld of computer vision because of its simplicity and efﬁciency. The average accuracy of CNN model

detection results in the object detector is greatly affected by the loss function. The precision of the localization

algorithm in the loss function is the main factor affecting the result. Based on the complete intersection over

union (CIoU) loss function, an improved penalty function is proposed to improve the localization accuracy.

Speciﬁcally, the algorithm more comprehensively considers matching bounding boxes between prediction

with ground truth, using the proportional relationship of the aspect ratio from both bounding boxes. Under

the same aspect ratio of the two bounding boxes, the inﬂuence factors of the prediction box on localization

accuracy were considered. In this way, the function of the penalty function is strengthened, and localization

accuracy of the network model improved. This loss function is called Improved CIoU (ICIoU). Experiments

on the Udacity, PASCAL VOC, and MS COCO datasets have demonstrated the effectiveness of ICIoU

in improving localization accuracy of network models by using the one-stage object detector YOLOv4.

Compared with CIoU, the proposed ICIoU improved average precision (AP) by 0.57% and AP75 by 0.12%

on Udacity, AP by 0.26% and AP75 by 1.28% on PASCAL VOC, and AP by 0.06% and AP75 by 0.65% on

MS COCO.

INDEX TERMS Bounding box regression, localization accuracy, loss function, object detection.

I. INTRODUCTION

Object detection is one of the key problems in computer

vision tasks. In recent years, convolutional neural net-

works (CNNs) have been increasingly applied in the ﬁeld

of computer vision [1]–[11]. When using CNNs to solve the

problem of object detection, no matter whether a regression or

classiﬁcation problem, a loss function is indispensable. Loss

functions are used to estimate the degree of inconsistency

between the predicted value of a model and the real value.

The main task of model training in the present work is to

use the optimization method to ﬁnd the model parameters

corresponding to the minimization of the loss function. The

loss function determines what the optimal value of the model

is, so the performance of different object detectors is affected

The associate editor coordinating the review of this manuscript and

approving it for publication was Sudipta Roy .

by the loss function. The loss function generally consists of

bounding box regression and classiﬁcation. The loss calcula-

tion of bounding box regression is the key step of object loca-

tion, multiobject detection, target tracking, and instance-level

segmentation. In terms of multiobject detection, compared

with the traditional region proposal methods, a deep CNN

has better performance advantages in predicting the bounding

box of candidate objects. These networks include one-stage

object detectors such as the YOLO series [3]–[6] and single

shot multibox detector (SSD) [9], two-stage object detectors,

such as series of the regions with CNN features (R-CNN)

[11]–[14], and even multistage object detectors, such as cas-

cade R-CNN [15]. In these networks, intersection over union

(IoU) loss has become the most popular evaluation mea-

surement algorithm for bounding box regression compared

with focal loss and L

-norm (e.g. L

, L

) loss [16], [17].

However, IoU algorithm cannot detect the bounding box

105686

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/

VOLUME 9, 2021

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余9页未读，立即下载

评论收藏

内容反馈

wxf2020csdn

粉丝: 1
资源: 1

卷积神经网络损失函数ICIoU

关于卷积神经网络损失函数的改进算法.pdf

关于卷积神经网络损失函数的改进算法.docx

损失函数为Cross entropy的手写数字识别神经网络代码与实现

神经网络-几种损失函数

损失函数为QuatraticCost的手写数字识别神经网络代码与实现

卷积神经网络的聚焦均方损失函数设计

卷积神经网络的损失最小训练后参数量化方法.docx

vivado2019.2平台中通过verilog实现CNN卷积神经网络包括卷积层,最大化池化层以及ReLU激活层+操作视频

基于孪生卷积神经网络与三元组损失函数的图像识别模型.pdf

python基于opencv下使用卷积神经网络的车牌识别系统 详细代码 可直接使用

GPU版本卷积神经网络

网络游戏-一种新的基于无损失函数的深度卷积神经网络的图像特征提取方法.zip

基于组合损失函数的BP神经网络风力发电短期预测方法.pdf

自定义损失函数长短期神经网络，自定义损失函数LSTM神经网络（代码完整，数据齐全，公式齐全）

超低照度下微光图像增强神经网络损失函数设计分析.pdf

一种新联合损失函数优化的迁移学习神经网络磨粒识别研究.pdf

多斜率自适应卷积神经网络激活函数.pdf

基于图卷积神经网络的函数自动命名.pdf

MATLAB工具箱-深度学习(卷积神经网络)函数工具箱3.0.rar

MATLAB实现CNN卷积神经网络时间序列预测（完整源码和数据）

基于混合激活函数的改进卷积神经网络算法.pdf

基于深度卷积神经网络与中心损失的人脸识别.pdf

一种基于角度距离损失函数和卷积神经网络的人脸识别算法

基于深度神经网络损失函数融合的文本检测.pdf

5层神经网络带L2正则化的损失函数计算方法

利用tensorflow实现的卷积神经网络来进行MNIST手写数字图像的分类.py

卷积神经网络（CNN）.pdf

基于CNN卷积神经网络的目标分类训练和测试matlab仿真【含操作视频】

最新资源

python基于opencv下使用卷积神经网络的车牌识别系统详细代码可直接使用