Rank&SortLossforObjectDetectionandInstance.pdf资源-CSDN文库

版权申诉

100 浏览量 2021-11-21 16:14:22 上传评论收藏 887KB PDF 举报

Rank & Sort Loss for Object Detection and Instance Segmentation是Kemal Oksuz等人提出的一种新的损失函数，用于训练深度学习中的目标检测和实例分割模型。该损失函数的核心思想是通过排名和排序来监督分类器，使其能够正确区分正负样本，并根据定位质量（如交并比IoU）对正样本进行排序。在传统的对象检测和实例分割任务中，网络通常包含一个分类子网络，负责识别物体类别。RS Loss的目标是确保每个正样本的得分高于所有负样本，并且正样本之间按照其定位质量进行排序。为了解决排名和排序的非可微性问题，研究者提出了“Identity Update”的概念，将错误驱动的更新与反向传播相结合，从而能够对正样本之间的排序误差进行建模。使用RS Loss训练模型有以下优势： 1. **简化训练过程**：由于排序目标的存在，分类器会自动优先考虑正样本，无需额外的辅助头（如中心度、IoU或mask-IoU）。 2. **应对类别不平衡**：由于其基于排名的特性，RS Loss对类别不平衡具有鲁棒性，因此不需要使用采样策略。 3. **任务平衡**：RS Loss通过无需调整的任务平衡系数解决了视觉检测器的多任务性质。实验证明，RS Loss在多个不同的视觉检测器上表现优秀，只需要调整学习率就能取得较好的结果。例如，在COCO数据集上，RS Loss使Faster R-CNN的box AP提高了约3个百分点，相比基于排名的基准aLRP Loss提高了约2个百分点。在LVIS数据集上，RS Loss与Mask R-CNN结合使用重复因子采样（RFS）时，mask AP提高了3.5个百分点，对于稀有类别的提升更是达到了7个百分点，且在所有对比方法中表现出色。 RS Loss的实现代码可在https://github.com/kemaloksuz/RankSortLoss找到，为研究人员和开发者提供了一个强大而简便的工具，用于改进目标检测和实例分割模型的性能。这个损失函数的引入，不仅提升了模型的准确性和鲁棒性，还简化了训练流程，降低了对超参数调整的依赖，为深度学习在计算机视觉领域的应用带来了新的突破。

资源推荐

资源详情

资源评论

Rank & Sort Loss for Object Detection and Instance Segmentation

Kemal Oksuz, Baris Can Cam, Emre Akbas

∗

, Sinan Kalkan

Dept. of Computer Engineering, Middle East Technical University, Ankara, Turkey

{kemal.oksuz, can.cam, eakbas, skalkan}@metu.edu.tr

Abstract

We propose Rank & Sort (RS) Loss, a ranking-based loss

function to train deep object detection and instance seg-

mentation methods (i.e. visual detectors). RS Loss super-

vises the classiﬁer, a sub-network of these methods, to rank

each positive above all negatives as well as to sort positives

among themselves with respect to (wrt.) their localisation

qualities (e.g. Intersection-over-Union - IoU). To tackle the

non-differentiable nature of ranking and sorting, we refor-

mulate the incorporation of error-driven update with back-

propagation as Identity Update, which enables us to model

our novel sorting error among positives. With RS Loss, we

signiﬁcantly simplify training: (i) Thanks to our sorting ob-

jective, the positives are prioritized by the classiﬁer with-

out an additional auxiliary head (e.g. for centerness, IoU,

mask-IoU), (ii) due to its ranking-based nature, RS Loss is

robust to class imbalance, and thus, no sampling heuris-

tic is required, and (iii) we address the multi-task nature

of visual detectors using tuning-free task-balancing coefﬁ-

cients. Using RS Loss, we train seven diverse visual detec-

tors only by tuning the learning rate, and show that it con-

sistently outperforms baselines: e.g. our RS Loss improves

(i) Faster R-CNN by

∼

3 box AP and aLRP Loss (ranking-

based baseline) by

∼

2 box AP on COCO dataset, (ii) Mask

R-CNN with repeat factor sampling (RFS) by 3.5 mask AP

(

∼

7 AP for rare classes) on LVIS dataset; and also out-

performs all counterparts. Code is available at: https:

//github.com/kemaloksuz/RankSortLoss.

1. Introduction

Owing to their multi-task (e.g. classiﬁcation, box regres-

sion, mask prediction) nature, object detection and instance

segmentation methods rely on loss functions of the form:

V D

k∈K

t∈T

, (1)

which combines L

, the loss function for task t on stage

k (e.g. |K| = 2 for Faster R-CNN [32] with RPN and R-

Equal contribution for senior authorship.

Classification Logits

0 1 2 3 4 5 6 7

3.0

2.0

1.0

0.0

-1.0

-2.0

-3.0

-4.0

Binary Labels

1 1 0 0 1 0 1 0

Target Ranking (𝑖)

0, 4, 1

, 6 (any order

)

, 3, 5, 7 (any order

)

Anchor ID (𝑖 )

(a) Ranking positives (+) above negatives (-)

(b) Rank&Sort Loss: Rank (+) above (-) & Sort (+) wrt their IoU labels

Classification Logits

0 1 2 3 4 5 6 7

3.0

2.0

1.0

0.0

-1.0

-2.0

-3.0

-4.0

Continuous Labels (IoU)

0.9

0.4

0.0

0.8

0.0

0.1

0.0

RS Loss Target Ranking (𝑖) 0 4 1 6

, 3, 5, 7 (any order

)

Anchor ID (𝑖 )

(+)

(-)

(+)

(-)

Figure 1. A ranking-based classiﬁcation loss vs RS Loss. (a) En-

forcing to rank positives above negatives provides a useful objec-

tive for training, however, it ignores ordering among positives. (b)

Our RS Loss, in addition to raking positives above negatives, aims

to sort positives wrt. their continuous IoUs (positives: a green tone

based on its label, negatives: orange). We propose Identity Update

(Section 3), a reformulation of error-driven update with backprop-

agation, to tackle these ranking and sorting operations which are

difﬁcult to optimize due to their non-differentiable nature.

CNN), weighted by a hyper-parameter λ

. In such formu-

lations, the number of hyper-parameters can easily exceed

10 [27], with additional hyper-parameters arising from task-

speciﬁc imbalance problems [28], e.g. the positive-negative

imbalance in the classiﬁcation task, and if a cascaded ar-

chitecture is used (e.g. HTC [7] employs 3 R-CNNs with

different λ

). Thus, although such loss functions have led

to unprecedented successes, they require tuning, which is

time consuming, leads to sub-optimal solutions and makes

fair comparison of methods challenging.

Recently proposed ranking-based loss functions, namely

“Average Precision (AP) Loss” [6] and “average Locali-

sation Recall Precision (aLRP) Loss” [27], offer two im-

portant advantages over the classical score-based functions

(e.g. Cross-entropy Loss and Focal Loss [22]): (1) They di-

rectly optimize the performance measure (e.g. AP), thereby

providing consistency between training and evaluation ob-

jectives. This also reduces the number of hyper-parameters

as the performance measure (e.g. AP) does not typically

have any hyper-parameters. (2) They are robust to class-

3009

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余9页未读，立即下载

评论收藏

内容反馈

版权申诉

易小侠

粉丝: 6606
资源: 9万+

Rank & Sort Loss for Object Detection and Instance.pdf

最新资源

Rank & Sort Loss for Object Detection and Instance.pdf

CVPR2018_Oral_论文合集_人工智能_机器学习

Virtual & Augmented Reality For Dummies by Paul Mealy-2018.pdf

Pearson-Digital Image Processing， 4th.Edition-2018.pdf

Learning to Rank using Gradient Descent.pdf

Learning to Rank for Information Retrieval and Natural Language Processing

Robust and Optimal Control.pdf

DeepRank Learning to rank with neural networks for recommendation.pdf

Discriminative Scale Space Tracking.pdf

[Meyer]_Matrix_Analysis_And_Applied_Linear_Algebra(b-ok.org)text book.pdf

Virtual & Augmented Reality For Dummies by Paul Mealy-2018.epub

GoDec:Randomized Low-rank & Sparse Matrix Decomposition in Noisy Case

Swift5：Exploring the iOS SDK, 5th Edition-June 2, 2019-5th.pdf

Learning to Rank for Information Retrieval pdf

3D Animation for the Raw Beginner Using Autodesk Maya, 2nd Edition.pdf

MySQL笔记合集.pdf

Fundamentals of Wireless Communication 无线通信基础 英文版

Excel中RANK函数的使用.pdf

rank排列函数.pdf

02-课件(sumif、rank.eq函数的使用).pdf

Machine Learning for Text-Springer(2018).pdf

Godec 稀疏表示与低秩表示的结合

改良的Rankin量表.pdf

Microsoft SQL Server 2008技术内幕：T-SQL查询.pdf

5G低Rank导致低峰值速率问题分析及总结.pdf

RANK 值优化提升 5G 下载速率案例.pdf

分析函数使用.pdf

最新资源

Fundamentals of Wireless Communication 无线通信基础英文版