【免费】(必读)PoseGuidedRGBDFeatureLearningfor3DObjectPoseEstimat资源-CSDN文库

需积分: 0 132 浏览量更新于2022-08-04 收藏 1.23MB PDF 举报

在本论文《Pose Guided RGBD Feature Learning for 3D Object Pose Estimation》中，作者Vassileios Balntas等人探讨了如何利用物体姿态作为指导来学习用于三维物体姿态估计的鲁棒特征。传统的研究主要集中在基于度量学习的特征嵌入上，通过三元组比较来区分相似和不相似的姿势标签。而该论文则考虑了训练样本之间精确的姿势差异，旨在学习这样的嵌入：在姿态标签空间中的距离与特征空间中的距离成比例。然而，对于对称物体，强制建立姿态与特征之间的关联并不理想，因此论文提出了使用反映物体对称性的权重来衡量姿态距离。此外，论文还研究了端到端的姿势回归方法，并证明这种方法可以进一步提升特征学习的判别能力，提高姿态识别的准确性。实验结果显示，通过姿态引导学习得到的特征比传统方式学习的特征具有更高的判别性，超越了现有的最佳工作成果。作者评估了在以前未见过的场景中，包含不同遮挡程度物体的姿势引导特征学习的泛化能力，并表明它能很好地适应新的任务。 3D物体检测和姿态估计是一个极具挑战性的任务，由于严重的遮挡、背景杂乱和大规模变化等因素，当代解决方案的表现会受到严重影响。当前最先进的方法如Hough Forests用于在3D空间中对补丁进行投票，或者训练卷积神经网络（CNNs）来进行像素级分类或直接回归物体的3D坐标。本文提出的方法创新地结合了RGBD（颜色和深度）数据，通过引入物体姿态指导，优化特征学习过程。RGBD数据为3D信息提供了丰富的上下文，使得网络能够更好地理解物体的几何结构，而不仅仅是依赖于色彩和纹理信息。通过这种方式，即使在复杂的环境条件下，也能提高对物体姿态的估计精度。这篇论文为3D对象检测和姿态估计领域提供了一个新的视角，即利用精确的物体姿态差异来指导特征学习，增强了特征的判别力，并且能够在新环境中展现出良好的泛化性能。这种方法有望在未来推动3D视觉系统在机器人导航、自动驾驶和虚拟现实等领域的应用。

Pose Guided RGBD Feature Learning for 3D Object Pose Estimation

Vassileios Balntas

†

, Andreas Doumanoglou

∗

, Caner Sahin

∗

, Juil Sock

∗

, Rigas Kouskouridas

‡

, Tae-Kyun Kim

∗

†

University of Oxford, UK

‡

Scape Technologies, UK

∗

Imperial College London, UK

balntas@robots.ox.ac.uk {a.doumanoglou12,c.sahin14,ju-il.sock08,tk.kim}@ic.ac.uk rigas@scape.io

Abstract

In this paper we examine the effects of using object poses

as guidance to learning robust features for 3D object pose

estimation. Previous works have focused on learning fea-

ture embeddings based on metric learning with triplet com-

parisons and rely only on the qualitative distinction of sim-

ilar and dissimilar pose labels. In contrast, we consider

the exact pose differences between the training samples,

and aim to learn embeddings such that the distances in the

pose label space are proportional to the distances in the

feature space. However, since it is less desirable to force

the pose-feature correlation when objects are symmetric,

we discuss the use of weights that reﬂect object symmetry

when measuring the pose distances. Furthermore, end-to-

end pose regression is investigated and is shown to further

boost the discriminative power of feature learning, improv-

ing pose recognition accuracies. Experimental results show

that the features that are learnt guided by poses, are signiﬁ-

cantly more discriminative than the ones learned in the tra-

ditional way, outperforming state-of-the-art works. Finally,

we measure the generalisation capacity of pose guided fea-

ture learning in previously unseen scenes containing objects

under different occlusion levels, and we show that it adapts

well to novel tasks.

1. Introduction

Detecting objects and estimating their 3D pose is a very

challenging task due to the fact that severe occlusions, back-

ground clutter and large scale changes dramatically affect

the performance of any contemporary solution. State of the

art methods make use of Hough Forests for casting patch

votes in the 3D space [

27, 7] or train CNNs to either per-

form classiﬁcation into the quantized 3D pose space [

13] or

regress the object position [

14] from local patches.

Another approach to the 3D object pose estimation prob-

lem involves transforming the initial problem into a nearest

Work done while VB and RK were at Imperial College London.

neighbour matching one, where extracted feature descrip-

tors are matched with a set of templates via nearest neigh-

bour search [9]. End-to-end deep networks for feature-

based nearest neighbour matching entail training a clas-

siﬁcation network with a classiﬁer layer which is subse-

quently removed, while the penultimate layer serve as a

feature descriptor [

24]. Direct feature learning for discrete

object classes with deep neural networks [

26, 12] demon-

strated successful results by using siamese and triplet net-

works optimised for discriminative embeddings. The latter

are learned in a way that ensures that features extracted from

samples belonging to the same class are close in the learned

embedding space, and samples from different classes are

further apart. Wohlhart and Lepetit [

29] adapted this frame-

work to the problem of learning feature descriptors for 3D

object pose estimation, by sampling the qualitative relation

of pose similarity, and forming triplets consisting of similar

and dissimilar poses.

It is apparent that moving from the continuous space of

3D object poses to the qualitative one of similar and dis-

similar pose pairs, leads to inevitable information loss. To-

wards this end in this paper we are interested in creating

a feature learning framework that directly uses the object

pose in the optimization process. Our key idea is that by

using the pose labels in the feature learning layer, we can

devise a learning procedure that has inherit knowledge of

the ﬁnal goal (i.e. 3D pose estimation), thus allowing for a

switch from a qualitative optimization (similar and dissimi-

lar poses), to a quantitative one (directly computed distance

in the pose space). In the proposed learning framework, the

pose-feature correlation is established with the adjusted dis-

tances in the pose space. Direct 3D pose regression seems

challenging due to ambiguities in appearance/pose space,

the continuous nature of the multi-dimensional output space

and the discrepancy between synthetic data used for training

and real data used for testing. However, training an end-to-

end pose regression network can still facilitate feature learn-

ing. Similar to [

24], while evaluating our system’s perfor-

mance, we remove the regression layer and use the feature

layer for nearest neighbour matching. The regression term

along with the pose-guided feature one further improves the

3856

下载后可阅读完整内容，剩余8页未读，立即下载

资源推荐

资源评论

SeaNico

粉丝: 26
资源: 320

(必读)Pose Guided RGBD Feature Learning for 3D Object Pose Estimat

最新资源

(必读)Pose Guided RGBD Feature Learning for 3D Object Pose Estimat

使用RGB-D传感器进行物体识别的有效3D形状描述符

Self-Guided and Cross-Guided Learning for Few-Shot Seg.pptx

PGDMPose guided deep model for pedestrian attribute recognition in surveillance

论文笔记：Progressive Attention Guided Recurrent Network for Salient Object Detection-附件资源

Feature-guided Image Stippling

Machine Learning for Decision Makers-Apress(2017).pdf

Orientation-Guided Similarity Learning for Person Re-identification

基于体素到点解码器的点云三维目标检测_From Voxel to Point IoU-guided 3D Object Dete

Fine-Grained Recognition via Attribute-Guided Attentive Feature Aggregation

Learning Video Object Segmentation from Static Images.pdf

行人重识别-用于被遮挡行人重识别优化的姿态引导pose-guided特征对齐-附项目源码+流程教程-优质项目实战.zip

Flow-Guided-Feature-Aggregation研究基于视频的目标检测FGFA框架.zip

FuzzIL: Coverage Guided Fuzzing for JavaScript Engines

Growing Object-Oriented Software, Guided by Tests epub

2019-ACL-Attention Guided Graph Convolutional Networks for Relat

Growing Object-Oriented Software, Guided by Tests, 英文原版, 带完整目录.pdf

Attention-guided CNN for image denoising.pptx

Python: Real World Machine Learning

「WEB应用防火墙」FuzzIL：Guided_Fuzzing_for_JavaScript_Engines - 安全众测

顺序引导的分离表征学习在溃疡性结肠炎分类中的应用_Order-Guided Disentangled Representatio

Growing_Object-Oriented_Software,_Guided_By_Tests_(2009)

Motion Guided Region Message Passing for Video Captioning.pdf

Pose-guided-action-recognition:使用人的姿势信息来帮助动作识别，并在pytorch中实现了注意力集中方法，C3D方法和两流体系结构的探索

Multi-scale self-guided attention for medical image segmentation.pdf

最新资源