【免费】POEMReconstructingHandinaPointEmbedded.pdf资源-CSDN文库

需积分: 0 126 浏览量更新于2024-11-19 收藏 1.29MB PDF 举报

多视图立体三维重建MVS论文

POEM: Reconstructing Hand in a Point Embedded Multi-view Stereo

Lixin Yang

1,2

Jian Xu

Licheng Zhong

Xinyu Zhan

Zhicheng Wang

Kejian Wu

Cewu Lu

1,2†

Shanghai Jiao Tong University

Shanghai Qi Zhi Institute

Nreal

{siriusyang, zlicheng, kelvin34501, lucewu}@sjtu.edu.cn

{jianxu, kejian}@nreal.ai chgggo@gmail.com

Abstract

Enable neural networks to capture 3D geometrical-

aware features is essential in multi-view based vision tasks.

Previous methods usually encode the 3D information of

multi-view stereo into the 2D features. In contrast, we

present a novel method, named POEM, that directly oper-

ates on the 3D POints Embedded in the Multi-view stereo

for reconstructing hand mesh in it. Point is a natural form

of 3D information and an ideal medium for fusing fea-

tures across views, as it has different projections on dif-

ferent views. Our method is thus in light of a simple yet

effective idea, that a complex 3D hand mesh can be rep-

resented by a set of 3D points that 1) are embedded in

the multi-view stereo, 2) carry features from the multi-view

images, and 3) encircle the hand. To leverage the power

of points, we design two operations: point-based feature

fusion and cross-set point attention mechanism. Evalua-

tion on three challenging multi-view datasets shows that

POEM outperforms the state-of-the-art in hand mesh re-

construction. Code and models are available for research

at github.com/lixiny/POEM

1. Introduction

Hand mesh reconstruction plays a central role in the ﬁeld

of augmented and mixed reality, as it can not only deliver

realistic experiences for the users in gaming but also sup-

port applications involving teleoperation, communication,

education, and ﬁtness outside of gaming. Many signiﬁcant

efforts have been made for the monocular 3D hand mesh

reconstruction [1, 5, 7, 9, 31, 32]. However, it still strug-

gles to produce applicable results, mainly for these three

reasons. (1) Depth ambiguity. Recovery of the absolute

position in a monocular camera system is an ill-posed prob-

lem. Hence, previous methods [9, 31, 54] only recovered

the hand vertices relative to the wrist (i.e. root-relative).

(2) Unknown perspectives. The shape of the hand’s 2D

†

Cewu Lu is the corresponding author, the member of Qing Yuan Re-

search Institute and MoE Key Lab of Artiﬁcial Intelligence, AI Institute,

Shanghai Jiao Tong University, China and Shanghai Qi Zhi institute.

Figure 1. Intersection area of N cameras’ frustum spaces. The

gray dots represent the point cloud P aggregated from N frustums.

Our method: POEM, standing for the point embedded multi-view

stereo, focuses on the dark area scatted with gray dots.

projection is highly dependent on the camera’s perspec-

tive model (i.e. camera intrinsic matrix). However, the

monocular-based methods usually suggest a weak perspec-

tive projection [1, 27], which is not accurate enough to re-

cover the hand’s 3D structure. (3) Occlusion. The occlu-

sion between the hand and its interacting objects also chal-

lenges the accuracy of the reconstruction [32]. These issues

limit monocular-based methods from practical application,

in which the absolute and accurate position of the hand sur-

face is required for interacting with our surroundings.

Our paper is thus focusing on reconstructing hands from

multi-view images. Motivation comes from two aspects.

First, the issues mentioned above can be alleviated by lever-

aging the geometrical consistency among multi-view im-

ages. Second, the prospered multi-view hand-object track-

ing setups [2, 4, 49, 55] and VR headsets bring us an urgent

demand and direct application of multi-view hand recon-

struction in real-time. A common practice of multi-view 3D

pose estimation follows a two-stage design. It ﬁrst estimates

the 2D key points of the skeleton in each view and then

back-project them to 3D space through several 2D-to-3D

lifting methods, e.g. algebraic triangulation [17,18,39], Pic-

torial Structures Model (PSM) [33, 38], 3D CNN [18, 43],

21108

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

DOI 10.1109/CVPR52729.2023.02022

Authorized licensed use limited to: Institute of Software. Downloaded on November 08,2024 at 02:43:11 UTC from IEEE Xplore. Restrictions apply.

下载后可阅读完整内容，剩余9页未读，立即下载

身份认证购VIP最低享 7 折!

30元优惠券

资源推荐

资源评论

GL_Rain

粉丝: 3149
资源: 36

POEM Reconstructing Hand in a Point Embedded.pdf

最新资源

POEM Reconstructing Hand in a Point Embedded.pdf

基于POEM_SLPP的人脸识别算法.pdf

Poem_Rabindranath Tagore.pdf

基于MBC和POEM特征的人脸识别方法.pdf

Symphonic Poem No.2, Op.65钢琴曲谱双手数字简谱钢琴曲谱.pdf

Poem人教版高二英语李芳芳.pptx

高一英语测试题附答案.pdf

常见的词性转换.pdf

dreawear网页制作实验指导.pdf

牛津英语模块Unit词汇PPT课件.pptx

大学英语四级考前必看：690个高频词汇整理.pdf

Heres a werid string. Its basicly just a poem I wrote that f

河南省林州市第一中学高中英语Unit2Poems写作指导_如何写诗评新人教版选修6

python文件和目录操作方法大全.pdf

PEP小学英语六上英语Unit,3知识点、考点梳理.pdf

百度AI攻略：智能写诗.pdf

胸外科临床路径(2019年版).pdf

基于序列到序列神经网络模型的古诗自动生成方法.pdf

(精品)最美的60句宋词：宋词中的名句精选.pdf

高中英语Revision(unit1—5ofBook6)人教版第六册.pdf

a_poem-0.12.3-py2.py3-none-any.whl

4BModule2单词词组列表[借鉴].pdf

【Python学习笔记】第七章 字符串.pdf

poem.db sqllite格式 唐诗宋词都在这里了

巴克莱-美股-医疗保健行业-美国生命科学与诊断Jack工具包（特别假日版）——第4卷第52期-27-12页.pdf

LSTM_poem.rar

Sort Poems.cpp

最新资源

【Python学习笔记】第七章字符串.pdf

poem.db sqllite格式唐诗宋词都在这里了