【免费】Recurrent_MVSNet_for_High-Resolution

需积分: 0 40 浏览量更新于2024-11-19 收藏 2MB PDF 举报

多视图立体三维重建MVS论文

Recurrent MVSNet for High-resolution Multi-view Stereo Depth Inference

Yao Yao

1∗

Zixin Luo

Shiwei Li

Tianwei Shen

Tian Fang

2†

Long Quan

The Hong Kong University of Science and Technology

{yyaoag, zluoag, slibc, tshenaa, quan}@cse.ust.hk

Shenzhen Zhuke Innovation Technology (Altizure)

fangtian@altizure.com

Abstract

Deep learning has recently demonstrated its excellent

performance for multi-view stereo (MVS). However, one

major limitation of current learned MVS approaches is the

scalability: the memory-consuming cost volume regulariza-

tion makes the learned MVS hard to be applied to high-

resolution scenes. In this paper, we introduce a scalable

multi-view stereo framework based on the recurrent neu-

ral network. Instead of regularizing the entire 3D cost vol-

ume in one go, the proposed Recurrent Multi-view Stereo

Network (R-MVSNet) sequentially regularizes the 2D cost

maps along the depth direction via the gated recurrent

unit (GRU). This reduces dramatically the memory con-

sumption and makes high-resolution reconstruction feasi-

ble. We ﬁrst show the state-of-the-art performance achieved

by the proposed R-MVSNet on the recent MVS benchmarks.

Then, we further demonstrate the scalability of the pro-

posed method on several large-scale scenarios, where pre-

vious learned approaches often fail due to the memory con-

straint. Code is available at https://github.com/

YoYo000/MVSNet.

1. Introduction

Multi-view stereo (MVS) aims to recover the dense repre-

sentation of the scene given multi-view images and cali-

brated cameras. While traditional methods [24, 10, 29, 9]

have achieved excellent reconstruction performance, recent

works [14, 13, 30] show that learned approaches are able to

produce results comparable to the traditional state-of-the-

arts. In particular, MVSNet [30] proposed a deep architec-

ture for depth map estimation, which signiﬁcantly boosts

the reconstruction completeness and the overall quality.

One of the key advantages of learning-based MVS is

the cost volume regularization, where most networks ap-

∗

Intern at Shenzhen Zhuke Innovation Technology (Altizure).

†

Corresponding author.

ply multi-scale 3D CNNs [14, 15, 30] to regularize the 3D

cost volume. However, this step is extremely memory ex-

pensive: it operates on 3D volumes and the memory re-

quirement grows cubically with the model resolution (Fig. 1

(d)). Consequently, current learned MVS algorithms could

hardly be scaled up to high-resolution scenarios.

Recent works on 3D with deep learning also acknowl-

edge this problem. OctNet [23] and O-CNN [27] exploit the

sparsity in 3D data and introduce the octree structure to 3D

CNNs. SurfaceNet [14] and DeepMVS [13] apply the engi-

neered divide-and-conquer strategy to the MVS reconstruc-

tion. MVSNet [30] builds the cost volume upon the ref-

erence camera frustum to decouple the reconstruction into

smaller problems of per-view depth map estimation. How-

ever, when it comes to a high-resolution 3D reconstruction

(e.g., volume size > 512

voxels), these methods will either

fail or take a long time for processing.

To this end, we present a novel scalable multi-view

stereo framework, dubbed as R-MVSNet, based on the re-

current neural network. The proposed network is built upon

the MVSNet architecture [30], but regularizes the cost vol-

ume in a sequential manner using the convolutional gated

recurrent unit (GRU) rather than 3D CNNs. With the se-

quential processing, the online memory requirement of the

algorithm is reduced from cubic to quadratic to the model

resolution (Fig. 1 (c)). As a result, the R-MVSNet is appli-

cable to high resolution 3D reconstruction with unlimited

depth-wise resolution.

We ﬁrst evaluate the R-MVSNet on DTU [1], Tanks and

Temples [17] and ETH3D [25] datasets, where our method

produces results comparable or even outperforms the state-

of-the-art MVSNet [30]. Next, we demonstrate the scal-

ability of the proposed method on several large-scale sce-

narios with detailed analysis on the memory consumption.

R-MVSNet is much more efﬁcient than other methods in

GPU memory and is the ﬁrst learning-based approach ap-

plicable to such wide depth range scenes, e.g., the advance

set of Tanks and Temples dataset [17].

5520

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

DOI 10.1109/CVPR.2019.00567

Authorized licensed use limited to: Institute of Software. Downloaded on November 15,2024 at 03:27:14 UTC from IEEE Xplore. Restrictions apply.

下载后可阅读完整内容，剩余9页未读，立即下载

资源推荐

资源评论

GL_Rain

粉丝: 2621
资源: 36

Recurrent_MVSNet_for_High-Resolution_Multi-View.pdf

最新资源

Recurrent_MVSNet_for_High-Resolution_Multi-View.pdf

2018(FRVSR)Frame Recurrent Video Super Resolution.pdf

COMISR Compression-Informed Video Super-Resolution.pdf

Maggioni_Efficient_Multi-Stage_Video_CVPR_2021_supplemental.pdf

Object_Class_Segmentation_of_RGB-D_Video_using_Neural_Networks.pdf

Improved Recurrent Neural Networks for Session-based Recommendations.pdf

Bidirectional Recurrent Convolutional Networks for Multi-Frame Super-Resolution

cheatsheet-recurrent-neural-networks.pdf

金融股票深度学习论文整理

[machine_learning_mastery系列]long-short-term-memory-networks-with-python.pdf

Recurrent Event Network for Reasoning over Temporal Knowledge Graphs.pdf

Deep_Learning_for_Computer_Vision_with_Python.pdf.2018_03_16

Homayounfar_Hierarchical_Recurrent_Attention_CVPR_2018_paper.pdf

Recurrent_neural_network_training_for_noise

2020年机器学习深度学习下载地址.txt

基于CNN-BiLSTM-...n的直流微电网故障诊断研究_孟宏宇.pdf

tensorflow_recurrent_network_demo.py例子

斯坦福Jure Leskovec图深度生成模型 - graph_gen-iclr-may19-long.pdf.zip

d2l-zh-1.0.zip_D2L 文件_d2lzh安装_deeplearning_mxnet_pig17v

CVPR2018_Oral_论文合集_人工智能_机器学习

Building a Recurrent Neural Network - Step by Step

rnnoise-java-master.zip_RNN_cookiesp6x_rnnoise-java_rnnoise-java

Homayounfar_Hierarchical_Recurrent_Attention_CVPR_2018_paper.zip

d2l-zh-1.1.zip

joone-engine-1.1.2.zip_joone_joone-engine_joone-engine.jar_神经网络

keras-ocr 模型文件 craft_mlt_25k.h5 crnn_kurapan.h5

rtrl.tar.gz_ RECURrENT_RTRL_recurrent _recurrent neural

最新资源