Real-timeRGB-DimagesstitchingusingmultipleKinectsforimprovedfieldofview

160 浏览量 2021-02-11 02:28:51 上传评论收藏 2.39MB PDF 举报

本文探讨了基于Kinect式RGB-D传感器存在的深度图缺陷和有限的视野（Field of View，FOV）问题，并提出了一种各向异性扩散（Anisotropic Diffusion，AD）基础的空洞填充方法来恢复深度图中无效的深度数据。通过拼接来自多个RGB-D传感器的深度和彩色图像来扩展Kinect式RGB-D传感器的视野。通过将深度图与彩色图像对齐，可以使用通过注册彩色图像计算出的校准数据，实时拼接深度和彩色图像成为深度和彩色全景图。实验结果表明，所提出的拼接方法能够生成无无效深度数据和轻微畸变的实时RGB-D全景图像，并且可以扩展到更多RGB-D传感器的集成，以构建360度全景RGB-D图像。深度信息是基于视觉（RGB）信息的计算机视觉应用程序的重要补充。传统的深度传感器包括飞行时间摄像头、激光测距扫描仪、结构光扫描仪和双目摄像头。另一种基于红外（IR）的深度传感器是像Microsoft Kinect这样的设备，它通过匹配红外图像中的点与预先校准的红外模式中的点生成深度图。与激光扫描仪和双目相机相比，Kinect的成本要低得多，并且能够以更高的速度生成可靠的深度图。Kinect已被广泛用于计算机视觉应用，如物体检测、分割和识别、三维建模和SLAM（同时定位与地图构建）。然而，在这些应用中使用Kinect的一个主要限制是Kinect的狭窄视野，这限制了对更多对象的覆盖。文章主要讨论了在使用Kinect传感器进行实时深度图拼接时的实时性问题，以及在拼接过程中的数据缺失和图像扭曲问题。首先介绍了传统的深度传感器和Kinect的基本工作原理，然后分析了Kinect在视野上存在的问题。之后，文章提出了一种利用各向异性扩散算法来填充深度图中缺失数据的空洞填充方法，以提高深度图的质量和完整性。作者通过实验验证了所提出的方法能够在实时条件下生成高质量的RGB-D全景图像，并且该方法能够扩展以集成更多的RGB-D传感器来构建更广阔的360度全景图像。在介绍中，文章还讨论了深度图像拼接与图像注册技术的关系。图像注册技术是将多个图像合成一个全景图像时的关键步骤，它涉及图像间的对齐、变换和平滑处理。文章说明了如何利用彩色图像注册计算出的校准数据，用于实时拼接深度和彩色图像。文章强调了实时处理的重要性和方法的有效性，这对于需要即时视觉信息反馈的应用场景尤为重要，比如在机器人导航、增强现实（AR）以及虚拟现实（VR）中。本文提出了一种创新的方法来解决基于Kinect传感器的实时RGB-D图像拼接问题，具有重要的实际应用价值和研究意义。通过这种技术的应用，能够显著扩展计算机视觉系统的观察范围，提升应用的质量和效率。在未来的研究中，可以进一步优化算法以适应更多种类的图像传感器，实现更高级别的数据融合和处理。

资源推荐

资源详情

资源评论

Real-time RGB-D images stitching using

multiple Kinects for improved ﬁeld of

view

Journal Title

XX(X):1–7

The Author(s) 2017

Reprints and permission:

sagepub.co.uk/journalsPermissions.nav

DOI: 10.1177/ToBeAssigned

www.sagepub.com/

Hengyu Li

, Hang Liu

, Ning Cao

, Yan Peng

, Shaorong Xie

, Jun Luo

and Yu Sun

Abstract

This paper concerns the problems of defective depth map and limited ﬁeld of view (FOV) of Kinect-style RGB-D sensors.

An anisotropic diffusion (AD) based hole ﬁlling method is proposed to recover invalid depth data in depth map. The

FOV of Kinect-style RGB-D sensor is extended by stitching depth and color images from multiple RGB-D sensors. By

aligning depth map with color image the registration data calculated by registering color images can be used to stitch

depth and color images into depth and color panoramic image in real-time concurrently. Experimental results show that

the proposed stitching method can generate RGB-D panorama with no invalid depth data and little distortion in real-

time and can be extended to incorporate more RGB-D sensors to construct even 360-degree FOV panoramic RGB-D

image.

Keywords

depth images stitching, RGB-D panorama, improved ﬁeld of view, depth map hole ﬁlling, Kinect, image registration

Introduction

Depth information is an important complement for visual

(RGB) information based computer vision applications.

Traditional depth sensors include time-of-ﬂight camera, laser

range scanner, structured light scanner and binocular camera.

Another type of depth sensor is an infrared (IR) based

sensor like Microsoft Kinect that generates depth map by

matching a dot in IR image with a dot in a pre-calibrated IR

pattern (Zhang (2012)). By comparison with laser scanner

and binocular camera, Kinect costs much lower and can

generate reliable depth map at much higher speed (Smisek et

al. (2013), Zug et al. (2012)). Kinect has been widely used as

the primary 3D sensor for computer vision applications like

detection, segmentation and recognition of objects (Gupta et

al. (2014), Shahroudy et al. (2016)), 3D modeling (Henry et

al. (2013), Barron and Malik (2013)) and SLAM (Whelan et

al. (2015)). However, a main limit when applying Kinect in

these applications is the narrow ﬁeld of view (FOV) of Kinect

that limits the coverage of more objects in scenes (Zug et

al. (2012), Han et al. (2013)]). The depth camera of Kinect

owns a horizontal FOV of 57

◦

that is much smaller than the

240

◦

FOV of Hokuyo URG-04LX-UG01, a laser scanner

owns similar maximum range and accuracy compared with

the Kinect sensor (Zug et al. (2012)).

To extend the sensing area of single Kinect, multiple

Kinects have been used in 3D reconstruction (Tong et al.

(2012), Alexiadis et al. (2013)) or 3D detection (Susanto et

al. (2012), Asteriadis et al. (2013), Morato et al. (2014)).

In these works, multiple Kinects were placed to face the

same object or to observe the same scenario to cover full

sides of the model and avoid depth shadows caused by

occlusion. Instead of placing Kinects to face inward, they

can also be placed to face outward to extend the limited FOV

through image stitching which is the purpose of our work.

Song et al. provided a solution to extend the FOV by a pre-

calibrated rotated top-bottom arrangement of two Kinects

and the pair of depth maps were perspectively transformed

to a common frontal ﬂat reference coordinate to form a

panoramic depth map by use of the homography between

depth maps(Song et al. (2015)). Though the depth maps

can be stitched seamlessly, the depth panorama had much

distortion since for larger ﬁelds of view we can not maintain a

ﬂat representation without excessively stretching pixels near

the border of the image (Szeliski and Richard (2006)).

To generate little distorted depth panorama with large

FOV, the cylindrical or spherical projection is usually

chosen. Each input image is warped to cylindrical plane

or spherical plane according to an estimated 3 × 3 camera

matrix or homography(Szeliski and Richard (2006)). The

problem is well addressed by the work in (Brown and Lowe

(2007)) where the camera matrix was estimated and reﬁned

based on matched SIFT features between input color images.

However, since depth maps lack of SIFT features to be

extracted this estimation method can not be directly applied

to depth maps registration. In this paper, we found that

by aligning depth map with color image the registration

matrix of color images can be used to register depth maps.

The problem of registering depth maps is transformed to

the problem of registering color images. It is also to be

found that if the scenes around cameras do not change

much the registration matrixes do not need to be updated

School of Mechatronic Engineering and Automation, Shanghai

University, 200072, China

Department of Mechanical and Industrial Engineering, University of

Toronto, Toronto, ON, M5S 3G8, Canada

Corresponding author:

Hang Liu

Email: liuhang shu@126.com

Prepared using sagej.cls [Version: 2015/06/09 v1.01]

2 Journal Title XX(X)

once being estimated successfully which saves much time

to estimate these matrixes and makes a real-time stitching

method possible. We also describe an efﬁcient anisotropic

diffusion (AD) based method to recover the invalid depth

data in depth map from Kinect.

Method

In this section, the pipeline and implementation details of

the proposed real time RGB-D images stitching method

are described. The ﬂowchart of the proposed RGB-D

images stitching method is shown in Figure 1. The

stitching ﬂowchart consists of three modules: depth map

preprocessing, registration and compositing. In order to

idea of the proposed RGB-D images stitching method is

to use the registration data from color images to register

depth maps. The main purpose of preprocessing module is

to align depth map with color image so the registration data

of color images can be applied to depth maps registration.

The registration module calculates registration data of color

images that contains the relative rotation matrixes between

pairs of captured color images based on the matched feature

points. Finally, the registration data is fed into compositing

module to construct RGB-D panorama. The registration

module costs most of the time in the ﬂowchart, once the

registration data being initialized the registration module can

run in background thread that makes the real-time stitching

possible. Constructing depth and color panorama in parallel

accelerates the speed of constructing RGB-D panorama even

more. The implementation details will be discussed in the

following sections.

Suitable layout of the two Kinects to extend FOV

Different layout of the two Kinects will inﬂuence the

amount of invalid depth pixels in each depth map and the

total FOV of stitched RGB-D panorama because of the

interference between Kinects. So the suitable layout of the

two Kinects must be ﬁrstly determined. There are mainly two

categories of layout of two Kinects: side by side layout and

superimposed layout. According to different included angle

θ between Kinects, there are different types in each category

as shown in Figure 2. Through experiments, it is found that

the side by side layout in Figure 2(c) and the superimposed

layout in Figure 2(e) provide similar widest FOV which

is almost two times than the original FOV. But since the

two Kinects in Figure 2(c) have more overlapping area

than in Figure 2(e), it causes serious interference between

Kinects which causes more invalid data in captured depth

map (Kramer et al. (2012)) as shown in Figure 3. So the

superimposed layout in Figure 2(e) is selected for our RGB-

D images stitching.

Depth map preprocessing

Align depth map with color image To align depth map

with color image, the relationship between them must be

derived. Assume that the coordinate of a 3D point is denoted

by P

= [X

, Y

, Z

]

, the coordinate of this 3D point

in camera coordinate system is denoted by P = [X, Y, Z]

P and P

are related by a 3 × 3 rotation matrix R and a

(a)

(b) (c)

(d) (e)

Figure 2. Different layout of the two Kinects: (a)(b)(c) side by

side layout, (d)(e) superimposed layout.

(a)

(b)

Figure 3. Interference between Kinects: (a) depth map from a

Kinect, (b) depth map contains more holes because of the

interference between Kinects.

translation matrix T which is a 3 × 1 matrix as shown in

equation (1).

P = RP

+ T, (1)

Let p = [u, v, 1]

denote the coordinate of the projection

point of 3D point in image plane. According to a pinhole

camera model (Hartley and Zisserman (2003)), p and P

are related by H, the intrinsic matrix of camera, which is

composed of focal length parameters f

and f

, and the

coordinates of the principal point u

and v

as seen in

Prepared using sagej.cls

剩余6页未读，继续阅读

评论收藏

内容反馈

weixin_38652270

粉丝: 3
资源: 893

Real-time RGB-D images stitching using multiple Kinects for impr...

最新资源

Real-time RGB-D images stitching using multiple Kinects for impr...

Image-stitching

Wide field-of-view foveated imaging system

Learning Rich Features from RGB-D Images

Real-time image stitching for automotive 360Âº vision systems.pdf

Python-Multiple-Image-Stitching-master.zip（Python 多图像拼接源码）

As-Projective-As-Possible Image Stitching with Moving DLT

Python-Multiple-Image-Stitching-master.zip_image stitching_pytho

image_stitching

matlab图像拼接源码汇总资源，传统图像拼接方法包括APAP、AANAP、SPHP、SPW、LPC、REW、TFA等，一劳永逸

Design and implementation of real-time high def inition aerial camera based on ADSP-BF561

LDB特征提取算法

Shape-Preserving Half-Projective Warps for Image Stitching.pdf

Seamless Image Stitching Using Optimized Boundary Matching for

医学影像处理项目：眼底图像拼接（Python）

基于python的小区监控图像拼接系统源码数据库论文.doc

improved-sift-stitching-code.rar_improved SIFT_sift_sift stitchi

Research on Sub-aperture Stitching Interferometer Method of Measuring Optical Plane

Multiple Terrain Brush 1.5

A view-free image stitching network based

Automatic Panoramic Image Stitching using Invariant Features 论文多图像拼接程序

论文Perception-based seam cutting for image stitching所需的MBS依赖库

红外图像拼接概述.docx

Image-Stitching-using-Graphcut-master.zip_GraphCut.m_graph cuts_

C Bath University Course Material

computer vision

DUAL-FISHEYE LENS STITCHING FOR 360-DEGREE IMAGING.pdf

最新资源