基于RGBD图像的遮挡处理实时增强现实_基于rgbd图像的实时遮挡处理的增强现实资源-CSDN文库

5 浏览量 2021-03-02 09:34:37 上传评论收藏 690KB PDF 举报

增强现实（AR）技术近年来一直是人机交互技术研究的热点。AR的目标是通过虚拟物体与真实世界无缝融合的幻觉来实现更加丰富的人机交互体验。典型的AR系统需要两个基本组成部分：三维注册（3D registration）和真实-虚拟融合（real-virtual fusion）。三维注册是指将虚拟物体准确地放置在真实世界的三维空间中的技术过程，而真实-虚拟融合则是指将虚拟物体和真实世界的图像混合在一起的技术。在增强现实系统中，处理遮挡（occlusion handling）是一个至关重要的环节，它直接关系到视觉真实性。遮挡处理指的是在系统中正确处理虚拟物体与真实场景中的其他物体之间的遮挡关系。为了提高AR系统的系统视觉真实性，需要优化遮挡处理技术，确保虚拟物体能够合理地遮挡或被遮挡于真实物体之后。为了达到这一目的，研究者提出了一种基于RGBD图像的实时遮挡处理增强现实架构。RGBD图像是一种包含颜色信息（RGB）和深度信息（Depth）的图像，通过这种图像可以获得场景的二维色彩信息以及物体间的相对距离信息，这对于三维空间的感知以及三维注册和遮挡处理至关重要。该架构包括三个主要部分：实时摄像头跟踪系统、三维重建系统和AR融合系统。实时摄像头跟踪系统能够及时地以视频速率追踪相机的位置，这对于实时三维注册至关重要。在此基础上，三维重建系统能够在扫描过程中相应地可视化重建场景。然后，在第二遍处理中，同时进行虚拟物体与真实场景之间的遮挡处理，依据摄像头的位置信息。虚拟物体的渲染结果和彩色图像被融合，从而生成AR内容。实验结果表明，这种方法对于遮挡处理是稳定和精确的，并且能够有效提升增强现实系统的视觉真实性。在介绍中也提到了增强现实系统满足Azuma的定义所必须的两个基本要素：遮挡处理和三维注册。在视觉真实性方面，这两个要素扮演着非常重要的角色。此外，文章还介绍了增强现实技术的一些最新进展，并且对于人机交互的意义进行了阐述。AR系统不仅能够提供比传统虚拟现实或现实世界更丰富的交互语义，还可以在多个应用领域中产生深远的影响，比如医疗、教育、娱乐、设计等。这项研究通过对实时遮挡处理的增强现实技术的开发，展现了计算机视觉和计算机图形学在提升真实感体验方面的最新进展，同时为未来的研究提供了新的方向，尤其是在三维注册、场景重建和真实感视觉合成方面。通过这项技术，我们有望在不久的将来看到更加逼真和互动的增强现实应用。

资源推荐

资源详情

资源评论

Real-time Augmented Reality with Occlusion Handling Based on RGBD Images

Xiaozhi Guo, Chen Wang, Yue Qi

State Key Laboratory of Virtual Reality Technology and Systems, Beihang University

Beijing 100191, China

Beihang University Qingdao Research Institute

Qingdao 266100, China

xiaozhi@buaa.edu.cn, vr_wangchen@buaa.edu.cn, qy@buaa.edu.cn

Abstract—Augmented Reality (AR) is one of the latest

developments in human-computer interaction technology. It

aims to generate illusions from seamless fusion of virtual

objects and real world. Typical AR system requires two basic

parts: three-dimensional registration and real-virtual fusion.

Occlusion handling is crucial for visual realism. To optimize

visual realism, we generated a real-time systematic

architecture to operate occlusion handling. The architecture is

based on RGBD images, and it consists of three parts: real-

time camera tracking system, 3D reconstruction system and

AR fusion system. Specifically, we used a two-pass scheme

strategy to execute the AR system. The first pass tracks

camera poses timely at video rate, which allows the

reconstruction results be updated and visualized

correspondingly during the scanning. The second pass takes

place simultaneously to handle occlusion between virtual

objects and real scene according to camera pose. Finally, the

render results of virtual objects and the color images are fused

to generate AR contents. Our results indicate that this method

is stable and precise for occlusion handling, and can effectively

improve realism in AR system.

Keywords-augmented reality; occlusion handling; scene

reconstruction; rgbd

I. INTRODUCTION

Augmented Reality (AR) has been a hot spot in research

for a long time [1]. It combines virtual items produced by

PC with real scene. AR system can produce more semantic

implications than either virtual or real world by orchestrating

them to be a whole. The primary challenge in generating

convincing augmented reality is to project 3D models onto a

user’s view of the real world and create a spatial sustained

illusion that the virtual and real scene coexist.

For a system to meet Azuma's definition of augmented

reality system [2], it must fulfill two fundamental

necessities: occlusion handling and 3D register. They play a

very important role in convincing augmented reality system.

Current AR system can be generally characterized into two

groups according to its tracking method [3]. One is based on

the sensor tracking technology that rotation and position of

the camera are acquired by the data from accelerometer,

GPS and compass. The cost of such system is generally

high. The other one is based on the technology of computer

vision. In this AR system, marker or scene features is

tracked by means of computer vision technology. Such as

QR-codes Based [4], Edge Snapping Based [5], 3D Line

Segment Based [6], and convex polygon marker Based [7].

Since Davison [8] and other researchers proposed

simultaneous localization and mapping (SLAM), it has been

widely considered in the field of augmented reality

[9][10][11].

One of the main problems of current augmented reality is

the lack of reliable depth information. It simply overlays

virtual 3D objects on real world imagery [12][13]. Such

overlay is not fantastic when displaying data in three

dimensions because the occlusion between real and

computer-generated objects is not addressed. Hauck JDVS

et al. utilized single depth image to handle Occlusion [14],

but its performance is not good enough in detail due to the

unstable depth value (Figure 1). As we can see,occlusion

handling is a key issue of AR realism.

To handle the occlusion between virtual objects and

reality scene, and improve the accuracy of Camera tracking,

we adopt a method of computer vision for camera tracking,

and a model-based approach for occlusion handling. In this

paper, we proposed a robust marker-less AR architecture

based on RGBD images.

II. S

YSTEM OVERVIEW

The goal of our system is to generate a convincing

model-based augmented reality system, which can handle

occlusion correctly and track camera precisely. In order to

achieve the goal, we adopt a specific method to rebuild the

real scene while tracking the camera simultaneously. We

take advantage of the kinect sensor, a conventional and low-

cost RGBD camera.

In our system, we adopted a two-pass scheme. The first

pass performs parallel camera tracking in real-time, and it

allows the reconstruction results to be updated and

visualized during the scanning process. The second pass

handles the occlusion between virtual object and real scene

model according to camera pose, and fuses the render result

with the color image.

The data processing in our system consists of three major

components. The flowchart of our system is shown in Fig.2.

• Camera tracking. Camera tracking needs to run at

the start of the system. We utilize a fast bilateral

filter to pre-process raw depth image. Then, we

exploit camera parameters to transform the depth

image into a point cloud in the 3D space, and

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余4页未读，立即下载

评论收藏

内容反馈

weixin_38642897

粉丝: 3
资源: 894

基于RGBD图像的遮挡处理实时增强现实

基于RGBD和IMU的实时室内SLAM及三维重建.zip

PCL点云-RGBD图像转换到点云

基于RGBD摄像头的障碍物检测

PCL点云-RGBD图像ICP迭代最近点之点云配准与匹配

Python-用于将多张RGBD图像融合到TSDFvoxelvolume中的Python代码

尝试5读取tfrecord_RGBD_图像分类RGB-D_

基于RGBD的粒子滤波跟踪程序

基于RGBD图像和卷积神经网络的快速道路检测

基于RGBD-SLAM的三维物体重建1

电信设备-基于RGBD信息的物体定位方法、装置以及机器视觉系统.zip

毕设&课设&项目&实训-，基于RGBD深度相机增加稠密点云重建线程.zip

层次分析matlab代码-Code-for-HSCS-Method:HSCS：基于分层稀疏性的RGBD图像共显着性检测，IEEETMM，201

RGBD-to-mitsuba:从RGBD图像生成Mitsuba场景的脚本

RGBD图像的显着性检测

基于多约束特征匹配和交叉标签传播的RGBD图像共显性检测

RGBD-to-Mesh:将来自 Kinect 传感器的 RGBD 图像转换为三角形网格表示。 CIS565 的最终项目

基于matlab的三维视觉课程中的点云或RGBD重建项目源码.zip

深度图像提取骨骼序列代码

基于CORDIC的反正弦和反余弦计算的FPGA实现

BA无标度网络中的SIR模型

使用3DCNN和卷积LSTM进行手势识别学习时空特征

基于三次贝塞尔曲线的类汽车曲率连续路径平滑

基于机器学习的设备剩余寿命预测方法综述

基于维纳过程的退化模型，具有递归过滤算法，可用于估计剩余使用寿命

基于FPGA的奇异值和特征值分解的快速实现。

磁悬浮系统自适应模糊PID控制器的设计

最新资源