综合物理数字世界中的数字人类（IPhD）.pdf资源-CSDN文库

版权申诉

27 浏览量 2024-03-31 10:45:21 上传评论收藏 733KB PDF 举报

### 综合物理数字世界中的数字人类（IPhD）：3DoF与6DoF下的高逼真度数字人类质量对比研究 #### 摘要解析与核心内容概述在虚拟现实（VR）和增强现实（AR）技术日益普及的背景下，《综合物理数字世界中的数字人类（IPhD）》这篇论文对3D重构技术的应用进行了深入探讨。该研究重点比较了使用点云表示法的高逼真度数字人类在3自由度（3DoF）与6自由度（6DoF）两种不同虚拟现实环境下的视觉质量差异。 #### 研究背景与动机随着捕捉、媒体处理及三维渲染技术的不断进步，VR/AR应用越来越受到大众市场的欢迎。在这个新兴的媒体环境中，点云作为一种简单且灵活的表示方法，因其适用于实时应用而变得日益普遍，尤其是在社交虚拟现实中重建人类模型方面。 #### 点云技术简介点云是由一系列离散的3D点组成的集合，这些点包含了位置信息以及可能的颜色或强度信息。点云数据可以用来构建复杂物体的三维模型，广泛应用于自动驾驶汽车、机器人导航、三维扫描、电影特效等领域。由于其简单性和灵活性，点云非常适合用于快速重建场景，并能够实现实时交互。 #### 研究目的与方法本研究的主要目的是评估使用点云表示的数字人类模型在压缩失真情况下的视觉质量。具体来说，研究人员比较了即将推出的点云压缩标准与基于八叉树的锚点编码器的性能。此外，研究还测试了两种不同的虚拟现实观看条件（3DoF与6DoF），以理解在虚拟空间中的互动如何影响视觉质量的感知。 #### 观察条件与实验设计 - **3DoF（Three Degrees of Freedom）**：用户可以在虚拟环境中进行旋转查看模型，但不能移动位置。 - **6DoF（Six Degrees of Freedom）**：用户不仅可以在虚拟环境中旋转，还可以前后左右上下移动，实现更自由的空间探索。实验通过让用户在两种不同的自由度条件下评估点云表示的数字人类模型的质量来进行。为了获得客观评价，研究人员收集了定量数据，并进行了实证分析。 #### 主要发现与贡献研究结果表明，感知到的视觉质量受测试内容的影响较大，目前的数据集可能不足以全面评估压缩解决方案的有效性。此外，研究还讨论了点云编码解决方案在处理无损压缩时存在的不足之处。 #### 结论与未来方向本文首次进行了动态点云在VR中的用户质量评估，为理解不同自由度下数字人类模型的视觉质量提供了宝贵的见解。未来的研究可以进一步探讨如何优化点云压缩算法以提高用户体验，特别是在交互式虚拟现实应用中。 #### 关键术语与领域 - **Human-centered computing**：以人为本的计算 - **Human computer interaction (HCI)**：人机交互 - **HCI design and evaluation methods**：人机交互设计与评估方法 - **User studies**：用户研究 - **Interaction paradigms**：交互范式 - **Virtual reality**：虚拟现实通过以上综述，我们可以看到《综合物理数字世界中的数字人类（IPhD）》这篇论文对于推动虚拟现实和增强现实领域的技术和应用具有重要意义，同时也为相关领域的研究人员提供了有价值的研究思路和技术参考。

资源推荐

资源详情

资源评论

Comparing the Quality of Highly Realistic Digital Humans in 3DoF and

6DoF: A Volumetric Video Case Study

Shishir Subramanyam

Jie Li Irene Viola Pablo Cesar

CWI, Amsterdam, The Netherlands

Figure 1: Users Evaluating Realistic Digital Humans in 6DoF (left) and 3DoF (right)

BSTRACT

Virtual Reality (VR) and Augmented Reality (AR) applications have

seen a drastic increase in commercial popularity. Different repre-

sentations have been used to create 3D reconstructions for AR and

VR. Point clouds are one such representation characterized by their

simplicity and versatility, making them suitable for real time appli-

cations, such as reconstructing humans for social virtual reality. In

this study, we evaluate how the visual quality of digital humans, rep-

resented using point clouds, is affected by compression distortions.

We compare the performance of the upcoming point cloud compres-

sion standard against an octree-based anchor codec. Two different

VR viewing conditions enabling 3- and 6 degrees of freedom are

tested, to understand how interacting in the virtual space affects the

perception of quality. To the best of our knowledge, this is the ﬁrst

work performing user quality evaluation of dynamic point clouds

in VR; in addition, contributions of the paper include quantitative

data and empirical ﬁndings. Results highlight how perceived visual

quality is affected by the tested content, and how current data sets

might not be sufﬁcient to comprehensively evaluate compression

solutions. Moreover, shortcomings in how point cloud encoding

solutions handle visually-lossless compression are discussed.

Index Terms:

Human-centered computing—Human computer in-

teraction (HCI)—HCI design and evaluation methods—User studies;

—Interaction paradigms—Virtual reality;

NTRODUCTION

Recent advances in capturing, media processing, and 3D rendering

technologies make VR/AR applications popular for mass consump-

tion [34]. In this new media landscape, point clouds are becoming

commonplace due to their simplicity and versatility. Still, the size of

dense point clouds is signiﬁcant (a frame of roughly 1M points takes

around 19-20 MBytes), which need compression techniques before

transmission. This paper provides an exhaustive quality comparison

between different encoding conﬁgurations of digital humans, repre-

sented as point clouds. By investigating the differences in quality,

we provide insights about how to optimise the delivery for both

downloading and real-time communication. One key novelty of

this paper is to study the quality based on realistic consumption

conditions, in 3- and 6- Degrees of Freedom (DoF) scenarios.

e-mail: {S.Subramanyam, Jie.Li, Irene.Viola, P.S.Cesar}@cwi.nl

Avatars are a core part of VR applications like social communi-

cation [28], sports training [21], or healthcare [20]. A major line

of scientiﬁc work has focused on how to make such avatars more

realistic, interactive, and autonomous [10, 24, 33]. In this paper, we

focus instead on point clouds as a suitable representation for digital

humans based on tele-portation principles [25]. In this case, the

research problem is not so much how to render and animate them to

make them look more realistic, but how to transport them optimally.

Given current advances in technology, real-time delivery of point

clouds is becoming a realistic alternative; focusing the attention

of the research community [23] and industry [32] in encoding and

transmission. Still, given the massive number of points per repre-

sentation, decisions need to be taken regarding the delivery (type

of encoder, bit-rate) to ensure an acceptable quality of experience

depending on the viewing conditions (3DoF, 6DoF). This is the core

research question this paper answers.

Contributions of the paper are two-fold: 1) It provides a ﬁrst

evaluation of the quality of highly realistic digital humans repre-

sented as dynamic point clouds in immersive viewing conditions.

Existing protocols [5, 7, 8,40, 42] did not consider the dynamic of

the point clouds, focused on one type of data set, and did not take

into account VR viewing conditions; 2) It provides quantitative sub-

jective results about the perceived quality of the contents, along with

qualitative insights on what is important for users in interacting with

digital humans in VR. Such results will help in better conﬁguring

the network conditions for the delivery of points clouds for real-time

transmission, and have implications over ongoing research and stan-

dardisation work regarding the underlying compression technology.

Particularly, this paper extensively studies this current and rel-

evant area of research by proposing 1) a new evaluation protocol,

including the work to create dynamic point clouds for evaluation,

and 2) quality of experience results. These results are based on

an experiment with 52 participants, evaluating 72 stimuli based on

eight dynamic point cloud sequences. Each point cloud sequence

was compressed in four bit-rates, using two types of compression

techniques. These 72 stimuli were evaluated in two viewing con-

ditions (3DoF and 6DoF). The data gathered include rating scores,

presence questionnaires, simulator sickness reports, and time spent

watching the content. The results indicate that, while bit-rate savings

can be obtained by choosing one compression solution over another,

visually lossless compression has not been fully achieved by the

algorithms under evaluation, even at rather large bit-rates. Moreover,

the choice of content can have an impact on how users rate its quality,

inﬂuencing the discriminating power of the selected protocol.

127

2020 IEEE Conference on Virtual Reality and 3D User Interfaces (VR)

DOI 10.1109/VR46266.2020.00-73

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余9页未读，立即下载

评论收藏

内容反馈

版权申诉

百态老人

粉丝: 6045
资源: 2万+

综合物理数字世界中的数字人类（IPhD）.pdf

可视化数字人体.pdf

digital-human

裕太微车载以太网PHY芯片YT8010A

Deep_Learning

相关实用应用程序（Windows可用）

李飞飞自传 我看见的世界 The World I see

ChatGPT使用总结：150个ChatGPT提示词模板（完整版）

全国计算机二级WPSoffice精选350道选择题题库（含答案）.pdf

eetop.cn-07-1射频电路设计理论与应用-王子宇 -课后答案1-10章

哈尔滨工业大学-ChatGPT调研报告-2023.3.6-94页.pdf

学术海报模板+论文科研+研究生

4个亲测好用的ChatGPT4渠道

chromedriver-win64.zip

车载毫米波雷达DOA估计综述博文仿真代码

ST-LINK Utility 4.6.0

The Worlds I See 我看见的世界【Fei-Fei-Li 李飞飞】

python大作业 含爬虫、数据可视化、地图、报告、及源码（2016-2021全国各地区粮食产量）.rar

ST语言规则编程手册全面讲解ST语言

技术资料分享HC05蓝牙指令集很好的技术资料.zip

1000份ppt模版，PPT模板优秀PPT

《2024大模型典型示范应用案例集》

卸载软件最最最彻底的工具

GJB 3206B-2022 《技术状态管理》

2024年RAICOM省赛获奖名单.pdf

软件著作权参考材料-模板

CTF Web解题大解密：如何找到神秘的Flag，成为夺旗赛的MVP

由于找不到iUtils.dll,无法继续执行代码

认知智能技术与产业研究报告2023

和利时DCS软件MACS 6.5.4 虚拟机（送一个工程案例），可以在线仿真，送学习资料 不含加密狗，8小时软件会自动退出，退

最新资源

李飞飞自传我看见的世界 The World I see

python大作业含爬虫、数据可视化、地图、报告、及源码（2016-2021全国各地区粮食产量）.rar

和利时DCS软件MACS 6.5.4 虚拟机（送一个工程案例），可以在线仿真，送学习资料不含加密狗，8小时软件会自动退出，退