人工智能-深度学习-基于深度学习的着装表演机器人研究及设计.pdf资源-CSDN文库

版权申诉

109 浏览量 2022-06-27 06:00:16 上传评论收藏 3.26MB PDF 举报

资源推荐

资源详情

资源评论

基于深度学习的着装表演机器人研究及设计

估计出人体的 2D 姿态，然后通过推理，在 2D 姿态的基础上获得 3D

人体骨骼序列。在第一阶段，本文采用自上而下的思路，将服装检测

部分的服装定位信息扩展为人体检测的定位信息。使用空间变换网络

对检测部分进行仿射，使目标人体姿态中心与图像中心重合。然后采

用当前效果最好的单人姿态估计网络 HRnet 识别 2D 人体姿态。在第

二阶段，采用时域空洞卷积网络从 2D 人体姿态推理出 3D 骨架序列。

为了解决现有 3D 人体姿态数据集不足的问题，本文引入了半监督的

训练方法，实现了高精度、长时间、稳定的 3D 人体骨骼序列提取。

（3）动作模仿和服装模仿在动画表演机器人系统中的验证：该

部分将前两个部分的研究成果相结合应用到表演机器人当中。首先采

用 Marvelous Designer 软件为表演机器人制作服装并将其放入服装库，

然后根据服装检测部分获取的演示者服装类别为表演机器人从服装

库中挑选服装，最后将人体姿态估计部分提取到的 3D 人体骨骼序列

转为表演机器人可读取的欧拉角序列，以一定数据格式传递给机器人，

最终实现了表演机器人的同类别服装更换以及动作模仿。本文通过大

量实验证明了本文方法的有效性，并通过和其他方法对比展示了本文

方法的优越性。

关键词：深度学习，服装检测，3D 姿态估计，表演机器人

万方数据

基于深度学习的着装表演机器人研究及设计

III

RESEARCH AND DESIGN OF PERFORMANCE ROBOT WITH

DRESS BASED ON DEEP LEARNING

ABSTRACT

In recent years, with the huge success of deep learning in many fields, robot researchers tend

to combine deep learning with robot research in order to give robots more capabilities of

perceptions and analysis. More and more robots leave the laboratories and factories to go into

people's daily lives.

Among these robots, the performance robot can "autonomously" perform and interact with

people for serving human. And the quality of performance service depends largely on the quality

of the 3D bone driving sequence. However, the existing acquisition of 3D bone sequences mainly

depends on the 3D human motion capture equipments or motion designers. These methods are

costly, making the research of intelligent performance robots relatively difficult. What’s more,

most of the existing work about performance robots ignore the effects of clothing on robots. In

order to solve the problems above, this thesis researches and designs the dressing categories and

acquisition methods of 3D bone sequence of performing robots based on deep learning. The

research work in this thesis is divided into the following three parts:

(1) Clothing detection and classification for performance robots: This section mainly detects

and classifies the clothing of the presenters in the scene, so that the robot can choose the

corresponding clothing type according to the detection results. And then this thesis’ target tracking

the clothing on presenters in videos, which is helpful for human pose estimation later. This thesis

made a dataset for the clothing detection task, and then improved the target detection network

YOLO v3 according to the application scenario and the characteristics of the dataset. In the later

work, this thesis changed the multi-scale process, introduced down sampling and pruned the net so

that the improved network will be used in this thesis’ task with a better performance.

(2) 3D bone sequence acquisition: This part consists of two stages. The 2D pose of the human

body is estimated from the scene videos, and then the 3D human bone sequence is obtained based

on the 2D pose through inference. In the first stage, this thesis used a top-down approach. First,

expand the clothing positioning information of the clothing detection part into the positioning

information of human body detection. Then, the spatial transformation network is used to affine

万方数据

                                                        基于深度学习的着装表演机器人研究及设计 
V 
 
目录 
 
 
摘  要.............................................................................................................................. I 
ABSTRACT ................................................................................................................. III 
第 1 章  绪论 .............................................................................................................. 1 
1 表演机器人的研究背景和意义....................................................................... 1 
2 表演机器人的国内外研究现状....................................................................... 2 
2.1 国外研究现状......................................................................................... 2 
2.2 国内研究现状......................................................................................... 3 
3 论文主要内容................................................................................................... 4 
4 论文结构安排................................................................................................... 5 
第 2 章  表演机器人的功能要求分析 ...................................................................... 7 
1 系统功能要求分析........................................................................................... 7 
2 表演机器人系统设计中的关键技术............................................................... 8 
3 本章小结......................................................................................................... 10 
第 3 章  基于 YOLO V3 的服装检测 ...................................................................... 11 
1  引言................................................................................................................ 11 
2 YOLO v3 实现原理 ....................................................................................... 12 
3 YOLO v3 网络改进 ....................................................................................... 17 
3.1  基于 K-means 的先验框优化 ............................................................. 18 
3.2  多尺度优化.......................................................................................... 19 
3.3  基于下采样的并行网络优化.............................................................. 20 
3.4  模型剪枝.............................................................................................. 21 
4  实验结果及分析............................................................................................ 23 
4.1  实验平台.............................................................................................. 23 
4.2  实验数据集.......................................................................................... 23 
4.3  数据预处理.......................................................................................... 23 
4.4  评价指标.............................................................................................. 24 
4.5  结果分析.............................................................................................. 25 
5  本章小结........................................................................................................ 27 
第 4 章  人体姿态估计 ............................................................................................ 28 
1  引言................................................................................................................ 28 
2 2D 人体姿态估计 ........................................................................................... 30 
2.1 AlphaPose 实现原理 ............................................................................ 32 
2.2  基于 AlphaPose 框架的 2D 人体姿态估计 ....................................... 32 
2.3  实验及结果分析.................................................................................. 34 
3 3D 骨骼序列获取 ........................................................................................... 41 
3.1  时域空洞卷积实现原理...................................................................... 42 
3.2  基于时域空洞卷积的 3D 骨骼序列获取 ........................................... 44 
4  本章小结........................................................................................................ 47 
第 5 章  着装表演机器人关键技术的验证 ............................................................ 48 
万方数据