人体姿态估计论文（openpose)_YOLOv7-Pose姿态估计代码+权重资源-CSDN文库

人体姿态估计

需积分: 50 130 浏览量 2017-12-18 22:15:58 上传评论 6 收藏 8.08MB PDF 举报

资源详情

资源评论

Realtime Multi-Person 2D Pose Estimation using Part Afﬁnity Fields

∗

Zhe Cao Tomas Simon Shih-En Wei Yaser Sheikh

The Robotics Institute, Carnegie Mellon University

{zhecao,shihenw}@cmu.edu {tsimon,yaser}@cs.cmu.edu

Abstract

We present an approach to efﬁciently detect the 2D pose

of multiple people in an image. The approach uses a non-

parametric representation, which we refer to as Part Afﬁnity

Fields (PAFs), to learn to associate body parts with individ-

uals in the image. The architecture encodes global con-

text, allowing a greedy bottom-up parsing step that main-

tains high accuracy while achieving realtime performance,

irrespective of the number of people in the image. The ar-

chitecture is designed to jointly learn part locations and

their association via two branches of the same sequential

prediction process. Our method placed ﬁrst in the inaugu-

ral COCO 2016 keypoints challenge, and signiﬁcantly ex-

ceeds the previous state-of-the-art result on the MPII Multi-

Person benchmark, both in performance and efﬁciency.

1. Introduction

Human 2D pose estimation—the problem of localizing

anatomical keypoints or “parts”—has largely focused on

ﬁnding body parts of individuals [8, 4, 3, 21, 33, 13, 25, 31,

6, 24]. Inferring the pose of multiple people in images, es-

pecially socially engaged individuals, presents a unique set

of challenges. First, each image may contain an unknown

number of people that can occur at any position or scale.

Second, interactions between people induce complex spa-

tial interference, due to contact, occlusion, and limb articu-

lations, making association of parts difﬁcult. Third, runtime

complexity tends to grow with the number of people in the

image, making realtime performance a challenge.

A common approach [23, 9, 27, 12, 19] is to employ

a person detector and perform single-person pose estima-

tion for each detection. These top-down approaches di-

rectly leverage existing techniques for single-person pose

estimation [17, 31, 18, 28, 29, 7, 30, 5, 6, 20], but suffer

from early commitment: if the person detector fails–as it

is prone to do when people are in close proximity–there is

no recourse to recovery. Furthermore, the runtime of these

∗

Video result: https://youtu.be/pW6nZXeWlGM

Figure 1. Top: Multi-person pose estimation. Body parts belong-

ing to the same person are linked. Bottom left: Part Afﬁnity Fields

(PAFs) corresponding to the limb connecting right elbow and right

wrist. The color encodes orientation. Bottom right: A zoomed in

view of the predicted PAFs. At each pixel in the ﬁeld, a 2D vector

encodes the position and orientation of the limbs.

top-down approaches is proportional to the number of peo-

ple: for each detection, a single-person pose estimator is

run, and the more people there are, the greater the computa-

tional cost. In contrast, bottom-up approaches are attractive

as they offer robustness to early commitment and have the

potential to decouple runtime complexity from the number

of people in the image. Yet, bottom-up approaches do not

directly use global contextual cues from other body parts

and other people. In practice, previous bottom-up meth-

ods [22, 11] do not retain the gains in efﬁciency as the ﬁ-

nal parse requires costly global inference. For example, the

seminal work of Pishchulin et al. [22] proposed a bottom-up

approach that jointly labeled part detection candidates and

associated them to individual people. However, solving the

integer linear programming problem over a fully connected

graph is an NP-hard problem and the average processing

time is on the order of hours. Insafutdinov et al. [11] built

on [22] with stronger part detectors based on ResNet [10]

and image-dependent pairwise scores, and vastly improved

the runtime, but the method still takes several minutes per

image, with a limit on the number of part proposals. The

pairwise representations used in [11], are difﬁcult to regress

precisely and thus a separate logistic regression is required.

arXiv:1611.08050v2 [cs.CV] 14 Apr 2017

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余8页未读，立即下载

评论收藏

内容反馈

人体姿态估计论文（open pose)

评论0

最新资源

人体姿态估计论文（open pose)

评论0

最新资源

相关推荐

轻量级人体姿势估计.pytorch：在PyTorch中快速准确的人体姿势估计。 包含“ CPU上的实时2D多人姿势估计：轻量级OpenPose”的实现

使用python语言的超轻量openpose源码，能实时对人体进行姿态检测，下载可直接运行

Human-Pose-Estimation-Papers:2D＆3D人体姿势估计

人体姿态估计的环境配置所需文件openpose

openpose源文章

OpenPose笔记PPT

openpose最新下载

Python-图像分类目标检测姿态估计分割的Pytorch实现

人体姿态估计的强大算法

OpenPose v1.3的release文件

基于OpenPose的人体睡姿识别实现与研究.pdf

人体姿态检测

Python-包含手和身体姿势估计openpose的pytorch实现

Openpose-pytorch开源项目用于姿态检测、人体关键点识别

openpos 人体姿势识别模型pose_iter_440000 pose_iter_116000 102000集合

Python-MobilePose是一个轻量级的基于PyTorch实现的单人姿态估计框架

姿态检测算法参考文献

图像和视频中基于部件检测器的人体姿态估计

pose estimate 姿态估计

opencv做的姿态检测项目

opepose opencv_DNN人体姿态检测模型

开源姿态识别openpose-1.5.1源码

openpose 1.4.0

Openpose15个点的一个json数据文件（其中包含三人的数据仅含pose）

openpose1.7.0

darknet+openpose

Openpose的简单使用

最新《深度学习人体姿态估计》综述论文

OpenPose_1.1.0_201707

轻量级OpenSSL安装文件

轻量级人体姿势估计.pytorch：在PyTorch中快速准确的人体姿势估计。包含“ CPU上的实时2D多人姿势估计：轻量级OpenPose”的实现