算法部署-使用OpenVINO在CPU上部署实时3D多人姿态估计算法-项目源码-优质项目分享.zip

共29个文件

py：17个

cpp：4个

hpp：3个

版权申诉

OpenVINO

CPU

11 浏览量 2024-04-26 13:42:02 上传评论收藏 84KB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

算法部署_使用OpenVINO在CPU上部署实时3D多人姿态估计算法_项目源码_优质项目分享.zip （29个子文件）

算法部署_使用OpenVINO在CPU上部署实时3D多人姿态估计算法_项目源码_优质项目分享

pose_extractor

CMakeLists.txt 1KB

wrapper.cpp 3KB

src

extract_poses.hpp 292B

extract_poses.cpp 2KB

peak.cpp 14KB

peak.hpp 1KB

human_pose.hpp 432B

human_pose.cpp 361B

setup.py 2KB

data

human_pose_estimation_3d_demo.jpg 50KB

extrinsics.json 469B

modules

__init__.py 0B

load_state.py 634B

draw.py 4KB

inference_engine_pytorch.py 2KB

one_euro_filter.py 1KB

input_reader.py 1KB

parse_poses.py 7KB

legacy_pose_extractor.py 11KB

pose.py 4KB

inference_engine_openvino.py 2KB

conv.py 1KB

requirements.txt 52B

models

__init__.py 0B

with_mobilenet.py 8KB

demo.py 6KB

README.md 5KB

scripts

convert_to_trt.py 1KB

convert_to_onnx.py 961B

# Real-time 3D Multi-person Pose Estimation Demo This repository contains 3D multi-person pose estimation demo in PyTorch. Intel OpenVINO™ backend can be used for fast inference on CPU. This demo is based on [Lightweight OpenPose](https://arxiv.org/pdf/1811.12004.pdf) and [Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB](https://arxiv.org/pdf/1712.03453.pdf) papers. It detects 2D coordinates of up to 18 types of keypoints: ears, eyes, nose, neck, shoulders, elbows, wrists, hips, knees, and ankles, as well as their 3D coordinates. It was trained on [MS COCO](http://cocodataset.org/#home) and [CMU Panoptic](http://domedb.perception.cs.cmu.edu/) datasets and achieves 100 mm MPJPE (mean per joint position error) on CMU Panoptic subset. <p align="center"> <img src="data/human_pose_estimation_3d_demo.jpg" /> </p> > The major part of this work was done by [Mariia Ageeva](https://github.com/marrmar), when she was the :top::rocket::fire: intern at Intel. ## Table of Contents * [Requirements](#requirements) * [Prerequisites](#prerequisites) * [Pre-trained model](#pre-trained-model) * [Running](#running) * [Inference with OpenVINO](#inference-openvino) * [Inference with TensorRT](#inference-tensorrt) ## Requirements * Python 3.5 (or above) * CMake 3.10 (or above) * C++ Compiler (g++ or MSVC) * OpenCV 4.0 (or above) > [Optional] [Intel OpenVINO](https://software.intel.com/en-us/openvino-toolkit) for fast inference on CPU. > [Optional] [NVIDIA TensorRT](https://docs.nvidia.com/deeplearning/tensorrt/install-guide/index.html) for fast inference on Jetson. ## Prerequisites 1. Install requirements: ``` pip install -r requirements.txt ``` 2. Build `pose_extractor` module: ``` python setup.py build_ext ``` 3. Add build folder to `PYTHONPATH`: ``` export PYTHONPATH=pose_extractor/build/:$PYTHONPATH ``` ## Pre-trained model <a name="pre-trained-model"/> BaiduDisk: Link: https://pan.baidu.com/s/1JGXieFyHwmzcXRX-6h4qjA Extraction code: 6666 ## Running To run the demo, pass path to the pre-trained checkpoint and camera id (or path to video file): ``` python demo.py --model human-pose-estimation-3d.pth --video 0 ``` > Camera can capture scene under different view angles, so for correct scene visualization, please pass camera extrinsics and focal length with `--extrinsics` and `--fx` options correspondingly (extrinsics sample format can be found in data folder). In case no camera parameters provided, demo will use the default ones. ## Inference with OpenVINO <a name="inference-openvino"/> To run with OpenVINO, it is necessary to convert checkpoint to OpenVINO format: 1. Set OpenVINO environment variables: ``` source <OpenVINO_INSTALL_DIR>/bin/setupvars.sh ``` 2. Convert checkpoint to ONNX: ``` python scripts/convert_to_onnx.py --checkpoint-path human-pose-estimation-3d.pth ``` 3. Convert to OpenVINO format: ``` python <OpenVINO_INSTALL_DIR>/deployment_tools/model_optimizer/mo.py --input_model human-pose-estimation-3d.onnx --input=data --mean_values=data[128.0,128.0,128.0] --scale_values=data[255.0,255.0,255.0] --output=features,heatmaps,pafs ``` To run the demo with OpenVINO inference, pass `--use-openvino` option and specify device to infer on: ``` python demo.py --model human-pose-estimation-3d.xml --device CPU --use-openvino --video 0 ``` ## Inference with TensorRT <a name="inference-tensorrt"/> To run with TensorRT, it is necessary to install it properly. Please, follow the [official guide](https://docs.nvidia.com/deeplearning/tensorrt/install-guide/index.html), these steps work for me: 1. [Install](https://developer.nvidia.com/cuda-downloads) CUDA 11.1. 2. [Install](https://developer.nvidia.com/cudnn) cuDNN 8 (runtime library, then developer). 3. Install `nvidia-tensorrt`: ``` python -m pip install nvidia-pyindex pip install nvidia-tensorrt==7.2.1.6 ``` 4. [Install](https://github.com/NVIDIA-AI-IOT/torch2trt) `torch2trt`. Convert checkpoint to TensorRT format: ``` python scripts/convert_to_trt.py --checkpoint-path human-pose-estimation-3d.pth ``` > TensorRT does not support dynamic network input size reshape. Make sure you have set proper network input height, width with `--height` and `--width` options during conversion (if not, there will be no detections). Default values work for a usual video with 16:9 aspect ratio (1280x720, 1920x1080). You can check the network input size with `print(scaled_img.shape)` in the demo.py To run the demo with TensorRT inference, pass `--use-tensorrt` option: ``` python demo.py --model human-pose-estimation-3d-trt.pth --use-tensorrt --video 0 ``` I have observed ~10x network inference speedup on RTX 2060 (in comparison with default PyTorch 1.6.0+cu101 inference).

评论收藏

内容反馈

版权申诉