人工智能项目资料-基于飞桨深度学习框架的实时行人分析进行功能扩展的球赛识别追踪工具.zip资源-CSDN文库

共906个文件

py：429个

pyc：314个

txt：41个

版权申诉

毕业设计

课程设计

项目资料

人工智能

151 浏览量 2024-02-03 13:27:47 上传评论收藏 176.99MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

人工智能项目资料-基于飞桨深度学习框架的实时行人分析进行功能扩展的球赛识别追踪工具.zip （906个子文件）

._human_pp_humansegv1_server_512x512_inference_model_with_softmax 212B

mute.bmp 13KB

volumn.bmp 13KB

316.bmp 1KB

620.bmp 1KB

212.bmp 1KB

224.bmp 1KB

610.bmp 1KB

612.bmp 1KB

214.bmp 1KB

622.bmp 1KB

318.bmp 1KB

624.bmp 1KB

132.bmp 1KB

430.bmp 1KB

632.bmp 1KB

828.bmp 1KB

trajectory.cc 15KB

pipeline.cc 13KB

tracker.cc 10KB

jde_predictor.cc 8KB

postprocess.cc 7KB

preprocess_op.cc 6KB

main.cc 6KB

sde_predictor.cc 2KB

predictor.cc 1KB

yaml-cpp.cmake 962B

lapjv.cpp 9KB

MainWindow.cpp 1KB

QWVideoWidget.cpp 854B

main.cpp 172B

args.data 2KB

FLAGS 1018B

c1.gif 18.15MB

mot.gif 14.11MB

c2.gif 12.76MB

attribute.gif 8.35MB

fight_demo.gif 5.26MB

player_ocr.gif 4.59MB

calling.gif 4.39MB

action.gif 3.77MB

smoking.gif 3.33MB

highlight.gif 2.06MB

boat.gif 1.59MB

football.gif 1.55MB

ski.gif 1.3MB

team_clas.gif 1.04MB

ball.gif 912KB

team_clas_1.gif 901KB

football1.gif 832KB

golf.gif 487KB

237.GIF 1000B

001.GIF 336B

.gitignore 47B

ui_MainWindow.h 9KB

trajectory.h 8KB

pipeline.h 5KB

preprocess_op.h 5KB

config_parser.h 4KB

predictor.h 4KB

sde_predictor.h 3KB

jde_predictor.h 3KB

tracker.h 2KB

postprocess.h 2KB

lapjv.h 2KB

utils.h 1KB

MainWindow.h 911B

QWVideoWidget.h 476B

23.ico 25KB

Movie Clip.ico 25KB

Recycle Bin empty.ico 25KB

5.ico 25KB

Audio CD.ico 23KB

Wave Sound.ico 23KB

22.ico 22KB

3.jpg 24KB

2.jpg 2KB

1.jpg 2KB

110.JPG 726B

LICENSE 34KB

README_cn.md 23KB

README.md 22KB

action_en.md 20KB

action.md 19KB

README_en.md 17KB

README.md 14KB

README.md 12KB

QUICK_STARTED.md 11KB

README.md 10KB

README_en.md 8KB

README.md 8KB

README_cn.md 7KB

mot_en.md 7KB

mot.md 7KB

README.md 7KB

attribute_en.md 6KB

attribute.md 6KB

README.md 6KB

paper.md 4KB

mtmct_en.md 4KB

共 906 条

English | [简体中文](README_cn.md) # PP-HumanSeg **Content** - 1 Introduction - 2 News - 3 PP-HumanSeg Models - 4 Quick Start - 5 Training and Finetuning - 6 Deployment ## 1 Introduction Human segmentation is a high-frequency application in the field of image segmentation. Generally, human segentation can be classified as portrait segmentation and general human segmentation. For portrait segmentation and general human segmentation, PaddleSeg releases the PP-HumanSeg models, which has **good performance in accuracy, inference speed and robustness**. Besides, we can deploy PP-HumanSeg models to products without training Besides, PP-HumanSeg models can be deployed to products at zero cost, and it also support fine-tuning to achieve better performance. The following is demonstration videos (due to the video is large, the loading will be slightly slow) .We provide full-process application guides from training to deployment, as well as video streaming segmentation and background replacement tutorials. Based on Paddle.js, you can experience the effects of [Portrait Snapshot](https://paddlejs.baidu.com/humanseg), [Video Background Replacement and Barrage Penetration](https://www.paddlepaddle.org.cn/paddlejs). <p align="center"> <img src="https://github.com/juncaipeng/raw_data/blob/master/images/portrait_bg_replace_1.gif" height="200"> <img src="https://github.com/LutaoChu/transfer_station/raw/master/conference.gif" height="200"> </p> ## 2 News - [2022-7] Release PP-HumanSeg V2 models. **The inference speed of portrait segmentation model is increased by 45.5%, mIoU is increased by 3.03%, and the visualization result is better**. The general human segmentation models also have improvement in accuracy and inference speed. - [2022-1] Human segmentation paper [PP-HumanSeg](./paper.md) was published in WACV 2022 Workshop, and open-sourced Connectivity Learning (SCL) method and large-scale video conferencing dataset ([PP-HumanSeg-14K](./paper.md)). - [2021-7] Baidu Video Conference can realize one-second joining on the web side. The virtual background function adopts our portrait segmentation model to realize real-time background replacement and background blur function, which protects user privacy and increases the fun in the meeting. - [2021-7] Release PP-HumanSeg V1 models, which has a portrait segmentation model and three general human segmentation models <p align="center"> <img src="https://user-images.githubusercontent.com/30695251/149886667-f47cab88-e81a-4fd7-9f32-fbb34a5ed7ce.png" height="200"> <img src="https://user-images.githubusercontent.com/30695251/149887482-d1fcd5d3-2cce-41b5-819b-bfc7126b7db4.png" height="200"> </p> ## 3 PP-HumanSeg Models ### 3.1 Portrait Segmentation Models We release self-developed portrait segmentation models for real-time applications such as mobile video and web conferences. These models can be directly integrated into products at zero cost. PP-HumanSegV1-Lite protrait segmentation model: It has good performance in accuracy and model size and the model architecture in [url](../../configs/pp_humanseg_lite/). PP-HumanSegV2-Lite protrait segmentation model: **The inference speed is increased by 45.5%, mIoU is increased by 3.03%, and the visualization result is better** compared to v1 model. These improvements are relayed on the following innovations. * Higher segmentation accuracy: We use the super lightweight models ([url](../../configs/mobileseg/)) released in PaddleSeg recently. We choose MobileNetV3 as backbone and design the multi-scale feature aggregation model. * Faster inference speed: We reduce the input resolution, which reduces the inference time and increases the receptive field. * Better robustness: Based on the idea of transfer learning, we first pretrain the model on a large general human segmentation dataset, and then finetune it on a small portrait segmentation dataset. | Model Name | Best Input Shape | mIou(%) | Inference Time on Arm CPU(ms) | Modle Size(MB) | Config File | Links | | --- | --- | --- | ---| --- | --- | --- | | PP-HumanSegV1-Lite | 398x224 | 93.60 | 29.68 | 2.3 | [cfg](./configs/portrait_pp_humansegv1_lite.yml) | [Checkpoint](https://paddleseg.bj.bcebos.com/dygraph/pp_humanseg_v2/portrait_pp_humansegv1_lite_398x224_pretrained.zip) \| [Inference Model (Argmax)](https://paddleseg.bj.bcebos.com/dygraph/pp_humanseg_v2/portrait_pp_humansegv1_lite_398x224_inference_model.zip) \| [Inference Model (Softmax)](https://paddleseg.bj.bcebos.com/dygraph/pp_humanseg_v2/portrait_pp_humansegv1_lite_398x224_inference_model_with_softmax.zip) | | PP-HumanSegV2-Lite | 256x144 | 96.63 | 15.86 | 5.4 | [cfg](./configs/portrait_pp_humansegv2_lite.yml) | [Checkpoint](https://paddleseg.bj.bcebos.com/dygraph/pp_humanseg_v2/portrait_pp_humansegv2_lite_256x144_smaller/portrait_pp_humansegv2_lite_256x144_pretrained.zip) \| [Inference Model (Argmax)](https://paddleseg.bj.bcebos.com/dygraph/pp_humanseg_v2/portrait_pp_humansegv2_lite_256x144_smaller/portrait_pp_humansegv2_lite_256x144_inference_model.zip) \| [Inference Model (Softmax)](https://paddleseg.bj.bcebos.com/dygraph/pp_humanseg_v2/portrait_pp_humansegv2_lite_256x144_smaller/portrait_pp_humansegv2_lite_256x144_inference_model_with_softmax.zip) | <details><summary>Note:</summary> * Test the segmentation accuracy (mIoU): We test the above models on [PP-HumanSeg-14K](./paper.md) dataset with the best input shape. * Test the inference time: Use [PaddleLite](https://www.paddlepaddle.org.cn/lite), xiaomi9 (Snapdragon 855 CPU), single thread, the best input shape. * For the best input shape, the ratio of height and width is 16:9, which is the same as the camera of mobile phone and laptop. * The checkpoint is the pretrained weight, which is used for finetune. * Inference model is used for deployment. * Inference Model (Argmax): The last operation of inference model is argmax, so the output has single channel. * Inference Model (Softmax): The last operation of inerence model is softmax, so the output has two channels. </details> <details><summary>Usage:</summary> * Portrait segmentation model can be directly integrated into products at zero cost. * For mobile phone, there are horizontal and vertical screen. We need to rotate the image to keep the human direction always be vertical. </details> ### 3.2 General Human Segmentation Models For general human segmentation task, we first build a big human segmentation dataset, then use the SOTA model in PaddleSeg for training, finally release several general human segmentation models. PP-HumanSegV2-Lite general human segmentation model: It uses the super lightweight models ([url](../../configs/mobileseg/)) released in PaddleSeg recently. Compared to V1 model, the mIoU is improved by 6.5%. PP-HumanSegV2-Mobile general human segmentation model: It uses the self-develop [PP-LiteSeg](../../configs/pp_liteseg/) model. Compared to V1 model, the mIoU is improved by 1.49% and the inference time is reduced by 5.7%. | Model Name | Best Input Shape | mIou(%) | Inference Time on ARM CPU(ms) | Inference Time on Nvidia GPU(ms) | Config File | Links | | ----- | ---------- | ---------- | -----------------| ----------------- | ------- | ------- | | PP-HumanSegV1-Lite | 192x192 | 86.02 | 12.3 | - | [cfg](./configs/human_pp_humansegv1_lite.yml) | [Checkpoint](https://paddleseg.bj.bcebos.com/dygraph/pp_humanseg_v2/human_pp_humansegv1_lite_192x192_pretrained.zip) \| [Inference Model (Argmax)](https://paddleseg.bj.bcebos.com/dygraph/pp_humanseg_v2/human_pp_humansegv1_lite_192x192_inference_model.zip) \| [Inference Model (Softmax)](https://paddleseg.bj.bcebos.com/dygraph/pp_humanseg_v2/human_pp_humansegv1_lite_192x192_inference_model_with_softmax.zip) | | PP-HumanSegV2-Lite | 192x192 | 92.52 | 15.3 | - | [cfg](./configs/human_pp_humansegv2_lite.yml) | [Checkpoint](https://paddleseg.bj.bcebos.com/dygraph/pp_humanseg_v2/human_pp_humansegv2_lite_192x192_pretrained.zip) \| [Inference Model (Argmax)](https

评论收藏

内容反馈

版权申诉