HRNet-works-master.zip资源-CSDN文库

共119个文件

py：47个

txt：9个

yaml：8个

版权申诉

152 浏览量 2024-04-16 00:54:53 上传评论收藏 7.76MB ZIP 举报

标题中的"HRNet-works-master.zip"表明这是一个压缩文件，主要包含有关HRNet（人力资源网络）工作的源代码或资源。HRNet通常指的是高分辨率网络，它在计算机视觉领域，特别是图像识别和理解方面有着广泛的应用。这个项目可能是对HRNet算法的一种实现或扩展，可能包含了训练脚本、模型定义、数据预处理逻辑以及相关的实验结果。描述中的"HRNet-works-master"可能是项目或代码仓库的名称，暗示了这是一个主分支或者是最主要的版本。在开源社区中，"master"分支通常代表项目的主线开发，包含了最新的稳定代码。在标签部分，由于没有提供具体的标签，我们无法直接获取更多信息。不过，我们可以推测，由于与HRNet相关，可能涉及到的标签有"深度学习"、"计算机视觉"、"图像识别"、"神经网络"等。压缩包内的"a.txt"可能是一个文本文件，通常用于记录笔记、配置信息或者简单的说明。具体内容需要解压后查看，但一般不包含核心代码或模型。 "HRNet-works-master"这个文件可能是一个包含整个项目结构的文件夹，里面可能有以下内容： 1. `README.md`：项目介绍和使用指南，可能包括安装步骤、依赖库、训练流程等。 2. `model.py`或类似的文件：HRNet模型的定义和实现。 3. `data`目录：用于存储数据集，可能包括预处理过的图像数据及其对应的标签。 4. `scripts`或`train`目录：训练脚本，可能包含训练、验证和测试模型的Python脚本。 5. `config`目录：配置文件，用于设置模型参数、训练超参数等。 6. `results`或`logs`目录：保存训练过程中的日志、模型权重及实验结果。 7. `requirements.txt`：列出项目所需的Python库及其版本。 HRNet的核心在于其始终保持高分辨率的特征表示，这有助于捕捉到图像中的细节信息。这种设计在处理需要精细定位的任务，如人体姿态估计、面部关键点检测等领域表现出色。通过多尺度信息融合，HRNet能够同时利用全局上下文和局部细节，提升模型的性能。这个项目可能是一个研究或实践HRNet的平台，提供了从数据预处理到模型训练和评估的全套流程。对于想深入了解或使用HRNet的人来说，这是一个很好的起点。如果你打算使用这个项目，你需要安装必要的Python环境，按照`README.md`中的指示进行配置，并准备相应的数据集。然后，运行训练脚本，根据日志调整模型参数，以达到最佳性能。

资源推荐

资源详情

资源评论

收起资源包目录

HRNet-works-master.zip （119个子文件）

_mask.c 731KB

_mask.c 658KB

maskApi.c 8KB

maskApiMex.c 6KB

gason.cpp 10KB

gasonMex.cpp 9KB

pycocotools-2.0-py3.6-win-amd64.egg 95KB

_mask.cp36-win_amd64.exp 796B

crowdpose.gif 1.61MB

.gitattributes 66B

.gitignore 1KB

gason.h 4KB

maskApi.h 2KB

pycocoDemo.ipynb 1.71MB

pycocoEvalDemo.ipynb 5KB

preds.json 11.03MB

crowdpose_val.json 2.62MB

instances_val2014_fakesegm100_results.json 265KB

captions_val2014_fakecap_results.json 87KB

instances_val2014_fakebbox100_results.json 59KB

person_keypoints_val2014_fakekeypoints100_results.json 35KB

_mask.cp36-win_amd64.lib 2KB

LICENSE 1KB

CocoApi.lua 11KB

MaskApi.lua 10KB

cocoDemo.lua 814B

init.lua 511B

env.lua 447B

CocoEval.m 22KB

CocoUtils.m 17KB

CocoApi.m 13KB

MaskApi.m 5KB

getPrmDflt.m 3KB

gason.m 2KB

evalDemo.m 2KB

cocoDemo.m 1KB

Makefile 207B

README.md 13KB

README.md 3KB

README.md 107B

gasonMex.mexa64 37KB

gasonMex.mexmaci64 40KB

_mask.obj 957KB

maskApi.obj 34KB

PKG-INFO 193B

arch_v2.png 76KB

fp16_optimizer.py 29KB

cocoeval.py 27KB

cocoeval.py 24KB

pose_higher_hrnet.py 21KB

coco.py 19KB

coco.py 16KB

loss.py 12KB

COCODataset.py 11KB

dist_train.py 11KB

CrowdPoseDataset.py 10KB

group.py 9KB

loss_scaler.py 9KB

fp16util.py 8KB

utils.py 8KB

vis.py 7KB

valid.py 7KB

inference.py 7KB

transforms.py 6KB

default.py 6KB

COCOKeypoints.py 6KB

trainer.py 5KB

CrowdPoseKeypoints.py 5KB

mask.py 5KB

target_generators.py 5KB

build.py 3KB

models.py 2KB

__init__.py 2KB

zipreader.py 2KB

crowdpose_concat_train_val.py 2KB

setup.py 791B

setup.py 776B

_init_paths.py 593B

__init__.py 412B

__init__.py 370B

demo.py 344B

__init__.py 258B

__init__.py 234B

__init__.py 22B

__init__.py 0B

_mask.cp36-win_amd64.pyd 144KB

_mask.pyx 11KB

共 119 条

# [HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation (CVPR 2020)](https://arxiv.org/abs/1908.10357) ## News * \[2020/07/05\] [A very nice blog](https://towardsdatascience.com/overview-of-human-pose-estimation-neural-networks-hrnet-higherhrnet-architectures-and-faq-1954b2f8b249) from Towards Data Science introducing HRNet and HigherHRNet for human pose estimation. * \[2020/03/12\] Support train/test on the CrowdPose dataset. * \[2020/02/24\] HigherHRNet is accepted to CVPR2020! * \[2019/11/23\] Code and models for [HigherHRNet](https://arxiv.org/abs/1908.10357) are now released! * \[2019/08/27\] HigherHRNet is now on [ArXiv](https://arxiv.org/abs/1908.10357). We will also release code and models, stay tuned! ## Introduction This is the official code of [HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation](https://arxiv.org/abs/1908.10357). Bottom-up human pose estimation methods have difficulties in predicting the correct pose for small persons due to challenges in scale variation. In this paper, we present **HigherHRNet**: a novel bottom-up human pose estimation method for learning scale-aware representations using high-resolution feature pyramids. Equipped with multi-resolution supervision for training and multi-resolution aggregation for inference, the proposed approach is able to solve the scale variation challenge in *bottom-up multi-person* pose estimation and localize keypoints more precisely, especially for small person. The feature pyramid in HigherHRNet consists of feature map outputs from HRNet and upsampled higher-resolution outputs through a transposed convolution. HigherHRNet outperforms the previous best bottom-up method by 2.5% AP for medium person on COCO test-dev, showing its effectiveness in handling scale variation. Furthermore, HigherHRNet achieves new state-of-the-art result on COCO test-dev (70.5% AP) without using refinement or other post-processing techniques, surpassing all existing bottom-up methods. HigherHRNet even surpasses all top-down methods on CrowdPose test (67.6% AP), suggesting its robustness in crowded scene. ![Illustrating the architecture of the proposed Higher-HRNet](/figures/arch_v2.png) ## Main Results ### Results on COCO val2017 without multi-scale test | Method | Backbone | Input size | #Params | GFLOPs | AP | Ap .5 | AP .75 | AP (M) | AP (L) | |--------------------|----------|------------|---------|--------|-------|-------|--------|--------|--------| | HigherHRNet | HRNet-w32 | 512 | 28.6M | 47.9 | 67.1 | 86.2 | 73.0 | 61.5 | 76.1 | | HigherHRNet | HRNet-w32 | 640 | 28.6M | 74.8 | 68.5 | 87.1 | 74.7 | 64.3 | 75.3 | | HigherHRNet | HRNet-w48 | 640 | 63.8M | 154.3 | 69.9 | 87.2 | 76.1 | 65.4 | 76.4 | ### Results on COCO val2017 *with* multi-scale test | Method | Backbone | Input size | #Params | GFLOPs | AP | Ap .5 | AP .75 | AP (M) | AP (L) | |--------------------|----------|------------|---------|--------|-------|-------|--------|--------|--------| | HigherHRNet | HRNet-w32 | 512 | 28.6M | 47.9 | 69.9 | 87.1 | 76.0 | 65.3 | 77.0 | | HigherHRNet | HRNet-w32 | 640 | 28.6M | 74.8 | 70.6 | 88.1 | 76.9 | 66.6 | 76.5 | | HigherHRNet | HRNet-w48 | 640 | 63.8M | 154.3 | 72.1 | 88.4 | 78.2 | 67.8 | 78.3 | ### Results on COCO test-dev2017 without multi-scale test | Method | Backbone | Input size | #Params | GFLOPs | AP | Ap .5 | AP .75 | AP (M) | AP (L) | |--------------------|----------|------------|---------|--------|-------|-------|--------|--------|--------| | OpenPose\* | - | - | - | - | 61.8 | 84.9 | 67.5 | 57.1 | 68.2 | | Hourglass | Hourglass | 512 | 277.8M | 206.9 | 56.6 | 81.8 | 61.8 | 49.8 | 67.0 | | PersonLab | ResNet-152 | 1401 | 68.7M | 405.5 | 66.5 | 88.0 | 72.6 | 62.4 | 72.3 | | PifPaf | - | - | - | - | 66.7 | - | - | 62.4 | 72.9 | | Bottom-up HRNet | HRNet-w32 | 512 | 28.5M | 38.9 | 64.1 | 86.3 | 70.4 | 57.4 | 73.9 | | **HigherHRNet** | HRNet-w32 | 512 | 28.6M | 47.9 | 66.4 | 87.5 | 72.8 | 61.2 | 74.2 | | **HigherHRNet** | HRNet-w48 | 640 | 63.8M | 154.3 | **68.4** | **88.2** | **75.1** | **64.4** | **74.2** | ### Results on COCO test-dev2017 *with* multi-scale test | Method | Backbone | Input size | #Params | GFLOPs | AP | Ap .5 | AP .75 | AP (M) | AP (L) | |--------------------|----------|------------|---------|--------|-------|-------|--------|--------|--------| | Hourglass | Hourglass | 512 | 277.8M | 206.9 | 63.0 | 85.7 | 68.9 | 58.0 | 70.4 | | Hourglass\* | Hourglass | 512 | 277.8M | 206.9 | 65.5 | 86.8 | 72.3 | 60.6 | 72.6 | | PersonLab | ResNet-152 | 1401 | 68.7M | 405.5 | 68.7 | 89.0 | 75.4 | 64.1 | 75.5 | | **HigherHRNet** | HRNet-w48 | 640 | 63.8M | 154.3 | **70.5** | **89.3** | **77.2** | **66.6** | **75.8** | ### Results on CrowdPose test | Method | AP | Ap .5 | AP .75 | AP (E) | AP (M) | AP (H) | |--------------------|-------|-------|--------|--------|--------|--------| | Mask-RCNN | 57.2 | 83.5 | 60.3 | 69.4 | 57.9 | 45.8 | | AlphaPose | 61.0 | 81.3 | 66.0 | 71.2 | 61.4 | 51.1 | | SPPE | 66.0. | 84.2 | 71.5 | 75.5 | 66.3 | 57.4 | | OpenPose | - | - | - | 62.7 | 48.7 | 32.3 | | **HigherHRNet** | 65.9 | 86.4 | 70.6 | 73.3 | 66.5 | 57.9 | | **HigherHRNet+** | **67.6** | **87.4** | **72.6** | **75.8** | **68.1** | **58.9** | *Note: + indicates using multi-scale test.* ## Environment The code is developed using python 3.6 on Ubuntu 16.04. NVIDIA GPUs are needed. The code is developed and tested using 4 NVIDIA P100 GPU cards. Other platforms or GPU cards are not fully tested. ## Quick start ### Installation 1. Install pytorch >= v1.1.0 following [official instruction](https://pytorch.org/). - **Tested with pytorch v1.4.0** 2. Clone this repo, and we'll call the directory that you cloned as ${POSE_ROOT}. 3. Install dependencies: ``` pip install -r requirements.txt ``` 4. Install [COCOAPI](https://github.com/cocodataset/cocoapi): ``` # COCOAPI=/path/to/clone/cocoapi git clone https://github.com/cocodataset/cocoapi.git $COCOAPI cd $COCOAPI/PythonAPI # Install into global site-packages make install # Alternatively, if you do not have permissions or prefer # not to install the COCO API into global site-packages python3 setup.py install --user ``` Note that instructions like # COCOAPI=/path/to/install/cocoapi indicate that you should pick a path where you'd like to have the software cloned and then set an environment variable (COCOAPI in this case) accordingly. 5. Install [CrowdPoseAPI](https://github.com/Jeff-sjtu/CrowdPose) exactly the same as COCOAPI. - **There is a bug in the CrowdPoseAPI, please reverse https://github.com/Jeff-sjtu/CrowdPose/commit/785e70d269a554b2ba29daf137354103221f479e** 6. Init output(training model output directory) and log(tensorboard log directory) directory: ``` mkdir output mkdir log ``` Your directory tree should look like this: ``` ${POSE_ROOT} ├── data ├── experiments ├── lib ├── log ├── models ├── output ├── tools ├── README.md └── requirements.txt ``` 7. Download pretrained models from our model zoo([GoogleDrive](https://drive.google.com/open?id=1bdXVmYrSynPLSk5lptvgyQ8fhziobD50) or [OneDrive](https://1drv.ms/f/s!AhIXJn_J-blW4AwKRMklXVzndJT0)) ``` ${POSE_ROOT} `-- models `-- pytorch |-- imagene

评论收藏

内容反馈

版权申诉