pytorch-yolov3详细可运行_cuda12.6对应的pytorch资源-CSDN文库

共59个文件

txt：12个

py：11个

pyc：11个

需积分: 42 111 浏览量 2020-11-04 22:48:13 上传评论 6 收藏 328.74MB RAR 举报

《PyTorch-YOLOv3深度学习目标检测框架详解》 YOLO（You Only Look Once）是一种实时目标检测系统，以其高效的性能和强大的准确性在计算机视觉领域备受瞩目。YOLOv3是其系列的第三个版本，由Joseph Redmon、Ali Farhadi等人于2018年提出，在YOLOv1和YOLOv2的基础上进行了优化，尤其在小目标检测和类别多样性上有了显著提升。本篇文章将深入探讨如何在PyTorch框架下实现YOLOv3，并分享一个可运行的项目实例。 YOLOv3的核心改进在于引入了多尺度预测，通过不同大小的检测框来捕获不同尺寸的目标，这显著提升了对小物体的检测能力。此外，YOLOv3还采用了 DarkNet-53 模型作为基础网络，这是一种深度残差网络，可以更有效地进行特征学习。YOLOv3的损失函数也进行了调整，包括分类损失、定位损失以及空闲框损失，以更好地平衡检测的精度和召回率。在PyTorch环境中实现YOLOv3，我们需要做以下几件事： 1. **构建网络结构**：按照YOLOv3的架构设计网络，包括输入层、卷积层、批量归一化层、激活层、锚框设置等，确保每个模块都能正确地完成其功能。 2. **训练过程**：准备数据集，通常包括标注好的图像和对应的边界框信息。使用数据加载器加载数据，定义损失函数，然后使用优化器进行模型参数的更新。训练过程中，可以采用多GPU并行计算来加速训练。 3. **推理阶段**：训练完成后，我们可以用预训练模型进行推理。在PyTorch中，通过forward函数将输入图像传递到模型，模型会返回预测的边界框和置信度。 4. **后处理**：预测结果通常包含大量的边界框，需要通过非极大值抑制（NMS）算法去除冗余的预测框，只保留最有可能的检测结果。 5. **可视化**：为了直观展示YOLOv3的检测效果，我们可以将预测的边界框绘制到原始图像上。在提供的“pytorch-yolov3”压缩包中，包含了完整的YOLOv3实现，包括模型结构、训练脚本、配置文件等。运行示例展示了如何加载模型、进行预测和显示结果。你可以参考其中的代码来理解YOLOv3的工作原理，并根据自己的需求进行修改和扩展。通过这个项目，你可以深入理解YOLOv3的架构和实现细节，同时也为其他深度学习目标检测任务提供了借鉴。如果你在使用过程中遇到任何问题，欢迎提问，我们将共同探讨解决方案。 PyTorch-YOLOv3项目提供了一个直观的学习平台，帮助开发者理解和应用目标检测技术。无论是对于研究还是实际应用，这个项目都是一个宝贵的资源，它将理论知识与实践操作紧密结合，让你在实战中提升自己的技能。

资源推荐

资源详情

资源评论

收起资源包目录

pytorch-yolov3.rar （59个子文件）

pytorch-yolov3

requirements.txt 90B

data

temp

valid

images

000615.jpg 639KB

000616.jpg 966KB

000614.jpg 776KB

labels

000616.txt 81B

000615.txt 148B

000614.txt 322B

classes.names 10B

train.txt 270B

train

images

000000000165.jpg 163KB

000000000110.jpg 151KB

000000000192.jpg 163KB

000000000036.jpg 175KB

000000000113.jpg 146KB

000000000086.jpg 192KB

labels

000000000113.txt 122B

000000000165.txt 119B

000000000192.txt 160B

000000000110.txt 161B

000000000086.txt 43B

000000000036.txt 62B

valid.txt 117B

video.py 7KB

LICENSE 34KB

models.py 15KB

utils

augmentations.py 207B

__init__.py 0B

datasets.py 5KB

utils.py 14KB

__pycache__

parse_config.cpython-36.pyc 1KB

utils.cpython-37.pyc 10KB

parse_config.cpython-37.pyc 1KB

datasets.cpython-37.pyc 5KB

augmentations.cpython-37.pyc 441B

utils.cpython-36.pyc 9KB

logger.cpython-36.pyc 1KB

__init__.cpython-36.pyc 146B

__init__.cpython-37.pyc 141B

augmentations.cpython-36.pyc 440B

datasets.cpython-36.pyc 5KB

parse_config.py 1KB

logger.py 673B

README.md 6KB

config

yolov3-tiny-shoulders.cfg 2KB

create_custom_model.sh 8KB

yolov3-tiny.cfg 2KB

shoulders.data 108B

yolov3.cfg 8KB

custom.data 99B

coco.data 115B

test.py 4KB

weights

download_weights.sh 303B

yolov3-tiny.weights 33.79MB

yolov3.weights 236.52MB

detect.py 5KB

checkpoint

yolov3_ckpt_165.pth 25.79MB

yolov3_ckpt_195.pth 25.79MB

yolov3_ckpt_190.pth 25.79MB

train.py 7KB

# PyTorch-YOLOv3 A minimal PyTorch implementation of YOLOv3, with support for training, inference and evaluation. ## Installation ##### Clone and install requirements $ git clone https://github.com/eriklindernoren/PyTorch-YOLOv3 $ cd PyTorch-YOLOv3/ $ sudo pip3 install -r requirements.txt ##### Download pretrained weights $ cd weights/ $ bash download_weights.sh ##### Download COCO $ cd data/ $ bash get_coco_dataset.sh ## Test Evaluates the model on COCO test. $ python3 test.py --weights_path weights/yolov3.weights | Model | mAP (min. 50 IoU) | | ----------------------- |:-----------------:| | YOLOv3 608 (paper) | 57.9 | | YOLOv3 608 (this impl.) | 57.3 | | YOLOv3 416 (paper) | 55.3 | | YOLOv3 416 (this impl.) | 55.5 | ## Inference Uses pretrained weights to make predictions on images. Below table displays the inference times when using as inputs images scaled to 256x256. The ResNet backbone measurements are taken from the YOLOv3 paper. The Darknet-53 measurement marked shows the inference time of this implementation on my 1080ti card. | Backbone | GPU | FPS | | ----------------------- |:--------:|:--------:| | ResNet-101 | Titan X | 53 | | ResNet-152 | Titan X | 37 | | Darknet-53 (paper) | Titan X | 76 | | Darknet-53 (this impl.) | 1080ti | 74 | $ python3 detect.py --image_folder data/samples/ <img src="assets/giraffe.png" width="480"\> <img src="assets/dog.png" width="480"\> <img src="assets/traffic.png" width="480"\> <img src="assets/messi.png" width="480"\> ## Train ``` $ train.py [-h] [--epochs EPOCHS] [--batch_size BATCH_SIZE] [--gradient_accumulations GRADIENT_ACCUMULATIONS] [--model_def MODEL_DEF] [--data_config DATA_CONFIG] [--pretrained_weights PRETRAINED_WEIGHTS] [--n_cpu N_CPU] [--img_size IMG_SIZE] [--checkpoint_interval CHECKPOINT_INTERVAL] [--evaluation_interval EVALUATION_INTERVAL] [--compute_map COMPUTE_MAP] [--multiscale_training MULTISCALE_TRAINING] ``` #### Example (COCO) To train on COCO using a Darknet-53 backend pretrained on ImageNet run: ``` $ python3 train.py --data_config config/coco.data --pretrained_weights weights/darknet53.conv.74 ``` #### Training log ``` ---- [Epoch 7/100, Batch 7300/14658] ---- +------------+--------------+--------------+--------------+ | Metrics | YOLO Layer 0 | YOLO Layer 1 | YOLO Layer 2 | +------------+--------------+--------------+--------------+ | grid_size | 16 | 32 | 64 | | loss | 1.554926 | 1.446884 | 1.427585 | | x | 0.028157 | 0.044483 | 0.051159 | | y | 0.040524 | 0.035687 | 0.046307 | | w | 0.078980 | 0.066310 | 0.027984 | | h | 0.133414 | 0.094540 | 0.037121 | | conf | 1.234448 | 1.165665 | 1.223495 | | cls | 0.039402 | 0.040198 | 0.041520 | | cls_acc | 44.44% | 43.59% | 32.50% | | recall50 | 0.361111 | 0.384615 | 0.300000 | | recall75 | 0.222222 | 0.282051 | 0.300000 | | precision | 0.520000 | 0.300000 | 0.070175 | | conf_obj | 0.599058 | 0.622685 | 0.651472 | | conf_noobj | 0.003778 | 0.004039 | 0.004044 | +------------+--------------+--------------+--------------+ Total Loss 4.429395 ---- ETA 0:35:48.821929 ``` #### Tensorboard Track training progress in Tensorboard: * Initialize training * Run the command below * Go to http://localhost:6006/ ``` $ tensorboard --logdir='logs' --port=6006 ``` ## Train on Custom Dataset #### Custom model Run the commands below to create a custom model definition, replacing `<num-classes>` with the number of classes in your dataset. ``` $ cd config/ # Navigate to config dir $ bash create_custom_model.sh <num-classes> # Will create custom model 'yolov3-custom.cfg' ``` #### Classes Add class names to `data/custom/classes.names`. This file should have one row per class name. #### Image Folder Move the images of your dataset to `data/custom/images/`. #### Annotation Folder Move your annotations to `data/custom/labels/`. The dataloader expects that the annotation file corresponding to the image `data/custom/images/train.jpg` has the path `data/custom/labels/train.txt`. Each row in the annotation file should define one bounding box, using the syntax `label_idx x_center y_center width height`. The coordinates should be scaled `[0, 1]`, and the `label_idx` should be zero-indexed and correspond to the row number of the class name in `data/custom/classes.names`. #### Define Train and Validation Sets In `data/custom/train.txt` and `data/custom/valid.txt`, add paths to images that will be used as train and validation data respectively. #### Train To train on the custom dataset run: ``` $ python3 train.py --model_def config/yolov3-custom.cfg --data_config config/custom.data ``` Add `--pretrained_weights weights/darknet53.conv.74` to train using a backend pretrained on ImageNet. ## Credit ### YOLOv3: An Incremental Improvement _Joseph Redmon, Ali Farhadi_ **Abstract** We present some updates to YOLO! We made a bunch of little design changes to make it better. We also trained this new network that’s pretty swell. It’s a little bigger than last time but more accurate. It’s still fast though, don’t worry. At 320 × 320 YOLOv3 runs in 22 ms at 28.2 mAP, as accurate as SSD but three times faster. When we look at the old .5 IOU mAP detection metric YOLOv3 is quite good. It achieves 57.9 AP50 in 51 ms on a Titan X, compared to 57.5 AP50 in 198 ms by RetinaNet, similar performance but 3.8× faster. As always, all the code is online at https://pjreddie.com/yolo/. [[Paper]](https://pjreddie.com/media/files/papers/YOLOv3.pdf) [[Project Webpage]](https://pjreddie.com/darknet/yolo/) [[Authors' Implementation]](https://github.com/pjreddie/darknet) ``` @article{yolov3, title={YOLOv3: An Incremental Improvement}, author={Redmon, Joseph and Farhadi, Ali}, journal = {arXiv}, year={2018} } ```

评论收藏

内容反馈