YOLOv3pytorch版源代码_yolov3源码下载资源-CSDN文库

共41个文件

py：12个

jpg：10个

png：4个

YOLOv3

pytorch

5星 · 超过95%的资源需积分: 50 146 浏览量 2022-06-25 15:23:20 上传评论 5 收藏 2.69MB ZIP 举报

YOLOv3是一种高效且准确的目标检测算法，其全称为"You Only Look Once"的第三版本。这个算法在计算机视觉领域有着广泛的应用，特别是在实时对象检测上。YOLOv3是YOLO系列的改进版本，相较于之前的YOLOv1和YOLOv2，它在保持快速检测速度的同时，显著提高了检测精度。 YOLOv3的设计理念是通过单次网络前向传播就能同时预测图像中的多个物体，避免了传统目标检测方法中繁琐的多阶段流程。它采用了一种基于网格的检测机制，每个网格负责预测几个边界框（bounding boxes），并对应多个类别概率。这种设计使得YOLOv3能够处理不同尺度的物体，尤其是在小物体检测方面有所提升。在YOLOv3的实现中，pytorch是一个常用的深度学习框架，它提供了灵活的神经网络构建工具和高效的GPU加速计算。PyTorch-YOLOv3-master是一个包含YOLOv3在pytorch环境下的完整实现的项目，它包括模型定义、训练脚本、数据预处理和后处理等所有必要的组件。在该项目中，开发者通常会遇到以下几个关键知识点： 1. **模型结构**：YOLOv3采用了DarkNet-53作为基础网络，这是一个深度卷积神经网络，用于特征提取。然后，通过一系列的卷积层、池化层和上采样层生成不同尺度的检测结果。 2. **锚框（Anchor Boxes）**：YOLOv3使用了预先定义的一组大小和比例不同的锚框，每个网格预测这些锚框对应的物体位置和类别概率，从而能更好地适应不同尺寸和形状的物体。 3. **损失函数**：YOLOv3的损失函数综合考虑了分类误差、定位误差以及背景预测的惩罚，它包括分类损失、坐标损失和置信度损失。 4. **数据预处理**：数据集通常需要进行归一化、缩放和标注转换，以便于网络训练。例如，PASCAL VOC或COCO数据集常被用于YOLOv3的训练。 5. **训练与优化**：训练过程中，通常使用Adam或SGD优化器，调整学习率、权重衰减等参数以优化模型性能。此外，还需要定期保存模型权重，以便于模型验证和后续的微调。 6. **推理与部署**：训练完成后，可以将模型部署到实际应用中。在pytorch环境中，通常会将模型转换为torchscript或ONNX格式，以支持跨平台的推理服务。 7. **评估指标**：对于检测效果的评估，常见的指标有平均精度（mAP）、平均召回率（mAR）等，它们可以帮助分析模型在不同类别和IoU阈值下的表现。掌握以上知识点，开发者不仅能理解YOLOv3的工作原理，还能有效地利用PyTorch实现和优化YOLOv3模型，将其应用于实际的图像检测任务。通过阅读和调试PyTorch-YOLOv3-master项目，可以深入学习目标检测技术，并对深度学习框架PyTorch有更深入的理解。

资源详情

资源评论

资源推荐

收起资源包目录

PyTorch-YOLOv3-master.zip （41个子文件）

PyTorch-YOLOv3-master

models.py 12KB

config

custom.data 99B

create_custom_model.sh 8KB

yolov3.cfg 8KB

yolov3-tiny.cfg 2KB

coco.data 115B

data

coco.names 625B

custom

valid.txt 29B

images

train.jpg 111KB

classes.names 6B

train.txt 29B

labels

train.txt 34B

get_coco_dataset.sh 1KB

samples

messi.jpg 124KB

room.jpg 83KB

street.jpg 100KB

field.jpg 111KB

herd_of_horses.jpg 130KB

dog.jpg 160KB

giraffe.jpg 374KB

person.jpg 77KB

eagle.jpg 139KB

test.py 8KB

train.py 9KB

assets

dog.png 342KB

giraffe.png 405KB

traffic.png 312KB

messi.png 258KB

LICENSE 34KB

detect.py 10KB

.gitignore 116B

weights

download_weights.sh 408B

README.md 6KB

utils

transforms.py 3KB

loss.py 10KB

augmentations.py 644B

utils.py 12KB

logger.py 790B

datasets.py 4KB

__init__.py 0B

parse_config.py 1KB

# PyTorch-YOLOv3 A minimal PyTorch implementation of YOLOv3, with support for training, inference and evaluation. ## Installation ##### Clone and install requirements $ git clone https://github.com/eriklindernoren/PyTorch-YOLOv3 $ cd PyTorch-YOLOv3/ $ sudo pip3 install -r requirements.txt ##### Download pretrained weights $ cd weights/ $ bash download_weights.sh ##### Download COCO $ cd data/ $ bash get_coco_dataset.sh ## Test Evaluates the model on COCO test. $ python3 test.py --weights weights/yolov3.weights | Model | mAP (min. 50 IoU) | | ----------------------- |:-----------------:| | YOLOv3 608 (paper) | 57.9 | | YOLOv3 608 (this impl.) | 57.3 | | YOLOv3 416 (paper) | 55.3 | | YOLOv3 416 (this impl.) | 55.5 | ## Inference Uses pretrained weights to make predictions on images. Below table displays the inference times when using as inputs images scaled to 256x256. The ResNet backbone measurements are taken from the YOLOv3 paper. The Darknet-53 measurement marked shows the inference time of this implementation on my 1080ti card. | Backbone | GPU | FPS | | ----------------------- |:--------:|:--------:| | ResNet-101 | Titan X | 53 | | ResNet-152 | Titan X | 37 | | Darknet-53 (paper) | Titan X | 76 | | Darknet-53 (this impl.) | 1080ti | 74 | $ python3 detect.py --images data/samples/ <img src="assets/giraffe.png" width="480"\> <img src="assets/dog.png" width="480"\> <img src="assets/traffic.png" width="480"\> <img src="assets/messi.png" width="480"\> ## Train For argument descriptions have a lock at `python3 train.py --help` #### Example (COCO) To train on COCO using a Darknet-53 backend pretrained on ImageNet run: ``` $ python3 train.py --data config/coco.data --pretrained_weights weights/darknet53.conv.74 ``` #### Training log ``` ---- [Epoch 7/100, Batch 7300/14658] ---- +------------+--------------+--------------+--------------+ | Metrics | YOLO Layer 0 | YOLO Layer 1 | YOLO Layer 2 | +------------+--------------+--------------+--------------+ | grid_size | 16 | 32 | 64 | | loss | 1.554926 | 1.446884 | 1.427585 | | x | 0.028157 | 0.044483 | 0.051159 | | y | 0.040524 | 0.035687 | 0.046307 | | w | 0.078980 | 0.066310 | 0.027984 | | h | 0.133414 | 0.094540 | 0.037121 | | conf | 1.234448 | 1.165665 | 1.223495 | | cls | 0.039402 | 0.040198 | 0.041520 | | cls_acc | 44.44% | 43.59% | 32.50% | | recall50 | 0.361111 | 0.384615 | 0.300000 | | recall75 | 0.222222 | 0.282051 | 0.300000 | | precision | 0.520000 | 0.300000 | 0.070175 | | conf_obj | 0.599058 | 0.622685 | 0.651472 | | conf_noobj | 0.003778 | 0.004039 | 0.004044 | +------------+--------------+--------------+--------------+ Total Loss 4.429395 ---- ETA 0:35:48.821929 ``` #### Tensorboard Track training progress in Tensorboard: * Initialize training * Run the command below * Go to http://localhost:6006/ ``` $ tensorboard --logdir='logs' --port=6006 ``` Storing the logs on a slow drive possibly leads to a significant training speed decrease. You can adjust the log directory using `--logdir <path>` when running `tensorboard` or the `train.py`. ## Train on Custom Dataset #### Custom model Run the commands below to create a custom model definition, replacing `<num-classes>` with the number of classes in your dataset. ``` $ cd config/ # Navigate to config dir $ bash create_custom_model.sh <num-classes> # Will create custom model 'yolov3-custom.cfg' ``` #### Classes Add class names to `data/custom/classes.names`. This file should have one row per class name. #### Image Folder Move the images of your dataset to `data/custom/images/`. #### Annotation Folder Move your annotations to `data/custom/labels/`. The dataloader expects that the annotation file corresponding to the image `data/custom/images/train.jpg` has the path `data/custom/labels/train.txt`. Each row in the annotation file should define one bounding box, using the syntax `label_idx x_center y_center width height`. The coordinates should be scaled `[0, 1]`, and the `label_idx` should be zero-indexed and correspond to the row number of the class name in `data/custom/classes.names`. #### Define Train and Validation Sets In `data/custom/train.txt` and `data/custom/valid.txt`, add paths to images that will be used as train and validation data respectively. #### Train To train on the custom dataset run: ``` $ python3 train.py --model config/yolov3-custom.cfg --data config/custom.data ``` Add `--pretrained_weights weights/darknet53.conv.74` to train using a backend pretrained on ImageNet. ## Credit ### YOLOv3: An Incremental Improvement _Joseph Redmon, Ali Farhadi_ **Abstract** We present some updates to YOLO! We made a bunch of little design changes to make it better. We also trained this new network that’s pretty swell. It’s a little bigger than last time but more accurate. It’s still fast though, don’t worry. At 320 × 320 YOLOv3 runs in 22 ms at 28.2 mAP, as accurate as SSD but three times faster. When we look at the old .5 IOU mAP detection metric YOLOv3 is quite good. It achieves 57.9 AP50 in 51 ms on a Titan X, compared to 57.5 AP50 in 198 ms by RetinaNet, similar performance but 3.8× faster. As always, all the code is online at https://pjreddie.com/yolo/. [[Paper]](https://pjreddie.com/media/files/papers/YOLOv3.pdf) [[Project Webpage]](https://pjreddie.com/darknet/yolo/) [[Authors' Implementation]](https://github.com/pjreddie/darknet) ``` @article{yolov3, title={YOLOv3: An Incremental Improvement}, author={Redmon, Joseph and Farhadi, Ali}, journal = {arXiv}, year={2018} } ```