基于YOLOv5（pytorch）的简单停车标志检测模型资源-CSDN文库

共5个文件

ipynb：2个

sh：1个

sbatch：1个

版权申诉

197 浏览量 2024-05-16 17:09:14 上传评论收藏 7.22MB ZIP 举报

YOLOv5是一种高效且准确的目标检测模型，它在计算机视觉领域被广泛应用，尤其是在实时对象检测上表现出色。本项目是基于YOLOv5和PyTorch实现的简单停车标志检测模型，旨在帮助自动驾驶车辆或其他智能系统识别和定位停车标志。 1. **YOLOv5介绍** YOLO（You Only Look Once）是一种单阶段的目标检测框架，以其实时性能和高精度而闻名。YOLOv5是该系列的最新版本，由 Ultralytics 团队开发，它在YOLOv3和YOLOv4的基础上进行了优化，包括更快的训练速度、更高的检测精度以及更小的模型大小。 2. **PyTorch框架** PyTorch是Facebook AI Research团队开发的一个深度学习框架，以其灵活性和易用性而受到开发者们的青睐。YOLOv5模型利用PyTorch的强大功能进行构建和训练，允许开发者快速实现和调试神经网络模型。 3. **模型训练** 在这个项目中，模型的训练过程会涉及数据预处理，包括图像增强（如翻转、缩放等）以增加模型泛化能力。训练数据集应包含各种停车标志的标注图像，每个图像都带有其对应的边界框坐标和类别标签。 4. **模型架构** YOLOv5的核心是其卷积神经网络架构，它包含多个backbone（如Darknet-53）和neck（如SPP-Block和Path Aggregation Network），以及用于预测边界框和类别的head部分。这种设计使得模型能同时对多个尺度的目标进行检测。 5. **损失函数** 训练过程中，模型会通过最小化多任务损失函数来优化权重，这通常包括分类损失、坐标回归损失等，以确保模型能准确地识别和定位停车标志。 6. **推理与应用** 一旦模型训练完成，可以将其部署到实际应用场景中，例如在自动驾驶车辆上。在运行时，模型会接收摄像头输入，实时检测图像中的停车标志，并输出其位置信息。 7. **项目结构** 压缩包文件"Stop-Sign-Detection-main"可能包含以下内容： - `models/`：存储预训练模型或用户训练的模型权重。 - `data/`：包含训练和验证数据集，以及相关的标注文件。 - `utils/`：提供数据处理、训练配置、模型保存和加载等辅助功能的Python脚本。 - `config.yaml`：配置文件，定义了训练参数，如学习率、批大小等。 - `train.py`：负责模型训练的主要脚本。 - `inference.py`：用于模型推理，将模型应用于新的图像或视频流。 8. **后处理步骤** 模型预测得到的是边界框坐标和置信度，还需要通过非极大值抑制（NMS）算法去除重复的检测结果，只保留最有可能的停车标志。 9. **评估与优化** 使用标准的评估指标，如平均精度（mAP）来衡量模型性能。如果表现不佳，可以通过调整超参数、更换数据增强策略或微调模型结构来进行优化。基于YOLOv5（PyTorch）的停车标志检测模型是一个实用的计算机视觉应用，它结合了深度学习和实时目标检测技术，为自动驾驶和其他相关领域提供了关键的视觉感知能力。通过持续的训练和优化，该模型可以进一步提高停车标志检测的准确性和效率。

资源推荐

资源详情

资源评论

收起资源包目录

Stop-Sign-Detection-main.zip （5个子文件）

Stop-Sign-Detection-main

Lab-3 Part2 Report.pdf 1.8MB

singlenode_demo

TSGStopSigns.ipynb 2.31MB

Train_YOLOv5_ipynb.ipynb 11.53MB

dist_training

train.sbatch 1KB

train.sh 528B

Lab-3 Part2 Report

Lab Report

Group Members:

Carolyn Cui 100399399

Haolin Li 218600849

Robert Palermo

Introduction

In our exploration of object detection models, we aim to determine the performance of existing popular models and, in addition,

compare the process of single-node deep learning to distributed deep learning.

The topic that best suits our needs in this foray into object detection models is stop signs our motivations for choosing this topic is

that stop sign detection, along with general street sign detection, is still a key challenge that any fully automatic or otherwise semi-

automated car has to overcome to be viable for the market.

There are nearly 700,000 accidents that occur around stop signs each year in the U.S., of which more than 80% can be prevented.

Numerous solutions do exist already, but performance is also continuously improving. Reasonably, because of scrutiny and safety

concerns, these models must be rigorously tested and trained, then fine-tuned. The rollout of AI-assisted technology in cars proves

to be beneficial not only for day-to-day drivers but also for disabled, elderly, and otherwise impaired drivers. We inch ever closer

toward fully automated rider experiences as these models improve. As such, we decided that this topic is doable given the time limit,

familiar to our group, and allows us to experience firsthand the challenges accompanying an object detection task.

Due to familiarity and ease of use, we have decided to use the YOLO models, known for their accuracy and real-time performance.

This makes them optimal for tasks that require high accuracy and speed, such as our case: detecting stop signs.

Dataset Curation

The dataset was created last year from photos taken around UC Merced, the Bay Area, and San Diego. Additional images are

screenshots courtesy of Google Street View. Totaling around 70 images, our dataset includes nighttime photos, stop signs at varying

distances, and other street signs (e.g., do not enter). Roboflow was used to annotate the dataset.

We acknowledge that, for object detection, the dataset is on the smaller end. Though the optimal dataset size depends on the

nature of the data and how accurate we want the model to be, 150 images are preferred for a more usable model. We opted for a

lower number due to time and resource constraints.

Single-Node

For all single-node training, we decided to use the YOLO models. The original YOLO, and all versions through YOLOv3, were

created by Joseph Redmon and based on the darknet neural network. Alexey Bochkovskiy continued his work in YOLOv4, which

boasts higher accuracy but is not necessarily faster.

Branching off of there, we chose YOLOv5 due to its existing documentation for both single-node and distributed training, and the

version also means it’s suitable for comparison to YOLOv4.

All training was conducted through Google Colab with a provided Tesla T4 GPU.

YOLOv4

Overview

As this initial application of YOLOv4 was one of our first times doing anything related to AI/ML, the approach and evaluation could

be better, and some metrics may need to be included.

Training

Though a model was provided, we made adjustments to suit our needs better. Our model can run up to 2000 iterations, with some

predetermined values as required by Darknet, such as a 0.001 learning rate and 64 batches. We settled on 32 subdivisions,

meaning two images are evaluated simultaneously.

We split training into eight sessions, stopping every 100 iterations to save the weights. This ensured we could roll back to an earlier

version in the case of potential overfitting.

Environment Setup: Darknet & Dependencies

We choose to use the YOLOv4 framework, which is implemented by Darknet

Importing Dataset

Download Pretrained weights

Training

Evaluation

The model was trained for 2000 iterations, which took approximately 12 hours. It was intentionally trained for a while longer than

needed to determine the best performance possible and the point at which we would begin to see overfitting. After evaluating the

performance at 1900 and 2000, we decided to stick with 1900 for our final model.

git clone https://github.com/AlexeyAB/darknet

# @title Change Config of darknet

# change configs

cd darknet

sed -i 's/OPENCV=0/OPENCV=1/' Makefile

sed -i 's/GPU=0/GPU=1/' Makefile

sed -i 's/CUDNN=0/CUDNN=1/' Makefile

sed -i 's/CUDNN_HALF=0/CUDNN_HALF=1/' Makefile

sed -i 's/LIBSO=0/LIBSO=1/' Makefile

make

# @title Copying Datasets and Configuration Files

# Copying datasets into /data

!cp -r /content/gdrive/MyDrive/TSG/stopsigns/dataset/train -d data/

!cp -r /content/gdrive/MyDrive/TSG/stopsigns/dataset/test -d data/

# Copying config file

!cp /content/gdrive/MyDrive/TSG/stopsigns/yolov4-obj.cfg ./cfg

# Copying object data

!cp /content/gdrive/MyDrive/TSG/stopsigns/obj.names ./data

!cp /content/gdrive/MyDrive/TSG/stopsigns/obj.data ./data

!wget https://github.com/AlexeyAB/darknet/releases/download/darknet_yolo_v3_optimal/yolov4.conv.137

!./darknet detector train data/obj.data cfg/yolov4-obj.cfg yolov4.conv.137 -dont_show -map 2>&1 > log.txt

Figure 1. YOLOv4 Loss Chart

Our loss dropped rapidly within the first 100 iterations and gradually decreased. There was a minimal decrease starting around 800-

1000 iterations. Overall, our accuracy or mean average precision (mAP) peaked at around 90% and an intersection over union (IoU)

of around 79%.

At 1900 iterations, we had a final mAP of 87.88% and a final loss of 0.0633. As you can see from the chart, our loss spiked at 2000

iterations, which is why we decided to roll back the model to 1900.

With most street signs, stop signs were usually correctly detected. Early on, we faced some difficulties with many non-stop signs,

but this was eventually relegated to only “do not enter” signs. We will discuss specifics in detail below.

Detection result with output weights:

Improvements and Difficulties

Once again, as this was our first time working with any sort of AI/ML model, we can make numerous improvements.

The first, and most costly improvement in terms of time, is adding more training data, especially null data. We did collect more nulls

to add to the dataset if needed. But as mentioned above, we attempted to keep the dataset smaller, knowing the tradeoff would be

in performance. Training an image detector is far more intensive than just training on data, and given the tight window in the original

project as well, it was not in our best interest to pull from large, existing datasets.

As mentioned above, a significant later issue was the model believing “do not enter” signs were stop signs. To solve this, we added

null data and resumed training, which improved its recognition. However, when we fed a demo video into the network, we

discovered it also thought various yellow signs, such as pedestrian signs, were stop signs. Once we got close enough, the model

correctly recognized that they were not stop signs. With more data and training, this was eventually mostly solved, though problems

with “do not enter” signs remained at low enough detection thresholds. There was a significant difference between a 50% and 75%

threshold. However, increasing this threshold also means real stop signs might not be detected in time. By extension, the detection

threshold can also be fine-tuned to minimize this sort of error.

Furthermore, though YOLOv4 allows for greater model customization, due to our unfamiliarity with the configuration at the time, we

could not fully configure the model to match our needs. Some parameters in the configuration were arbitrarily chosen, while other

parameters were finalized after a lot of guessing and checking to ensure that the model was actually working. With a deeper

understanding of machine learning concepts, we will be able to actually fine-tune similar models in the future.

YOLOv5

Implemented in PyTorch, YOLOv5 is designed to be faster than YOLOv4 while maintaining competitive accuracy. It focuses on real-

time applications and is optimized for efficiency.

Training Steps

Environment Setup: YOLOv5 & Dependencies

We choose to use the YOLOv5 framework, which is implemented by PyTorch

Importing Dataset

For dataset, we utilized the tagged resources provided by Rainbow Flow, which offers high-quality and sufficient data.

Adjusting Model Configuration

git clone https://github.com/ultralytics/yolov5

cd yolov5

git reset --hard fbe67e465375231474a2ad80a4389efc77ecff99

# install dependencies as necessary

!pip install -qr requirements.txt # install dependencies (ignore errors)

import torch

from IPython.display import Image, clear_output # to display images

from utils.downloads import attempt_download # to download models/datasets

...

project = rf.workspace("team-intense").project("stop-signs-cb6tq")

dataset = project.version(5).download("yolov5")

writetemplate /content/yolov5/models/custom_yolov5s.yaml

%%writetemplate /content/yolov5/models/custom_yolov5s.yaml

# parameters

nc: {num_classes} # number of classes

depth_multiple: 0.33 # model depth multiple

width_multiple: 0.50 # layer channel multiple

# anchors

anchors:

- [10,13, 16,30, 33,23] # P3/8

- [30,61, 62,45, 59,119] # P4/16

- [116,90, 156,198, 373,326] # P5/32

评论收藏

内容反馈

版权申诉

hakesashou

粉丝: 6722
资源: 1675

基于YOLOv5（pytorch）的简单停车标志检测模型

基于pytorch的yolov5目标检测框架

使用YOLOv5进行目标检测的简单示例代码

基于Yolov5的车牌检测器

yolov5s目标检测模型，基于pytorch实现

基于YOLOv5模型改进

YOLOV5交通标志识别检测数据集+代码+模型+教学视频

基于YOLOv5的车辆检测，亲测可用

基于Yolov8的中国交通标志（CCTSDB）识别检测系统

道路交通路标图像数据集2000多张图片+yolov5pytorch格式标记.zip

基于python与yolov5的车牌识别检测设计与实现

基于Yolov5的车辆识别

基于 YOLOv5 的对象检测算法的系统

基于Pytorch实现的Bert模型

yolov5进行目标检测

基于yolov5的课堂违纪检测系统

基于yolov5+django实现交通标志检测识别源码+模型.zip

基于pytorch深度学习框架，实用开源模型yolov4实现模板检测与yolov5实现车牌检测与LPRNet实现车牌检测.rar

基于YOLOv5网络模型的交通标志检测（带数据集）

基于YOLOv5算法的深度学习模型，是一种基于交通标志+交通信号灯的检测模型

车牌识别（基于yolov5）

Python-介绍PyTorch的简单示例

yolov5-pytorch目标检测程序,目标检测示例代码

yolov5 pytorch1.5

基于pytorch车型识别系统

ncnn使用yolov5示例.zip 基础实例

基于Aidlux的停车标志检测（可修改为coco 80类目标检测）源码

基于yolov5的车牌检测，包含车牌角点检测.zip

基于yolov5的停车系统二维测绘

基于pytorch深度学习框架，实用开源模型yolov4实现模板检测与yolov5实现车牌检测与LPRNet实现车牌检测.zip

深度学习，YOLOv5,交通标志检测

最新资源