yolov8系列--SalientfeatureextractorbasedonyoloV8.zip资源-CSDN文库

共16个文件

py：6个

md：4个

pt：2个

需积分: 5 129 浏览量 2024-02-24 21:46:27 上传评论收藏 94.53MB ZIP 举报

标题中的“yolov8系列--Salient feature extractor based on yoloV8”指的是一个基于YOLOv8的显著特征提取方法。YOLO（You Only Look Once）是一种目标检测算法，它以其高效和实时性而闻名。YOLOv8是YOLO系列的最新版本，可能在速度、精度或新特性上有所提升。而“显著特征提取”通常是指从图像中识别出最重要或最具有代表性的部分，这对于目标检测、图像识别和理解等任务至关重要。在YOLOv8中，显著特征提取可能涉及到以下知识点： 1. **网络架构**：YOLOv8可能会采用更先进的网络设计，如卷积神经网络（CNN）、残差块、注意力机制等，以增强对图像特征的学习能力。 2. **特征金字塔网络（FPN）**：YOLO系列经常利用FPN来捕获不同尺度的目标，FPN通过多尺度特征融合提高目标检测性能。 3. **注意力机制**：例如SE模块（Squeeze-and-Excitation）或CBAM（Convolutional Block Attention Module），可以增强模型对重要特征的响应，抑制不相关信息。 4. **锚框（Anchor Boxes）**：YOLO系列使用预定义的锚框来预测不同大小和比例的目标，YOLOv8可能优化了锚框的设计，使其更适合各种形状和尺寸的目标。 5. **损失函数**：YOLOv8可能采用了改进的损失函数，以平衡分类和定位的权重，提高训练效果。 6. **训练策略**：包括数据增强、批标准化、学习率调度等，以优化模型的泛化能力。 7. **优化器**：如Adam、SGD等，用于调整模型参数更新的方式。 8. **模型量化与剪枝**：为了实现更高的效率，YOLOv8可能涉及模型量化，将浮点运算转化为整数运算，以及模型剪枝，减少计算量而不显著影响性能。 9. **实时性和效率**：YOLO系列强调实时性，YOLOv8可能通过优化计算流程、减少计算复杂度等方式进一步提高了运行速度。 10. **应用场景**：显著特征提取在自动驾驶、视频监控、无人机、医学影像分析等领域有广泛应用，YOLOv8可能特别适合这些对实时性有高要求的场景。文件名称“kwan1120”看起来像是一个用户或日期的标识，但没有更多信息无法直接关联到技术细节。在实际项目中，这可能是代码库、日志文件或者训练记录的名称。 YOLOv8在显著特征提取方面可能包含了多种技术进步和创新，这些技术不仅提升了模型的性能，还可能优化了模型的运行效率，使其更适合实时应用。如果你深入研究这个项目，将有机会了解到更多关于深度学习、目标检测以及特征提取的前沿知识。

资源推荐

资源详情

资源评论

收起资源包目录

yolov8系列--Salient feature extractor based on yoloV8.zip （16个子文件）

kwan1120

extract.py 13KB

.github

ISSUE_TEMPLATE

feature_request.md 595B

bug_report.md 834B

utils

convert_to_yolo.py 2KB

split_test_to_val.py 1KB

move_img_to_dir.py 761B

masks_to_yoloTXT.py 2KB

stream_from_cam.py 3KB

docs

images

field6.gif 22.87MB

CODE_OF_CONDUCT.md 5KB

requirements.txt 57B

models

salient_extract_m.pt 52.22MB

salient_extract_n.pt 25.32MB

.gitignore 186B

README.md 5KB

COPYING 32KB

# Salient Extract Yolov8 based feature extraction model. The goal of this work is to develop a set of tools that enables users to easily collect training data from the field. Whether it be plants, minerals or other objects. The features of interest can then be extracted by the salient extraction model and superimposed on divers backgrounds. Creating synthetic dataset only comprised of images taken in the real world. This project is in very early development stages, expect bugs and frequent updates. ![example](docs/images/field6.gif) ## Getting started (Linux): Check your GPU install / drivers (optional): nvidia-smi Your output should look something like this: +-----------------------------------------------------------------------------+ | NVIDIA-SMI 525.60.11 Driver Version: 525.60.11 CUDA Version: 12.0 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |===============================+======================+======================| | 0 NVIDIA GeForce ... Off | 00000000:07:00.0 On | N/A | | 25% 38C P8 24W / 260W | 1236MiB / 11264MiB | 5% Default | | | | N/A | +-------------------------------+----------------------+----------------------+ Install: git clone git@github.com:andrewjouffray/salient-extract.git cd salient-extract pip install -r requirements.txt Run: python extract.py --model models/salient_extract_n.pt --input yourVideo.mp4 --output output.mp4 --smooth --stitch ## Access the models (YoloV8) name | mAP@50 | location --- | --- | --- salient_n | 0.829 | models/salient_extract_n.pt salient_s | 0.877 | [patreon.com/SalientExtractAi (Free)](https://patreon.com/SalientExtractAi) salient_m | 0.723 | models/salient_extract_m.pt salient_m2 | 0.874 | [patreon.com/SalientExtractAi (Free)](https://patreon.com/SalientExtractAi) salient_l | 0.888 | [patreon.com/SalientExtractAi (Tier2)](https://patreon.com/SalientExtractAi) salient_x | 0.811 | [patreon.com/SalientExtractAi (Tier2)](https://patreon.com/SalientExtractAi) salient_x2 | 0.899 | [patreon.com/SalientExtractAi (Tier2)](https://patreon.com/SalientExtractAi) ## Access the Datasets (YoloV8 format): Some of them are __free__, so no need to subscribe to my Patreon to gain access. name | # of images | location --- | --- | --- synthetic salient objects | 120,000 + | [patreon.com/SalientExtractAi (Tier3)](https://patreon.com/SalientExtractAi) sample synthetic dataset | 355 | [patreon.com/SalientExtractAi (Free)](https://patreon.com/SalientExtractAi) validation | 204 | [patreon.com/SalientExtractAi (Free)](https://patreon.com/SalientExtractAi) MSRA_10K (yolo) | 10,000 | [patreon.com/SalientExtractAi (Free)](https://patreon.com/SalientExtractAi) ## How it works: This salient feature extractor is based on the yolov8-seg model, trained on synthetic data comprised of salient objects in a focused foreground superimposed over random blurred and in-focus background images. Therefore the model has a strong bias for in-focused objects, that are not your hands. ## arguments **--model -m:** example: `--model models/salient_extract_n.pt`. This is the path to the model you want to use. **--input -i:** example: `--input yourVideo.mp4`. This is the path to the video that you want to extract features out of. **--output -o:** example: `--output output.mp4`. This is the path and filename that you want to use to save all the extracted feature. It must be a video **--smooth -s:** If you use the `--smooth` flag, the script will stack the masks of every 4 frames together, attempting to compensate for the Jitteriness of the detection masks. This means that the cutouts might not be as accurate if the object moves a lot in the frame. **--stitch -t:** If you use the `--stitch` flag, the script will stitch the input frame, prediction frame and cut-out mask frames side by side. Omitting this flag will just output the cut-out frames. ## Current limitations: - Need for in-focus foreground and out-of-focus background - jitteriness of the detection masks - mask inaccuracies when using smooth mode - struggles with objects that are not one uniform mass ## generate semi-synthetic data: You can use the sister-project to salient extract [Composite Image Generator](https://github.com/andrewjouffray/Composite-Image-Generator), to generate images using the "copy paste" method. Note: this project needs a bit of work and a few updates. ## Special thanks Although this specific project has been developed on my own free time using my own resources. I would like to thank Dr. Rakesh Kaundal and the [KAABiL Lab](http://bioinfo.usu.edu/) for providing hardware and assistance in the development of early models that eventually led me to start this project.

评论收藏

内容反馈