快速目标分割：FastSAM_fastsam资源-CSDN文库

共53个文件

png：19个

py：14个

jpg：13个

版权申诉

图像处理

5星 · 超过95%的资源 198 浏览量 2024-02-05 16:29:01 上传评论收藏 39.52MB ZIP 举报

《快速目标分割：FastSAM深度解析》在计算机视觉领域，目标分割是一项核心任务，它旨在将图像中的每个像素划分为不同的类别，如前景、背景或特定的目标对象。近年来，随着深度学习技术的发展，目标分割的精度和效率都有了显著提升。其中，“快速目标分割：FastSAM”是一个备受瞩目的项目，它专注于为用户提供高效且精确的目标分割解决方案。 FastSAM，全称为“快速分割任意对象模型”，它的设计目标是解决图像和视频中对象分割的实时性和准确性问题。该项目采用了创新的算法和方法，能够在保持高精度的同时，大大缩短处理时间，为研究者和开发者带来了极大的便利。 FastSAM的核心在于其优化的分割算法。这种算法结合了深度学习和传统图像处理技术，通过训练复杂的神经网络模型来识别和分割图像中的目标。网络通常由卷积层、池化层、全连接层等构成，能够自动学习特征并进行像素级别的分类。在设计上，FastSAM可能采用了轻量级的网络架构，如MobileNet或EfficientNet，以实现快速运算。 FastSAM对于实时性能的追求体现在其优化策略上。这可能包括硬件加速，如GPU并行计算，以及模型剪枝、量化等技术，以降低计算复杂度，提高运行速度。同时，该算法可能还引入了动态阈值设定，根据场景的变化自适应地调整分割标准，从而在不同条件下都能保持良好的分割效果。在实际应用中，FastSAM适用于各种场景，例如自动驾驶、视频监控、医疗影像分析等。在自动驾驶中，它可以快速准确地识别行人、车辆和其他道路元素，确保系统的安全决策；在视频监控中，它可以帮助分析行为，提供异常检测预警；在医疗影像分析中，它可以辅助医生定位病灶，提高诊断效率。项目压缩包中的“FastSAM-main”可能包含了FastSAM的源代码、预训练模型、数据集、示例脚本以及相关文档。通过这些资源，用户可以深入理解FastSAM的工作原理，并根据自己的需求进行二次开发或定制化应用。 FastSAM是一个集高效与精确于一体的快速目标分割解决方案，它的出现无疑推动了计算机视觉领域的进步，为研究人员和开发者提供了强大的工具，有助于他们在各自的领域中实现更智能、更快速的图像处理和目标识别。

资源推荐

资源详情

资源评论

收起资源包目录

FastSAM-main.zip （53个子文件）

FastSAM-main

app_gradio.py 15KB

setup.py 631B

segpredict.py 1KB

assets

replicate-2.png 97KB

salient.png 1.48MB

more_usages

everything_mode.png 25KB

box_prompt.png 26KB

draw_edge.png 26KB

everything_mode_without_retina.png 25KB

more_points.png 25KB

text_prompt_cat.png 31KB

dog_clip.png 303KB

logo.png 140KB

eightpic.png 2.04MB

replicate-3.png 597KB

eightpic.pdf 506KB

head_fig.png 5.87MB

hf_points_mode.png 3.64MB

Overview.png 3.82MB

building.png 2.59MB

replicate-1.png 3.29MB

hf_everything_mode.png 2.95MB

anomaly.png 1.06MB

LICENSE 34KB

Inference.py 4KB

predict.py 5KB

examples

sa_192.jpg 1.16MB

sa_11025.jpg 965KB

sa_414.jpg 2.13MB

sa_8776.jpg 460KB

dogs.jpg 438KB

sa_862.jpg 1.48MB

sa_1309.jpg 1.06MB

sa_561.jpg 803KB

sa_10039.jpg 381KB

utils

__init__.py 0B

tools.py 16KB

tools_gradio.py 6KB

output

dogs.jpg 27KB

cat.jpg 30KB

cog.yaml 678B

fastsam

utils.py 3KB

__init__.py 290B

predict.py 3KB

model.py 4KB

decoder.py 5KB

prompt.py 18KB

requirements.txt 324B

MORE_USAGES.md 2KB

.gitignore 83B

images

dogs.jpg 438KB

cat.jpg 2.41MB

README.md 11KB

![](assets/logo.png) # Fast Segment Anything [[`ðPaper`](https://arxiv.org/pdf/2306.12156.pdf)] [[`ð¤HuggingFace Demo`](https://huggingface.co/spaces/An-619/FastSAM)] [[`Colab demo`](https://colab.research.google.com/drive/1oX14f6IneGGw612WgVlAiy91UHwFAvr9?usp=sharing)] [[`Replicate demo & API`](https://replicate.com/casia-iva-lab/fastsam)] [~~[`OpenXLab Demo`](https://openxlab.org.cn/apps/detail/zxair/FastSAM)~~] [[`Model Zoo`](#model-checkpoints)] [[`BibTeX`](#citing-fastsam)] [[`Video Demo`](https://youtu.be/yHNPyqazYYU)] ![FastSAM Speed](assets/head_fig.png) The **Fast Segment Anything Model(FastSAM)** is a CNN Segment Anything Model trained using only 2% of the SA-1B dataset published by SAM authors. FastSAM achieves comparable performance with the SAM method at **50Ã higher run-time speed**. ![FastSAM design](assets/Overview.png) **ð Updates** - **`2023/11/28`** Recommendation: [Semantic FastSAM](https://github.com/KBH00/Semantic-Fast-SAM), which add the semantic class labels to FastSAM. Thanks to [KBH00](https://github.com/KBH00/Semantic-Fast-SAM) for this valuable contribution. - **`2023/09/11`** Release [Training and Validation Code](https://github.com/CASIA-IVA-Lab/FastSAM/releases). - **`2023/08/17`** Release [OpenXLab Demo](https://openxlab.org.cn/apps/detail/zxair/FastSAM). Thanks to OpenXLab Team for help. - **`2023/07/06`** Added to [Ultralytics (YOLOv8) Model Hub](https://docs.ultralytics.com/models/fast-sam/). Thanks to [Ultralytics](https://github.com/ultralytics/ultralytics) for help ð¹. - **`2023/06/29`** Support [text mode](https://huggingface.co/spaces/An-619/FastSAM) in HuggingFace Space. Thanks a lot to [gaoxinge](https://github.com/gaoxinge) for help ð¹. - **`2023/06/29`** Release [FastSAM_Awesome_TensorRT](https://github.com/ChuRuaNh0/FastSam_Awsome_TensorRT). Thanks a lot to [ChuRuaNh0](https://github.com/ChuRuaNh0) for providing the TensorRT model of FastSAM ð¹. - **`2023/06/26`** Release [FastSAM Replicate Online Demo](https://replicate.com/casia-iva-lab/fastsam). Thanks a lot to [Chenxi](https://chenxwh.github.io/) for providing this nice demo ð¹. - **`2023/06/26`** Support [points mode](https://huggingface.co/spaces/An-619/FastSAM) in HuggingFace Space. Better and faster interaction will come soon! - **`2023/06/24`** Thanks a lot to [Grounding-SAM](https://github.com/IDEA-Research/Grounded-Segment-Anything) for Combining Grounding-DINO with FastSAM in [Grounded-FastSAM](https://github.com/IDEA-Research/Grounded-Segment-Anything/tree/main/EfficientSAM) ð¹. ## Installation Clone the repository locally: ```shell git clone https://github.com/CASIA-IVA-Lab/FastSAM.git ``` Create the conda env. The code requires `python>=3.7`, as well as `pytorch>=1.7` and `torchvision>=0.8`. Please follow the instructions [here](https://pytorch.org/get-started/locally/) to install both PyTorch and TorchVision dependencies. Installing both PyTorch and TorchVision with CUDA support is strongly recommended. ```shell conda create -n FastSAM python=3.9 conda activate FastSAM ``` Install the packages: ```shell cd FastSAM pip install -r requirements.txt ``` Install CLIP: ```shell pip install git+https://github.com/openai/CLIP.git ``` ## <a name="GettingStarted"></a> Getting Started First download a [model checkpoint](#model-checkpoints). Then, you can run the scripts to try the everything mode and three prompt modes. ```shell # Everything mode python Inference.py --model_path ./weights/FastSAM.pt --img_path ./images/dogs.jpg ``` ```shell # Text prompt python Inference.py --model_path ./weights/FastSAM.pt --img_path ./images/dogs.jpg --text_prompt "the yellow dog" ``` ```shell # Box prompt (xywh) python Inference.py --model_path ./weights/FastSAM.pt --img_path ./images/dogs.jpg --box_prompt "[[570,200,230,400]]" ``` ```shell # Points prompt python Inference.py --model_path ./weights/FastSAM.pt --img_path ./images/dogs.jpg --point_prompt "[[520,360],[620,300]]" --point_label "[1,0]" ``` You can use the following code to generate all masks, make mask selection based on prompts, and visualize the results. ```shell from fastsam import FastSAM, FastSAMPrompt model = FastSAM('./weights/FastSAM.pt') IMAGE_PATH = './images/dogs.jpg' DEVICE = 'cpu' everything_results = model(IMAGE_PATH, device=DEVICE, retina_masks=True, imgsz=1024, conf=0.4, iou=0.9,) prompt_process = FastSAMPrompt(IMAGE_PATH, everything_results, device=DEVICE) # everything prompt ann = prompt_process.everything_prompt() # bbox default shape [0,0,0,0] -> [x1,y1,x2,y2] ann = prompt_process.box_prompt(bboxes=[[200, 200, 300, 300]]) # text prompt ann = prompt_process.text_prompt(text='a photo of a dog') # point prompt # points default [[0,0]] [[x1,y1],[x2,y2]] # point_label default [0] [1,0] 0:background, 1:foreground ann = prompt_process.point_prompt(points=[[620, 360]], pointlabel=[1]) prompt_process.plot(annotations=ann,output_path='./output/dog.jpg',) ``` You are also welcomed to try our Colab demo: [FastSAM_example.ipynb](https://colab.research.google.com/drive/1oX14f6IneGGw612WgVlAiy91UHwFAvr9?usp=sharing). ## Different Inference Options We provide various options for different purposes, details are in [MORE_USAGES.md](MORE_USAGES.md). ## Training or Validation Training from scratch or validation: [Training and Validation Code](https://github.com/CASIA-IVA-Lab/FastSAM/releases). ## Web demo ### Gradio demo - We also provide a UI for testing our method that is built with gradio. You can upload a custom image, select the mode and set the parameters, click the segment button, and get a satisfactory segmentation result. Currently, the UI supports interaction with the 'Everything mode' and 'points mode'. We plan to add support for additional modes in the future. Running the following command in a terminal will launch the demo: ``` # Download the pre-trained model in "./weights/FastSAM.pt" python app_gradio.py ``` - This demo is also hosted on [HuggingFace Space](https://huggingface.co/spaces/An-619/FastSAM). ![HF_Everyhting](assets/hf_everything_mode.png) ![HF_Points](assets/hf_points_mode.png) ### Replicate demo - [Replicate demo](https://replicate.com/casia-iva-lab/fastsam) has supported all modes, you can experience points/box/text mode. ![Replicate-1](assets/replicate-1.png) ![Replicate-2](assets/replicate-2.png) ![Replicate-3](assets/replicate-3.png) ## <a name="Models"></a>Model Checkpoints Two model versions of the model are available with different sizes. Click the links below to download the checkpoint for the corresponding model type. - **`default` or `FastSAM`: [YOLOv8x based Segment Anything Model](https://drive.google.com/file/d/1m1sjY4ihXBU1fZXdQ-Xdj-mDltW-2Rqv/view?usp=sharing) | [Baidu Cloud (pwd: 0000).](https://pan.baidu.com/s/18KzBmOTENjByoWWR17zdiQ?pwd=0000)** - `FastSAM-s`: [YOLOv8s based Segment Anything Model.](https://drive.google.com/file/d/10XmSj6mmpmRb8NhXbtiuO9cTTBwR_9SV/view?usp=sharing) ## Results All result were tested on a single NVIDIA GeForce RTX 3090. ### 1. Inference time Running Speed under Different Point Prompt Numbers(ms). | method | params | 1 | 10 | 100 | E(16x16) | E(32x32\*) | E(64x64) | |:------------------:|:--------:|:-----:|:-----:|:-----:|:----------:|:-----------:|:----------:| | SAM-H | 0.6G | 446 | 464 | 627 | 852 | 2099 | 6972 | | SAM-B | 136M | 110 | 125 | 230 | 432 | 1383 | 5417 | | FastSAM | 68M | 40 |40 | 40 | 40 | 40 | 40 | ### 2. Memory usage | Dataset | Method | GPU Memory (MB) | | :-------: | :-----: | :-------------: | | COCO 2017 | FastSAM | 2608 | | COCO 2017 | SAM-H | 7060 | | COCO 2017 | SAM-B | 4670 | ### 3. Zero-shot Transfer Experiments #### Edge Detection Test on the BSDB500 dataset. |method | year| ODS | OIS | AP | R50 | |:----------:|:

评论收藏

内容反馈

版权申诉