FastSAM.zip资源-CSDN文库

共85个文件

png：19个

py：13个

jpg：13个

图像处理

图像分割

需积分: 2 46 浏览量 2023-10-28 10:08:58 上传评论收藏 105.56MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

FastSAM.zip （85个子文件）

FastSAM

app_gradio.py 15KB

.DS_Store 8KB

segpredict.py 1KB

assets

replicate-2.png 97KB

salient.png 1.48MB

more_usages

everything_mode.png 25KB

box_prompt.png 26KB

draw_edge.png 26KB

everything_mode_without_retina.png 25KB

more_points.png 25KB

text_prompt_cat.png 31KB

dog_clip.png 303KB

logo.png 140KB

eightpic.png 2.04MB

replicate-3.png 597KB

eightpic.pdf 506KB

head_fig.png 5.87MB

hf_points_mode.png 3.64MB

Overview.png 3.82MB

building.png 2.59MB

replicate-1.png 3.29MB

hf_everything_mode.png 2.95MB

anomaly.png 1.06MB

LICENSE 11KB

Inference.py 4KB

predict.py 5KB

examples

sa_192.jpg 1.16MB

sa_11025.jpg 965KB

sa_414.jpg 2.13MB

sa_8776.jpg 460KB

dogs.jpg 438KB

sa_862.jpg 1.48MB

sa_1309.jpg 1.06MB

sa_561.jpg 803KB

sa_10039.jpg 381KB

utils

__init__.py 0B

tools.py 15KB

tools_gradio.py 6KB

.git

index 5KB

HEAD 21B

refs

heads

main 41B

tags

remotes

origin

HEAD 30B

main 41B

objects

pack

pack-9e72cc890597c735cb172869ac05d6f1d55e98df.idx 5KB

pack-9e72cc890597c735cb172869ac05d6f1d55e98df.pack 58KB

pack-03896426e301a49481f6f41f89c980ee9c143dcd.idx 13KB

pack-03896426e301a49481f6f41f89c980ee9c143dcd.pack 65.93MB

info

description 73B

packed-refs 188B

FETCH_HEAD 207B

info

exclude 250B

logs

HEAD 322B

refs

heads

main 322B

remotes

origin

HEAD 176B

main 146B

hooks

post-update.sample 189B

prepare-commit-msg.sample 1KB

commit-msg.sample 896B

pre-receive.sample 544B

update.sample 4KB

pre-commit.sample 2KB

pre-rebase.sample 5KB

applypatch-msg.sample 478B

fsmonitor-watchman.sample 5KB

push-to-checkout.sample 3KB

pre-applypatch.sample 424B

pre-push.sample 1KB

pre-merge-commit.sample 416B

ORIG_HEAD 41B

config 306B

branches

output

dogs.jpg 27KB

cat.jpg 30KB

cog.yaml 678B

fastsam

utils.py 2KB

__init__.py 290B

predict.py 3KB

model.py 4KB

decoder.py 5KB

prompt.py 18KB

requirements.txt 324B

MORE_USAGES.md 2KB

.gitignore 64B

images

dogs.jpg 438KB

cat.jpg 2.41MB

README.md 10KB

![](assets/logo.png) # Fast Segment Anything [[`ðPaper`](https://arxiv.org/pdf/2306.12156.pdf)] [[`ð¤HuggingFace Demo`](https://huggingface.co/spaces/An-619/FastSAM)] [[`Colab demo`](https://colab.research.google.com/drive/1oX14f6IneGGw612WgVlAiy91UHwFAvr9?usp=sharing)] [[`Replicate demo & API`](https://replicate.com/casia-iva-lab/fastsam)] [[`Model Zoo`](#model-checkpoints)] [[`BibTeX`](#citing-fastsam)] ![FastSAM Speed](assets/head_fig.png) The **Fast Segment Anything Model(FastSAM)** is a CNN Segment Anything Model trained by only 2% of the SA-1B dataset published by SAM authors. The FastSAM achieve a comparable performance with the SAM method at **50Ã higher run-time speed**. ![FastSAM design](assets/Overview.png) **ð Updates** - **`2023/06/29`** Support [text mode](https://huggingface.co/spaces/An-619/FastSAM) in HuggingFace Space. Thanks a lot to [gaoxinge](https://github.com/gaoxinge) for help ð¹. - **`2023/06/29`** Release [FastSAM_Awesome_TensorRT](https://github.com/ChuRuaNh0/FastSam_Awsome_TensorRT). Thanks a lot to [ChuRuaNh0](https://github.com/ChuRuaNh0) for providing the TensorRT model of FastSAM ð¹. - **`2023/06/26`** Release [FastSAM Replicate Online Demo](https://replicate.com/casia-iva-lab/fastsam). Thanks a lot to [Chenxi](https://chenxwh.github.io/) for providing this nice demo ð¹. - **`2023/06/26`** Support [points mode](https://huggingface.co/spaces/An-619/FastSAM) in HuggingFace Space. Better and faster interaction will come soon! - **`2023/06/24`** Thanks a lot to [Grounding-SAM](https://github.com/IDEA-Research/Grounded-Segment-Anything) for Combining Grounding-DINO with FastSAM in [Grounded-FastSAM](https://github.com/IDEA-Research/Grounded-Segment-Anything/tree/main/EfficientSAM) ð¹. ## Installation Clone the repository locally: ```shell git clone https://github.com/CASIA-IVA-Lab/FastSAM.git ``` Create the conda env. The code requires `python>=3.7`, as well as `pytorch>=1.7` and `torchvision>=0.8`. Please follow the instructions [here](https://pytorch.org/get-started/locally/) to install both PyTorch and TorchVision dependencies. Installing both PyTorch and TorchVision with CUDA support is strongly recommended. ```shell conda create -n FastSAM python=3.9 conda activate FastSAM ``` Install the packages: ```shell cd FastSAM pip install -r requirements.txt ``` Install CLIP: ```shell pip install git+https://github.com/openai/CLIP.git ``` ## <a name="GettingStarted"></a> Getting Started First download a [model checkpoint](#model-checkpoints). Then, you can run the scripts to try the everything mode and three prompt modes. ```shell # Everything mode python Inference.py --model_path ./weights/FastSAM.pt --img_path ./images/dogs.jpg ``` ```shell # Text prompt python Inference.py --model_path ./weights/FastSAM.pt --img_path ./images/dogs.jpg --text_prompt "the yellow dog" ``` ```shell # Box prompt (xywh) python Inference.py --model_path ./weights/FastSAM.pt --img_path ./images/dogs.jpg --box_prompt "[570,200,230,400]" ``` ```shell # Points prompt python Inference.py --model_path ./weights/FastSAM.pt --img_path ./images/dogs.jpg --point_prompt "[[520,360],[620,300]]" --point_label "[1,0]" ``` You can use the following code to generate all masks, make mask selection based on prompts, and visualize the results. ```shell from fastsam import FastSAM, FastSAMPrompt model = FastSAM('./weights/FastSAM.pt') IMAGE_PATH = './images/dogs.jpg' DEVICE = 'cpu' everything_results = model(IMAGE_PATH, device=DEVICE, retina_masks=True, imgsz=1024, conf=0.4, iou=0.9,) prompt_process = FastSAMPrompt(IMAGE_PATH, everything_results, device=DEVICE) # everything prompt ann = prompt_process.everything_prompt() # bbox default shape [0,0,0,0] -> [x1,y1,x2,y2] ann = prompt_process.box_prompt(bbox=[200, 200, 300, 300]) # text prompt ann = prompt_process.text_prompt(text='a photo of a dog') # point prompt # points default [[0,0]] [[x1,y1],[x2,y2]] # point_label default [0] [1,0] 0:background, 1:foreground ann = prompt_process.point_prompt(points=[[620, 360]], pointlabel=[1]) prompt_process.plot(annotations=ann,output='./output/',) ``` You are also welcomed to try our Colab demo: [FastSAM_example.ipynb](https://colab.research.google.com/drive/1oX14f6IneGGw612WgVlAiy91UHwFAvr9?usp=sharing). ## Different Inference Options We provide various options for different purposes, details are in [MORE_USAGES.md](MORE_USAGES.md). ## Web demo ### Gradio demo - We also provide a UI for testing our method that is built with gradio. You can upload a custom image, select the mode and set the parameters, click the segment button, and get a satisfactory segmentation result. Everything mode and points mode are now supported for interaction, other modes will try to support in the future. Running the following command in a terminal will launch the demo: ``` # Download the pre-trained model in "./weights/FastSAM.pt" python app_gradio.py ``` - This demo is also hosted on [HuggingFace Space](https://huggingface.co/spaces/An-619/FastSAM). ![HF_Everyhting](assets/hf_everything_mode.png) ![HF_Points](assets/hf_points_mode.png) ### Replicate demo - [Replicate demo](https://replicate.com/casia-iva-lab/fastsam) has supported all modes, you can experience points/box/text mode. ![Replicate-1](assets/replicate-1.png) ![Replicate-2](assets/replicate-2.png) ![Replicate-3](assets/replicate-3.png) ## <a name="Models"></a>Model Checkpoints Two model versions of the model are available with different sizes. Click the links below to download the checkpoint for the corresponding model type. - **`default` or `FastSAM`: [YOLOv8x based Segment Anything Model](https://drive.google.com/file/d/1m1sjY4ihXBU1fZXdQ-Xdj-mDltW-2Rqv/view?usp=sharing) | [Baidu Cloud (pwd: 0000).](https://pan.baidu.com/s/18KzBmOTENjByoWWR17zdiQ?pwd=0000)** - `FastSAM-s`: [YOLOv8s based Segment Anything Model.](https://drive.google.com/file/d/10XmSj6mmpmRb8NhXbtiuO9cTTBwR_9SV/view?usp=sharing) ## Results All result were tested on a single NVIDIA GeForce RTX 3090. ### 1. Inference time Running Speed under Different Point Prompt Numbers(ms). | method | params | 1 | 10 | 100 | E(16x16) | E(32x32\*) | E(64x64) | |:------------------:|:--------:|:-----:|:-----:|:-----:|:----------:|:-----------:|:----------:| | SAM-H | 0.6G | 446 | 464 | 627 | 852 | 2099 | 6972 | | SAM-B | 136M | 110 | 125 | 230 | 432 | 1383 | 5417 | | FastSAM | 68M | 40 |40 | 40 | 40 | 40 | 40 | ### 2. Memory usage | Dataset | Method | GPU Memory (MB) | | :-------: | :-----: | :-------------: | | COCO 2017 | FastSAM | 2608 | | COCO 2017 | SAM-H | 7060 | | COCO 2017 | SAM-B | 4670 | ### 3. Zero-shot Transfer Experiments #### Edge Detection Test on the BSDB500 dataset. |method | year| ODS | OIS | AP | R50 | |:----------:|:-------:|:--------:|:--------:|:------:|:-----:| | HED | 2015| .788 | .808 | .840 | .923 | | SAM | 2023| .768 | .786 | .794 | .928 | | FastSAM | 2023| .750 | .790 | .793 | .903 | #### Object Proposals ##### COCO | method | AR10 | AR100 | AR1000 | AUC | | :-------: | :--: | :---: | :-----: | :--: | | SAM-H E64 | 15.5 | 45.6 | 67.7 | 32.1 | | SAM-H E32 | 18.5 | 49.5 | 62.5 | 33.7 | | SAM-B E32 | 11.4 | 39.6 | 59.1 | 27.3 | | FastSAM | 15.7 | 47.3 | 63.7 | 32.2 | ##### LVIS bbox AR@1000 | method | all | small | med. | large | |:---------------:|:-----:|:------:|:-----:|:------:| | ViTDet-H | 65.0 | 53.2 | 83.3 | 91.2 | zero-shot transfer methods | SAM-H E64 | 52.1 | 36.6 | 75.1 | 88.2 | | SAM-H E32 | 50.3 | 33.1 | 76.2 | 89.8 | | SAM-B E32 | 45.0 | 29.3 | 68.7 | 80.6 | | FastSAM | 57.1 | 44.3 | 77.1 | 85.3 | #### Instance Segmentation On COCO 2017 | method | AP | APS | APM | APL | | :------: | :--: | :--: | :--: | :--: | | ViTD

评论收藏

内容反馈