Python-Pytorch实现用于图像语义分割UNet

共18个文件

py：13个

pth：1个

txt：1个

Python开发-机器学习

3星 · 超过75%的资源需积分: 45 16 浏览量 2019-08-11 06:35:34 上传评论 12 收藏 47.26MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

Python-Pytorch实现用于图像语义分割UNet.zip （18个子文件）

Pytorch-UNet-master

eval.py 653B

MODEL.pth 51.14MB

submit.py 781B

train.py 4KB

predict.py 5KB

LICENSE 34KB

requirements.txt 53B

.gitignore 62B

dice_loss.py 1KB

README.md 2KB

unet

unet_parts.py 2KB

__init__.py 29B

unet_model.py 911B

utils

crf.py 658B

utils.py 2KB

__init__.py 84B

load.py 1KB

data_vis.py 274B

# Pytorch-UNet ![input and output for a random image in the test dataset](https://framapic.org/OcE8HlU6me61/KNTt8GFQzxDR.png) Customized implementation of the [U-Net](https://arxiv.org/pdf/1505.04597.pdf) in Pytorch for Kaggle's [Carvana Image Masking Challenge](https://www.kaggle.com/c/carvana-image-masking-challenge) from a high definition image. This was used with only one output class but it can be scaled easily. This model was trained from scratch with 5000 images (no data augmentation) and scored a [dice coefficient](https://en.wikipedia.org/wiki/S%C3%B8rensen%E2%80%93Dice_coefficient) of 0.988423 (511 out of 735) on over 100k test images. This score is not quite good but could be improved with more training, data augmentation, fine tuning, playing with CRF post-processing, and applying more weights on the edges of the masks. The model used for the last submission is stored in the `MODEL.pth` file, if you wish to play with it. The data is available on the [Kaggle website](https://www.kaggle.com/c/carvana-image-masking-challenge/data). ## Usage **Note : Use Python 3** ### Prediction You can easily test the output masks on your images via the CLI. To see all options: `python predict.py -h` To predict a single image and save it: `python predict.py -i image.jpg -o output.jpg` To predict a multiple images and show them without saving them: `python predict.py -i image1.jpg image2.jpg --viz --no-save` You can use the cpu-only version with `--cpu`. You can specify which model file to use with `--model MODEL.pth`. ### Training `python train.py -h` should get you started. A proper CLI is yet to be added. ## Warning In order to process the image, it is split into two squares (a left on and a right one), and each square is passed into the net. The two square masks are then merged again to produce the final image. As a consequence, the height of the image must be strictly superior than half the width. Make sure the width is even too. ## Dependencies This package depends on [pydensecrf](https://github.com/lucasb-eyer/pydensecrf), available via `pip install`. ## Notes on memory The model has be trained from scratch on a GTX970M 3GB. Predicting images of 1918*1280 takes 1.5GB of memory. Training takes approximately 3GB, so if you are a few MB shy of memory, consider turning off all graphical displays. This assumes you use bilinear up-sampling, and not transposed convolution in the model.

评论收藏

内容反馈