这是通过级联文本笔画检测和擦除来删除场景文本的最小实现。这个github存储库用于研究场景文本擦除的图像修复。谢谢：）.zip资源-CSDN文库

共22个文件

png：10个

py：6个

db：2个

版权申诉

102 浏览量 2023-03-21 23:59:13 上传评论收藏 1.19MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

这是通过级联文本笔画检测和擦除来删除场景文本的最小实现。这个github存储库用于研究场景文本擦除的图像修复。谢谢：）.zip （22个子文件）

SceneTextRemover-pytorch-main

losses.py 525B

doc

model.png 280KB

dataset_example.png 222KB

epoch5.PNG 94KB

Thumbs.db 12KB

text.png 1KB

epoch10.PNG 138KB

epoch120.PNG 89KB

epoch1.PNG 139KB

epoch30.PNG 134KB

epoch50.PNG 83KB

back.png 14KB

LICENSE 1KB

network.py 8KB

dataset.py 2KB

modules.py 4KB

requirements.txt 35B

.gitignore 2KB

train.py 8KB

create_dataset.py 4KB

results

show

Thumbs.db 18KB

README.md 3KB

# Scene Text Remover Pytorch Implementation This is a minimal implementation of [Scene text removal via cascaded text stroke detection and erasing](https://arxiv.org/pdf/2011.09768.pdf). This github repository is for studying on image in-painting for scene text erasing. Thank you :) ## Requirements Python 3.7 or later with all [requirements.txt](./requirements.txt) dependencies installed, including `torch>=1.6`. To install run: ``` $ pip install -r requirements.txt ``` ## Model Summary ![model architecture](./doc/model.png) This model has u-net sub modules. `Gd` detects text stroke image `Ms` with `I` and `M`. `G'd` detects more precise text stroke `M's`. Similarly, `Gr` generates text erased image `Ite`, and `G'r` generates more precise output `I'te`. ## Custom Dictionary Not to be confused, I renamed the names. `I` : Input Image (with text) `Mm` : Text area mask (`M` in the model) `Ms` : Text stroke mask; output of `Gd` `Ms_` : Text stroke mask; output of `G'd` `Msgt` : Text stroke mask ; ground truth `Ite` : Text erased image; output of `Gr` `Ite_` : Text erased image; output of `G'r` `Itegt`: Text erased image; ground truth ## Prepare Dataset You need to prepare background images in `backs` directory and text binary images in `font_mask` directory. ![background image, text image example](./doc/back.png) [part of background image sample, text binary image sample] Executing `python create_dataset.py` will automatically generate `I`, `Itegt`, `Mm`, `Msgt` data. (If you already have `I`, `Itegt`, `Mm`, `Msgt`, you can skip this section) ``` ├─dataset │ ├─backs │ │ # background images │ └─font_mask │ │ # text binary images │ └─train │ │ └─I │ │ └─Itegt │ │ └─Mm │ │ └─Msgt │ └─val │ └─I │ └─Itegt │ └─Mm │ └─Msgt ``` I generated my dataset with 709 background images and 2410 font mask. I used 17040 pairs for training and 4260 pairs for validation. ![](./doc/dataset_example.png) Thanks for helping me gathering background images [sina-Kim]([sina-Kim (github.com)](https://github.com/sina-Kim)). ## Train All you need to do is: ``` python python train.py ``` ## Result From the left `I`, `Itegt`, `Ite`, `Ite_`, `Msgt`, `Ms`, `Ms_` * Epoch 2 ![](./doc/epoch1.PNG) * Epoch 5 ![](./doc/epoch5.PNG) * Epoch 10 ![](./doc/epoch10.PNG) * Epoch 30 ![](./doc/epoch30.PNG) * Epoch 50 ![](./doc/epoch50.PNG) * Epoch 120 ![](./doc/epoch120.PNG) These are not good enough for real task. I think the reason is lack of dataset and simplicity. But, it was a good experience for me to implement the paper. ## Issue If you are having a trouble to run this code, please use issue tab. Thank you.

评论收藏

内容反馈

版权申诉