生成对抗网络-通过生成对抗网络生成交互式图像-附项目源码+流程教程-优质项目实战.zip

共58个文件

py：42个

png：5个

sh：4个

版权申诉

生成对抗网络

交互式图像

项目源码

优质项目

129 浏览量 2024-10-17 22:02:40 上传评论收藏 9.31MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

生成对抗网络_通过生成对抗网络生成交互式图像_附项目源码+流程教程_优质项目实战.zip （58个子文件）

生成对抗网络_通过生成对抗网络生成交互式图像_附项目源码+流程教程_优质项目实战

iGAN_script.py 4KB

lib

utils.py 4KB

__init__.py 0B

inits.py 3KB

ilsvrc_2012_mean.npy 1.5MB

AlexNet.py 5KB

rng.py 442B

theano_utils.py 530B

updates.py 8KB

activations.py 2KB

ops.py 4KB

HOGNet.py 3KB

image_save.py 2KB

costs.py 883B

html.py 2KB

generate_samples.py 2KB

constrained_opt_theano.py 5KB

datasets

scripts

download_hdf5_dataset.sh 300B

constrained_opt.py 9KB

train_dcgan

train_dcgan_utils.py 7KB

batchnorm_dcgan.py 4KB

train_script.sh 516B

get_num_images.py 480B

create_hdf5.py 4KB

upgrade_model.py 1KB

load.py 2KB

batchnorm_predict_z.py 4KB

train_dcgan_config.py 3KB

train_predict_z.py 7KB

train_dcgan.py 6KB

README.md 5KB

pack_model.py 1KB

model_def

dcgan_theano.py 9KB

__init__.py 0B

dcgan_theano_config.py 638B

models

scripts

download_dcgan_model.sh 214B

download_alexnet.sh 223B

pics

script_result.png 461KB

input_color.png 417B

input_edge.png 388B

predict.jpg 116KB

shoes_test.png 9KB

input_color_mask.png 302B

demo_teaser.jpg 151KB

ui_intro.jpg 93KB

demo.gif 7.88MB

iGAN_main.py 4KB

__init__.py 0B

ui_recorder.py 2KB

ui_color.py 1KB

ui_warp.py 3KB

gui_draw.py 12KB

gui_vis.py 4KB

ui_sketch.py 1KB

gui_design.py 7KB

save_result.py 2KB

README.md 10KB

iGAN_predict.py 7KB

## Interactive Image Generation via Generative Adversarial Networks <img src='pics/demo.gif' width=320> <img src='pics/demo_teaser.jpg' width=800> Given a few user strokes, our system could produce photo-realistic samples that best satisfy the user edits in real-time. Our system is based on deep generative models such as Generative Adversarial Networks ([GAN](https://arxiv.org/abs/1406.2661)) and [DCGAN](https://github.com/Newmu/dcgan_code). The system serves the following two purposes: * An intelligent drawing interface for automatically generating images inspired by the color and shape of the brush strokes. * An interactive visual debugging tool for understanding and visualizing deep generative models. By interacting with the generative model, a developer can understand what visual content the model can produce, as well as the limitation of the model. Please cite our paper if you find this code useful in your research. (Contact: Jun-Yan Zhu, junyanz at mit dot edu) ## Getting started * Install the python libraries. (See [Requirements](#requirements)). * Download the code * Download the model. (See `Model Zoo` for details): ``` bash bash ./models/scripts/download_dcgan_model.sh outdoor_64 ``` * Run the python script: ``` bash THEANO_FLAGS='device=gpu0, floatX=float32, nvcc.fastmath=True' python iGAN_main.py --model_name outdoor_64 ``` ## Requirements The code is written in Python2 and requires the following 3rd party libraries: * numpy * [OpenCV](http://opencv.org/) ```bash sudo apt-get install python-opencv ``` * [Theano](https://github.com/Theano/Theano) ```bash sudo pip install --upgrade --no-deps git+git://github.com/Theano/Theano.git ``` * [PyQt4](https://wiki.python.org/moin/PyQt4): more details on Qt installation can be found [here](http://www.saltycrane.com/blog/2008/01/how-to-install-pyqt4-on-ubuntu-linux/) ```bash sudo apt-get install python-qt4 ``` * [Qdarkstyle](https://github.com/ColinDuquesnoy/QDarkStyleSheet) ```bash sudo pip install qdarkstyle ``` * [dominate](https://github.com/Knio/dominate) ```bash sudo pip install dominate ``` * GPU + CUDA + cuDNN: The code is tested on GTX Titan X + CUDA 7.5 + cuDNN 5. Here are the tutorials on how to install [CUDA](http://www.r-tutor.com/gpu-computing/cuda-installation/cuda7.5-ubuntu) and [cuDNN](http://askubuntu.com/questions/767269/how-can-i-install-cudnn-on-ubuntu-16-04). A decent GPU is required to run the system in real-time. [**Warning**] If you run the program on a GPU server, you need to use remote desktop software (e.g., VNC), which may introduce display artifacts and latency problem. ## Python3 For `Python3` users, you need to replace `pip` with `pip3`: * PyQt4 with Python3: ``` bash sudo apt-get install python3-pyqt4 ``` * OpenCV3 with Python3: see the installation [instruction](http://www.pyimagesearch.com/2015/07/20/install-opencv-3-0-and-python-3-4-on-ubuntu/). ## Interface: <img src='pics/ui_intro.jpg' width=800> #### Layout * Drawing Pad: This is the main window of our interface. A user can apply different edits via our brush tools, and the system will display the generated image. Check/Uncheck `Edits` button to display/hide user edits. * Candidate Results: a display showing thumbnails of all the candidate results (e.g., different modes) that fits the user edits. A user can click a mode (highlighted by a green rectangle), and the drawing pad will show this result. * Brush Tools: `Coloring Brush` for changing the color of a specific region; `Sketching brush` for outlining the shape. `Warping brush` for modifying the shape more explicitly. * Slider Bar: drag the slider bar to explore the interpolation sequence between the initial result (i.e., randomly generated image) and the current result (e.g., image that satisfies the user edits). * Control Panel: `Play`: play the interpolation sequence; `Fix`: use the current result as additional constraints for further editing `Restart`: restart the system; `Save`: save the result to a webpage. `Edits`: Check the box if you would like to show the edits on top of the generated image. #### User interaction * `Coloring Brush`: right-click to select a color; hold left click to paint; scroll the mouse wheel to adjust the width of the brush. * `Sketching Brush`: hold left-click to sketch the shape. * `Warping Brush`: We recommend you first use coloring and sketching before the warping brush. Right-click to select a square region; hold left click to drag the region; scroll the mouse wheel to adjust the size of the square region. * Shortcuts: P for `Play`, F for `Fix`, R for `Restart`; S for `Save`; E for `Edits`; Q for quitting the program. * Tooltips: when you move the cursor over a button, the system will display the tooltip of the button. ## Model Zoo: Download the Theano DCGAN model (e.g., outdoor_64). Before using our system, please check out the random real images vs. DCGAN generated samples to see which kind of images that a model can produce. ``` bash bash ./models/scripts/download_dcgan_model.sh outdoor_64 ``` * [ourdoor_64.dcgan_theano](http://efrosgans.eecs.berkeley.edu/iGAN/models/theano_dcgan/outdoor_64.dcgan_theano) (64x64): trained on 150K landscape images from MIT [Places](http://places.csail.mit.edu/) dataset [[Real](http://efrosgans.eecs.berkeley.edu/iGAN/samples/outdoor_64_real.png) vs. [DCGAN](http://efrosgans.eecs.berkeley.edu/iGAN/samples/outdoor_64_dcgan.png)]. * [church_64.dcgan_theano](http://efrosgans.eecs.berkeley.edu/iGAN/models/theano_dcgan/church_64.dcgan_theano) (64x64): trained on 126k church images from the [LSUN](http://lsun.cs.princeton.edu/2016/) challenge [[Real](http://efrosgans.eecs.berkeley.edu/iGAN/samples/church_64_real.png) vs. [DCGAN](http://efrosgans.eecs.berkeley.edu/iGAN/samples/church_64_dcgan.png)]. * [handbag_64.dcgan_theano](http://efrosgans.eecs.berkeley.edu/iGAN/models/theano_dcgan/handbag_64.dcgan_theano) (64x64): trained on 137K handbag images downloaded from Amazon [[Real](http://efrosgans.eecs.berkeley.edu/iGAN/samples/handbag_64_real.png) vs. [DCGAN](http://efrosgans.eecs.berkeley.edu/iGAN/samples/handbag_64_dcgan.png)]. * [shoes_64.dcgan_theano](http://efrosgans.eecs.berkeley.edu/iGAN/models/theano_dcgan/shoes_64.dcgan_theano) (64x64): trained on 50K shoes images collected by [Yu and Grauman](http://vision.cs.utexas.edu/projects/finegrained/utzap50k/) [[Real](http://efrosgans.eecs.berkeley.edu/iGAN/samples/shoes_64_real.png) vs. [DCGAN](http://efrosgans.eecs.berkeley.edu/iGAN/samples/shoes_64_dcgan.png)]. * [hed_shoes_64.dcgan_theano](http://efrosgans.eecs.berkeley.edu/iGAN/models/theano_dcgan/hed_shoes_64.dcgan_theano) (64x64): trained on 50K shoes sketches (computed by [HED](https://github.com/s9xie/hed)) [[Real](http://efrosgans.eecs.berkeley.edu/iGAN/samples/hed_shoes_64_real.png) vs. [DCGAN](http://efrosgans.eecs.berkeley.edu/iGAN/samples/hed_shoes_64_dcgan.png)]. (Use this model with `--shadow` flag) We provide a simple script to generate samples from a pre-trained DCGAN model. You can run this script to test if Theano, CUDA, cuDNN are configured properly before running our interface. ```bash THEANO_FLAGS='device=gpu0, floatX=float32, nvcc.fastmath=True' python generate_samples.py --model_name outdoor_64 --output_image outdoor_64_dcgan.png ``` ## Command line arguments: Type `python iGAN_main.py --help` for a complete list of the arguments. Here we discuss some important arguments: * `--model_name`: the name of the model (e.g., outdoor_64, shoes_64, etc.) * `--model_type`: currently only supports dcgan_theano. * `--model_file`: the file that stores the generative model; If not specified, `model_file='./models/%s.%s' % (model_name, model_type)` * `--top_k`: the number of the candidate results being displayed * `--average`: show an average image in the main window. Inspired by [AverageExplorer](https://www.cs.cmu.edu/~junyanz/projects/averageExplorer/), average image is a weighted average of multiple generated results, with the weights reflecting user-indi

评论收藏

内容反馈

版权申诉