一些GAN实战代码,仅限参考_BiGAN资源-CSDN文库

共115个文件

py：62个

png：42个

gz：4个

需积分: 10 149 浏览量 2022-10-09 16:25:15 上传评论收藏 156.58MB ZIP 举报

**标题与描述解析** 标题和描述中提到的“一些GAN实战代码,仅限参考”意味着这是一个包含实际应用 Generative Adversarial Networks（生成对抗网络，GANs）的代码集合，适用于学习和研究目的。GAN是一种深度学习技术，由Ian Goodfellow在2014年提出，主要用于图像生成、视频生成、文本到图像合成等创造性任务。 **GAN基础知识** 1. **GAN架构**：GAN由两个神经网络构成——生成器（Generator）和判别器（Discriminator）。生成器试图创建逼真的数据样本，而判别器则尝试区分真实数据和生成器制造的假数据。两者在对抗过程中不断提升，直到生成器能够生成几乎无法区分真假的数据。 2. **训练过程**：在训练期间，生成器接收随机噪声作为输入，并尝试生成看起来像真实数据的样本。判别器则接收到真实数据和生成器的假数据，尝试区分它们。通过反复迭代，两个网络相互博弈，最终生成器的输出可以变得非常接近真实数据。 3. **应用领域**：GANs广泛应用于图像生成、风格迁移、超分辨率、图像修复、视频预测、图像合成、艺术作品生成、虚拟现实、药物发现等。 **GAN的种类** 1. **基本GAN**：最初的GAN模型，是最基础的形式，通常用于图像生成。 2. **DCGAN（深度卷积生成对抗网络）**：引入了卷积和反卷积层，提高了图像生成的质量和稳定性。 3. **CGAN（条件GAN）**：允许在生成过程中加入额外的条件信息，如类别标签，使生成结果更可控。 4. **Wasserstein GAN (WGAN)**：解决了基本GAN在训练过程中的梯度消失问题，通过Wasserstein距离改进了损失函数。 5. **LAPGAN（分层逐层生成对抗网络）**：通过逐步生成高分辨率细节来提高图像生成质量。 6. **StyleGAN**：引入了样式编码，允许独立控制图像的风格和内容。 7. **BigGAN**：大型生成对抗网络，使用大模型规模和数据增强技术，实现了高质量图像生成。 **代码实践中的要点** 1. **数据预处理**：对输入数据进行归一化、缩放或增强，以优化网络性能。 2. **模型构建**：根据所选GAN类型，正确搭建生成器和判别器的结构，包括层的类型、数量、激活函数等。 3. **损失函数**：选择合适的损失函数，如交叉熵损失或Wasserstein距离。 4. **优化器**：选择合适的优化器，如Adam或SGD，调整学习率和其他超参数。 5. **训练循环**：编写训练循环，包括前向传播、损失计算、反向传播和权重更新。 6. **生成器输出可视化**：在训练过程中定期展示生成器的输出，以便观察模型的进展。 7. **保存和加载模型**：设置模型检查点，以便在训练中断后恢复。 8. **调参**：根据模型表现调整网络结构、损失函数、优化器参数等，以优化模型性能。这些代码示例可能涵盖了以上的一些或所有概念，为学习者提供了一个实际操作的平台，通过运行和修改代码，可以深入理解GAN的工作原理和优化技巧。

资源详情

资源评论

资源推荐

收起资源包目录

一些GAN实战代码,仅限参考（115个子文件）

train-images-idx3-ubyte.gz 9.45MB

t10k-images-idx3-ubyte.gz 1.57MB

train-labels-idx1-ubyte.gz 28KB

t10k-labels-idx1-ubyte.gz 4KB

1_AB.jpg 34KB

keras_mnistm.pkl 157.01MB

0.png 259KB

400.png 103KB

800.png 86KB

1200.png 79KB

1600.png 78KB

2000.png 77KB

15600.png 75KB

10400.png 75KB

3200.png 75KB

11200.png 75KB

4400.png 75KB

8000.png 75KB

10800.png 75KB

14800.png 75KB

2400.png 74KB

12400.png 74KB

12800.png 74KB

15200.png 74KB

5200.png 74KB

10000.png 74KB

14400.png 74KB

2800.png 74KB

11600.png 74KB

8400.png 74KB

6400.png 74KB

13200.png 74KB

4800.png 74KB

14000.png 74KB

3600.png 74KB

6000.png 74KB

9200.png 74KB

6800.png 74KB

7200.png 74KB

4000.png 74KB

8800.png 73KB

9600.png 73KB

5600.png 73KB

13600.png 73KB

12000.png 73KB

7600.png 73KB

mnistm_pic.png 37KB

mnist_pic.png 9KB

training.pt 45.32MB

clustergan.py 18KB

stargan.py 11KB

munit.py 11KB

infogan.py 11KB

models.py 10KB

unit.py 10KB

cyclegan.py 10KB

pixelda.py 10KB

bicyclegan.py 9KB

dualgan.py 9KB

acgan.py 8KB

discogan.py 8KB

cogan.py 8KB

sgan.py 8KB

esrgan.py 8KB

dragan.py 7KB

ebgan.py 7KB

cgan.py 7KB

pix2pix.py 7KB

wgan_gp.py 7KB

began.py 7KB

relativistic_gan.py 7KB

aae.py 7KB

wgan_div.py 6KB

context_encoder.py 6KB

dcgan.py 6KB

lsgan.py 6KB

ccgan.py 6KB

srgan.py 6KB

bgan.py 6KB

mnistm.py 6KB

gan.py 5KB

models.py 5KB

softmax_gan.py 5KB

wgan.py 5KB

mnistm.py 5KB

models.py 5KB

models.py 4KB

models.py 2KB

datasets.py 2KB

datasets.py 1KB

utils.py 1KB

test_on_image.py 1KB

共 115 条

from __future__ import print_function try: import argparse import os import numpy as np from torch.autograd import Variable from torch.autograd import grad as torch_grad import torch import torchvision import torch.nn as nn import torch.nn.functional as F from torch.utils.data import DataLoader from torchvision import datasets import torchvision.transforms as transforms from torchvision.utils import save_image from itertools import chain as ichain except ImportError as e: print(e) raise ImportError os.makedirs("images", exist_ok=True) parser = argparse.ArgumentParser(description="ClusterGAN Training Script") parser.add_argument("-n", "--n_epochs", dest="n_epochs", default=200, type=int, help="Number of epochs") parser.add_argument("-b", "--batch_size", dest="batch_size", default=64, type=int, help="Batch size") parser.add_argument("-i", "--img_size", dest="img_size", type=int, default=28, help="Size of image dimension") parser.add_argument("-d", "--latent_dim", dest="latent_dim", default=30, type=int, help="Dimension of latent space") parser.add_argument("-l", "--lr", dest="learning_rate", type=float, default=0.0001, help="Learning rate") parser.add_argument("-c", "--n_critic", dest="n_critic", type=int, default=5, help="Number of training steps for discriminator per iter") parser.add_argument("-w", "--wass_flag", dest="wass_flag", action='store_true', help="Flag for Wasserstein metric") args = parser.parse_args() # Sample a random latent space vector def sample_z(shape=64, latent_dim=10, n_c=10, fix_class=-1, req_grad=False): assert (fix_class == -1 or (fix_class >= 0 and fix_class < n_c) ), "Requested class %i outside bounds."%fix_class Tensor = torch.cuda.FloatTensor # Sample noise as generator input, zn zn = Variable(Tensor(0.75*np.random.normal(0, 1, (shape, latent_dim))), requires_grad=req_grad) ######### zc, zc_idx variables with grads, and zc to one-hot vector # Pure one-hot vector generation zc_FT = Tensor(shape, n_c).fill_(0) # zc_idx,生成长度为shape全部为0的tensor zc_idx = torch.empty(shape, dtype=torch.long) if (fix_class == -1): #生成数值范围在0到n_c-1的tensor zc_idx = zc_idx.random_(n_c).cuda() #每行对应位置填1，one-hot zc_FT = zc_FT.scatter_(1, zc_idx.unsqueeze(1), 1.) else: zc_idx[:] = fix_class zc_FT[:, fix_class] = 1 zc_idx = zc_idx.cuda() zc_FT = zc_FT.cuda() zc = Variable(zc_FT, requires_grad=req_grad) # Return components of latent space variable return zn, zc, zc_idx def calc_gradient_penalty(netD, real_data, generated_data): # GP strength LAMBDA = 10 b_size = real_data.size()[0] # Calculate interpolation alpha = torch.rand(b_size, 1, 1, 1) alpha = alpha.expand_as(real_data) alpha = alpha.cuda() interpolated = alpha * real_data.data + (1 - alpha) * generated_data.data interpolated = Variable(interpolated, requires_grad=True) interpolated = interpolated.cuda() # Calculate probability of interpolated examples prob_interpolated = netD(interpolated) # Calculate gradients of probabilities with respect to examples gradients = torch_grad(outputs=prob_interpolated, inputs=interpolated, grad_outputs=torch.ones(prob_interpolated.size()).cuda(), create_graph=True, retain_graph=True)[0] # Gradients have shape (batch_size, num_channels, img_width, img_height), # so flatten to easily take norm per example in batch gradients = gradients.view(b_size, -1) # Derivatives of the gradient close to 0 can cause problems because of # the square root, so manually calculate norm and add epsilon gradients_norm = torch.sqrt(torch.sum(gradients ** 2, dim=1) + 1e-12) # Return gradient penalty return LAMBDA * ((gradients_norm - 1) ** 2).mean() # Weight Initializer def initialize_weights(net): for m in net.modules(): if isinstance(m, nn.Conv2d): m.weight.data.normal_(0, 0.02) m.bias.data.zero_() elif isinstance(m, nn.ConvTranspose2d): m.weight.data.normal_(0, 0.02) m.bias.data.zero_() elif isinstance(m, nn.Linear): m.weight.data.normal_(0, 0.02) m.bias.data.zero_() # Softmax function def softmax(x): return F.softmax(x, dim=1) class Reshape(nn.Module): """ Class for performing a reshape as a layer in a sequential model. """ def __init__(self, shape=[]): super(Reshape, self).__init__() self.shape = shape def forward(self, x): return x.view(x.size(0), *self.shape) def extra_repr(self): # (Optional)Set the extra information about this module. You can test # it by printing an object of this class. return 'shape={}'.format( self.shape ) class Generator_CNN(nn.Module): """ CNN to model the generator of a ClusterGAN Input is a vector from representation space of dimension z_dim output is a vector from image space of dimension X_dim """ # Architecture : FC1024_BR-FC7x7x128_BR-(64)4dc2s_BR-(1)4dc2s_S def __init__(self, latent_dim, n_c, x_shape, verbose=False): super(Generator_CNN, self).__init__() self.name = 'generator' self.latent_dim = latent_dim self.n_c = n_c self.x_shape = x_shape self.ishape = (128, 7, 7) self.iels = int(np.prod(self.ishape)) self.verbose = verbose self.model = nn.Sequential( # Fully connected layers torch.nn.Linear(self.latent_dim + self.n_c, 1024), nn.BatchNorm1d(1024), nn.LeakyReLU(0.2, inplace=True), torch.nn.Linear(1024, self.iels), nn.BatchNorm1d(self.iels), nn.LeakyReLU(0.2, inplace=True), # Reshape to 128 x (7x7) Reshape(self.ishape), # Upconvolution layers nn.ConvTranspose2d(128, 64, 4, stride=2, padding=1, bias=True), nn.BatchNorm2d(64), nn.LeakyReLU(0.2, inplace=True), nn.ConvTranspose2d(64, 1, 4, stride=2, padding=1, bias=True), nn.Sigmoid() ) initialize_weights(self) if self.verbose: print("Setting up {}...\n".format(self.name)) print(self.model) def forward(self, zn, zc): #（64，30+10） z = torch.cat((zn, zc), 1) x_gen = self.model(z) # Reshape for output x_gen = x_gen.view(x_gen.size(0), *self.x_shape) return x_gen class Encoder_CNN(nn.Module): """ CNN to model the encoder of a ClusterGAN Input is vector X from image space if dimension X_dim Output is vector z from representation space of dimension z_dim """ def __init__(self, latent_dim, n_c, verbose=False): super(Encoder_CNN, self).__init__() self.name = 'encoder' self.channels = 1 self.latent_dim = latent_dim self.n_c = n_c self.cshape = (128, 5, 5) self.iels = int(np.prod(self.cshape)) self.lshape = (self.iels,) self.verbose = verbose self.model = nn.Sequential( # Convolutional layers nn.Conv2d(self.channels, 64, 4, stride=2, bias=True), nn.LeakyReLU(0.2, inplace=True), nn.Conv2d(64, 128, 4, stride=2, bias=True), nn.LeakyReLU(0.2, inplace=True), # Flatten Reshape(self.lshape), # Fully connected layers torch.nn.Linear(self.iels, 1024), nn.LeakyReLU(0.2, inplace=True), torch.nn.Linear(1024, latent_dim + n_c) ) initialize_weights(self) if self.verbose