Tag Archives: encoder-decoder

Implementation of CycleGAN for Image-to-image Translation

CycleGAN is a variant of a generative adversarial network and was introduced to perform image translation from domain X to domain Y without using a paired set of training examples. In the previous blog, I have already described CycleGAN in detail. In this blog, we will implement CycleGAN to translate apple images to orange images and vice-versa with the help of Keras library. Here are some recommended blogs that you should refer before implementing CycleGAN:

Cycle-Consistent Generative Adversarial Networks (CycleGAN)
Image to Image Translation Using Conditional GAN
Implementation of Image-to-image translation using conditional GAN

Load the Dataset And Preprocess

CycleGAN does not require any paired dataset as compared to other image translation algorithms. Hence here we will use two sets of datasets. One consists of apple images and the other consists of orange images. Both the datasets are not paired with each other. Here are some images from the dataset:

You can download the dataset from this link. Or run the following command from your terminal.

wget https://people.eecs.berkeley.edu/~taesung_park/CycleGAN/datasets/apple2orange.zip
unzip apple2orange.zip

1 2	wget https://people.eecs.berkeley.edu/~taesung_park/CycleGAN/datasets/apple2orange.zip unzip apple2orange.zip

Dataset consists of four folders: trainA, trainB, testA, and testB. ‘A’ dataset consists of apple images and the ‘B’ dataset consist of orange images. Training set consists of approx 1000 images for each type and the test set consists of approx 200 images corresponding to each type.

So, let’s first import all the required libraries:

import cv2
import os
from tqdm import tqdm
from keras.layers import BatchNormalization, Reshape, Dense, Input, LeakyReLU, Conv2D, Conv2DTranspose, Concatenate, ReLU, Dropout, ZeroPadding2D
from keras.models import Model
from keras.initializers import RandomNormal
from keras.optimizers import Adam
import numpy as np
import time

import cv2

import os

from tqdm import tqdm

from keras.layers import BatchNormalization, Reshape, Dense, Input, LeakyReLU, Conv2D, Conv2DTranspose, Concatenate, ReLU, Dropout, ZeroPadding2D

from keras.models import Model

from keras.initializers import RandomNormal

from keras.optimizers import Adam

import numpy as np

import time

Dataset is a little preprocessed as it contains all images of equal size (256, 256, 3). Other preprocessing steps that we are going to use are normalization and random flipping. Here we are normalizing every image between -1 to 1 and randomly flipping horizontally. Here is the code:

def load_img(file_path):
    img = cv2.imread(file_path)
    if np.random.rand() > 0.5:
        img = cv2.flip(img, 1)
    img = (img/127.5) - 1
    return img

def load_img(file_path):

img = cv2.imread(file_path)

if np.random.rand() > 0.5:

img = cv2.flip(img, 1)

img = (img/127.5) - 1

return img

Now load the training images from the directory into a list.

train_a = []
train_b = []

trainA_path = r'/content/apple2orange/trainA'

for files in tqdm(os.listdir(trainA_path)):
    file_path = os.path.join(trainA_path, files)
    input_img = load_img(file_path)
    train_a.append(input_img)

trainB_path = r'/content/apple2orange/trainB'

for files in tqdm(os.listdir(trainB_path)):
    file_path = os.path.join(trainB_path, files)
    input_img = load_img(file_path)
    train_b.append(input_img)

train_a = np.array(train_a)
train_b = np.array(train_b)

train_a = []

train_b = []

trainA_path = r'/content/apple2orange/trainA'

for files in tqdm(os.listdir(trainA_path)):

file_path = os.path.join(trainA_path, files)

input_img = load_img(file_path)

train_a.append(input_img)

trainB_path = r'/content/apple2orange/trainB'

for files in tqdm(os.listdir(trainB_path)):

file_path = os.path.join(trainB_path, files)

input_img = load_img(file_path)

train_b.append(input_img)

train_a = np.array(train_a)

train_b = np.array(train_b)

Build the Generator

The network architecture that I have used is very similar to the architecture used in image-to-image translation with conditional GAN. The major difference is the loss function. In CycleGAN two more losses have been introduced. One is cycle consistency loss and the other is identity loss.

Here generator network is a U-net architecture. This U-net architecture consists of the encoder-decoder model with a skip connection between encoder and decoder. Here we will use two generator networks. One will translate from apple to orange (G: X -> Y) and the other will translate from orange to apple (F: Y -> X). Each generator network is consists of encoder and decoder. Each encoder block is consist of three layers (Conv -> BatchNorm -> Leakyrelu). And each block in decoder network is consist of four layers (Transposed Conv -> BatchNorm -> Dropout -> Relu). The generator will take an image as input and outputs a generated image. Both images will have a size of (256, 256, 3). Here is the code:

def generator():

    image_input = Input(shape=(256, 256, 3))
    
    # Encoder Network

    conv_1 = Conv2D(64,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(image_input)
    act_1 = LeakyReLU(alpha=0.2)(conv_1)

    conv_2 = Conv2D(128,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_1)
    batch_norm_2 = BatchNormalization(momentum=0.8)(conv_2)
    act_2 = LeakyReLU(alpha=0.2)(batch_norm_2)

    conv_3 = Conv2D(256,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_2)
    batch_norm_3 = BatchNormalization(momentum=0.8)(conv_3)
    act_3 = LeakyReLU(alpha=0.2)(batch_norm_3)

    conv_4 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_3)
    batch_norm_4 = BatchNormalization(momentum=0.8)(conv_4)
    act_4 = LeakyReLU(alpha=0.2)(batch_norm_4)

    conv_5 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_4)
    batch_norm_5 = BatchNormalization(momentum=0.8)(conv_5)
    act_5 = LeakyReLU(alpha=0.2)(batch_norm_5)

    conv_6 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_5)
    batch_norm_6 = BatchNormalization(momentum=0.8)(conv_6)
    act_6 = LeakyReLU(alpha=0.2)(batch_norm_6)

    conv_7 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_6)
    batch_norm_7 = BatchNormalization()(conv_7)
    act_7= LeakyReLU(alpha=0.2)(batch_norm_7)

    conv_8 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_7)
    batch_norm_8 = BatchNormalization(momentum=0.8)(conv_8)
    act_8 = LeakyReLU(alpha=0.2)(batch_norm_8)


    # Decoder Network and skip connections with encoder 
    
    convt_1 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_8)
    batch_normt_1 = BatchNormalization(momentum=0.8)(convt_1)
    drop_1 = Dropout(0.5)(batch_normt_1)
    actt_1 = ReLU()(drop_1)
    concat_1 = Concatenate()([actt_1, act_7])

    convt_2 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_1)
    batch_normt_2 = BatchNormalization(momentum=0.8)(convt_2)
    drop_2 = Dropout(0.5)(batch_normt_2)
    actt_2 = ReLU()(drop_2)
    concat_2 = Concatenate()([actt_2, act_6])

    convt_3 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_2)
    batch_normt_3 = BatchNormalization(momentum=0.8)(convt_3)
    drop_3 = Dropout(0.5)(batch_normt_3)
    actt_3 = ReLU()(drop_3)
    concat_3 = Concatenate()([actt_3, act_5])

    convt_4 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_3)
    batch_normt_4 = BatchNormalization(momentum=0.8)(convt_4)
    actt_4 = ReLU()(batch_normt_4)
    concat_4 = Concatenate()([actt_4, act_4])

    convt_5 = Conv2DTranspose(256,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_4)
    batch_normt_5 = BatchNormalization(momentum=0.8)(convt_5)
    actt_5 = ReLU()(batch_normt_5)
    concat_5 = Concatenate()([actt_5, act_3])

    convt_6 = Conv2DTranspose(128,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_5)
    batch_normt_6 = BatchNormalization(momentum=0.8)(convt_6)
    actt_6 = ReLU()(batch_normt_6)
    concat_6 = Concatenate()([actt_6, act_2])

    convt_7 = Conv2DTranspose(64,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_6)
    batch_normt_7 = BatchNormalization(momentum=0.8)(convt_7)
    actt_7 = ReLU()(batch_normt_7)
    concat_7 = Concatenate()([actt_7, act_1])

    outputs = Conv2DTranspose(3,4,strides=2,use_bias=False,activation='tanh',kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_7)
    gen_model = Model(image_input, outputs)
    
#     gen_model.summary()

    return gen_model

def generator():

image_input = Input(shape=(256, 256, 3))

# Encoder Network

conv_1 = Conv2D(64,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(image_input)

act_1 = LeakyReLU(alpha=0.2)(conv_1)

conv_2 = Conv2D(128,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_1)

batch_norm_2 = BatchNormalization(momentum=0.8)(conv_2)

act_2 = LeakyReLU(alpha=0.2)(batch_norm_2)

conv_3 = Conv2D(256,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_2)

batch_norm_3 = BatchNormalization(momentum=0.8)(conv_3)

act_3 = LeakyReLU(alpha=0.2)(batch_norm_3)

conv_4 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_3)

batch_norm_4 = BatchNormalization(momentum=0.8)(conv_4)

act_4 = LeakyReLU(alpha=0.2)(batch_norm_4)

conv_5 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_4)

batch_norm_5 = BatchNormalization(momentum=0.8)(conv_5)

act_5 = LeakyReLU(alpha=0.2)(batch_norm_5)

conv_6 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_5)

batch_norm_6 = BatchNormalization(momentum=0.8)(conv_6)

act_6 = LeakyReLU(alpha=0.2)(batch_norm_6)

conv_7 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_6)

batch_norm_7 = BatchNormalization()(conv_7)

act_7= LeakyReLU(alpha=0.2)(batch_norm_7)

conv_8 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_7)

batch_norm_8 = BatchNormalization(momentum=0.8)(conv_8)

act_8 = LeakyReLU(alpha=0.2)(batch_norm_8)

# Decoder Network and skip connections with encoder

convt_1 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_8)

batch_normt_1 = BatchNormalization(momentum=0.8)(convt_1)

drop_1 = Dropout(0.5)(batch_normt_1)

actt_1 = ReLU()(drop_1)

concat_1 = Concatenate()([actt_1, act_7])

convt_2 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_1)

batch_normt_2 = BatchNormalization(momentum=0.8)(convt_2)

drop_2 = Dropout(0.5)(batch_normt_2)

actt_2 = ReLU()(drop_2)

concat_2 = Concatenate()([actt_2, act_6])

convt_3 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_2)

batch_normt_3 = BatchNormalization(momentum=0.8)(convt_3)

drop_3 = Dropout(0.5)(batch_normt_3)

actt_3 = ReLU()(drop_3)

concat_3 = Concatenate()([actt_3, act_5])

convt_4 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_3)

batch_normt_4 = BatchNormalization(momentum=0.8)(convt_4)

actt_4 = ReLU()(batch_normt_4)

concat_4 = Concatenate()([actt_4, act_4])

convt_5 = Conv2DTranspose(256,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_4)

batch_normt_5 = BatchNormalization(momentum=0.8)(convt_5)

actt_5 = ReLU()(batch_normt_5)

concat_5 = Concatenate()([actt_5, act_3])

convt_6 = Conv2DTranspose(128,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_5)

batch_normt_6 = BatchNormalization(momentum=0.8)(convt_6)

actt_6 = ReLU()(batch_normt_6)

concat_6 = Concatenate()([actt_6, act_2])

convt_7 = Conv2DTranspose(64,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_6)

batch_normt_7 = BatchNormalization(momentum=0.8)(convt_7)

actt_7 = ReLU()(batch_normt_7)

concat_7 = Concatenate()([actt_7, act_1])

outputs = Conv2DTranspose(3,4,strides=2,use_bias=False,activation='tanh',kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_7)

gen_model = Model(image_input, outputs)

# gen_model.summary()

return gen_model

genA = generator()
genB = generator()

1 2	genA = generator() genB = generator()

Build the Discriminator

Discriminator network is a patchGAN pretty similar to the one used in the code for image-to-image translation with conditional GAN. Here two discriminators will be used. One discriminator will discriminate between images generated by generator A and orange images. And another discriminator is used to discriminate between image generated by generator B and apple images.

This patchGAN is nothing but a convolution network. The difference between patchGAN and normal convolution network is that instead of producing output as single scalar vector it generates an NxN array. This NxN array maps to the patch from the input images. And then takes an average to classify the whole image as real or fake.

def discriminator():

    img_inp = Input(shape = (256, 256, 3))

    conv_1 = Conv2D(64,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(img_inp)
    act_1 = LeakyReLU(alpha=0.2)(conv_1)

    conv_2 = Conv2D(128,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_1)
    batch_norm_2 = BatchNormalization(momentum=0.8)(conv_2)
    act_2 = LeakyReLU(alpha=0.2)(batch_norm_2)

    conv_3 = Conv2D(256,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_2)
    batch_norm_3 = BatchNormalization(momentum=0.8)(conv_3)
    act_3 = LeakyReLU(alpha=0.2)(batch_norm_3)

    zero_pad = ZeroPadding2D()(act_3)

    conv_4 = Conv2D(512,4,strides=1,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02))(zero_pad)
    batch_norm_4 = BatchNormalization(momentum=0.8)(conv_4)
    act_4 = LeakyReLU(alpha=0.2)(batch_norm_4)

    zero_pad_1 = ZeroPadding2D()(act_4)
    outputs = Conv2D(1,4,strides=1,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02))(zero_pad_1)

    disc_model = Model(img_inp, outputs)

#     disc_model.summary()
    return disc_model

def discriminator():

img_inp = Input(shape = (256, 256, 3))

conv_1 = Conv2D(64,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(img_inp)

act_1 = LeakyReLU(alpha=0.2)(conv_1)

conv_2 = Conv2D(128,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_1)

batch_norm_2 = BatchNormalization(momentum=0.8)(conv_2)

act_2 = LeakyReLU(alpha=0.2)(batch_norm_2)

conv_3 = Conv2D(256,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_2)

batch_norm_3 = BatchNormalization(momentum=0.8)(conv_3)

act_3 = LeakyReLU(alpha=0.2)(batch_norm_3)

zero_pad = ZeroPadding2D()(act_3)

conv_4 = Conv2D(512,4,strides=1,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02))(zero_pad)

batch_norm_4 = BatchNormalization(momentum=0.8)(conv_4)

act_4 = LeakyReLU(alpha=0.2)(batch_norm_4)

zero_pad_1 = ZeroPadding2D()(act_4)

outputs = Conv2D(1,4,strides=1,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02))(zero_pad_1)

disc_model = Model(img_inp, outputs)

# disc_model.summary()

return disc_model

discA = discriminator()
discB = discriminator()
discA.summary()

discA = discriminator()

discB = discriminator()

discA.summary()

Combined Network

Now we will create a combined network to train the generator model. Here both discriminators will be non-trainable. To train the generator network we will also use cycle consistency loss and identity loss.

Cycle consistency says that if we translate an English sentence to a french sentence and then translate back it to English sentence we should arrive at the original sentence. To calculate the cycle consistency loss first pass the input image A to generator A and then pass the predicted output to the generator B. Now calculate the loss between image generated from generator B and input image B. Same goes while taking image B as input to the generator B.

In case of identity loss, If we are passing image from domain A to generator A and trying to generate image looking similar to image from domain B then identity loss makes sure that even if we pass image from domain B to generator A it should generate image from domain B. Here is the code for combined model.

def combined():
    inputA = Input(shape = (256, 256, 3)) 
    inputB = Input(shape = (256, 256, 3)) 
    gen_imgB = genA(inputA)
    gen_imgA = genB(inputB)
    
    #for cycle consistency
    reconstruct_imgA = genB(gen_imgB)
    reconstruct_imgB = genA(gen_imgA)
    
    # identity mapping
    gen_orig_imgB = genA(inputB)
    gen_orig_imgA = genB(inputA)
      
    discA.trainable = False
    discB.trainable = False
    
    valid_imgA = discA(gen_imgA)
    valid_imgB = discA(gen_imgB)
    
    comb_model = Model([inputA, inputB], [valid_imgA, valid_imgB, reconstruct_imgA, reconstruct_imgB, gen_orig_imgA, gen_orig_imgA])
#     comb_model.summary()
    return comb_model

def combined():

inputA = Input(shape = (256, 256, 3))

inputB = Input(shape = (256, 256, 3))

gen_imgB = genA(inputA)

gen_imgA = genB(inputB)

#for cycle consistency

reconstruct_imgA = genB(gen_imgB)

reconstruct_imgB = genA(gen_imgA)

# identity mapping

gen_orig_imgB = genA(inputB)

gen_orig_imgA = genB(inputA)

discA.trainable = False

discB.trainable = False

valid_imgA = discA(gen_imgA)

valid_imgB = discA(gen_imgB)

comb_model = Model([inputA, inputB], [valid_imgA, valid_imgB, reconstruct_imgA, reconstruct_imgB, gen_orig_imgA, gen_orig_imgA])

# comb_model.summary()

return comb_model

comb_model = combined()

1	comb_model = combined()

Loss, Optimizer and Compile the Models

Here we are using mse loss for the discriminator networks and mae loss for the generator network. Optimizer use here is Adam. The batch size for the network is 1 and the total number of epochs is 200.

optimizer = Adam(0.0002, 0.5)

discA.compile(loss='mse', optimizer=optimizer, metrics=['accuracy'])
discB.compile(loss='mse', optimizer=optimizer, metrics=['accuracy'])
comb_model.compile(loss=['mse', 'mse', 'mae', 'mae', 'mae','mae'],loss_weights=[  1, 1, 10, 10, 1, 1],optimizer=optimizer)

disc_patch = (30, 30, 1)
epochs = 200
valid = np.ones((1,) + disc_patch)
fake = np.zeros((1,) + disc_patch)

optimizer = Adam(0.0002, 0.5)

discA.compile(loss='mse', optimizer=optimizer, metrics=['accuracy'])

discB.compile(loss='mse', optimizer=optimizer, metrics=['accuracy'])

comb_model.compile(loss=['mse', 'mse', 'mae', 'mae', 'mae','mae'],loss_weights=[ 1, 1, 10, 10, 1, 1],optimizer=optimizer)

disc_patch = (30, 30, 1)

epochs = 200

valid = np.ones((1,) + disc_patch)

fake = np.zeros((1,) + disc_patch)

Train the Network

Generate image from generator A using image from domain A, Similarly generate an image from generator B using image from domain B.
Train discriminator A on batch using images from domain A and images generated from generator B as real and fake image respectively.
Train discriminator B on batch using images from domain B and images generated from generator A as real and fake image respectively.
Train generator on batch using the combined model.
Repeat steps from 1 to 4 for every image in the training dataset and then repeat this process for 200 epochs.

def train():
    for j in range(epochs):
        t1 = time.time()
        for i in range(len(train_a)):
            img_a = np.expand_dims(train_a[i], axis = 0)
            img_b = np.expand_dims(train_b[i], axis = 0)
            img_b_gen = genA.predict(img_a)
            img_a_gen = genB.predict(img_b)
            
            #train discriminator A
            dA_real_loss = discA.train_on_batch(img_a, valid)
            dA_fake_loss = discA.train_on_batch(img_a_gen, fake)
            
            
            #train discriminator B
            dB_real_loss = discB.train_on_batch(img_b, valid)
            dB_fake_loss = discB.train_on_batch(img_b_gen, fake)
            
            # train generator
            g_loss = comb_model.train_on_batch([img_a, img_b], [valid, valid, img_a, img_b, img_a, img_b])
            if i ==993:
              print('time taken for one epoch', time.time()-t1)
              print(j, i, dA_real_loss, dA_fake_loss, dB_real_loss, dB_fake_loss, g_loss)

def train():

for j in range(epochs):

t1 = time.time()

for i in range(len(train_a)):

img_a = np.expand_dims(train_a[i], axis = 0)

img_b = np.expand_dims(train_b[i], axis = 0)

img_b_gen = genA.predict(img_a)

img_a_gen = genB.predict(img_b)

#train discriminator A

dA_real_loss = discA.train_on_batch(img_a, valid)

dA_fake_loss = discA.train_on_batch(img_a_gen, fake)

#train discriminator B

dB_real_loss = discB.train_on_batch(img_b, valid)

dB_fake_loss = discB.train_on_batch(img_b_gen, fake)

# train generator

g_loss = comb_model.train_on_batch([img_a, img_b], [valid, valid, img_a, img_b, img_a, img_b])

if i ==993:

print('time taken for one epoch', time.time()-t1)

print(j, i, dA_real_loss, dA_fake_loss, dB_real_loss, dB_fake_loss, g_loss)

train()

train()

Hope you enjoy reading.

If you have any doubts/suggestion please feel free to ask and I will do my best to help or improve myself. Good-bye until next time.

Implementation of Image-to-image translation using conditional GAN

Dataset and Preprocessing

To implement an image-to-image translation model using conditional GAN, we need a paired dataset as shown in the below image.

Center for Machine Perception (CMP) at the Czech Technical University in Prague provides rich source of the paired dataset for image-to-image translation which we can use here for our model. In this blog, we will use edges to shoe dataset provided by this link. This dataset consists of a train and validation set. The training set is consist of 49825 images and validation set is consist of 200 images. This dataset consist of some preprocessed images which contains edge and shoe in a single image as shown below:

These images have the size of (256, 512, 3) where 256 is the height, 512 is the width and the number of channels is 3. Now to bifurcate this image into input and output image, we can just slice this image from mid. After segregating we also need to normalize the image. These images consist of values b/w 0 to 255 and to make training faster and reducing the chances of getting stuck in local minima we need to normalize these images. we will normalize these images between -1 to 1. Here is the code to preprocess the image.

def load_img(file_path):
    img = cv2.imread(file_path).astype('float32')
    edge, shoe = ((img[:, :256, :]/127.5) - 1), ((img[:, 256:, :]/127.5)-1)
    return edge, shoe

def load_img(file_path):

img = cv2.imread(file_path).astype('float32')

edge, shoe = ((img[:, :256, :]/127.5) - 1), ((img[:, 256:, :]/127.5)-1)

return edge, shoe

In the preprocessing step we have only used the normalization technique. To preprocess the images we can also do some random jittering and random mirroring as mentioned in the paper. To perform random jittering you just need to upscale the image to 286×286 and then randomly crop to 256×256. To perform random mirroring you need to flip the image horizontally.

Generator Network

Generator network for this conditional GAN architecture is a modified U-net architecture. This U-net architecture consists of an encoder-decoder network with skip connections between encoder and decoder. Each encoder block is consist of three layers (Conv -> BatchNorm -> Leakyrelu). Downsampling in the encoder layer is performed using the strided convolutional layers. Each block in decoder network is consist of four layers (Transposed Conv -> BatchNorm -> Dropout -> Relu). Dropout is only applied for the first three blocks in the decoder network. The input shape for the network is (256, 256, 3). Output shape is also (256, 256, 3) which will be a generated image.

Normally in a generative adversarial network, input to a generator is a noise vector. But here we will use a combination of noise vector and edge image as input to the generator. We will take a noise vector of size 100 and then use a dense layer and then reshape it to concatenate with image input. Here is the code for the generator network. The model looks a little lengthy but don’t worry these are just repeated U-net blocks for encoder and decoder.

def generator():

    noise_input = Input(shape = (100,))
    dense1 = Reshape((256, 256, 3))(Dense(256*256*3)(noise_input))

    image_input = Input(shape=(256, 256, 3))
    concat_layer_gen = Concatenate()([image_input, dense1])
    
    # Encoder Network

    conv_1 = Conv2D(64,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_layer_gen)
    act_1 = LeakyReLU(alpha=0.2)(conv_1)

    conv_2 = Conv2D(128,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_1)
    batch_norm_2 = BatchNormalization(momentum=0.8)(conv_2)
    act_2 = LeakyReLU(alpha=0.2)(batch_norm_2)

    conv_3 = Conv2D(256,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_2)
    batch_norm_3 = BatchNormalization(momentum=0.8)(conv_3)
    act_3 = LeakyReLU(alpha=0.2)(batch_norm_3)

    conv_4 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_3)
    batch_norm_4 = BatchNormalization(momentum=0.8)(conv_4)
    act_4 = LeakyReLU(alpha=0.2)(batch_norm_4)

    conv_5 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_4)
    batch_norm_5 = BatchNormalization(momentum=0.8)(conv_5)
    act_5 = LeakyReLU(alpha=0.2)(batch_norm_5)

    conv_6 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_5)
    batch_norm_6 = BatchNormalization(momentum=0.8)(conv_6)
    act_6 = LeakyReLU(alpha=0.2)(batch_norm_6)

    conv_7 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_6)
    batch_norm_7 = BatchNormalization()(conv_7)
    act_7= LeakyReLU(alpha=0.2)(batch_norm_7)

    conv_8 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_7)
    batch_norm_8 = BatchNormalization(momentum=0.8)(conv_8)
    act_8 = LeakyReLU(alpha=0.2)(batch_norm_8)


    # Decoder Network and skip connections with encoder 
    
    convt_1 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_8)
    batch_normt_1 = BatchNormalization(momentum=0.8)(convt_1)
    drop_1 = Dropout(0.5)(batch_normt_1)
    actt_1 = ReLU()(drop_1)
    concat_1 = Concatenate()([actt_1, act_7])

    convt_2 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_1)
    batch_normt_2 = BatchNormalization(momentum=0.8)(convt_2)
    drop_2 = Dropout(0.5)(batch_normt_2)
    actt_2 = ReLU()(drop_2)
    concat_2 = Concatenate()([actt_2, act_6])

    convt_3 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_2)
    batch_normt_3 = BatchNormalization(momentum=0.8)(convt_3)
    drop_3 = Dropout(0.5)(batch_normt_3)
    actt_3 = ReLU()(drop_3)
    concat_3 = Concatenate()([actt_3, act_5])

    convt_4 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_3)
    batch_normt_4 = BatchNormalization(momentum=0.8)(convt_4)
    actt_4 = ReLU()(batch_normt_4)
    concat_4 = Concatenate()([actt_4, act_4])

    convt_5 = Conv2DTranspose(256,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_4)
    batch_normt_5 = BatchNormalization(momentum=0.8)(convt_5)
    actt_5 = ReLU()(batch_normt_5)
    concat_5 = Concatenate()([actt_5, act_3])

    convt_6 = Conv2DTranspose(128,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_5)
    batch_normt_6 = BatchNormalization(momentum=0.8)(convt_6)
    actt_6 = ReLU()(batch_normt_6)
    concat_6 = Concatenate()([actt_6, act_2])

    convt_7 = Conv2DTranspose(64,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_6)
    batch_normt_7 = BatchNormalization(momentum=0.8)(convt_7)
    actt_7 = ReLU()(batch_normt_7)
    concat_7 = Concatenate()([actt_7, act_1])

    outputs = Conv2DTranspose(3,4,strides=2,use_bias=False,activation='tanh',kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_7)
    gen_model = Model([noise_input, image_input], outputs)
    
    gen_model.summary()

    return gen_model

def generator():

noise_input = Input(shape = (100,))

dense1 = Reshape((256, 256, 3))(Dense(256*256*3)(noise_input))

image_input = Input(shape=(256, 256, 3))

concat_layer_gen = Concatenate()([image_input, dense1])

# Encoder Network

conv_1 = Conv2D(64,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_layer_gen)

act_1 = LeakyReLU(alpha=0.2)(conv_1)

conv_2 = Conv2D(128,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_1)

batch_norm_2 = BatchNormalization(momentum=0.8)(conv_2)

act_2 = LeakyReLU(alpha=0.2)(batch_norm_2)

conv_3 = Conv2D(256,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_2)

batch_norm_3 = BatchNormalization(momentum=0.8)(conv_3)

act_3 = LeakyReLU(alpha=0.2)(batch_norm_3)

conv_4 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_3)

batch_norm_4 = BatchNormalization(momentum=0.8)(conv_4)

act_4 = LeakyReLU(alpha=0.2)(batch_norm_4)

conv_5 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_4)

batch_norm_5 = BatchNormalization(momentum=0.8)(conv_5)

act_5 = LeakyReLU(alpha=0.2)(batch_norm_5)

conv_6 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_5)

batch_norm_6 = BatchNormalization(momentum=0.8)(conv_6)

act_6 = LeakyReLU(alpha=0.2)(batch_norm_6)

conv_7 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_6)

batch_norm_7 = BatchNormalization()(conv_7)

act_7= LeakyReLU(alpha=0.2)(batch_norm_7)

conv_8 = Conv2D(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_7)

batch_norm_8 = BatchNormalization(momentum=0.8)(conv_8)

act_8 = LeakyReLU(alpha=0.2)(batch_norm_8)

# Decoder Network and skip connections with encoder

convt_1 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_8)

batch_normt_1 = BatchNormalization(momentum=0.8)(convt_1)

drop_1 = Dropout(0.5)(batch_normt_1)

actt_1 = ReLU()(drop_1)

concat_1 = Concatenate()([actt_1, act_7])

convt_2 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_1)

batch_normt_2 = BatchNormalization(momentum=0.8)(convt_2)

drop_2 = Dropout(0.5)(batch_normt_2)

actt_2 = ReLU()(drop_2)

concat_2 = Concatenate()([actt_2, act_6])

convt_3 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_2)

batch_normt_3 = BatchNormalization(momentum=0.8)(convt_3)

drop_3 = Dropout(0.5)(batch_normt_3)

actt_3 = ReLU()(drop_3)

concat_3 = Concatenate()([actt_3, act_5])

convt_4 = Conv2DTranspose(512,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_3)

batch_normt_4 = BatchNormalization(momentum=0.8)(convt_4)

actt_4 = ReLU()(batch_normt_4)

concat_4 = Concatenate()([actt_4, act_4])

convt_5 = Conv2DTranspose(256,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_4)

batch_normt_5 = BatchNormalization(momentum=0.8)(convt_5)

actt_5 = ReLU()(batch_normt_5)

concat_5 = Concatenate()([actt_5, act_3])

convt_6 = Conv2DTranspose(128,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_5)

batch_normt_6 = BatchNormalization(momentum=0.8)(convt_6)

actt_6 = ReLU()(batch_normt_6)

concat_6 = Concatenate()([actt_6, act_2])

convt_7 = Conv2DTranspose(64,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_6)

batch_normt_7 = BatchNormalization(momentum=0.8)(convt_7)

actt_7 = ReLU()(batch_normt_7)

concat_7 = Concatenate()([actt_7, act_1])

outputs = Conv2DTranspose(3,4,strides=2,use_bias=False,activation='tanh',kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(concat_7)

gen_model = Model([noise_input, image_input], outputs)

gen_model.summary()

return gen_model

Discriminator Network

Here discriminator is a patchGAN. A patchGAN is basically a convolutional network where the input image is mapped to an NxN array instead of a single scalar vector. For this conditional GAN, the discriminator takes two inputs. One is edge image and the other is the shoe image. Both inputs are of shape 9256, 256, 3). The output shape of this network is (30, 30, 1). Here each 30×30 output patch classifies the 70×70 portion of the input image.

Here each block in the discriminator is consist of 3 layers (Conv -> BatchNorm -> LeakyRelu). I have used the Gaussian Blurring layer to reduce the dominance of discriminator while training. Here is the full code.

def discriminator():

    inp_1 = Input(shape = (256, 256, 3))
    inp_2 = Input(shape = (256, 256, 3))

    concat = Concatenate()([inp_1, inp_2])
    noise = GaussianNoise(0.01)(concat)
    conv_1 = Conv2D(64,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(noise)
    act_1 = LeakyReLU(alpha=0.2)(conv_1)

    conv_2 = Conv2D(128,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_1)
    batch_norm_2 = BatchNormalization(momentum=0.8)(conv_2)
    act_2 = LeakyReLU(alpha=0.2)(batch_norm_2)

    conv_3 = Conv2D(256,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_2)
    batch_norm_3 = BatchNormalization(momentum=0.8)(conv_3)
    act_3 = LeakyReLU(alpha=0.2)(batch_norm_3)

    zero_pad = ZeroPadding2D()(act_3)

    conv_4 = Conv2D(512,4,strides=1,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02))(zero_pad)
    batch_norm_4 = BatchNormalization(momentum=0.8)(conv_4)
    act_4 = LeakyReLU(alpha=0.2)(batch_norm_4)

    zero_pad_1 = ZeroPadding2D()(act_4)
    outputs = Conv2D(1,4,strides=1,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02))(zero_pad_1)

    disc_model = Model([inp_1, inp_2], outputs)
    disc_model.compile(optimizer = Adam(lr=0.0000002, beta_1=0.5), loss = 'binary_crossentropy',  metrics = ['accuracy'], loss_weights=[0.5])

    disc_model.summary()
    return disc_model

def discriminator():

inp_1 = Input(shape = (256, 256, 3))

inp_2 = Input(shape = (256, 256, 3))

concat = Concatenate()([inp_1, inp_2])

noise = GaussianNoise(0.01)(concat)

conv_1 = Conv2D(64,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(noise)

act_1 = LeakyReLU(alpha=0.2)(conv_1)

conv_2 = Conv2D(128,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_1)

batch_norm_2 = BatchNormalization(momentum=0.8)(conv_2)

act_2 = LeakyReLU(alpha=0.2)(batch_norm_2)

conv_3 = Conv2D(256,4,strides=2,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02),padding='same')(act_2)

batch_norm_3 = BatchNormalization(momentum=0.8)(conv_3)

act_3 = LeakyReLU(alpha=0.2)(batch_norm_3)

zero_pad = ZeroPadding2D()(act_3)

conv_4 = Conv2D(512,4,strides=1,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02))(zero_pad)

batch_norm_4 = BatchNormalization(momentum=0.8)(conv_4)

act_4 = LeakyReLU(alpha=0.2)(batch_norm_4)

zero_pad_1 = ZeroPadding2D()(act_4)

outputs = Conv2D(1,4,strides=1,use_bias=False,kernel_initializer=RandomNormal(mean=0.,stddev=0.02))(zero_pad_1)

disc_model = Model([inp_1, inp_2], outputs)

disc_model.compile(optimizer = Adam(lr=0.0000002, beta_1=0.5), loss = 'binary_crossentropy', metrics = ['accuracy'], loss_weights=[0.5])

disc_model.summary()

return disc_model

Combined Network

Now we will create a combined network to train the generator model. Firstly this network takes noise vector and edge image as input and generates a new image using a generator network. Now the output from the generator network and edge image is fed to the discriminator network to get the output. But here discriminator will be non-trainable. Here is the network code.

def combined():
    inputs_noise = Input(shape = (100,)) 
    inputs = Input(shape = (256, 256, 3)) 
    gen_img = generator_model([inputs_noise, inputs])
    discriminator_model.trainable = False
    outs = discriminator_model([inputs, gen_img])
    comb_model = Model([inputs_noise, inputs], [outs, gen_img])
    comb_model.compile(optimizer = Adam(lr=0.0002, beta_1=0.5), loss = ['binary_crossentropy', 'mae'], loss_weights = [1,100], metrics = ['accuracy'])

    comb_model.summary()
    return comb_model

def combined():

inputs_noise = Input(shape = (100,))

inputs = Input(shape = (256, 256, 3))

gen_img = generator_model([inputs_noise, inputs])

discriminator_model.trainable = False

outs = discriminator_model([inputs, gen_img])

comb_model = Model([inputs_noise, inputs], [outs, gen_img])

comb_model.compile(optimizer = Adam(lr=0.0002, beta_1=0.5), loss = ['binary_crossentropy', 'mae'], loss_weights = [1,100], metrics = ['accuracy'])

comb_model.summary()

return comb_model

Training

I have used binary cross-entropy loss for the discriminator network. For the generator network, I have coupled the binary cross-entropy loss with mae loss. This is because, for image-to-image translation, the generator’s duty is not only to fool the discriminator but also to generate real-looking images. I have used Adam optimizer for both generator and discriminator but the only difference is that I have kept a low learning rate for the discriminator to make it less dominant while training. I have used a batch size of 1. Here are the steps to train the explained conditional GAN.

Train the discriminator model with real output images with patch labels of values 1.
Train the discriminator model with images generated from a generator with patch labels of values 0.
Train the generator network using the combined model.
Repeat the steps from 1 to 3 for each image in the training dataset and then repeat all this for some number of epochs.

train_path =  r'\Downloads\edges2shoes.tar\edges2shoes\train'
sample_len = len(os.listdir(train_path))
epochs = 10
batch_size = 1

def get_data(indx):

    file_name = str(indx) + '_AB' +'.jpg'
    file_path = os.path.join(train_path, file_name)
    img = cv2.imread(file_path).astype('float32')
    edge, shoe = ((img[:, :256, :]/127.5) - 1), ((img[:, 256:, :]/127.5)-1)

    return np.array([edge]), np.array([shoe])



def train():

    real_loss = 0
    fake_loss = 0
    gan_loss = 0
    
    for j in range(epochs):
        for i in range(sample_len):
            
            # train discriminator with real output images
            batch_x, batch_y = get_data(i+1)
            a = discriminator_model.train_on_batch([batch_y, batch_x], np.ones((batch_size, 30, 30, 1)))[1]
            real_loss += a
            
            # train discriminator with fake  generated images
            input_noise = np.random.normal(0, 1, size=(batch_size, 100))
            gen_outs = generator_model.predict([input_noise, batch_x])
            b = discriminator_model.train_on_batch([batch_y, gen_outs], np.zeros((batch_size, 30, 30, 1)))[1]
            fake_loss += b

            # train the generator model
            c = combined_model.train_on_batch([input_noise, batch_x], [np.ones((batch_size, 30, 30, 1)), batch_y])[3]
            gan_loss += c
            print(i, a, b, c)
            
        print(j, fake_loss/sample_len, real_loss/sample_len, gan_loss/sample_len)

train_path = r'\Downloads\edges2shoes.tar\edges2shoes\train'

sample_len = len(os.listdir(train_path))

epochs = 10

batch_size = 1

def get_data(indx):

file_name = str(indx) + '_AB' +'.jpg'

file_path = os.path.join(train_path, file_name)

img = cv2.imread(file_path).astype('float32')

edge, shoe = ((img[:, :256, :]/127.5) - 1), ((img[:, 256:, :]/127.5)-1)

return np.array([edge]), np.array([shoe])

def train():

real_loss = 0

fake_loss = 0

gan_loss = 0

for j in range(epochs):

for i in range(sample_len):

# train discriminator with real output images

batch_x, batch_y = get_data(i+1)

a = discriminator_model.train_on_batch([batch_y, batch_x], np.ones((batch_size, 30, 30, 1)))[1]

real_loss += a

# train discriminator with fake generated images

input_noise = np.random.normal(0, 1, size=(batch_size, 100))

gen_outs = generator_model.predict([input_noise, batch_x])

b = discriminator_model.train_on_batch([batch_y, gen_outs], np.zeros((batch_size, 30, 30, 1)))[1]

fake_loss += b

# train the generator model

c = combined_model.train_on_batch([input_noise, batch_x], [np.ones((batch_size, 30, 30, 1)), batch_y])[3]

gan_loss += c

print(i, a, b, c)

print(j, fake_loss/sample_len, real_loss/sample_len, gan_loss/sample_len)

Hope you enjoy reading.

If you have any doubts/suggestion please feel free to ask and I will do my best to help or improve myself. Good-bye until next time.

TheAILearner

Mastering Artificial Intelligence

Tag Archives: encoder-decoder

Implementation of CycleGAN for Image-to-image Translation

Load the Dataset And Preprocess

Build the Generator

Build the Discriminator

Implementation of Image-to-image translation using conditional GAN

Dataset and Preprocessing

Generator Network