Category Archives: GAN

Implementation of GANs to generated Handwritten Digits

In the previous blog, we studied about GANs, now in this blog, we will implement GANs to generate MNIST digits dataset.

In the generative adversarial networks, both generator and discriminator are trained simultaneously. Both networks can overpower each other if not trained properly. If discriminator is trained more than it will easily detect fake and real image then the generator will not able to generate real-looking images. And if the generator is trained heavily then discriminator will not be able to classify between real and fake images. We can solve this problem by properly setting the learning rate for both networks.

When we train discriminator we do not train generator and when we train generator we do not train discriminator. This makes the generator to train properly. Now, let’s look into the code for each part on the GAN network.

Discriminator Network:

We are using MNIST digits dataset which is having an image shape of (28, 28, 1). Since the image size is small we can use MLP network for discriminator instead of using convolutional layers. To do this first we need to reshape input into a single vector of size (784, 1). Then I have applied three dense layers of 512, 256 and 128 hidden units in each layers.

def discriminator(self):

    input_disc = Input(shape = (784,))
    hidden1 = Dense(512, activation = 'relu')(input_disc)
    hidden2 = Dense(256, activation = 'relu')(hidden1)
    hidden3 = Dense(128, activation = 'relu')(hidden2)
    output = Dense(1, activation = 'sigmoid')(hidden3)
    disc_model = Model(input_disc, output)
    disc_model.compile(loss='binary_crossentropy', optimizer=self.optimizer, metrics=['accuracy'])
    print(disc_model.summary())

    return disc_model

def discriminator(self):

input_disc = Input(shape = (784,))

hidden1 = Dense(512, activation = 'relu')(input_disc)

hidden2 = Dense(256, activation = 'relu')(hidden1)

hidden3 = Dense(128, activation = 'relu')(hidden2)

output = Dense(1, activation = 'sigmoid')(hidden3)

disc_model = Model(input_disc, output)

disc_model.compile(loss='binary_crossentropy', optimizer=self.optimizer, metrics=['accuracy'])

print(disc_model.summary())

return disc_model

Generator Network:

To create generator network we will first take random noise as input with the shape of (100, 1). Then I have used three hidden layers with shape of 256, 512 and 1024. The output of the generator network is then reshaped to (28, 28, 1). I have batch normalization in each hidden layer. Batch normalization improves the quality of the trained model and also stabilizes the training process.

def generator(self):

    input_gen = Input(shape = (self.latent_dim,))
    hidden1 = BatchNormalization(momentum=0.8)(Dense(256, activation = 'relu')(input_gen))
    hidden2 = BatchNormalization(momentum=0.8)(Dense(512, activation = 'relu')(hidden1))
    hidden3 = BatchNormalization(momentum=0.8)(Dense(1024, activation = 'relu')(hidden2))
    output = Dense(784, activation='tanh')(hidden3)
    reshaped_output = Reshape((28, 28, 1))(output)
    gen_model = Model(input_gen, reshaped_output)
    gen_model.compile(loss='binary_crossentropy', optimizer=self.optimizer)
    print(gen_model.summary())


    return gen_model

def generator(self):

input_gen = Input(shape = (self.latent_dim,))

hidden1 = BatchNormalization(momentum=0.8)(Dense(256, activation = 'relu')(input_gen))

hidden2 = BatchNormalization(momentum=0.8)(Dense(512, activation = 'relu')(hidden1))

hidden3 = BatchNormalization(momentum=0.8)(Dense(1024, activation = 'relu')(hidden2))

output = Dense(784, activation='tanh')(hidden3)

reshaped_output = Reshape((28, 28, 1))(output)

gen_model = Model(input_gen, reshaped_output)

gen_model.compile(loss='binary_crossentropy', optimizer=self.optimizer)

print(gen_model.summary())

return gen_model

Combined Model:

To train the generator we need to create a combined model where we do not train the discriminator model. In combined model random noise is being given as input to the generator network and the output image is then passed through the discriminator network to get the label. Here I have flagged discriminator model as non-trainable.

def combined(self):

    inputs = Input(shape = (self.latent_dim,)) 
    gen_img = self.generator_model(inputs)
    gen_img = Reshape((784,))(gen_img)
    self.discriminator_model.trainable = False
    outs = self.discriminator_model(gen_img)
    comb_model = Model(inputs, outs)
    comb_model.compile(loss='binary_crossentropy', optimizer=self.optimizer, metrics=['accuracy'])
    print(comb_model.summary())

    return comb_model

def combined(self):

inputs = Input(shape = (self.latent_dim,))

gen_img = self.generator_model(inputs)

gen_img = Reshape((784,))(gen_img)

self.discriminator_model.trainable = False

outs = self.discriminator_model(gen_img)

comb_model = Model(inputs, outs)

comb_model.compile(loss='binary_crossentropy', optimizer=self.optimizer, metrics=['accuracy'])

print(comb_model.summary())

return comb_model

Training the GAN network:

Training a GAN network requires careful hyper-parameters tuning. If the model is not trained carefully it will not converge to produce good results. We will use the following steps to train this GAN network:

Firstly we will normalize input dataset (MNIST images).
Train the discriminator with real images (from MNIST dataset)
Sample same number of noise vectors to predict the output from generator network (Generator is not trained here).
Train the discriminator network with images generated in the previous step.
Take new random samples to train the generator with a combined model without training discriminator.
Repeat from step 2-5 for some number of iterations. I have trained it for 30000 iterations.

def train(self):

    train_data = (self.x_train.astype(np.float32) - 127.5) / 127.5

    for i in range(self.iterations):

        batch_indx = np.random.randint(0, train_data.shape[0], size = (self.half_batch_size))
        batch_x = train_data[batch_indx]
        batch_x = batch_x.reshape((-1, 784))

        input_noise = np.random.normal(0, 1, size=(self.half_batch_size, 100))
        gen_outs = self.generator_model.predict(input_noise)
        gen_outs = gen_outs.reshape((-1, 784))

        real_loss = self.discriminator_model.train_on_batch(batch_x, np.ones((self.half_batch_size,1)))
        fake_loss = self.discriminator_model.train_on_batch(gen_outs, np.zeros((self.half_batch_size,1)))

        disc_loss = 0.5*np.add(fake_loss,real_loss)

        full_batch_input_noise = np.random.normal(0, 1, size=(self.batch_size, 100))
        gan_loss = self.combined_model.train_on_batch(full_batch_input_noise, np.array([1] * self.batch_size))

        print(i, disc_loss, gan_loss)

def train(self):

train_data = (self.x_train.astype(np.float32) - 127.5) / 127.5

for i in range(self.iterations):

batch_indx = np.random.randint(0, train_data.shape[0], size = (self.half_batch_size))

batch_x = train_data[batch_indx]

batch_x = batch_x.reshape((-1, 784))

input_noise = np.random.normal(0, 1, size=(self.half_batch_size, 100))

gen_outs = self.generator_model.predict(input_noise)

gen_outs = gen_outs.reshape((-1, 784))

real_loss = self.discriminator_model.train_on_batch(batch_x, np.ones((self.half_batch_size,1)))

fake_loss = self.discriminator_model.train_on_batch(gen_outs, np.zeros((self.half_batch_size,1)))

disc_loss = 0.5*np.add(fake_loss,real_loss)

full_batch_input_noise = np.random.normal(0, 1, size=(self.batch_size, 100))

gan_loss = self.combined_model.train_on_batch(full_batch_input_noise, np.array([1] * self.batch_size))

print(i, disc_loss, gan_loss)

Take a look into the generated images from this GAN network.

Here is the full code.

from keras.layers import Input, Dense, Reshape, BatchNormalization
from keras.models import Model
from keras.optimizers import Adam
from keras.datasets import mnist
import numpy as np

class GAN():
    def __init__(self):

        (self.x_train, self.y_train), (self.x_test, self.y_test) = mnist.load_data()
        self.batch_size = 128
        self.half_batch_size = 64
        self.latent_dim = 100
        self.iterations = 30000
        self.optimizer = Adam(0.0002, 0.5)
        self.generator_model = self.generator() 
        self.discriminator_model = self.discriminator()
        self.combined_model = self.combined()
        

    def generator(self):
        
        input_gen = Input(shape = (self.latent_dim,))
        hidden1 = BatchNormalization(momentum=0.8)(Dense(256, activation = 'relu')(input_gen))
        hidden2 = BatchNormalization(momentum=0.8)(Dense(512, activation = 'relu')(hidden1))
        hidden3 = BatchNormalization(momentum=0.8)(Dense(1024, activation = 'relu')(hidden2))
        output = Dense(784, activation='tanh')(hidden3)
        reshaped_output = Reshape((28, 28, 1))(output)
        gen_model = Model(input_gen, reshaped_output)
        gen_model.compile(loss='binary_crossentropy', optimizer=self.optimizer)
        print(gen_model.summary())
        
        
        return gen_model
    
    def discriminator(self):
        
        input_disc = Input(shape = (784,))
        hidden1 = Dense(512, activation = 'relu')(input_disc)
        hidden2 = Dense(256, activation = 'relu')(hidden1)
        hidden3 = Dense(128, activation = 'relu')(hidden2)
        output = Dense(1, activation = 'sigmoid')(hidden3)
        disc_model = Model(input_disc, output)
        disc_model.compile(loss='binary_crossentropy', optimizer=self.optimizer, metrics=['accuracy'])
        print(disc_model.summary())
        
        return disc_model
    
    def combined(self):
        
        inputs = Input(shape = (self.latent_dim,)) 
        gen_img = self.generator_model(inputs)
        gen_img = Reshape((784,))(gen_img)
        self.discriminator_model.trainable = False
        outs = self.discriminator_model(gen_img)
        comb_model = Model(inputs, outs)
        comb_model.compile(loss='binary_crossentropy', optimizer=self.optimizer, metrics=['accuracy'])
        print(comb_model.summary())
        
        return comb_model
    
    def train(self):
        
        train_data = (self.x_train.astype(np.float32) - 127.5) / 127.5
        
        for i in range(self.iterations):
            
            batch_indx = np.random.randint(0, train_data.shape[0], size = (self.half_batch_size))
            batch_x = train_data[batch_indx]
            batch_x = batch_x.reshape((-1, 784))
            
            input_noise = np.random.normal(0, 1, size=(self.half_batch_size, 100))
            gen_outs = self.generator_model.predict(input_noise)
            gen_outs = gen_outs.reshape((-1, 784))
            
            fake_loss = self.discriminator_model.train_on_batch(gen_outs, np.zeros((self.half_batch_size,1)))
            real_loss = self.discriminator_model.train_on_batch(batch_x, np.ones((self.half_batch_size,1)))
            
            disc_loss = 0.5*np.add(fake_loss,real_loss)
            
            full_batch_input_noise = np.random.normal(0, 1, size=(self.batch_size, 100))
            gan_loss = self.combined_model.train_on_batch(full_batch_input_noise, np.array([1] * self.batch_size))
            
            print(i, disc_loss, gan_loss)
            
# training the network
gan = GAN()
gan.train()


# generating new images from trained network
import matplotlib.pyplot as plt

r, c = 5, 5
noise = np.random.normal(0, 1, (r * c, 100))

gen_imgs = gan.generator_model.predict(noise)

# Rescale images 0 - 1
gen_imgs = 0.5 * gen_imgs + 0.5

fig, axs = plt.subplots(r, c)
cnt = 0
for i in range(r):
    for j in range(c):
        axs[i,j].imshow(gen_imgs[cnt, :,:,0], cmap='gray')
        axs[i,j].axis('off')
        cnt += 1
        
plt.show()
fig.savefig("mnist.png")
plt.close()

100

101

102

103

104

105

106

107

108

109

110

111

112

from keras.layers import Input, Dense, Reshape, BatchNormalization

from keras.models import Model

from keras.optimizers import Adam

from keras.datasets import mnist

import numpy as np

class GAN():

def __init__(self):

(self.x_train, self.y_train), (self.x_test, self.y_test) = mnist.load_data()

self.batch_size = 128

self.half_batch_size = 64

self.latent_dim = 100

self.iterations = 30000

self.optimizer = Adam(0.0002, 0.5)

self.generator_model = self.generator()

self.discriminator_model = self.discriminator()

self.combined_model = self.combined()

def generator(self):

input_gen = Input(shape = (self.latent_dim,))

hidden1 = BatchNormalization(momentum=0.8)(Dense(256, activation = 'relu')(input_gen))

hidden2 = BatchNormalization(momentum=0.8)(Dense(512, activation = 'relu')(hidden1))

hidden3 = BatchNormalization(momentum=0.8)(Dense(1024, activation = 'relu')(hidden2))

output = Dense(784, activation='tanh')(hidden3)

reshaped_output = Reshape((28, 28, 1))(output)

gen_model = Model(input_gen, reshaped_output)

gen_model.compile(loss='binary_crossentropy', optimizer=self.optimizer)

print(gen_model.summary())

return gen_model

def discriminator(self):

input_disc = Input(shape = (784,))

hidden1 = Dense(512, activation = 'relu')(input_disc)

hidden2 = Dense(256, activation = 'relu')(hidden1)

hidden3 = Dense(128, activation = 'relu')(hidden2)

output = Dense(1, activation = 'sigmoid')(hidden3)

disc_model = Model(input_disc, output)

disc_model.compile(loss='binary_crossentropy', optimizer=self.optimizer, metrics=['accuracy'])

print(disc_model.summary())

return disc_model

def combined(self):

inputs = Input(shape = (self.latent_dim,))

gen_img = self.generator_model(inputs)

gen_img = Reshape((784,))(gen_img)

self.discriminator_model.trainable = False

outs = self.discriminator_model(gen_img)

comb_model = Model(inputs, outs)

comb_model.compile(loss='binary_crossentropy', optimizer=self.optimizer, metrics=['accuracy'])

print(comb_model.summary())

return comb_model

def train(self):

train_data = (self.x_train.astype(np.float32) - 127.5) / 127.5

for i in range(self.iterations):

batch_indx = np.random.randint(0, train_data.shape[0], size = (self.half_batch_size))

batch_x = train_data[batch_indx]

batch_x = batch_x.reshape((-1, 784))

input_noise = np.random.normal(0, 1, size=(self.half_batch_size, 100))

gen_outs = self.generator_model.predict(input_noise)

gen_outs = gen_outs.reshape((-1, 784))

fake_loss = self.discriminator_model.train_on_batch(gen_outs, np.zeros((self.half_batch_size,1)))

real_loss = self.discriminator_model.train_on_batch(batch_x, np.ones((self.half_batch_size,1)))

disc_loss = 0.5*np.add(fake_loss,real_loss)

full_batch_input_noise = np.random.normal(0, 1, size=(self.batch_size, 100))

gan_loss = self.combined_model.train_on_batch(full_batch_input_noise, np.array([1] * self.batch_size))

print(i, disc_loss, gan_loss)

# training the network

gan = GAN()

gan.train()

# generating new images from trained network

import matplotlib.pyplot as plt

r, c = 5, 5

noise = np.random.normal(0, 1, (r * c, 100))

gen_imgs = gan.generator_model.predict(noise)

# Rescale images 0 - 1

gen_imgs = 0.5 * gen_imgs + 0.5

fig, axs = plt.subplots(r, c)

cnt = 0

for i in range(r):

for j in range(c):

axs[i,j].imshow(gen_imgs[cnt, :,:,0], cmap='gray')

axs[i,j].axis('off')

cnt += 1

plt.show()

fig.savefig("mnist.png")

plt.close()

Hope you enjoy reading.

If you have any doubt/suggestion please feel free to ask and I will do my best to help or improve myself. Good-bye until next time.

An Introduction to Generative Adversarial Networks (GANs)

Implementing semi-supervised Learning using GANs

Implementing Semi-Supervised GAN

Now we will implement a semi-supervised GAN using MNIST digits dataset. If you want to implement a simple GAN you can follow this blog: Implementation of GANs to generated Handwritten Digits.

MNIST digits dataset consists of 60000 training images from which we will only use 1000 labeled images and rest as unlabeled images. We will select random 1000 labeled images containing 100 images for each class. Let’s see the code for this:

def sample_1000(self, x, y):

    x_1000 = []
    y_1000 = []
    for i in range(10):
        x_i = x[y==i]
        ix = np.random.randint(0, len(x_i), 100)
        [x_1000.append(x_i[j]) for j in ix]
        [y_1000.append(i) for j in ix]

    return x_1000, y_1000

def sample_1000(self, x, y):

x_1000 = []

y_1000 = []

for i in range(10):

x_i = x[y==i]

ix = np.random.randint(0, len(x_i), 100)

[x_1000.append(x_i[j]) for j in ix]

[y_1000.append(i) for j in ix]

return x_1000, y_1000

Discriminator in SGAN

For this semi-supervised GAN model, we will create two discriminator models both of them share weights of every layer but have different output layers. One model will be the binary classifier model (discriminate between real and fake images) and another will be multi-class classifier model (predicts labels for the input image). Let’s see the code for this:

def discriminator(self):
        
    input_disc = Input(shape = (784,))
    hidden1 = Dense(512, activation = 'relu')(input_disc)
    hidden2 = Dense(256, activation = 'relu')(hidden1)
    hidden3 = Dense(128, activation = 'relu')(hidden2)
    output = Dense(1, activation = 'sigmoid')(hidden3)
    output2 = Dense(10, activation = 'softmax', name = 'classification_layer')(hidden3)
    disc_model = Model(input_disc, output)
    disc_model_2 = Model(input_disc, output2)
    disc_model.compile(loss=['binary_crossentropy'], optimizer=self.optimizer, metrics=['accuracy'])
    disc_model_2.compile(loss=['categorical_crossentropy'], optimizer=self.optimizer, metrics=['accuracy'])
    print(disc_model.summary())
    print(disc_model_2.summary())

    return disc_model, disc_model_2

def discriminator(self):

input_disc = Input(shape = (784,))

hidden1 = Dense(512, activation = 'relu')(input_disc)

hidden2 = Dense(256, activation = 'relu')(hidden1)

hidden3 = Dense(128, activation = 'relu')(hidden2)

output = Dense(1, activation = 'sigmoid')(hidden3)

output2 = Dense(10, activation = 'softmax', name = 'classification_layer')(hidden3)

disc_model = Model(input_disc, output)

disc_model_2 = Model(input_disc, output2)

disc_model.compile(loss=['binary_crossentropy'], optimizer=self.optimizer, metrics=['accuracy'])

disc_model_2.compile(loss=['categorical_crossentropy'], optimizer=self.optimizer, metrics=['accuracy'])

print(disc_model.summary())

print(disc_model_2.summary())

return disc_model, disc_model_2

Generator in SGAN

Generator in this SGAN is a simple multi-layer neural network having three hidden layers with units 512, 256 and 128. The output layer is having a shape of the original image (28, 28,1). Input to the generator will we random noise of vector size 100. Here is the code.

def generator(self):

    input_gen = Input(shape = (self.latent_dim,))
    hidden1 = BatchNormalization(momentum=0.8)(Dense(256, activation = 'relu')(input_gen))
    hidden2 = BatchNormalization(momentum=0.8)(Dense(512, activation = 'relu')(hidden1))
    hidden3 = BatchNormalization(momentum=0.8)(Dense(1024, activation = 'relu')(hidden2))
    output = Dense(784, activation='tanh')(hidden3)
    reshaped_output = Reshape((28, 28, 1))(output)
    gen_model = Model(input_gen, reshaped_output)
    gen_model.compile(loss='binary_crossentropy', optimizer=self.optimizer)
    print(gen_model.summary())


    return gen_model

def generator(self):

input_gen = Input(shape = (self.latent_dim,))

hidden1 = BatchNormalization(momentum=0.8)(Dense(256, activation = 'relu')(input_gen))

hidden2 = BatchNormalization(momentum=0.8)(Dense(512, activation = 'relu')(hidden1))

hidden3 = BatchNormalization(momentum=0.8)(Dense(1024, activation = 'relu')(hidden2))

output = Dense(784, activation='tanh')(hidden3)

reshaped_output = Reshape((28, 28, 1))(output)

gen_model = Model(input_gen, reshaped_output)

gen_model.compile(loss='binary_crossentropy', optimizer=self.optimizer)

print(gen_model.summary())

return gen_model

Training the model

Training this model will consist of the following steps:

Sample both label and unlabeled data from the MNIST dataset, also normalize and make labels of data into categorical form.
Train the multi-class discriminator model with labeled real images (take a batch from images)
Train the binary-class discriminator model with unlabeled real images (take a batch from images)
Sample noise of vector size 100 and train the binary-class discriminator model with fake images generated by generator network.
Sample noise of vector size 100 and train the combined model to train the generator network.
Repeat steps from 2-5 for some number of iterations. I have trained it for 10000 iterations.

In the above training steps, you can see that we are training multi-class discriminator and binary-class discriminator in different steps. But actually they are sharing weights of the same network except for the output layer (As I have mentioned earlier).

Also, Binary-class discriminator is trained two times in every iteration, one with real images taken from the dataset and another with fake images generated from the generative network. While multi-class discriminator is trained once in each iteration, only with real labeled images. This is because multi-class labels are not available for generated images.

def train(self):

    train_data, train_data_y = self.sample_1000(self.x_train, self.y_train)
    train_data = ((np.array(train_data).astype(np.float32))-127.5)/127.5
    train_data_y = to_categorical(train_data_y)

    all_train_data = ((np.array(self.x_train).astype(np.float32))-127.5)/127.5
    all_train_data_y = to_categorical(self.y_train)

    for j in range(self.iterations):

        batch_indx = np.random.randint(0, train_data.shape[0], size = (self.half_batch_size))
        batch_x = train_data[batch_indx]
        batch_x = batch_x.reshape((-1, 784))
        batch_y = train_data_y[batch_indx]


        batch_indx_total = np.random.randint(0, all_train_data.shape[0], size = (self.half_batch_size))
        batch_x_total = all_train_data[batch_indx_total]
        batch_x_total = batch_x_total.reshape((-1, 784))
        batch_y_total = all_train_data_y[batch_indx_total]


        input_noise = np.random.normal(0, 1, size=(self.half_batch_size, 100))
        gen_outs = self.generator_model.predict(input_noise)
        gen_outs = gen_outs.reshape((-1, 784))

        classi_loss = self.classification_model.train_on_batch(batch_x, batch_y)
        real_loss1 = self.discriminator_model.train_on_batch(batch_x_total, np.ones((self.half_batch_size,1)))
        fake_loss = self.discriminator_model.train_on_batch(gen_outs, np.zeros((self.half_batch_size,1)))     


        full_batch_input_noise = np.random.normal(0, 1, size=(self.batch_size, 100))
        gan_loss = self.combined_model.train_on_batch(full_batch_input_noise, np.array([1] * self.batch_size))

        if j%1000 == 0:
            test_data = ((self.x_test.astype(np.float32)-127.5)/127.5).reshape((-1, 784))
            test_results = self.classification_model.predict(test_data)
            test_results_argmax = np.argmax(test_results, axis = 1)

            count = 0
            for i in range(len(test_results_argmax)):
                if test_results_argmax[i] == self.y_test[i]:
                    count += 1
            print("Accuracy After", j,"iterations: ", (count/len(test_data))*100)

def train(self):

train_data, train_data_y = self.sample_1000(self.x_train, self.y_train)

train_data = ((np.array(train_data).astype(np.float32))-127.5)/127.5

train_data_y = to_categorical(train_data_y)

all_train_data = ((np.array(self.x_train).astype(np.float32))-127.5)/127.5

all_train_data_y = to_categorical(self.y_train)

for j in range(self.iterations):

batch_indx = np.random.randint(0, train_data.shape[0], size = (self.half_batch_size))

batch_x = train_data[batch_indx]

batch_x = batch_x.reshape((-1, 784))

batch_y = train_data_y[batch_indx]

batch_indx_total = np.random.randint(0, all_train_data.shape[0], size = (self.half_batch_size))

batch_x_total = all_train_data[batch_indx_total]

batch_x_total = batch_x_total.reshape((-1, 784))

batch_y_total = all_train_data_y[batch_indx_total]

input_noise = np.random.normal(0, 1, size=(self.half_batch_size, 100))

gen_outs = self.generator_model.predict(input_noise)

gen_outs = gen_outs.reshape((-1, 784))

classi_loss = self.classification_model.train_on_batch(batch_x, batch_y)

real_loss1 = self.discriminator_model.train_on_batch(batch_x_total, np.ones((self.half_batch_size,1)))

fake_loss = self.discriminator_model.train_on_batch(gen_outs, np.zeros((self.half_batch_size,1)))

full_batch_input_noise = np.random.normal(0, 1, size=(self.batch_size, 100))

gan_loss = self.combined_model.train_on_batch(full_batch_input_noise, np.array([1] * self.batch_size))

if j%1000 == 0:

test_data = ((self.x_test.astype(np.float32)-127.5)/127.5).reshape((-1, 784))

test_results = self.classification_model.predict(test_data)

test_results_argmax = np.argmax(test_results, axis = 1)

count = 0

for i in range(len(test_results_argmax)):

if test_results_argmax[i] == self.y_test[i]:

count += 1

print("Accuracy After", j,"iterations: ", (count/len(test_data))*100)

I have also tested the SGAN model with 10000 test dataset provided by MNIST after every 1000 iteration. Here is the result of that.

Now you can see that I have trained this SGAN model with only 1000 labeled images and it gives an accuracy of about 94.8%, that is quite nice.

Give me the full code!

from keras.layers import Input, Dense, Reshape, BatchNormalization
from keras.models import Model
from keras.optimizers import Adam
from keras.datasets import mnist
from keras.utils import to_categorical
import numpy as np

class GAN():
    def __init__(self):

        (self.x_train, self.y_train), (self.x_test, self.y_test) = mnist.load_data()
        self.batch_size = 100
        self.half_batch_size = 50
        self.latent_dim = 100
        self.iterations = 10000
        self.optimizer = Adam(0.0002, 0.5)
        self.generator_model = self.generator() 
        self.discriminator_model, self.classification_model = self.discriminator()
        self.combined_model = self.combined()
        

    def generator(self):
        
        input_gen = Input(shape = (self.latent_dim,))
        hidden1 = BatchNormalization(momentum=0.8)(Dense(256, activation = 'relu')(input_gen))
        hidden2 = BatchNormalization(momentum=0.8)(Dense(512, activation = 'relu')(hidden1))
        hidden3 = BatchNormalization(momentum=0.8)(Dense(1024, activation = 'relu')(hidden2))
        output = Dense(784, activation='tanh')(hidden3)
        reshaped_output = Reshape((28, 28, 1))(output)
        gen_model = Model(input_gen, reshaped_output)
        gen_model.compile(loss='binary_crossentropy', optimizer=self.optimizer)
        print(gen_model.summary())
        
        
        return gen_model
    
    def discriminator(self):
        
        input_disc = Input(shape = (784,))
        hidden1 = Dense(512, activation = 'relu')(input_disc)
        hidden2 = Dense(256, activation = 'relu')(hidden1)
        hidden3 = Dense(128, activation = 'relu')(hidden2)
        output = Dense(1, activation = 'sigmoid')(hidden3)
        output2 = Dense(10, activation = 'softmax', name = 'classification_layer')(hidden3)
        disc_model = Model(input_disc, output)
        disc_model_2 = Model(input_disc, output2)
        disc_model.compile(loss=['binary_crossentropy'], optimizer=self.optimizer, metrics=['accuracy'])
        disc_model_2.compile(loss=['categorical_crossentropy'], optimizer=self.optimizer, metrics=['accuracy'])
        print(disc_model.summary())
        print(disc_model_2.summary())
        
        return disc_model, disc_model_2
    
    def combined(self):
        
        inputs = Input(shape = (self.latent_dim,)) 
        gen_img = self.generator_model(inputs)
        gen_img = Reshape((784,))(gen_img)
        self.discriminator_model.trainable = False
        outs = self.discriminator_model(gen_img)
        comb_model = Model(inputs, outs)
        comb_model.compile(loss='binary_crossentropy', optimizer=self.optimizer, metrics=['accuracy'])
        print(comb_model.summary())
        
        return comb_model
    
    def sample_1000(self, x, y):
        
        x_1000 = []
        y_1000 = []
        for i in range(10):
            x_i = x[y==i]
            ix = np.random.randint(0, len(x_i), 100)
            [x_1000.append(x_i[j]) for j in ix]
            [y_1000.append(i) for j in ix]
            
        return x_1000, y_1000
    
    def train(self):
        
        train_data, train_data_y = self.sample_1000(self.x_train, self.y_train)
        train_data = ((np.array(train_data).astype(np.float32))-127.5)/127.5
        train_data_y = to_categorical(train_data_y)
        
        all_train_data = ((np.array(self.x_train).astype(np.float32))-127.5)/127.5
        all_train_data_y = to_categorical(self.y_train)
        
        for j in range(self.iterations):
            
            batch_indx = np.random.randint(0, train_data.shape[0], size = (self.half_batch_size))
            batch_x = train_data[batch_indx]
            batch_x = batch_x.reshape((-1, 784))
            batch_y = train_data_y[batch_indx]
            
            
            batch_indx_total = np.random.randint(0, all_train_data.shape[0], size = (self.half_batch_size))
            batch_x_total = all_train_data[batch_indx_total]
            batch_x_total = batch_x_total.reshape((-1, 784))
            batch_y_total = all_train_data_y[batch_indx_total]
            
            
            input_noise = np.random.normal(0, 1, size=(self.half_batch_size, 100))
            gen_outs = self.generator_model.predict(input_noise)
            gen_outs = gen_outs.reshape((-1, 784))
            
            classi_loss = self.classification_model.train_on_batch(batch_x, batch_y)
            real_loss1 = self.discriminator_model.train_on_batch(batch_x_total, np.ones((self.half_batch_size,1)))
            fake_loss = self.discriminator_model.train_on_batch(gen_outs, np.zeros((self.half_batch_size,1)))     
        
            
            full_batch_input_noise = np.random.normal(0, 1, size=(self.batch_size, 100))
            gan_loss = self.combined_model.train_on_batch(full_batch_input_noise, np.array([1] * self.batch_size))
            
            if j%1000 == 0:
                test_data = ((self.x_test.astype(np.float32)-127.5)/127.5).reshape((-1, 784))
                test_results = self.classification_model.predict(test_data)
                test_results_argmax = np.argmax(test_results, axis = 1)
                
                count = 0
                for i in range(len(test_results_argmax)):
                    if test_results_argmax[i] == self.y_test[i]:
                        count += 1
                print("Accuracy After", j,"iterations: ", (count/len(test_data))*100)
            
            
gan = GAN()
gan.train()

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

from keras.layers import Input, Dense, Reshape, BatchNormalization

from keras.models import Model

from keras.optimizers import Adam

from keras.datasets import mnist

from keras.utils import to_categorical

import numpy as np

class GAN():

def __init__(self):

(self.x_train, self.y_train), (self.x_test, self.y_test) = mnist.load_data()

self.batch_size = 100

self.half_batch_size = 50

self.latent_dim = 100

self.iterations = 10000

self.optimizer = Adam(0.0002, 0.5)

self.generator_model = self.generator()

self.discriminator_model, self.classification_model = self.discriminator()

self.combined_model = self.combined()

def generator(self):

input_gen = Input(shape = (self.latent_dim,))

hidden1 = BatchNormalization(momentum=0.8)(Dense(256, activation = 'relu')(input_gen))

hidden2 = BatchNormalization(momentum=0.8)(Dense(512, activation = 'relu')(hidden1))

hidden3 = BatchNormalization(momentum=0.8)(Dense(1024, activation = 'relu')(hidden2))

output = Dense(784, activation='tanh')(hidden3)

reshaped_output = Reshape((28, 28, 1))(output)

gen_model = Model(input_gen, reshaped_output)

gen_model.compile(loss='binary_crossentropy', optimizer=self.optimizer)

print(gen_model.summary())

return gen_model

def discriminator(self):

input_disc = Input(shape = (784,))

hidden1 = Dense(512, activation = 'relu')(input_disc)

hidden2 = Dense(256, activation = 'relu')(hidden1)

hidden3 = Dense(128, activation = 'relu')(hidden2)

output = Dense(1, activation = 'sigmoid')(hidden3)

output2 = Dense(10, activation = 'softmax', name = 'classification_layer')(hidden3)

disc_model = Model(input_disc, output)

disc_model_2 = Model(input_disc, output2)

disc_model.compile(loss=['binary_crossentropy'], optimizer=self.optimizer, metrics=['accuracy'])

disc_model_2.compile(loss=['categorical_crossentropy'], optimizer=self.optimizer, metrics=['accuracy'])

print(disc_model.summary())

print(disc_model_2.summary())

return disc_model, disc_model_2

def combined(self):

inputs = Input(shape = (self.latent_dim,))

gen_img = self.generator_model(inputs)

gen_img = Reshape((784,))(gen_img)

self.discriminator_model.trainable = False

outs = self.discriminator_model(gen_img)

comb_model = Model(inputs, outs)

comb_model.compile(loss='binary_crossentropy', optimizer=self.optimizer, metrics=['accuracy'])

print(comb_model.summary())

return comb_model

def sample_1000(self, x, y):

x_1000 = []

y_1000 = []

for i in range(10):

x_i = x[y==i]

ix = np.random.randint(0, len(x_i), 100)

[x_1000.append(x_i[j]) for j in ix]

[y_1000.append(i) for j in ix]

return x_1000, y_1000

def train(self):

train_data, train_data_y = self.sample_1000(self.x_train, self.y_train)

train_data = ((np.array(train_data).astype(np.float32))-127.5)/127.5

train_data_y = to_categorical(train_data_y)

all_train_data = ((np.array(self.x_train).astype(np.float32))-127.5)/127.5

all_train_data_y = to_categorical(self.y_train)

for j in range(self.iterations):

batch_indx = np.random.randint(0, train_data.shape[0], size = (self.half_batch_size))

batch_x = train_data[batch_indx]

batch_x = batch_x.reshape((-1, 784))

batch_y = train_data_y[batch_indx]

batch_indx_total = np.random.randint(0, all_train_data.shape[0], size = (self.half_batch_size))

batch_x_total = all_train_data[batch_indx_total]

batch_x_total = batch_x_total.reshape((-1, 784))

batch_y_total = all_train_data_y[batch_indx_total]

input_noise = np.random.normal(0, 1, size=(self.half_batch_size, 100))

gen_outs = self.generator_model.predict(input_noise)

gen_outs = gen_outs.reshape((-1, 784))

classi_loss = self.classification_model.train_on_batch(batch_x, batch_y)

real_loss1 = self.discriminator_model.train_on_batch(batch_x_total, np.ones((self.half_batch_size,1)))

fake_loss = self.discriminator_model.train_on_batch(gen_outs, np.zeros((self.half_batch_size,1)))

full_batch_input_noise = np.random.normal(0, 1, size=(self.batch_size, 100))

gan_loss = self.combined_model.train_on_batch(full_batch_input_noise, np.array([1] * self.batch_size))

if j%1000 == 0:

test_data = ((self.x_test.astype(np.float32)-127.5)/127.5).reshape((-1, 784))

test_results = self.classification_model.predict(test_data)

test_results_argmax = np.argmax(test_results, axis = 1)

count = 0

for i in range(len(test_results_argmax)):

if test_results_argmax[i] == self.y_test[i]:

count += 1

print("Accuracy After", j,"iterations: ", (count/len(test_data))*100)

gan = GAN()

gan.train()

Hope you enjoy reading.

If you have any doubt/suggestion please feel free to ask and I will do my best to help or improve myself. Good-bye until next time.

TheAILearner

Mastering Artificial Intelligence

Category Archives: GAN

Implementation of GANs to generated Handwritten Digits

An Introduction to Generative Adversarial Networks (GANs)

Implementing semi-supervised Learning using GANs

Implementing Semi-Supervised GAN

Give me the full code!