Playground project for generative models in PyTorch

In this project I want to implement and try several approaches on generating artificial data from scratch.

About data

As these are all Machine Learning models they learn from given data. We are using two different datasets until now. MNIST and CelebA. However, it is straightforward to extend implementation to other datasets.

1. Autoencoders

Architecture details

Autoencoder architecture is fully described by a set of parameters:

base_channels: Number of channels after first convolution
conv_blocks_per_decrease: Convolutions in each downsizing module
channel_increase_factor: Factor by which channels increase after each downsizing module
encode_factor: Number of downsizing modules
latent_dim: Dimension of encoding vector
initial_upsample_size: Start resolution in decoder
skip_connections: Use ResNet like skip connections in downsizing modules
auxillary: Allow encoder to also learn class labels (makes training easier)

For a more exact description see configs folder

A model summary can be retrieved by running python -m models.Autoencoder -h. Note: parameters in summary are currently hardcoded.

Convolutional model with linear hidden dimension

Autoencoder, that encodes to a n-dimensional linear feature vector. n is dependent on architecture parameters. Standard is 128.

Reconstructions results for standard Autoencoder (128 dimensions)


MNIST

t-SNE representation of hidden space

To see how well the model is seperating classes, we sample from the test set and visualize their hidden represention using t-SNE. We use MNIST as we have class labels.

Variational Autoencoders (VAE)

Variational autoencoders similarily to Autoencoders trying to find a good hidden encoding for reconstruction of input data. However, they encode to a mean and variance, where the minimization of KL-divergence to a standard normal distribution is part of optimization objective. Thus artificial data can be generated by sampling from a standard normal distribution and decode these.
Reconstruction of given test samples


MNIST

Randomly generated artificial samples

Reconstructed samples from linear grid in two dimensions

2. Generative Adverserial Networks (GANs)

Vanilla GAN

Original GANs (Goodfellow et al) that are based upon linear layers in generator and discriminator.

DC GAN

Concept of GANs that utilize convolutional layers in generator and discriminator.

Auxillary GAN

DC GAN but labels of dataset are used as generator input and discriminator output.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
common		common
configs		configs
eval_tools		eval_tools
models		models
result_figures		result_figures
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
dataset_mean.py		dataset_mean.py
environment.yml		environment.yml
normalize_mnist_test.py		normalize_mnist_test.py
reset_visdom.sh		reset_visdom.sh
setup.sh		setup.sh
test_AE.py.save		test_AE.py.save
train_model.py		train_model.py

a1302z/GenerativeModels

Folders and files

Latest commit

History

Repository files navigation

Playground project for generative models in PyTorch

About data

1. Autoencoders

t-SNE representation of hidden space

2. Generative Adverserial Networks (GANs)

About

Resources

Stars

Watchers

Forks

Languages