SEGAN

From left to right: baseline, comparison method 1, this project's method, comparison method 2

SEGAN is a project aims to control semantic attributes of results generated by StyleGAN2 through modified it's latent space. In their paper, they discussed the impact of latent space on the results of human face images, lower layers affect more general semantic attributes, such as gender, skin color, and higher layers affect in details, such as smiles, hairstyles, etc. It is worth noting that the latent space here is not equal to the space of the initial noise sampled in the original GAN, we will discuss it more in next section. They used this discovery to do "StyleMix", it can mix styles from different seed images to cantrol the semantic attributes of new generated images. But this control is not precise, it is difficult to decouple semantic attributes and control them independently in this way. Therefore, SEGAN introduces a linear subspace to locate interpretable and controllable dimensions from vectors from latent space.

LatentSpace

StyleGAN has a intermediate latent space $W$, which then controls the generator through adaptive instance normalization (AdaIN) at each convolution layer in their unique network architexture.

LinearSubSpace

The idea of using linear subspace comes from EigenGAN, showed that linear subspace can find interpretable and controllable dimensions from different generator layers. They succeeded on the original GAN, so the questions here is how to apply this method in StyleGAN2 which have a quite different structure. The network architecture will be discussed in later section.

Run

Refer to this note.

Hyperparameters

resolution: 64
learning rate: 0.002
batch size: 16
dimension of latent vecotr: 64
r1 weight: 10
regularzation interval of discriminator: 16
regularzation interval of generator: 4

Model

Directly adding a linear subspace in each block and add the output to the feature map as in the original GAN does not fully apply to the case of StyleGAN2. This would make the network too complex, and more importantly we want to modify semantic attributes by modifying $W$ space instead of $Z$ space (sampling noise). The following figure shows model structure whitch have a best performance. The linear subspace is placed before the modulation module of each layer and processes $w$ vectors. Use --mode=2 to switch to this model.

Dataset

The dataset can be found here, It's a very small subset of the danbooru dataset.

Discussion

Baseline&OtherDesign

The baseline is original StyleGAN2, use --mode=0 to switch to it. There are still two modes as comparison, --mode=1 and --mode=3, their model structure showed in following figures.

Result

Generate Images

Some samples generated by mode 0 to 3, left to right.

Training

The red curve's (mode=2) log have some trouble, need to retrain it. TODO

Model	FID
origin (0)	133.75
1	139.33
this project's (2)	123.79
2	163.29

Decoupling

$PPL=E[1/ϵ^2 d(G(slerp(z1,z2;t)),G(slerp(z1,z2;t+ϵ)))$

Model	PPL($ϵ=1e-2,t∈(0,1),30ktimes$)
origin (0)	821.70
1	816.25
this project's (2)	818.62
2	824.03

Some samples generated by mode 0 to 3, left to right. But middle images are generated by interpolating the top and bottom latent vectors. It can be seen that except for the second to last image in the last column, they basically follow certain linear rules. !()[/pictures/figure5.png]

Semantic Control

Above figure shows control semantic attributes of generated results by modified their latent vector in specific layer and dimensions which found by linear space. Where L means layer, N means number of linear subspace, D means dimension.

Conclusion

It can be seen that --mode=2 has the most balanced performance, so this project uses this structure as default.

Others

Here's no regularzation in linear space so actually the implementation of linear space in this project is different with EigenGAN, have to fix it fulture.TODO
Better to train in bigger dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
op		op
pictures		pictures
result		result
trainning		trainning
.gitignore		.gitignore
README.md		README.md
generate.py		generate.py
model.py		model.py
model_SEG.py		model_SEG.py
subspace.py		subspace.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SEGAN

Contents

Overview

LatentSpace

LinearSubSpace

Run

Hyperparameters

Model

Dataset

Discussion

Baseline&OtherDesign

Result

Generate Images

Training

Decoupling

Semantic Control

Conclusion

Others

About

Releases

Packages

Languages

curefate/SEGan_Pytorch

Folders and files

Latest commit

History

Repository files navigation

SEGAN

Contents

Overview

LatentSpace

LinearSubSpace

Run

Hyperparameters

Model

Dataset

Discussion

Baseline&OtherDesign

Result

Generate Images

Training

Decoupling

Semantic Control

Conclusion

Others

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages