Benign Autoencoders

@article{malamud2022benign,
  title={Benign Autoencoders},
  author={Malamud, Semyon and Schrimpf, Andreas and Xu, Teng Andrea and Matera, Giuseppe and Didisheim, Antoine},
  journal={arXiv preprint arXiv:2210.00637},
  year={2022}
}

Abstract

Recent progress in Generative Artificial Intelligence (AI) relies on efficient data representations, often featuring encoder-decoder architectures. We formalize the mathematical problem of finding the optimal encoder-decoder pair and characterize its solution, which we name the "benign autoencoder" (BAE). We prove that BAE projects data onto a manifold whose dimension is the {\it optimal compressibility dimension} of the generative problem. We highlight surprising connections between BAE and several recent developments in AI, such as conditional GANs, context encoders, stable diffusion, stacked autoencoders, and the learning capabilities of generative models. As an illustration, we show how BAE can find optimal, low-dimensional latent representations that improve the performance of a discriminator under a distribution shift. By compressing "malignant" data dimensions, BAE leads to smoother and more stable gradients.

Many generative models are tightly linked to our theoretical framework, see Definition 1 in the paper.

Experiments

The key testable implication of our theory is the existence of an optimal bottleneck (latent) dimension for the encoder: With too few latent dimensions, the model is not rich enough; with too many, it encodes malignant dimensions that hurt (or simply do not improve) performance: The encoded information ``saturates.''

Dependencies

1. Distance Regularized GAN

python3 distance_regularized_gans/resize_celebA.py # Resize celebA 64 x 64
python3 distance_regularized_gans/main.py   # Train Models
python3 distance_regularized_gans/generate.py # FID
python3 distance_regularized_gans/grid_from_models.py # Figure 1

2. Context-Encoders

python3 context_encoders/main.py # Train Models
python3 context_encoders/evaluate.py # Generate In-painted images on test data
python3 context_encoders/lpips_2dirs.py # LPIPS

3. Evaluating the Quality of the Generator with a Discriminator

# Same for fmnist
python3 generator_quality/mnist/train_W.py # Train Discriminator
python3 generator_quality/mnist/train_bae.py # Train AE with Discriminator Penalty
python3 generator_quality/mnist/plot_class_baes.py # Plot

(a) MNIST	(b) FMNIST

For the theoretical findings, we kindly direct the reader to the paper.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
context_encoders		context_encoders
distance_regularized_gans		distance_regularized_gans
generator_quality		generator_quality
plotting		plotting
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Benign Autoencoders

Abstract

Experiments

Dependencies

1. Distance Regularized GAN

2. Context-Encoders

3. Evaluating the Quality of the Generator with a Discriminator

About

Releases

Packages

Languages

License

tengandreaxu/benign-autoencoders

Folders and files

Latest commit

History

Repository files navigation

Benign Autoencoders

Abstract

Experiments

Dependencies

1. Distance Regularized GAN

2. Context-Encoders

3. Evaluating the Quality of the Generator with a Discriminator

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages