Skip to content

Commit

Permalink
initial commit
Browse files Browse the repository at this point in the history
  • Loading branch information
jzenn committed Oct 30, 2023
0 parents commit e3cea3f
Show file tree
Hide file tree
Showing 12 changed files with 2,199 additions and 0 deletions.
43 changes: 43 additions & 0 deletions .github/workflows/static.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
# Simple workflow for deploying static content to GitHub Pages
name: Deploy static content to Pages

on:
# Runs on pushes targeting the default branch
push:
branches: ["main"]

# Allows you to run this workflow manually from the Actions tab
workflow_dispatch:

# Sets permissions of the GITHUB_TOKEN to allow deployment to GitHub Pages
permissions:
contents: read
pages: write
id-token: write

# Allow only one concurrent deployment, skipping runs queued between the run in-progress and latest queued.
# However, do NOT cancel in-progress runs as we want to allow these production deployments to complete.
concurrency:
group: "pages"
cancel-in-progress: false

jobs:
# Single deploy job since we're just deploying
deploy:
environment:
name: github-pages
url: ${{ steps.deployment.outputs.page_url }}
runs-on: ubuntu-latest
steps:
- name: Checkout
uses: actions/checkout@v3
- name: Setup Pages
uses: actions/configure-pages@v3
- name: Upload artifact
uses: actions/upload-pages-artifact@v2
with:
# Upload entire repository
path: './www'
- name: Deploy to GitHub Pages
id: deployment
uses: actions/deploy-pages@v2
3 changes: 3 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
.DS_Store
dataset
index
41 changes: 41 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,41 @@
# The SVHN Dataset Is Deceptive for Probabilistic Generative Models Due to a Distribution Mismatch
<div id="top"></div>

[![arxiv-link](https://img.shields.io/badge/Paper-PDF-red?style=flat&logo=arXiv&logoColor=red)](.)
[![Project Page](https://img.shields.io/badge/Project%20Page-610208)](https://jzenn.github.io/svhn-remix)
[![Download](https://img.shields.io/badge/Download-074161)](https://jzenn.github.io/svhn-remix#download)

<span><a href="https://timx.me" target="_blank">Tim&nbsp;Z.&nbsp;Xiao</a><sup>&#42;</sup> &emsp; <b>&middot;</b> &emsp;
<a href="https://jzenn.github.io" target="_blank">Johannes&nbsp;Zenn</a><sup>&#42;</sup> &emsp; <b>&middot;</b> &emsp;
<a href="https://robamler.github.io" target="_blank">Robert&nbsp;Bamler</a>
</span>
<br/>
<sup>&#42;</sup>Equal contribution, order determined by coin flip.



## About The Project
This is the official GitHub repository for our NeurIPS 2023 DistShift Workshop paper [The SVHN Dataset Is Deceptive for Probabilistic Generative Models Due to a Distribution Mismatch](.) where we propose a new split for the SVHN dataset that does not suffer from distribution mismatch.
[**Visit the project page**](https://jzenn.github.io/svhn-remix) and [**download SVHN-Remix**](https://jzenn.github.io/svhn-remix#download) dataset or split.

> The Street View House Numbers (SVHN) dataset [(Netzer et al., 2011)](http://ufldl.stanford.edu/housenumbers/nips2011_housenumbers.pdf) is a popular benchmark dataset in deep learning.
Originally designed for digit classification tasks, the SVHN dataset has been widely used as a benchmark for various other tasks including generative modeling.
However, with this work, we aim to warn the community about an issue of the SVHN dataset as a benchmark for generative modeling tasks: we discover that the official split into training set and test set of the SVHN dataset are not drawn from the same distribution.
We empirically show that this distribution mismatch has little impact on the classification task (which may explain why this issue has not been detected before), but it severely affects the evaluation of probabilistic generative models, such as Variational Autoencoders and diffusion models.
As a workaround, we propose to mix and re-split the official training and test set when SVHN is used for tasks other than classification.
We publish a new split and the indices we used to create it at [https://jzenn.github.io/svhn-remix/](https://jzenn.github.io/svhn-remix/).


## Citation:
Following is the Bibtex if you would like to cite our paper :

```bibtex
@article{xiao2023the,
title={The SVHN Dataset Is Deceptive for Probabilistic Generative Models Due to a Distribution Mismatch},
author={Xiao, Tim Z. and Zenn, Johannes and Bamler, Robert},
journal={NeurIPS 2023 Workshop on Distribution Shifts},
year={2023}
}
```

<p align="right">(<a href="#top">back to top</a>)</p>
361 changes: 361 additions & 0 deletions www/index.html

Large diffs are not rendered by default.

1 change: 1 addition & 0 deletions www/static/css/bulma.css.map.txt

Large diffs are not rendered by default.

1 change: 1 addition & 0 deletions www/static/css/bulma.min.css

Large diffs are not rendered by default.

5 changes: 5 additions & 0 deletions www/static/css/fontawesome.all.min.css

Large diffs are not rendered by default.

105 changes: 105 additions & 0 deletions www/static/css/index.css
Original file line number Diff line number Diff line change
@@ -0,0 +1,105 @@
body {
font-family: 'Noto Sans', sans-serif;
}

.content pre {
padding: 0;
}

.footer {
padding: 1.5rem 1.5rem 1.5rem
}

.footer .icon-link {
font-size: 25px;
color: #000;
}

.link-block a {
margin-top: 5px;
margin-bottom: 5px;
}

.dnerf {
font-variant: small-caps;
}


.teaser .hero-body {
padding-top: 0;
padding-bottom: 3rem;
}

.teaser {
font-family: 'Google Sans', sans-serif;
}


.publication-title {
}

.publication-banner {
max-height: parent;

}

.publication-banner video {
position: relative;
left: auto;
top: auto;
transform: none;
object-fit: fit;
}

.publication-header .hero-body {
}

.publication-title {
font-family: 'Google Sans', sans-serif;
}

.publication-authors {
font-family: 'Google Sans', sans-serif;
}

.publication-venue {
color: #555;
width: fit-content;
font-weight: bold;
}

.publication-awards {
color: #ff3860;
width: fit-content;
font-weight: bolder;
}

.publication-authors {
}

.publication-authors a {
color: hsl(204, 86%, 53%) !important;
}

.publication-authors a:hover {
text-decoration: underline;
}

.author-block {
display: inline-block;
}


.results-images {
max-width: 50%;
}

.results-image {
max-width: 25%;
}

.results-images-container {
display: flex;
max-width: 100%;
margin: 0 auto;
}
443 changes: 443 additions & 0 deletions www/static/images/classifier_loss.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading

0 comments on commit e3cea3f

Please sign in to comment.