Noise2Noise Binning

Axel Ekman, Jian-Hua Chen, Venera Weinhardt, Gerry McDermott, Mark A. Le Gros, Carolyn Larabell

As Presented by: Axel Ekman at CAMERA Workshop October 31 - November 2, 2018

Intro

In terms of signal processing, the optimal digital filter to remove the high-frequency portion of the image is the sinc filter. When decimation is done by an integer factor, area-averaging is usually very close to optimal and produces usually not much aliasing. In this case, downsampling by a factor of 2 can be expressed in the from

Ideal filters like this are unbiased and do not take into account any priors that may be suitable for the image. The basic idea of this method is that we can construct separate signals from the data and train a CNN to do the downsampling.

Recent work of Lehtinen et al. show that instead of needing true signal, CNN filters can be trained using noisy images as both input and training target by minimizing some distance (loss function) L.

Between the noisy observations.

Noise2Noise: Learning Image Restoration without Clean Data

Lehtinen, Jaakko, et al. “Noise2Noise: Learning Image Restoration without Clean Data.”

Proceedings of the 35th International Conference on Machine Learning, PMLR 80:2965-2974, 2018.

Now within the sampling rate of the output image, we can view all pixels corresponding to the same binned pixel as separate observations of the downsampled image. This provides information to optimize some parameterized filter such that we can use the result of Lehtinen et al. to train a CNN downsampler.

where X1 and X2 are two uncorrelated data samples from the high-resolution image. This can be e.g. done by dividing each downsampled pixel into two diagonal regions (the fact that the center-of-mass is the same should take care of some sub-pixel artifacts). One could also choose random samples of the square to construct several permutations of the same image. In practice this made little difference in the results.

Examples

Color images

Below we show the result for 'monarch' in SET14 with artificial Gaussian noise (sigma = 50) compared to the mean-binned image.

Comparison of different filters for BSD300 and SET14 (PSNR/SSIM)

Dataset	Mean	TV	NLM	BM3D	CNN
Gaussian , sigma = 30
BSD300	24.80 / 0.75	29.19 / 0.90	28.76 / 0.88	29.50 / 0.91	30.70 / 0.93
Set14	24.87 / 0.77	29.16 / 0.91	29.15 / 0.90	29.33 / 0.92	30.70 / 0.94
Poisson noise, lambda = 10
BSD300	24.33 / 0.74	28.60 / 0.89	27.83 / 0.85	29.11 / 0.90	30.65 / 0.93
Set14	23.36 / 0.72	27.73 / 0.89	27.38 / 0.86	28.22 / 0.90	30.17 / 0.93

The TV denoising and Non-Local Means were done using the implementations in scikit-ikmage. NLM was done with patch size 5 and patch distance of 6. The BM3D was done using the implementation in pybm3d with default parameters. The reference methods (TV, NLM, BM3D) show the optimal result by minimizing the true loss function using oracle information of the the reference image.

Tomography

Example of a SXT reconstruction of a Human B-cell reconstructed with FBP (Ram-Lak). In this example, the net was trained simultaneously on all projection images.

Binned projections	CNN binned projections

Credit where credit is due

Supported by:

Encoder-decoder neural network implementation adapted from the UNet implementation of jaxony.

Summary function for PyTorch modules adapted from sksq96.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
cnnbin		cnnbin
data		data
images		images
tests		tests
.gitignore		.gitignore
Hyper.ipynb		Hyper.ipynb
README.md		README.md
dev.ipynb		dev.ipynb
image_example.ipynb		image_example.ipynb
setup.py		setup.py
setup_pip.bat		setup_pip.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Noise2Noise Binning

Intro