Replication of the upscalers #152

rom1504 · 2022-06-19T19:27:02Z

Hey, so we got decent versions of the prior and the basic decoder now.

I think the current code is already able to train upscalers but we need more doc for it.

Let's have a upscaler.md explaining

What is it
How to prepare the dataset
what hyper parameters
command to run the training
expected GPU hours cost

And then train it!

We can also discuss what's the right dataset, but I figure the laion5B subset we call "laion high resolution" could do the trick (it's 170M images in 1024x1024 or bigger)

I understand only the image (and clip image EMB) is needed and no text ?

nousr · 2022-06-19T20:14:57Z

Here's some relevant sections of the paper for reference while in this thread

lucidrains · 2022-06-20T15:17:38Z

they are also using the BSR degradation used by Rombach et al https://github.com/CompVis/latent-diffusion/tree/e66308c7f2e64cb581c6d27ab6fbeb846828253b/ldm/modules/image_degradation https://github.com/cszn/BSRGAN/blob/main/utils/utils_blindsr.py that I don't have in the repository yet

tempted to just go with Imagen's noising procedure (on top of the blur) and call it a day (it would be a lot simpler)

lucidrains · 2022-06-20T16:01:53Z

ok, 0.11.0 should allow for the different noise schedules across different unets, as in the paper

after adding the BSR image degradation (or some alternative), i think i'm comfortable giving the repository a 1.0

lucidrains · 2022-06-20T16:02:46Z

I understand only the image (and clip image EMB) is needed and no text ?

@rom1504 yup, no text conditioning needed, i think it should all be in the image embedding!

YUHANG-Ma · 2022-06-26T05:37:05Z

Hi all,
I am aiming to train the decoder and upsampler. Because the decoder and upsampler have too many parameters, so I decide to train them seperately. I saw in the readme which says the upsampler and the decoder net can be trained seperately. I viewed the code, in my understanding, although I can train them seperately, I need to load the parameters of both unet 0 and unet 1 and change the unet number into 1 to train only unet 1. I don't know if I am right. If so, I couldn't train unet0 and unet 1 in two seperate machines. I am wondering how I could train the decoder net and upsamplers seperately?
Best,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replication of the upscalers #152

Replication of the upscalers #152

rom1504 commented Jun 19, 2022

nousr commented Jun 19, 2022

lucidrains commented Jun 20, 2022 •

edited

Loading

lucidrains commented Jun 20, 2022

lucidrains commented Jun 20, 2022

YUHANG-Ma commented Jun 26, 2022

Replication of the upscalers #152

Replication of the upscalers #152

Comments

rom1504 commented Jun 19, 2022

nousr commented Jun 19, 2022

lucidrains commented Jun 20, 2022 • edited Loading

lucidrains commented Jun 20, 2022

lucidrains commented Jun 20, 2022

YUHANG-Ma commented Jun 26, 2022

lucidrains commented Jun 20, 2022 •

edited

Loading