Allow custom models #7

lrq3000 · 2024-12-30T18:28:15Z

Please allow to select custom models, especially the main diffusion rendering one, so we can choose a lightning or turbo model that can render faster in much fewer steps and finetuned for our purpose (eg, realist, anime, etc).

ai-anchorite · 2024-12-30T23:00:07Z

You can quite easily do this by editing app.py. The model loaders are quite obvious. Drag the file into a coding LLM and will be able to switch them out for another huggingface derived model.

It's the beautify of permissive licences and locally run apps. Once it's on your system, it's yours to do with as you will :)

Not sure I'll be adding a UI model loader here. It's a port of an existing project and the nature of these things is something better will come along and replace it before long!

lrq3000 · 2024-12-31T17:55:31Z

Oh ok I see, thank you for the pointers. Exploring coding LLM is on my todo list, I'll give it a try then.

Would you accept a PR if I manage to implement it? While I agree that there likely will be something better at some point in the future, I don't think it will be replaced anytime soon, it hasn't for now at least for creative upscaling, and ClarityAI exists since at least a year I think?

ai-anchorite · 2025-01-01T03:42:30Z

Bear in mind that this was a demo project by Finegrain to show that their Refiners library can recreate an existing diffusion task. So it may well not respond as expected to model changes etc.

There's resizing happening in the pre-upscale stage, then does an upscale using ESRGAN prior to the diffusion upscale pass. This affects the expected image output dimensions.

Or something like that. They go into the project detail here: https://blog.finegrain.ai/posts/reproducing-clarity-upscaler

My driving interest in the project was to bring the HF demo to run locally via Pinokio, especially since it worked with 8GB VRAM! And because i enjoy tweaking app UIs more than using apps haha. The actual upscaling logic was left alone. The code in enhancer.py and esrgan_model.py is beyond me.

Switching to an inferior low-step model doesn't seem productive, i think most people would accept a couple of extra seconds than lose output quality? An anime specific model could be interesting though.

There's definitely potential for a much better creative upscaler in here. Personally I dislike SD1.5's unavoidable affect on human/realistic faces, so don't use it myself, but feel free to fork and tweak and improve!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow custom models #7

Allow custom models #7

lrq3000 commented Dec 30, 2024

ai-anchorite commented Dec 30, 2024

lrq3000 commented Dec 31, 2024 •

edited

Loading

ai-anchorite commented Jan 1, 2025 •

edited

Loading

Allow custom models #7

Allow custom models #7

Comments

lrq3000 commented Dec 30, 2024

ai-anchorite commented Dec 30, 2024

lrq3000 commented Dec 31, 2024 • edited Loading

ai-anchorite commented Jan 1, 2025 • edited Loading

lrq3000 commented Dec 31, 2024 •

edited

Loading

ai-anchorite commented Jan 1, 2025 •

edited

Loading