clip #116

Muinez · 2024-12-25T18:27:45Z

Could you please try training Sana together with CLIP, similar to how it's done in SDXL? I experimented with fine-tuning Sana on CLIP embeddings (I modified the caption channels), and the model trained significantly better compared to using pure gemma

lawrence-cj · 2025-01-02T13:30:47Z

Nice. Any comparison to learn about the improvement?

Muinez · 2025-01-03T07:44:17Z

Nice. Any comparison to learn about the improvement?

https://wandb.ai/muinez/mysana

1 is Gemma.
2 and 3 are CLIP.

Don't focus on the end of the 2nd run because I broke something there. Look at the 3rd run and the beginning to mid-point of the 2nd run.

lawrence-cj · 2025-01-03T07:53:15Z

No idea what's the improvement. Can you explain more?

Muinez · 2025-01-03T08:39:12Z

No idea what's the improvement. Can you explain more?

The model seems to generate more aesthetically pleasing art overall, with improvements in features like eyes and textures. Prompt following has gotten worse, though, because the prompt doesn't fit within the 64 token limit (which was the limit of the CLIP version I used for training). It looks like the art has become more varied and possibly more lively—although that could be my impression, and I’m not the only one who noticed. I shared this with others, and they also think the CLIP version performs better. It’s not a huge resource hog either—if I managed to do this on a modest A6000, the model adapts quickly. I think it’s worth experimenting with if you haven’t tried it yet. If you decide to train, maybe try using the CLIP from SDXL Animagine finetune. You could then further fine-tune it on longer prompts to improve its understanding of them

lawrence-cj added the documentation Improvements or additions to documentation label Jan 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clip #116

clip #116

Muinez commented Dec 25, 2024

lawrence-cj commented Jan 2, 2025

Muinez commented Jan 3, 2025

lawrence-cj commented Jan 3, 2025

Muinez commented Jan 3, 2025

clip #116

clip #116

Comments

Muinez commented Dec 25, 2024

lawrence-cj commented Jan 2, 2025

Muinez commented Jan 3, 2025

lawrence-cj commented Jan 3, 2025

Muinez commented Jan 3, 2025