Skip to content

Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.

License

Notifications You must be signed in to change notification settings

elsucht/LyCORIS

 
 

Repository files navigation

LyCORIS - Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.

banner image

A project that implements different parameter-efficient fine-tuning algorithms for Stable Diffusion.

This project originated from LoCon (see archive branch).

If you are interested in discussing more details, you can join our Discord server

Discord!

If you want to check more in-depth experiment results and discussions for LyCORIS, you can check our paper

Algorithm Overview

LyCORIS currently contains LoRA (LoCon), LoHa, LoKr, (IA)^3, DyLoRA, Native fine-tuning (aka dreambooth). GLoRA and GLoKr are coming soon. Please check List of Implemented Algorithms and Guidelines for more details.

A simple comparison of some of these methods are provided below (to be taken with a grain of salt)

Full LoRA LoHa LoKr low factor LoKr high factor
Fidelity
Flexibility $^*$ $^†$
Diversity
Size
Training Speed Linear
Training Speed Conv

★ > ◉ > ● > ▲ [> means better and smaller size is better]

$^*$ Flexibility means anything related to generating images not similar to those in the training set, and combination of multiple concepts, whether they are trained together or not $^†$ It may become more difficult to switch base model or combine multiple concepts in this situation

Usage

Image Generation

After sd-webui 1.5.0, LyCORIS models are officially supported by the built-in LoRA system. You can put them in either models/Lora or models/LyCORIS and use the default syntax <lora:filename:multiplier> to trigger it.

When we add new model types, we will always make sure they can be used with the newest version of sd-webui.

As for sd-webui with version < 1.5.0, please check this extension.

Others

As far as we are aware, LyCORIS models are also supported in the following interfaces / online generation services (please help us complete the list!)

However, newer model types may not always be supported. If you encounter this issue, consider requesting the developers of the corresponding interface or website to include support for the new type.

Training

There are three different ways to train LyCORIS models.

  • With kohya-ss/sd-scripts (see a list of compatible graphical interfaces and colabs at the end of the section)
  • With HCP-Diffusion
  • With your own script by using LyCORIS as standalone wrappers for any pytorch modules.

In any case, please install this package in the corresponding virtual environment. You can either install it

  • through pip

    pip install lycoris_lora
  • or from source

    git clone https://github.com/KohakuBlueleaf/LyCORIS
    cd LyCORIS
    pip install .

A detilaed description of the network arguments is provided in docs/Network-Args.md.

kohya script

You can use this package's kohya module to run kohya's training script to train lycoris module for SD models

  • with command line arguments

    accelerate launch train_network.py \
      --network_module lycoris.kohya \
      --network_dim "DIM_FOR_LINEAR" --network_alpha "ALPHA_FOR_LINEAR"\
      --network_args "conv_dim=DIM_FOR_CONV" "conv_alpha=ALPHA_FOR_CONV" \
      "dropout=DROPOUT_RATE" "algo=locon" \
  • with toml files

    accelerate launch train_network.py \
      --config_file example_configs/training_configs/kohya/loha_config.toml

    For your convenience, some example toml files for kohya LyCORIS training are provided in example/training_configs/kohya.

HCP-Diffusion

You can use this package's hcp module to run HCP-Diffusion's training script to train lycoris module for SD models

accelerate launch -m hcpdiff.train_ac_single \
  --cfg example_configs/training_configs/hcp/hcp_diag_oft.yaml

For your convenience, some example yaml files for HCP LyCORIS training are provided in example/training_configs/hcp.

For the moment being the outputs of HCP-Diffusion are not directly compatible with a1111/sdwebui. You can perform conversion with tools/batch_hcp_convert.py.

In the case of pivotal tuning, tools/batch_bundle_convert.py can be further used to convert to and from bundle formats.

As standalone wrappers

See standalone_example.py for full example.

Import create_lycoris and LycorisNetwork from lycoris library, put your preset to LycorisNetwork and then use create_lycoris to create LyCORIS module for your pytorch module.

For example:

from lycoris import create_lycoris, LycorisNetwork

LycorisNetwork.apply_preset(
    {"target_name": [".*attn.*"]}
)
lycoris_net = create_lycoris(
    your_model, 
    1.0, 
    linear_dim=16, 
    linear_alpha=2.0, 
    algo="lokr"
)
lycoris_net.apply_to()

# after apply_to(), your_model() will run with LyCORIS net
lycoris_param = lycoris_net.parameters()
forward_with_lyco = your_model(x)

You can check my HakuPhi project to see how I utilize LyCORIS to finetune the Phi-1.5 models.

Graphical Interfaces and Colabs (via kohya trainer)

You can also train LyCORIS with the following graphical interfaces

and colabs (please help us complete the list!)

However, they are not guaranteed to be up-to-date. In particular, newer types may not be supported. Consider requesting the developers for support or simply use the original kohya script in this case.

Utilities

Extract LoCon

You can extract LoCon from a dreambooth model with its base model.

python3 extract_locon.py <settings> <base_model> <db_model> <output>

Use --help to get more info

$ python3 extract_locon.py --help
usage: extract_locon.py [-h] [--is_v2] [--is_sdxl] [--device DEVICE] [--mode MODE] [--safetensors] [--linear_dim LINEAR_DIM]
                        [--conv_dim CONV_DIM] [--linear_threshold LINEAR_THRESHOLD] [--conv_threshold CONV_THRESHOLD]
                        [--linear_ratio LINEAR_RATIO] [--conv_ratio CONV_RATIO] [--linear_quantile LINEAR_QUANTILE]
                        [--conv_quantile CONV_QUANTILE] [--use_sparse_bias] [--sparsity SPARSITY] [--disable_cp]
                        base_model db_model output_name

Merge LyCORIS back to model

You can merge your LyCORIS model back to your checkpoint(base model)

python3 merge.py <settings> <base_model> <lycoris_model> <output>

Use --help to get more info

$ python3 merge.py --help
usage: merge.py [-h] [--is_v2] [--is_sdxl] [--device DEVICE] [--dtype DTYPE] [--weight WEIGHT] base_model lycoris_model output_name

Change Log

For full log, please see Change.md

2023/12/15 quick fixes of 2.0.2

  • Fix bugs in full module.
  • Related: Fix bugs in stable-diffusion-webui/extensions-builtin/Lora

2023/12/14 quick fixes of 2.0.1

  • Support merge sdxl loras which trained on plain diffusers with Kohya's LoRA implementation.
    • Can be found in LECO or other similar projects.
  • Refactor the batch convert scripts for pivotal bundle and hcp.
  • Change the class name lycoris.kohya.LycorisNetwork to lycoris.kohya.LycorisNetworkKohya to avoid confusion.
  • Fix bugs in merge scripts for Norm module and LoKr module.
  • Fix bugs in scaled weight norms of OFT.
  • Fix bugs in extract scripts for SDXL.
  • Fix bugs in full module which consume 2x vram.
  • Fix bugs in create_network_from_weights which caused bugs in "resume" feature for SDXL.

2023/12/02 update to 2.0.0

  • Start supporting HCP-Diffusion (The reason to name this version "2.0.0")
    • Now LyCORIS support LoHa/LoKr/Diag-OFT algorithm in HCP-Diffusion
    • Add Pivotal tuning utilities
    • Add hcp convert utilities
    • Have no plan at this time to support full/lora and train_norms since HCP can do them natively
  • Add Diag-OFT modules
  • Add standalone usage support
    • Can wrap any pytorch module which contains Linear/Conv2d/LayerNorm/GroupNorm modules
    • Will support more module in the future
  • Add SDXL support in Merge script
  • Add SDXL support in Extract-locon
  • More efficient (speed/vram) implementation for full module
  • Better implementation of custom state_dict
  • Fix errors of dropouts
  • Fix errors of apply_max_norms
  • Fix errors of resume

Todo list

  • Module and Document for using LyCORIS in any other model, Not only SD.
  • Proposition3 in FedPara
    • also need custom backward to save the vram
  • Low rank + sparse representation
    • For extraction
    • For training
  • Support more operation, not only linear and conv2d.
  • Configure varying ranks or dimensions for specific modules as needed.
  • Automatically selecting an algorithm based on the specific rank requirement.
  • Explore other low-rank representations or parameter-efficient methods to fine-tune either the entire model or specific parts of it.
  • More experiments for different task, not only diffusion models.

Citation

@misc{LyCORIS,
      title={Navigating Text-To-Image Customization: From LyCORIS Fine-Tuning to Model Evaluation}, 
      author={Shin-Ying Yeh and Yu-Guan Hsieh and Zhidong Gao and Bernard B W Yang and Giyeong Oh and Yanmin Gong},
      year={2023},
      eprint={2309.14859},
      archivePrefix={arXiv}
}

About

Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%