Skip to content

Latest commit

 

History

History
82 lines (56 loc) · 3.17 KB

README.md

File metadata and controls

82 lines (56 loc) · 3.17 KB

Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models

Nick Stracke1 · Stefan A. Baumann1 · Josh Susskind2 · Miguel A. Bautista2 · Björn Ommer1

1 CompVis Group @ LMU Munich
2 Apple

ECCV 2024

Project Page Paper

This repository contains an implementation of the paper "CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models".

We present LoRAdapter, an approach that unifies both style and structure conditioning under the same formulation using a novel conditional LoRA block that enables zero-shot control. LoRAdapter is an efficient, powerful, and architecture-agnostic approach to condition text-to-image diffusion models, which enables fine-grained control conditioning during generation and outperforms recent state-of-the-art approaches.

teaser

🔥 Updates

  • Implemented B-LoRA implicit content and style disentangle using LoRAdapter
  • Released Code and Weights for inference

💪 TODO

  • Add training Code
  • Add more structure conditioning checkpoints (including SDXL)
  • Experiment with SD3

Setup

Create the conda environment

conda env create -f environment.yaml

Activate the conda environment

conda activate loradapter

Weights

All weights are available on HuggingFace.

For ease of you, you can also use the provided bash script download_weights.sh to automatically download all available weights and place them in the the right directory.

Usage

Sampling works according to the following schema:

python sample.py experiment=<check ./config/experiments>

All currently available experiments are listed in /config/experiments. Feel free to adjust the configs according to you own needs.

B-LoRA

Sampling using the B-LoRA LoRAdapter is possible using the config sample_b-lora_sdxl.yaml. By default this will condition on both content and style of the image. For conditioning on only content or only style, change the adaption_mode to either b-lora_content or b-lora_style. Also set ignore_check to true as we are only loading the checkpoint partially.

For best results provide information about the missing modality via the text prompt or using another LoRAdapter.

🎓 Citation

If you use this codebase or otherwise found our work valuable, please cite our paper:

@misc{stracke2024loradapter,
  title={CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models},
  author={Nick Stracke and Stefan Andreas Baumann and Joshua Susskind and Miguel Angel Bautista and Björn Ommer},
  year={2024},
  eprint={2405.07913},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}