PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement

PromptCharm is an interactive system for iterative refinement of text-to-image creation with diffusion models. This repository contains the official implementation of our related paper:

PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement

Zhijie Wang, Yuheng Huang, Da Song, Lei Ma, Tianyi Zhang

2024 ACM CHI Conference on Human Factors in Computing Systems (CHI 2024)

Getting Started

Environments Set-up

Python >= 3.6

We suggest use virtual environment to avoid messing up your own environments.

Create virtual environments (optional)

$ cd ./backend
$ python -m venv ./venv
$ source ./venv/bin/activate

Install

pip install -r requirements.txt

git clone -b promptcharm https://github.com/paulwong16/ecco.git
cd ecco
pip install -e . 

cd ..
git clone https://github.com/paulwong16/daam.git
cd daam
pip install -e .
cd ..

NPM >= 7

Download pre-mined images from diffusion_db and organize them as the followings. You can also follow the notebook in ./backend to do it by yourself.

├── web/dashboard
│   ├── public
│   ├── src
│   │   └── data
│   │       │── diffusion_db
│   │       │   │── 0.jpg
│   │       │   │── 1.jpg
│   │       │   └── ...
│   │       └── ...
│   └── ...
├── backend
└── ...

Install

$ cd ./web/dashboard
$ npm install

Basic Usage

Quick start

$ npm start

Copy the url and open it in browser.

Start backend

$ cd ./backend
$ python main.py --seed [YOUR RANDOM SEED]

Citation

If you found our paper/code useful in your research, please consider citing:

@inproceedings{wang2024promptcharm,
 author = {Wang, Zhijie and Huang, Yuheng and Song, Da and Ma, Lei and Zhang, Tianyi},
 title = {PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement},
 booktitle = {Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems},
 year = {2024},
}

License

This project is released under the MIT license.

Acknowledgement

Kudos to the following projects:

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
backend		backend
figs		figs
web/dashboard		web/dashboard
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement

Getting Started

Environments Set-up

Python >= 3.6

NPM >= 7

Basic Usage

Quick start

Start backend

Citation

License

Acknowledgement

About

Releases

Packages

Contributors 2

Languages

License

ma-labo/PromptCharm

Folders and files

Latest commit

History

Repository files navigation

PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement

Getting Started

Environments Set-up

Python >= 3.6

NPM >= 7

Basic Usage

Quick start

Start backend

Citation

License

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages