Skip to content

Latest commit

 

History

History
13 lines (10 loc) · 401 Bytes

README.md

File metadata and controls

13 lines (10 loc) · 401 Bytes

ssm-im

small experiments with AR image modeling with Mamba.

setup

install with poetry lock && poetry install or use conda and install things by hand. mostly just the standard torch stuff + wandb.

run

Current best: python main.py --batch-size 64 --lr 1e-3 --n-layer 16 --d-model 512 --clip-grad-val 1

experiments

logged in wandb.