Skip to content

Commit

Permalink
1.0.0
Browse files Browse the repository at this point in the history
  • Loading branch information
Muennighoff committed Feb 16, 2024
1 parent 9f7f02c commit 3ac3905
Show file tree
Hide file tree
Showing 8 changed files with 24 additions and 14 deletions.
14 changes: 12 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

![](visuals/octopus.jpg)

This repository provides all materials for the paper [Generative Representational Instruction Tuning](https://arxiv.org/abs/2202.08904). We continue developing the repository and welcome any contributions. If you want to use the code in the exact same way as in the paper, please use the 1.0.0 release at commit hash `TODO`.
This repository provides all materials for the paper [Generative Representational Instruction Tuning](https://arxiv.org/abs/2402.09906). We continue developing the repository and welcome any contributions. If you want to use the code in the exact same way as in the paper, please use the 1.0.0 release at commit hash `TODO`.

- [Inference](#inference)
- [Training](#training)
Expand Down Expand Up @@ -119,6 +119,7 @@ Shortcuts:
- emb/gen/gritlm = embedding, generative, unified
- bf16c = embeddings are cast back to bf16 after pooling and similarity computation is also done in bf16 (simulating how cached embeddings would operate)
- bb/cc/bbcc... = order of bidirectional vs causal attention
- gendups = not using `--use_unique_indices` during training. If not used and training is unified, then data is duplicated worsening performance

The most important ones are:

Expand Down Expand Up @@ -160,7 +161,9 @@ They are explained in more detail in the paper and its appendix. So to e.g. trai
Setup:
```bash
git clone https://github.com/ContextualAI/gritlm`
cd gritlm
pip install -e .
cd gritlm
````
Below are easy examples for getting started:
Expand Down Expand Up @@ -437,5 +440,12 @@ The code is inspired by:
If useful please consider citing 😊
```bibtex
TODO
@misc{muennighoff2024generative,
title={Generative Representational Instruction Tuning},
author={Niklas Muennighoff and Hongjin Su and Liang Wang and Nan Yang and Furu Wei and Tao Yu and Amanpreet Singh and Douwe Kiela},
year={2024},
eprint={2402.09906},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
```
2 changes: 1 addition & 1 deletion gritlm/__init__.py
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
__version__ = "0.9.4"
__version__ = "1.0.0"

from .gritlm import GritLM
2 changes: 1 addition & 1 deletion scripts/training/train_embonly.sh
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@
######################
### Set enviroment ###
######################
cd /home/niklas/gritlm
cd /home/niklas/gritlm/gritlm
source /env/bin/start-ctx-user
conda activate gritlm
export WANDB_PROJECT="gritlm"
Expand Down
2 changes: 1 addition & 1 deletion scripts/training/train_genonly.sh
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@
######################
### Set enviroment ###
######################
cd /home/niklas/gritlm
cd /home/niklas/gritlm/gritlm
source /env/bin/start-ctx-user
conda activate gritlm
export WANDB_PROJECT="gritlm"
Expand Down
6 changes: 3 additions & 3 deletions scripts/training/train_gritlm_7b.sh
Original file line number Diff line number Diff line change
Expand Up @@ -12,10 +12,10 @@
######################
### Set enviroment ###
######################
cd /home/niklas/gritlm
cd /home/niklas/gritlm/gritlm
source /env/bin/start-ctx-user
conda activate gritlmt2
NCCL_ASYNC_ERROR_HANDLING=1
conda activate gritlm
#NCCL_ASYNC_ERROR_HANDLING=1
export WANDB_PROJECT="gritlm"
# Training setup
GPUS_PER_NODE=8
Expand Down
4 changes: 2 additions & 2 deletions scripts/training/train_gritlm_8x7b.sh
Original file line number Diff line number Diff line change
Expand Up @@ -12,11 +12,11 @@
######################
### Set enviroment ###
######################
cd /home/niklas/gritlm
cd /home/niklas/gritlm/gritlm
source /env/bin/start-ctx-user
conda activate gritlm
#NCCL_ASYNC_ERROR_HANDLING=1
export TORCH_NCCL_ASYNC_ERROR_HANDLING=1
#export TORCH_NCCL_ASYNC_ERROR_HANDLING=1
export WANDB_PROJECT="gritlm"
# Training setup
GPUS_PER_NODE=8
Expand Down
6 changes: 3 additions & 3 deletions scripts/training/train_test.sh
Original file line number Diff line number Diff line change
Expand Up @@ -12,10 +12,10 @@
######################
### Set enviroment ###
######################
cd /home/niklas/gritlm
cd /home/niklas/gritlm/gritlm
source /env/bin/start-ctx-user
conda activate gritlmt2
NCCL_ASYNC_ERROR_HANDLING=1
conda activate gritlm
#NCCL_ASYNC_ERROR_HANDLING=1
export WANDB_PROJECT="gritlm"
# Training setup
GPUS_PER_NODE=8
Expand Down
2 changes: 1 addition & 1 deletion setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -43,7 +43,7 @@

setup(
name='gritlm',
version='0.9.4',
version='1.0.0',
description='GritLM',
long_description=readme,
long_description_content_type="text/markdown",
Expand Down

0 comments on commit 3ac3905

Please sign in to comment.