-
Notifications
You must be signed in to change notification settings - Fork 95
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cuda device-side runtime error when training on custom dataset for JSON outputs #170
Comments
FYI, this dataset also doesn't work. It fails for another reason, TypeError somewhere else. This could be unrelated, but I'm wondering if the method for injecting prompts/responses is robust to stringified JSON.
|
@SinclairHudson Thanks for flagging this issue. For For |
device-side.txt |
Can you run With above info - I will try to replicate and debug on my end. Also, it could be an issue due to using multiple GPUs (huggingface/transformers#22546). If model is small enough, can you try to pin the weights on one GPU via |
Describe the bug
When attempting to train on this dataset: https://huggingface.co/datasets/azizshaw/text_to_json
To Reproduce
Steps to reproduce the behaviour:
Checkout main branch
Replace the data ingestion portion of llmtune/config.yml with:
And then run
Expected behavior
To my knowledge, this should run without error.
Environment:
This bug doesn't occur on the normal dataset, just on this other one. So, it could be something with a specific token or encoding in this dataset? Or there could be an issue with JSON outputs interfering with YAML syntax in the config.
The text was updated successfully, but these errors were encountered: