You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Could you please share the data format or provide an example row from the mmichat_speech.jsonl file?
In anygpt/src/train/stage2_sft.py, the preprocess function maps raw_datasets to tokenized_datasets. However, I'm a bit confused about how this processing works.
It would be very helpful if you could provide a short example or sample in JSONL format.
The text was updated successfully, but these errors were encountered:
Thank you for the great work!
Could you please share the data format or provide an example row from the mmichat_speech.jsonl file?
In anygpt/src/train/stage2_sft.py, the preprocess function maps raw_datasets to tokenized_datasets. However, I'm a bit confused about how this processing works.
It would be very helpful if you could provide a short example or sample in JSONL format.
The text was updated successfully, but these errors were encountered: