You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
My question is, when I directly use the instantiated SFTTrainer to execute trainer.predict, the predictions in the result contains normal logits. However, after processing trainer with train_on_responses_only and then executing trainer.predict, I was surprised to find that the predictions in the result is an empty tuple.
Why does this happen? How can I make it return logits as expected?
The text was updated successfully, but these errors were encountered:
I saw the following code snippet in your qwen2.5 fine-tuning tutorial:
Here,
trainer
is an instance ofSFTTrainer
.My question is, when I directly use the instantiated
SFTTrainer
to executetrainer.predict
, thepredictions
in the result contains normal logits. However, after processingtrainer
withtrain_on_responses_only
and then executingtrainer.predict
, I was surprised to find that thepredictions
in the result is an empty tuple.Why does this happen? How can I make it return logits as expected?
The text was updated successfully, but these errors were encountered: