-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About FT Qwen2.5 #68
Comments
Tagging Yuqian @lavinal712 (who contributed the integration of Qwen2.5) for help. |
I'm currently trying to modify the version of Transformers, but I've encountered a tensor dimension mismatch issue, which might be caused by the latest code in Transformers. I will fix this issue as soon as possible. |
The upgrade of the Transformers library has made it impossible to fine-tune many models. The current code temporarily does not fully support the fine-tuning of Qwen2.5-VL. We apologize for the inconvenience. |
@lyons66 The new branch of this repo |
@zjysteven Hi, I have successfully launched the ft for Qwen2.5-vl-7B. Noticing that after upgrading to 4.49.0.dev0 there are two things that need updating:
But I still got an image features token size mismatch problem: This issue wasn't there when fine-tuning Qwen2 on the same datasets. My training sets have different image sizes, but the patching codes previously worked well. I don't know why this Qwen 2.5 became sensitive then. |
There's actually a little bit more than 1, 2 that needed updates, but we've got all of them covered under the branch If not, would you try again under that branch? Also this time please use our |
What is the |
It works. Thanks a lot for pointing out that! |
Hi, I noticed that in the qwen2_vl.py, we only have Qwen2VLForConditionalGeneration from transformers. However, for Qwen2.5, it requires the dev version of transformers (4.49.0) to import Qwen2_5_VLForConditionalGeneration.
So, how could I apply the fine-tuning for Qwen2.5-7b?
The text was updated successfully, but these errors were encountered: