Replies: 1 comment 2 replies
-
Thanks a lot for your attention to our work @GeorgiaA I was wondering what's the bottleneck of Grounded SAM 2 in your underwater scenarios, because we combine grounding model with tracking model in this pipeline, if the zero-shot performance of grounding model is good in your scenes, you don't have to fine-tune this part. If you're facing tracking issue in SAM 2, you only have to fine-tune SAM 2 on your own image with tracking labels. And you can replace the grounding model with other detectors (if you only need to detect a fixed set of classes) like YOLOv11 and fine-tune YOLO in your own scenarios, which is a more effective way. If you want to use a fine-tuned Grounding Model, you can try some open-source solutions (because we did not release our training code) on fine-tune grounding models: MM-Grounding-DINO or YOLO-World. I hope these solutions help for you |
Beta Was this translation helpful? Give feedback.
-
Hello,
I am working with grounded SAM2 to detect objects in underwater images, but it needs fine-tuning to adapt to an underwater setting. My company is asking for an estimated cost of how much it would cost to fine-tune. I plan to use cloud resources at first to estimate future costs, so I need to get a rough idea of how many hours it would take to fine-tune grounded SAM2. I plan to use an A100 GPU with 80GB memory as suggested in the SAM2 fine-tuning/training documentation. I have around 2,500 images that I could use.
Has anyone fine-tuned grounded SAM2 before? If so how long did it take approximately, how much data were you using, and how many epochs did you fine-tune for?
Any information would be greatly appreciated!
Thanks,
Georgia
Beta Was this translation helpful? Give feedback.
All reactions