Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Broken/split lines in segmentation #677

Open
johnlockejrr opened this issue Jan 9, 2025 · 1 comment
Open

Broken/split lines in segmentation #677

johnlockejrr opened this issue Jan 9, 2025 · 1 comment

Comments

@johnlockejrr
Copy link

johnlockejrr commented Jan 9, 2025

I'm trying to train/finetune a model with a pretty constant dataset of one column. No matter how I train from scratch or finetune it I always end up with broken/split lines here and there, what can be the issue?

How I tryed to train the model:

ketos segtrain -d cuda:0 -f page -t sam_40_mss.txt -q early --min-epochs 50 --resize both --schedule reduceonplateau -i /home/incognito/kraken-train/kraken-5.2.9-py3.11/lib/python3.11/site-packages/kraken/blla.mlmodel -o sam_40_mss_seg/sam_40_mss_seg_v1
ketos segtrain -d cuda:0 -f page -t sam_40_mss.txt -q early --min-epochs 40 -cl --resize both --schedule reduceonplateau -i /home/incognito/kraken-train/kraken-5.2.9-py3.11/lib/python3.11/site-packages/kraken/blla.mlmodel -o sam_40_mss_seg/sam_40_mss_seg_v2 :
ketos segtrain -d cuda:0 -f page -t sam_40_mss.txt -q early --min-epochs 40 --resize both --schedule reduceonplateau -i BiblIAlong02_se3_2_tl.mlmodel -o sam_40_mss_seg/sam_40_mss_seg_v3
ketos segtrain -d cuda:0 -f page -t sam_40_mss.txt -q early --min-epochs 40 --resize both -tl --schedule reduceonplateau -i BiblIAlong02_se3_2_tl.mlmodel -o sam_40_mss_seg/sam_40_mss_seg_v4
ketos segtrain -d cuda:0 -f page -t sam_40_mss.txt -q early --min-epochs 40 -cl --schedule reduceonplateau -o sam_40_mss_seg/sam_40_mss_seg_v5

Example output:

image

Below is the training from scratch, no broken lines but the polygon masks are awful...

image
@johnlockejrr
Copy link
Author

UPDATE:

I finetuned my data with ubma_segmentation.mlmodel and now seems to do the job... I'm not sure how they trained the model, I'll ask @stweil

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant