-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Low accuracy of own trained model #1
Comments
This is unexpected, and is probably due to the change in tensorflow versions (the code is written with TF 0.7.0 and is probably not compatible with newer version of tensorflow). If you download the pre-trained models and test their accuracy, do you get reasonable performances? |
Thanks for response. My TensorFlow is version 0.12. The pre-trained models achieved similar result as the paper. Actually, the accuracy is reasonable at the beginning of high_res training initialized with low_res, but it goes down when the number of iterations increases. |
Mmm, that's very strange. Probably the training batches weren't correctly built? Blow is my terminal output when re-training the model using this repo. The output when training the high resolution model is in this gist: |
well, the output is different a lot from each other. train_referit_seg_highres.log |
I'm not exactly sure what's wrong. The final accuracy |
Btw, if you are in a hurry and the problem persists, you can try out the Caffe implementation of this project (https://github.com/Seth-Park/text_objseg_caffe implemented by Seth Park), which gives higher performance with a slightly different set of hyperparameters. |
Thanks for your kind help. I'll try the caffe implementation later. |
Seems the CUDA/GPU issue tensorflow/tensorflow#3507, though after upgrading CUDA version to 8.0, the issue is not resolved yet. I am trying to train on CPU only. |
Hi ronghang
I am trying to train the high resolution model using your code. I followed all the instructions in the README, and did not change any parameters in the code, but the the performance of trained model is extremely low, just about 4.5% for overall IoU. Is it possible that you updated the code afterward, but the modified code is not uploaded to the github?
Thanks
The text was updated successfully, but these errors were encountered: