-
Notifications
You must be signed in to change notification settings - Fork 105
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tesla P4 - CUDA error cudaErrorIllegalAddress #278
Comments
Just hit the same with a new P4. It crashed but and the process hung, couldn't stop or kill the process, had to reboot.
|
Don't know if this adds anything, but have been trying to debug a bit more too, here's a gist with output from compute-sanitizer during the crash (ignore the 'About to uploadArray', was just me trying to see where it might be having issues. https://gist.github.com/vb216/852194bc9e7307e46cc01f1880134f67 Alot of repetition of CudaConvertRMapToLinePoints hitting address out of bounds on different locations That's with ECC turned off on the GPU incase it made any difference (doesnt seem to have done). Just running the debug build to see if anything more useful comes out of that. |
The |
Great news, keen to test |
Do you have an estimate when this might be released or at lease available to build/test? |
Same issue heren also with a P4 |
This should likely be landing before the weekend |
it seems fixed?
|
Closing as it appears the fixe was successful as confirmed by 2 independent users. |
what is your Tesla P4 clock speed when plotting/farming ? |
While previously I have run bladebit CUDA with my Tesla P4, after noticing a few other people reporting issues with the card I tried again and was able to consistently recreate the crash. For this first failure I was using the Ubuntu binary from https://github.com/Chia-Network/bladebit/actions/runs/4129720923/jobs/7135639600#step:3:5.
https://gist.github.com/altendky/3ad52845cbb71c106dbe276f3d95bba1
After Harold requested debug info, I made #271 to get debug builds with the following results from https://github.com/Chia-Network/bladebit/actions/runs/4149269955
https://gist.github.com/altendky/25ef339f5cfd28345dd641bdd9a1e4bb
bladebit/cuda/GpuStreams.cpp
Line 571 in 62af659
The text was updated successfully, but these errors were encountered: