-
Notifications
You must be signed in to change notification settings - Fork 962
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Validation error & crash on wgpu Vulkan + Windows #6832
Comments
Another issue I reported for an early wgpu 23 version might or might not be related: #6279. If nothing else the bisection there also pointed to some locking behaviour. It also looks similair to #6323 - comptue heavy workload, and I am getting validation errors of the form
|
If a single submission goes longer than 60s, you might see that, if that's not the case I'm not sure wht the issue is on the top of my head. |
It's definitely not going over 60s, the amount of GPU work in the order of ~100ms, and putting a submit() after every submit() call still crashes. I've tried downgrading to 22.10 but it still seems to crash. I've also tried adding
But the stack trace is still
With a stacktrace pointing to wherever the last submit was, or other similair traces. If you have any tips what to try / how to investigate this would be much appreciated! |
Description
When running my app (https://github.com/ArthurBrussee/brush), training proceeds steadily for a while, until the app crashes. The symptons seem hard to pin down, it happens fairly randomly. Just before the crash the Vulkan validation layer spits out a bunch of errors about semaphores. Most tellingly some semaphore value seems to be u64::MAX which Vulkan trips over.
This causes a device loss (possibly?) after which wgpu crashes because of #6378, I think.
I have not been able to reproduce this on Metal, not sure about Vulkan + Linux.
Extra materials
Log with validation errors
log.txt
Platform
wgpu (trunk or 23.0 or 23.1 repro), windows 11, Vulkan, 4070 on 566.36.
The text was updated successfully, but these errors were encountered: