-
Notifications
You must be signed in to change notification settings - Fork 438
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ET-VK] Using shared memory offsetting in conv2d pw and saving ivec3 pos instead of ivec2 to improve performance. #7817
[ET-VK] Using shared memory offsetting in conv2d pw and saving ivec3 pos instead of ivec2 to improve performance. #7817
Conversation
…pos instead of ivec2 to improve performance. This diff changes conv2d pw op shader to offset shared memory based on thread local index to improve performance. Change also saves pos as ivec3 pos instead of ivec2. Differential Revision: [D68400786](https://our.internmc.facebook.com/intern/diff/D68400786/) [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7817
Note: Links to docs will display an error until the docs builds have been completed. ✅ You can merge normally! (1 Unrelated Failure)As of commit b89ced3 with merge base e78ed83 ( BROKEN TRUNK - The following job failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This pull request was exported from Phabricator. Differential Revision: D68400786 |
…ving ivec3 pos instead of ivec2 to improve performance." This diff changes conv2d pw op shader to offset shared memory based on thread local index to improve performance. Change also saves pos as ivec3 pos instead of ivec2. Differential Revision: [D68400786](https://our.internmc.facebook.com/intern/diff/D68400786/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D68400786 |
…ving ivec3 pos instead of ivec2 to improve performance." This diff changes conv2d pw op shader to offset shared memory based on thread local index to improve performance. Change also saves pos as ivec3 pos instead of ivec2. Differential Revision: [D68400786](https://our.internmc.facebook.com/intern/diff/D68400786/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D68400786 |
…ving ivec3 pos instead of ivec2 to improve performance." This diff changes conv2d pw op shader to offset shared memory based on thread local index to improve performance. Change also saves pos as ivec3 pos instead of ivec2. Differential Revision: [D68400786](https://our.internmc.facebook.com/intern/diff/D68400786/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D68400786 |
…ving ivec3 pos instead of ivec2 to improve performance." This diff changes conv2d pw op shader to offset shared memory based on thread local index to improve performance. Change also saves pos as ivec3 pos instead of ivec2. Differential Revision: [D68400786](https://our.internmc.facebook.com/intern/diff/D68400786/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D68400786 |
…ving ivec3 pos instead of ivec2 to improve performance." This diff changes conv2d pw op shader to offset shared memory based on thread local index to improve performance. Change also saves pos as ivec3 pos instead of ivec2. Differential Revision: [D68400786](https://our.internmc.facebook.com/intern/diff/D68400786/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D68400786 |
…ving ivec3 pos instead of ivec2 to improve performance." This diff changes conv2d pw op shader to offset shared memory based on thread local index to improve performance. Change also saves pos as ivec3 pos instead of ivec2. Differential Revision: [D68400786](https://our.internmc.facebook.com/intern/diff/D68400786/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D68400786 |
…ving ivec3 pos instead of ivec2 to improve performance." This diff changes conv2d pw op shader to offset shared memory based on thread local index to improve performance. Change also saves pos as ivec3 pos instead of ivec2. Differential Revision: [D68400786](https://our.internmc.facebook.com/intern/diff/D68400786/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D68400786 |
16d44b6
into
gh/trivedivivek/49/base
…pos instead of ivec2 to improve performance. Pull Request resolved: #7817 This diff changes conv2d pw op shader to offset shared memory based on thread local index to improve performance. Change also saves pos as ivec3 pos instead of ivec2. ghstack-source-id: 263238733 @exported-using-ghexport Differential Revision: [D68400786](https://our.internmc.facebook.com/intern/diff/D68400786/)
…pos instead of ivec2 to improve performance. Pull Request resolved: #7817 This diff changes conv2d pw op shader to offset shared memory based on thread local index to improve performance. Change also saves pos as ivec3 pos instead of ivec2. ghstack-source-id: 263238733 @exported-using-ghexport Differential Revision: [D68400786](https://our.internmc.facebook.com/intern/diff/D68400786/) Co-authored-by: Vivek Trivedi <[email protected]>
…pos instead of ivec2 to improve performance. Pull Request resolved: #7817 This diff changes conv2d pw op shader to offset shared memory based on thread local index to improve performance. Change also saves pos as ivec3 pos instead of ivec2. ghstack-source-id: 263238733 @exported-using-ghexport Differential Revision: [D68400786](https://our.internmc.facebook.com/intern/diff/D68400786/) Co-authored-by: Vivek Trivedi <[email protected]>
…pos instead of ivec2 to improve performance. Pull Request resolved: pytorch#7817 This diff changes conv2d pw op shader to offset shared memory based on thread local index to improve performance. Change also saves pos as ivec3 pos instead of ivec2. ghstack-source-id: 263238733 @exported-using-ghexport Differential Revision: [D68400786](https://our.internmc.facebook.com/intern/diff/D68400786/) Co-authored-by: Vivek Trivedi <[email protected]>
Stack from ghstack (oldest at bottom):
This diff changes conv2d pw op shader to offset shared memory based on thread local index to improve performance. Change also saves pos as ivec3 pos instead of ivec2.
Differential Revision: D68400786