vulkan: implement several ops relevant for ggml_opt #11769

remyoudompheng · 2025-02-09T10:09:33Z

This PR implements several GGML opcodes which are possibly relevant for #10544 (SUM, ARGMAX, SUB, COUNT_EQUAL, OPT_STEP_ADAMW, REPEAT_BACK).
After these patches, it is possible to run test-opt using the Vulkan backend (with a few failures maybe caused by rounding issues?).

Several issues were identified in test-backend-ops:

SUB was not tested at all
REPEAT_BACK has a few cases not supported by the CPU backend (crash with -b CPU)

Several issues were identified in Vulkan CHECK_RESULTS mode:

RWKV_WKV6 was crashing
various buffers were not freed

0cc4m · 2025-02-10T07:00:57Z

ggml/src/ggml-vulkan/vulkan-shaders/count_equal.comp

+        count += uint(data_a[idx] == data_b[idx]);
+    }
+
+    atomicAdd(data_d[0], D_TYPE(count));


This shader crashes my Intel A770. I assume it's this atomicAdd. Maybe there is a way to avoid it?

I'm not sure how to perform reduction with multiple workgroups without adding an extra buffer.
Maybe doing a single atomic per warp helps with your crash?

Does it also crash with this variant : a1633e4 ?

It's surprising this crashes because int32 atomics in compute shaders are required in vulkan 1.0. Does it crash during the compile or while executing? Maybe the compiler would handle uint better?

remyoudompheng added 9 commits February 9, 2025 10:56

vulkan: support memset_tensor

5c1d8a9

vulkan: support GGML_OP_SUM

abf4c2e

vulkan: implement GGML_OP_ARGMAX

deb15e3

vulkan: implement GGML_OP_SUB

148f586

vulkan: implement GGML_OP_COUNT_EQUAL

095f8d1

vulkan: implement GGML_OP_OPT_STEP_ADAMW

9526033

vulkan: fix check_results RWKV_WKV6 crash and memory leaks

e6a2c06

vulkan: implement GGML_OP_REPEAT_BACK

bc34976

tests: remove invalid test-backend-ops REPEAT_BACK tests

941efc0

github-actions bot added testing Everything test related Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Feb 9, 2025

0cc4m reviewed Feb 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vulkan: implement several ops relevant for ggml_opt #11769

vulkan: implement several ops relevant for ggml_opt #11769

remyoudompheng commented Feb 9, 2025

0cc4m Feb 10, 2025

remyoudompheng Feb 10, 2025

jeffbolznv Feb 10, 2025

vulkan: implement several ops relevant for ggml_opt #11769

Are you sure you want to change the base?

vulkan: implement several ops relevant for ggml_opt #11769

Conversation

remyoudompheng commented Feb 9, 2025

0cc4m Feb 10, 2025

Choose a reason for hiding this comment

remyoudompheng Feb 10, 2025

Choose a reason for hiding this comment

jeffbolznv Feb 10, 2025

Choose a reason for hiding this comment