[pull] master from ggerganov:master #149

pull · 2025-01-10T05:45:18Z

See Commits and Changes for more details.

Created by pull[bot] (v2.0.0-alpha.1)

Can you help keep this open source service alive? 💖 Please sponsor : )

* SYCL: refactor ggml_sycl_compute_forward * SYCL: add back GGML_USED(dst) to ggml_sycl_cpy * SYCL: add function name to noop debug * SYCL: Some device info print refactoring and add details of XMX availability

@compilade

llama: add support for QRWKV6 model architecture (#11001) * WIP: Add support for RWKV6Qwen2 Signed-off-by: Molly Sophia <[email protected]> * RWKV: Some graph simplification Signed-off-by: Molly Sophia <[email protected]> * Add support for RWKV6Qwen2 with cpu and cuda GLA Signed-off-by: Molly Sophia <[email protected]> * RWKV6[QWEN2]: Concat lerp weights together to reduce cpu overhead Signed-off-by: Molly Sophia <[email protected]> * Fix some typos Signed-off-by: Molly Sophia <[email protected]> * code format changes Signed-off-by: Molly Sophia <[email protected]> * Fix wkv test & add gla test Signed-off-by: Molly Sophia <[email protected]> * Fix cuda warning Signed-off-by: Molly Sophia <[email protected]> * Update README.md Signed-off-by: Molly Sophia <[email protected]> * Update ggml/src/ggml-cuda/gla.cu Co-authored-by: Georgi Gerganov <[email protected]> * Fix fused lerp weights loading with RWKV6 Signed-off-by: Molly Sophia <[email protected]> * better sanity check skipping for QRWKV6 in llama-quant thanks @compilade Signed-off-by: Molly Sophia <[email protected]> Co-authored-by: compilade <[email protected]> --------- Signed-off-by: Molly Sophia <[email protected]> Co-authored-by: Georgi Gerganov <[email protected]> Co-authored-by: compilade <[email protected]>

…roup_size_control validation error (#11161) * Vulkan: Remove float16 use in shaders * Fix validation error about subgroup_size_control extension

qnixsynapse and others added 3 commits January 10, 2025 08:13

SYCL: Refactor ggml_sycl_compute_forward (#11121)

c6860cc

* SYCL: refactor ggml_sycl_compute_forward * SYCL: add back GGML_USED(dst) to ggml_sycl_cpy * SYCL: add function name to noop debug * SYCL: Some device info print refactoring and add details of XMX availability

Vulkan: Fix float16 use on devices without float16 support + fix subg…

c3f9d25

…roup_size_control validation error (#11161) * Vulkan: Remove float16 use in shaders * Fix validation error about subgroup_size_control extension

pull bot added the ⤵️ pull label Jan 10, 2025

pull bot merged commit c3f9d25 into syther-labs:master Jan 10, 2025

github-actions bot added testing python ggml SYCL Nvidia GPU Vulkan labels Jan 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pull] master from ggerganov:master #149

[pull] master from ggerganov:master #149

pull bot commented Jan 10, 2025 •

edited

Loading

[pull] master from ggerganov:master #149

[pull] master from ggerganov:master #149

Conversation

pull bot commented Jan 10, 2025 • edited Loading

pull bot commented Jan 10, 2025 •

edited

Loading