llama.cpp/ggml-vulkan-shaders.hpp at 864a99e7a01d9422d2f55618dbe62c8099a2175c - llama.cpp - Gitea: Git with a cup of tea

root/llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-09-22 21:16:20 +00:00

0cc4m 3d7ebf6312

Vulkan Mixture of Experts (MoE) support (#7628 )

* Finish Vulkan mul_mat_id implementation

* Add Vulkan sum_rows and div ops

* Fix MUL_MAT_ID matrix matrix shader

* Fix MUL_MAT_ID matrix vector shader dispatch size

* Fix MUL_MAT_ID matrix vector shader and dispatch code

* Update Vulkan CPU offload for MUL_MAT_ID

* Fix crash when using split mode none and setting a main GPU

2024-06-03 10:59:14 +02:00

8.2 MiB

Raw Blame History

The file is too large to be shown. View Raw