llama.cpp/ggml/src/ggml-sycl
Nicolò Scipione 40c6d79fb5
SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (#10584)
* [SYCL] Move to Compile Time backend selection on oneMKL Interface for NVIDIA backend

Move to compile time selection to backend to avoid latency at run time.
Add it to all mkl gemm calls and only for NVIDIA backend.

Signed-off-by: nscipione <nicolo.scipione@codeplay.com>

* Formatting

* Address PR comments to increase readibility

---------

Signed-off-by: nscipione <nicolo.scipione@codeplay.com>
2024-12-04 09:29:20 +08:00
..
dpct SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (#10584) 2024-12-04 09:29:20 +08:00
backend.hpp Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133) 2024-11-07 15:19:10 +08:00
CMakeLists.txt SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (#10584) 2024-12-04 09:29:20 +08:00
common.cpp Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133) 2024-11-07 15:19:10 +08:00
common.hpp Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133) 2024-11-07 15:19:10 +08:00
concat.cpp Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133) 2024-11-07 15:19:10 +08:00
concat.hpp [SYCL] add concat through dim 1/2 (#8483) 2024-07-15 19:32:15 +08:00
conv.cpp [SYCL] add conv support (#8688) 2024-07-29 10:50:27 +08:00
conv.hpp [SYCL] add conv support (#8688) 2024-07-29 10:50:27 +08:00
convert.cpp [SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052) 2024-08-20 23:06:51 +08:00
convert.hpp [SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052) 2024-08-20 23:06:51 +08:00
dequantize.hpp Fixed dequant precision issues in Q4_1 and Q5_1 (#9711) 2024-10-03 07:50:44 +01:00
dmmv.cpp [SYCL] Fix DMMV dequantization (#9279) 2024-09-04 16:26:33 +01:00
dmmv.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
element_wise.cpp Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133) 2024-11-07 15:19:10 +08:00
element_wise.hpp Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133) 2024-11-07 15:19:10 +08:00
gemm.hpp [SYCL] Add oneDNN primitive support (#9091) 2024-08-22 12:50:10 +08:00
ggml-sycl.cpp SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (#10584) 2024-12-04 09:29:20 +08:00
im2col.cpp [SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052) 2024-08-20 23:06:51 +08:00
im2col.hpp [SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052) 2024-08-20 23:06:51 +08:00
mmq.cpp ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
mmq.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mmvq.cpp fix mul_mat_vec_q and *_vec_q error (#9939) 2024-10-21 14:26:09 +08:00
mmvq.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
norm.cpp sycl : Fixes to broken builds and test-backend-ops (#10257) 2024-11-13 09:40:57 +00:00
norm.hpp [SYCL] Fix the sub group size of Intel (#8106) 2024-07-02 10:16:00 +08:00
outprod.cpp SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (#10584) 2024-12-04 09:29:20 +08:00
outprod.hpp Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133) 2024-11-07 15:19:10 +08:00
presets.hpp Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133) 2024-11-07 15:19:10 +08:00
rope.cpp ggml : move rope type enum to ggml.h (#8949) 2024-08-13 21:13:15 +02:00
rope.hpp [SYCL] Update SYCL-Rope op and Refactor (#8157) 2024-07-01 19:39:06 +08:00
softmax.cpp [SYCL] fix scratch size of softmax (#8642) 2024-07-23 15:43:28 +08:00
softmax.hpp [SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266) 2024-07-05 13:06:13 +08:00
tsembd.cpp [SYCL] Add TIMESTEP_EMBEDDING OP (#8707) 2024-07-30 14:56:51 +08:00
tsembd.hpp [SYCL] Add TIMESTEP_EMBEDDING OP (#8707) 2024-07-30 14:56:51 +08:00
vecdotq.hpp sycl: Use syclcompat::dp4a (#10267) 2024-11-15 11:09:12 +08:00
wkv6.cpp Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133) 2024-11-07 15:19:10 +08:00
wkv6.hpp Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133) 2024-11-07 15:19:10 +08:00