llama.cpp/ggml/src/ggml-sycl
luoyu-intel 1731d4238f
[SYCL] Add oneDNN primitive support (#9091)
* add onednn

* add sycl_f16

* add dnnl stream

* add engine map

* use dnnl for intel only

* use fp16fp16fp16

* update doc
2024-08-22 12:50:10 +08:00
..
dpct [SYCL] Updated SYCL device filtering (#8901) 2024-08-07 11:25:36 +01:00
backend.hpp [SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052) 2024-08-20 23:06:51 +08:00
common.cpp [SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052) 2024-08-20 23:06:51 +08:00
common.hpp [SYCL] Add oneDNN primitive support (#9091) 2024-08-22 12:50:10 +08:00
concat.cpp [SYCL] add concat through dim 1/2 (#8483) 2024-07-15 19:32:15 +08:00
concat.hpp [SYCL] add concat through dim 1/2 (#8483) 2024-07-15 19:32:15 +08:00
conv.cpp [SYCL] add conv support (#8688) 2024-07-29 10:50:27 +08:00
conv.hpp [SYCL] add conv support (#8688) 2024-07-29 10:50:27 +08:00
convert.cpp [SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052) 2024-08-20 23:06:51 +08:00
convert.hpp [SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052) 2024-08-20 23:06:51 +08:00
dequantize.hpp [SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052) 2024-08-20 23:06:51 +08:00
dmmv.cpp [SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052) 2024-08-20 23:06:51 +08:00
dmmv.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
gemm.hpp [SYCL] Add oneDNN primitive support (#9091) 2024-08-22 12:50:10 +08:00
im2col.cpp [SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052) 2024-08-20 23:06:51 +08:00
im2col.hpp [SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052) 2024-08-20 23:06:51 +08:00
mmq.cpp ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
mmq.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mmvq.cpp [SYCL] Fixing wrong VDR iq4nl value (#8812) 2024-08-02 08:55:17 +08:00
mmvq.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
norm.cpp ggml : add epsilon as a parameter for group_norm (#8818) 2024-08-06 10:26:46 +03:00
norm.hpp [SYCL] Fix the sub group size of Intel (#8106) 2024-07-02 10:16:00 +08:00
presets.hpp [SYCL] Add TIMESTEP_EMBEDDING OP (#8707) 2024-07-30 14:56:51 +08:00
rope.cpp ggml : move rope type enum to ggml.h (#8949) 2024-08-13 21:13:15 +02:00
rope.hpp [SYCL] Update SYCL-Rope op and Refactor (#8157) 2024-07-01 19:39:06 +08:00
softmax.cpp [SYCL] fix scratch size of softmax (#8642) 2024-07-23 15:43:28 +08:00
softmax.hpp [SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266) 2024-07-05 13:06:13 +08:00
tsembd.cpp [SYCL] Add TIMESTEP_EMBEDDING OP (#8707) 2024-07-30 14:56:51 +08:00
tsembd.hpp [SYCL] Add TIMESTEP_EMBEDDING OP (#8707) 2024-07-30 14:56:51 +08:00
vecdotq.hpp CUDA: refactor and optimize IQ MMVQ (#8215) 2024-07-01 20:39:06 +02:00