llama.cpp/ggml/src/ggml-sycl
slaren 2b1f616b20
ggml : reduce hash table reset cost (#8698)
* ggml : reduce hash table reset cost

* fix unreachable code warnings after GGML_ASSERT(false)

* GGML_ASSERT(false) -> GGML_ABORT("fatal error")

* GGML_ABORT use format string
2024-07-27 04:41:55 +02:00
..
dpct ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
backend.hpp [SYCL] add concat through dim 1/2 (#8483) 2024-07-15 19:32:15 +08:00
common.cpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
common.hpp ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
concat.cpp [SYCL] add concat through dim 1/2 (#8483) 2024-07-15 19:32:15 +08:00
concat.hpp [SYCL] add concat through dim 1/2 (#8483) 2024-07-15 19:32:15 +08:00
convert.cpp [SYCL] Use multi_ptr to clean up deprecated warnings (#8256) 2024-07-10 16:10:49 +01:00
convert.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
dequantize.hpp Dequant improvements rebase (#8255) 2024-07-03 09:55:34 +08:00
dmmv.cpp ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
dmmv.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mmq.cpp ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
mmq.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mmvq.cpp ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
mmvq.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
norm.cpp [SYCL] Use multi_ptr to clean up deprecated warnings (#8256) 2024-07-10 16:10:49 +01:00
norm.hpp [SYCL] Fix the sub group size of Intel (#8106) 2024-07-02 10:16:00 +08:00
presets.hpp [SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266) 2024-07-05 13:06:13 +08:00
rope.cpp ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
rope.hpp [SYCL] Update SYCL-Rope op and Refactor (#8157) 2024-07-01 19:39:06 +08:00
softmax.cpp [SYCL] fix scratch size of softmax (#8642) 2024-07-23 15:43:28 +08:00
softmax.hpp [SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266) 2024-07-05 13:06:13 +08:00
vecdotq.hpp CUDA: refactor and optimize IQ MMVQ (#8215) 2024-07-01 20:39:06 +02:00