llama.cpp/ggml/src/ggml-sycl
Johannes Gäßler cb5fad4c6c
CUDA: refactor and optimize IQ MMVQ (#8215)
* CUDA: refactor and optimize IQ MMVQ

* uint -> uint32_t

* __dp4a -> ggml_cuda_dp4a

* remove MIN_CC_DP4A checks

* change default

* try CI fix
2024-07-01 20:39:06 +02:00
..
dpct llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
backend.hpp [SYCL] Update SYCL-Rope op and Refactor (#8157) 2024-07-01 19:39:06 +08:00
common.cpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
common.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
convert.cpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
convert.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
dequantize.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
dmmv.cpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
dmmv.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mmq.cpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mmq.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mmvq.cpp CUDA: refactor and optimize IQ MMVQ (#8215) 2024-07-01 20:39:06 +02:00
mmvq.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
presets.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
rope.cpp [SYCL] Update SYCL-Rope op and Refactor (#8157) 2024-07-01 19:39:06 +08:00
rope.hpp [SYCL] Update SYCL-Rope op and Refactor (#8157) 2024-07-01 19:39:06 +08:00
vecdotq.hpp CUDA: refactor and optimize IQ MMVQ (#8215) 2024-07-01 20:39:06 +02:00