.. |
dpct
|
Enabled more data types for oneMKL gemm_batch (#8236)
|
2024-07-05 13:23:25 +01:00 |
backend.hpp
|
[SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266)
|
2024-07-05 13:06:13 +08:00 |
common.cpp
|
llama : reorganize source code + improve CMake (#8006)
|
2024-06-26 18:33:02 +03:00 |
common.hpp
|
rm get_work_group_size() by local cache for performance (#8286)
|
2024-07-05 10:32:29 +08:00 |
convert.cpp
|
Dequant improvements rebase (#8255)
|
2024-07-03 09:55:34 +08:00 |
convert.hpp
|
llama : reorganize source code + improve CMake (#8006)
|
2024-06-26 18:33:02 +03:00 |
dequantize.hpp
|
Dequant improvements rebase (#8255)
|
2024-07-03 09:55:34 +08:00 |
dmmv.cpp
|
[SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266)
|
2024-07-05 13:06:13 +08:00 |
dmmv.hpp
|
llama : reorganize source code + improve CMake (#8006)
|
2024-06-26 18:33:02 +03:00 |
mmq.cpp
|
llama : reorganize source code + improve CMake (#8006)
|
2024-06-26 18:33:02 +03:00 |
mmq.hpp
|
llama : reorganize source code + improve CMake (#8006)
|
2024-06-26 18:33:02 +03:00 |
mmvq.cpp
|
[SYCL] Fix the sub group size of Intel (#8106)
|
2024-07-02 10:16:00 +08:00 |
mmvq.hpp
|
llama : reorganize source code + improve CMake (#8006)
|
2024-06-26 18:33:02 +03:00 |
norm.cpp
|
[SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266)
|
2024-07-05 13:06:13 +08:00 |
norm.hpp
|
[SYCL] Fix the sub group size of Intel (#8106)
|
2024-07-02 10:16:00 +08:00 |
presets.hpp
|
[SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266)
|
2024-07-05 13:06:13 +08:00 |
rope.cpp
|
[SYCL] Update SYCL-Rope op and Refactor (#8157)
|
2024-07-01 19:39:06 +08:00 |
rope.hpp
|
[SYCL] Update SYCL-Rope op and Refactor (#8157)
|
2024-07-01 19:39:06 +08:00 |
softmax.cpp
|
[SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266)
|
2024-07-05 13:06:13 +08:00 |
softmax.hpp
|
[SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266)
|
2024-07-05 13:06:13 +08:00 |
vecdotq.hpp
|
CUDA: refactor and optimize IQ MMVQ (#8215)
|
2024-07-01 20:39:06 +02:00 |