..
dpct
[SYCL] Updated SYCL device filtering ( #8901 )
2024-08-07 11:25:36 +01:00
backend.hpp
Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration ( #10133 )
2024-11-07 15:19:10 +08:00
common.cpp
Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration ( #10133 )
2024-11-07 15:19:10 +08:00
common.hpp
Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration ( #10133 )
2024-11-07 15:19:10 +08:00
concat.cpp
Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration ( #10133 )
2024-11-07 15:19:10 +08:00
concat.hpp
[SYCL] add concat through dim 1/2 ( #8483 )
2024-07-15 19:32:15 +08:00
conv.cpp
[SYCL] add conv support ( #8688 )
2024-07-29 10:50:27 +08:00
conv.hpp
[SYCL] add conv support ( #8688 )
2024-07-29 10:50:27 +08:00
convert.cpp
[SYCL] Fix SYCL im2col
and convert
Overflow with Large Dims ( #9052 )
2024-08-20 23:06:51 +08:00
convert.hpp
[SYCL] Fix SYCL im2col
and convert
Overflow with Large Dims ( #9052 )
2024-08-20 23:06:51 +08:00
dequantize.hpp
Fixed dequant precision issues in Q4_1 and Q5_1 ( #9711 )
2024-10-03 07:50:44 +01:00
dmmv.cpp
[SYCL] Fix DMMV dequantization ( #9279 )
2024-09-04 16:26:33 +01:00
dmmv.hpp
llama : reorganize source code + improve CMake ( #8006 )
2024-06-26 18:33:02 +03:00
element_wise.cpp
Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration ( #10133 )
2024-11-07 15:19:10 +08:00
element_wise.hpp
Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration ( #10133 )
2024-11-07 15:19:10 +08:00
gemm.hpp
[SYCL] Add oneDNN primitive support ( #9091 )
2024-08-22 12:50:10 +08:00
im2col.cpp
[SYCL] Fix SYCL im2col
and convert
Overflow with Large Dims ( #9052 )
2024-08-20 23:06:51 +08:00
im2col.hpp
[SYCL] Fix SYCL im2col
and convert
Overflow with Large Dims ( #9052 )
2024-08-20 23:06:51 +08:00
mmq.cpp
ggml : reduce hash table reset cost ( #8698 )
2024-07-27 04:41:55 +02:00
mmq.hpp
llama : reorganize source code + improve CMake ( #8006 )
2024-06-26 18:33:02 +03:00
mmvq.cpp
fix mul_mat_vec_q and *_vec_q error ( #9939 )
2024-10-21 14:26:09 +08:00
mmvq.hpp
llama : reorganize source code + improve CMake ( #8006 )
2024-06-26 18:33:02 +03:00
norm.cpp
ggml : add epsilon as a parameter for group_norm ( #8818 )
2024-08-06 10:26:46 +03:00
norm.hpp
[SYCL] Fix the sub group size of Intel ( #8106 )
2024-07-02 10:16:00 +08:00
outprod.cpp
Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration ( #10133 )
2024-11-07 15:19:10 +08:00
outprod.hpp
Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration ( #10133 )
2024-11-07 15:19:10 +08:00
presets.hpp
Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration ( #10133 )
2024-11-07 15:19:10 +08:00
rope.cpp
ggml : move rope type enum to ggml.h ( #8949 )
2024-08-13 21:13:15 +02:00
rope.hpp
[SYCL] Update SYCL-Rope op and Refactor ( #8157 )
2024-07-01 19:39:06 +08:00
softmax.cpp
[SYCL] fix scratch size of softmax ( #8642 )
2024-07-23 15:43:28 +08:00
softmax.hpp
[SYCL] Fix WARP_SIZE=16 bug of Intel GPU ( #8266 )
2024-07-05 13:06:13 +08:00
tsembd.cpp
[SYCL] Add TIMESTEP_EMBEDDING
OP ( #8707 )
2024-07-30 14:56:51 +08:00
tsembd.hpp
[SYCL] Add TIMESTEP_EMBEDDING
OP ( #8707 )
2024-07-30 14:56:51 +08:00
vecdotq.hpp
CUDA: refactor and optimize IQ MMVQ ( #8215 )
2024-07-01 20:39:06 +02:00
wkv6.cpp
Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration ( #10133 )
2024-11-07 15:19:10 +08:00
wkv6.hpp
Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration ( #10133 )
2024-11-07 15:19:10 +08:00