llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-28 12:24:35 +00:00

History

Nicolò Scipione 40c6d79fb5 SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (#10584 ) * [SYCL] Move to Compile Time backend selection on oneMKL Interface for NVIDIA backend Move to compile time selection to backend to avoid latency at run time. Add it to all mkl gemm calls and only for NVIDIA backend. Signed-off-by: nscipione <nicolo.scipione@codeplay.com> * Formatting * Address PR comments to increase readibility --------- Signed-off-by: nscipione <nicolo.scipione@codeplay.com>		2024-12-04 09:29:20 +08:00
..
dpct	SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (#10584 )	2024-12-04 09:29:20 +08:00
backend.hpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
CMakeLists.txt	SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (#10584 )	2024-12-04 09:29:20 +08:00
common.cpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
common.hpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
concat.cpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
concat.hpp	[SYCL] add concat through dim 1/2 (#8483 )	2024-07-15 19:32:15 +08:00
conv.cpp	[SYCL] add conv support (#8688 )	2024-07-29 10:50:27 +08:00
conv.hpp	[SYCL] add conv support (#8688 )	2024-07-29 10:50:27 +08:00
convert.cpp	[SYCL] Fix SYCL `im2col` and `convert` Overflow with Large Dims (#9052 )	2024-08-20 23:06:51 +08:00
convert.hpp	[SYCL] Fix SYCL `im2col` and `convert` Overflow with Large Dims (#9052 )	2024-08-20 23:06:51 +08:00
dequantize.hpp	Fixed dequant precision issues in Q4_1 and Q5_1 (#9711 )	2024-10-03 07:50:44 +01:00
dmmv.cpp	[SYCL] Fix DMMV dequantization (#9279 )	2024-09-04 16:26:33 +01:00
dmmv.hpp	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
element_wise.cpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
element_wise.hpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
gemm.hpp	[SYCL] Add oneDNN primitive support (#9091 )	2024-08-22 12:50:10 +08:00
ggml-sycl.cpp	SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (#10584 )	2024-12-04 09:29:20 +08:00
im2col.cpp	[SYCL] Fix SYCL `im2col` and `convert` Overflow with Large Dims (#9052 )	2024-08-20 23:06:51 +08:00
im2col.hpp	[SYCL] Fix SYCL `im2col` and `convert` Overflow with Large Dims (#9052 )	2024-08-20 23:06:51 +08:00
mmq.cpp	ggml : reduce hash table reset cost (#8698 )	2024-07-27 04:41:55 +02:00
mmq.hpp	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
mmvq.cpp	fix mul_mat_vec_q and *_vec_q error (#9939 )	2024-10-21 14:26:09 +08:00
mmvq.hpp	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
norm.cpp	sycl : Fixes to broken builds and test-backend-ops (#10257 )	2024-11-13 09:40:57 +00:00
norm.hpp	[SYCL] Fix the sub group size of Intel (#8106 )	2024-07-02 10:16:00 +08:00
outprod.cpp	SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (#10584 )	2024-12-04 09:29:20 +08:00
outprod.hpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
presets.hpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
rope.cpp	ggml : move rope type enum to ggml.h (#8949 )	2024-08-13 21:13:15 +02:00
rope.hpp	[SYCL] Update SYCL-Rope op and Refactor (#8157 )	2024-07-01 19:39:06 +08:00
softmax.cpp	[SYCL] fix scratch size of softmax (#8642 )	2024-07-23 15:43:28 +08:00
softmax.hpp	[SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266 )	2024-07-05 13:06:13 +08:00
tsembd.cpp	[SYCL] Add `TIMESTEP_EMBEDDING` OP (#8707 )	2024-07-30 14:56:51 +08:00
tsembd.hpp	[SYCL] Add `TIMESTEP_EMBEDDING` OP (#8707 )	2024-07-30 14:56:51 +08:00
vecdotq.hpp	sycl: Use syclcompat::dp4a (#10267 )	2024-11-15 11:09:12 +08:00
wkv6.cpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
wkv6.hpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00