llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-11-14 23:09:53 +00:00

History

Zhiyuan Li 3bcd40b3c5 Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 ) * rwkv6: rename to wkv6 * rwkv6: support avx2 avx512 armv8 armv9 * rwkv6: update cuda file name * rwkv6: rename params * wkv on sycl * sycl: add some ops * sycl: Enhance OP support judgment * wkv6: drop armv9 and tranfer to GGML style ggml-ci * sync : ggml * update the function to use appropriate types * fix define error * Update ggml/src/ggml-cpu.c * add appropriate asserts * move element-wise functions outside * put the declaration outside the loop * rewrite to be more inline with the common pattern for distributing threads * use recommended way GGML_TENSOR_LOCALS --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> Co-authored-by: Diego Devesa <slarengh@gmail.com> Co-authored-by: Plamen Minev <pacominev@gmail.com> Co-authored-by: Yuri Khrustalev <ykhrustalev@users.noreply.github.com> Co-authored-by: Meng, Hengyu <airdldl@163.com>		2024-11-07 15:19:10 +08:00
..
dpct	[SYCL] Updated SYCL device filtering (#8901 )	2024-08-07 11:25:36 +01:00
backend.hpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
common.cpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
common.hpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
concat.cpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
concat.hpp	[SYCL] add concat through dim 1/2 (#8483 )	2024-07-15 19:32:15 +08:00
conv.cpp	[SYCL] add conv support (#8688 )	2024-07-29 10:50:27 +08:00
conv.hpp	[SYCL] add conv support (#8688 )	2024-07-29 10:50:27 +08:00
convert.cpp	[SYCL] Fix SYCL `im2col` and `convert` Overflow with Large Dims (#9052 )	2024-08-20 23:06:51 +08:00
convert.hpp	[SYCL] Fix SYCL `im2col` and `convert` Overflow with Large Dims (#9052 )	2024-08-20 23:06:51 +08:00
dequantize.hpp	Fixed dequant precision issues in Q4_1 and Q5_1 (#9711 )	2024-10-03 07:50:44 +01:00
dmmv.cpp	[SYCL] Fix DMMV dequantization (#9279 )	2024-09-04 16:26:33 +01:00
dmmv.hpp	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
element_wise.cpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
element_wise.hpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
gemm.hpp	[SYCL] Add oneDNN primitive support (#9091 )	2024-08-22 12:50:10 +08:00
im2col.cpp	[SYCL] Fix SYCL `im2col` and `convert` Overflow with Large Dims (#9052 )	2024-08-20 23:06:51 +08:00
im2col.hpp	[SYCL] Fix SYCL `im2col` and `convert` Overflow with Large Dims (#9052 )	2024-08-20 23:06:51 +08:00
mmq.cpp	ggml : reduce hash table reset cost (#8698 )	2024-07-27 04:41:55 +02:00
mmq.hpp	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
mmvq.cpp	fix mul_mat_vec_q and *_vec_q error (#9939 )	2024-10-21 14:26:09 +08:00
mmvq.hpp	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
norm.cpp	ggml : add epsilon as a parameter for group_norm (#8818 )	2024-08-06 10:26:46 +03:00
norm.hpp	[SYCL] Fix the sub group size of Intel (#8106 )	2024-07-02 10:16:00 +08:00
outprod.cpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
outprod.hpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
presets.hpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
rope.cpp	ggml : move rope type enum to ggml.h (#8949 )	2024-08-13 21:13:15 +02:00
rope.hpp	[SYCL] Update SYCL-Rope op and Refactor (#8157 )	2024-07-01 19:39:06 +08:00
softmax.cpp	[SYCL] fix scratch size of softmax (#8642 )	2024-07-23 15:43:28 +08:00
softmax.hpp	[SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266 )	2024-07-05 13:06:13 +08:00
tsembd.cpp	[SYCL] Add `TIMESTEP_EMBEDDING` OP (#8707 )	2024-07-30 14:56:51 +08:00
tsembd.hpp	[SYCL] Add `TIMESTEP_EMBEDDING` OP (#8707 )	2024-07-30 14:56:51 +08:00
vecdotq.hpp	CUDA: refactor and optimize IQ MMVQ (#8215 )	2024-07-01 20:39:06 +02:00
wkv6.cpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00
wkv6.hpp	Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133 )	2024-11-07 15:19:10 +08:00