llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-13 20:14:29 +00:00

History

Diego Devesa be5caccef9 llama : only use default buffer types for the KV cache (#10358 )		2024-11-17 12:25:45 +01:00
..
include	ggml: new optimization interface (ggml/988)	2024-11-17 08:30:29 +02:00
src	llama : only use default buffer types for the KV cache (#10358 )	2024-11-17 12:25:45 +01:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	CUDA: remove DMMV, consolidate F16 mult mat vec (#10318 )	2024-11-17 09:09:55 +01:00