llama.cpp/ggml
2024-11-08 10:11:22 +02:00
..
cmake llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
include ggml : add ggml_flash_attn_ext_get_prec 2024-11-08 10:11:21 +02:00
src metal : use F16 precision in FA kernels 2024-11-08 10:11:22 +02:00
.gitignore vulkan : cmake integration (#8119) 2024-07-13 18:12:39 +02:00
CMakeLists.txt metal : use F16 precision in FA kernels 2024-11-08 10:11:22 +02:00