This website requires JavaScript.
Explore
Help
Sign In
root
/
llama.cpp
Watch
1
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggerganov/llama.cpp.git
synced
2024-12-25 02:44:36 +00:00
Code
Issues
Actions
9
Packages
Projects
Releases
Wiki
Activity
7d787ed96c
llama.cpp
/
ggml
History
slaren
7d787ed96c
ggml : do not crash when quantizing q4_x_x with an imatrix (
#9192
)
2024-08-26 19:44:43 +02:00
..
cmake
llama : reorganize source code + improve CMake (
#8006
)
2024-06-26 18:33:02 +03:00
include
CPU/CUDA: Gemma 2 FlashAttention support (
#8542
)
2024-08-24 21:34:59 +02:00
src
ggml : do not crash when quantizing q4_x_x with an imatrix (
#9192
)
2024-08-26 19:44:43 +02:00
.gitignore
vulkan : cmake integration (
#8119
)
2024-07-13 18:12:39 +02:00
CMakeLists.txt
Vulkan Optimizations and Fixes (
#8959
)
2024-08-14 18:32:53 +02:00