mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-12 11:40:17 +00:00

History

Olivier Chafik 5265c15d4c rename llama\|main -> llama-cli; consistent RPM bin prefixes		2024-06-10 15:34:14 +01:00
..
CMakeLists.txt	prefix more cmake targets w/ llama-	2024-06-08 14:05:34 +01:00
quantize.cpp	common : normalize naming style (#7462 )	2024-05-22 20:04:20 +03:00
README.md	doc: add references to hugging face GGUF-my-repo quantisation web tool. (#7288 )	2024-05-16 15:38:43 +10:00
tests.sh	rename llama\|main -> llama-cli; consistent RPM bin prefixes	2024-06-10 15:34:14 +01:00

quantize

You can also use the GGUF-my-repo space on Hugging Face to build your own quants without any setup.

Note: It is synced from llama.cpp main every 6 hours.

Llama 2 7B