mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-12 11:40:17 +00:00

History

Georgi Gerganov 2789baf480 tests : fix --keep_split -> --keep-split (#7374 )		2024-05-20 08:55:09 +03:00
..
CMakeLists.txt	quantize: add imatrix and dataset metadata in GGUF (#6658 )	2024-04-26 20:06:33 +02:00
quantize.cpp	quantize : fix --keep-split check (#7374 )	2024-05-19 19:37:04 +03:00
README.md	doc: add references to hugging face GGUF-my-repo quantisation web tool. (#7288 )	2024-05-16 15:38:43 +10:00
tests.sh	tests : fix --keep_split -> --keep-split (#7374 )	2024-05-20 08:55:09 +03:00

quantize

You can also use the GGUF-my-repo space on Hugging Face to build your own quants without any setup.

Note: It is synced from llama.cpp main every 6 hours.

Llama 2 7B