mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-24 02:14:35 +00:00

History

Georgi Gerganov 6ff13987ad common : normalize naming style (#7462 ) * common : normalize naming style ggml-ci * common : match declaration / definition order * zig : try to fix build		2024-05-22 20:04:20 +03:00
..
CMakeLists.txt	quantize: add imatrix and dataset metadata in GGUF (#6658 )	2024-04-26 20:06:33 +02:00
quantize.cpp	common : normalize naming style (#7462 )	2024-05-22 20:04:20 +03:00
README.md	doc: add references to hugging face GGUF-my-repo quantisation web tool. (#7288 )	2024-05-16 15:38:43 +10:00
tests.sh	tests : fix --keep_split -> --keep-split (#7374 )	2024-05-20 08:55:09 +03:00

quantize

You can also use the GGUF-my-repo space on Hugging Face to build your own quants without any setup.

Note: It is synced from llama.cpp main every 6 hours.

Llama 2 7B