mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-28 12:24:35 +00:00

History

Iwan Kawrakow 0e826d12a5 quantize: be able to specify the token embedding tensor type		2024-03-22 16:27:34 +02:00
..
CMakeLists.txt	build : link against build info instead of compiling against it (#3879 )	2023-11-02 08:50:16 +02:00
quantize.cpp	quantize: be able to specify the token embedding tensor type	2024-03-22 16:27:34 +02:00
README.md	readme : add some recent perplexity and bpw measurements to READMES, link for k-quants (#3340 )	2023-09-27 18:30:36 +03:00

quantize

TODO

Llama 2 7B