mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-29 12:54:35 +00:00

History

Iwan Kawrakow 7883796f71 quantize: be able to specify the output tensor type		2024-03-22 16:11:34 +02:00
..
CMakeLists.txt	build : link against build info instead of compiling against it (#3879 )	2023-11-02 08:50:16 +02:00
quantize.cpp	quantize: be able to specify the output tensor type	2024-03-22 16:11:34 +02:00
README.md	readme : add some recent perplexity and bpw measurements to READMES, link for k-quants (#3340 )	2023-09-27 18:30:36 +03:00

quantize

TODO

Llama 2 7B