Commit Graph

2 Commits

Author SHA1 Message Date
Georgi Gerganov
d3e2cedb79
ggml : move FP16 <-> FP32 stuff to ggml-impl.h
ggml-ci
2023-10-30 16:39:23 +02:00
Georgi Gerganov
d69d777c02
ggml : quantization refactoring (#3833)
* ggml : factor all quantization code in ggml-quants

ggml-ci

* ggml-quants : fix Zig and Swift builds + quantize tool

ggml-ci

* quantize : --pure option for disabling k-quant mixtures

---------

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com>
2023-10-29 18:32:28 +02:00