llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-25 10:54:36 +00:00

History

unbounded 5f939498d5 ggml : unit test for quantization functions (#953 ) * Unit test for quantization functions Use the ggml_internal_get_quantize_fn function to loop through all quantization formats and run a sanity check on the result. Also add a microbenchmark that times these functions directly without running the rest of the GGML graph. * test-quantize-fns: CI fixes Fix issues uncovered in CI - need to use sizes divisible by 328 for loop unrolling - use intrinsic header that should work on Mac test-quantize: remove Per PR comment, subsumed by test-quantize-fns * test-quantize: fix for q8_0 intermediates		2023-04-22 12:10:39 +03:00
..
CMakeLists.txt	ggml : unit test for quantization functions (#953 )	2023-04-22 12:10:39 +03:00
test-double-float.c	all : be more strict about converting float to double (#458 )	2023-03-28 19:48:20 +03:00
test-quantize-fns.cpp	ggml : unit test for quantization functions (#953 )	2023-04-22 12:10:39 +03:00
test-quantize-perf.cpp	ggml : unit test for quantization functions (#953 )	2023-04-22 12:10:39 +03:00
test-tokenizer-0.cpp	llama : well-defined static initialization of complex objects (#927 )	2023-04-17 17:41:53 +03:00