llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-26 03:14:35 +00:00

History

PAB efb6ae9630 feat: add `GGML_UNARY_OP_ARGMAX` Metal kernel (ggml/1019) * implemented argmax kernel * tpig -> tgpig * change to strides * contiguous assertions * kernel working and tested * argmax simd parallel implementation * added 2 new tests for argmax in test-backend-ops * cosmit * added 3 tests cases for perf eval * add test_argmax in make_test_cases_perf * Update test-backend-ops.cpp Co-authored-by: Diego Devesa <slarengh@gmail.com> --------- Co-authored-by: Diego Devesa <slarengh@gmail.com>		2024-12-03 20:04:49 +02:00
..
include	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
src	feat: add `GGML_UNARY_OP_ARGMAX` Metal kernel (ggml/1019)	2024-12-03 20:04:49 +02:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	ggml : automatic selection of best CPU backend (#10606 )	2024-12-01 16:12:41 +01:00