llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-28 20:34:37 +00:00

History

Pierrick Hymbert a016026a3a server: continuous performance monitoring and PR comment (#6283 ) * server: bench: init * server: bench: reduce list of GPU nodes * server: bench: fix graph, fix output artifact * ci: bench: add mermaid in case of image cannot be uploaded * ci: bench: more resilient, more metrics * ci: bench: trigger build * ci: bench: fix duration * ci: bench: fix typo * ci: bench: fix mermaid values, markdown generated * typo on the step name Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com> * ci: bench: trailing spaces * ci: bench: move images in a details section * ci: bench: reduce bullet point size --------- Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>		2024-03-27 20:26:49 +01:00
..
bench.yml	server: continuous performance monitoring and PR comment (#6283 )	2024-03-27 20:26:49 +01:00
build.yml	[SYCL] fix no file in win rel (#6314 )	2024-03-27 09:47:06 +08:00
close-issue.yml	ci : close inactive issue, increase operations per run (#6270 )	2024-03-24 10:57:06 +02:00
code-coverage.yml	ci: apply concurrency limit for github workflows (#6243 )	2024-03-22 19:15:06 +02:00
docker.yml	ci: apply concurrency limit for github workflows (#6243 )	2024-03-22 19:15:06 +02:00
editorconfig.yml	ci: apply concurrency limit for github workflows (#6243 )	2024-03-22 19:15:06 +02:00
gguf-publish.yml	gguf.py : fix CI for publishing GGUF package (#3532 )	2023-10-07 22:14:10 +03:00
nix-ci-aarch64.yml	ci: apply concurrency limit for github workflows (#6243 )	2024-03-22 19:15:06 +02:00
nix-ci.yml	ci: apply concurrency limit for github workflows (#6243 )	2024-03-22 19:15:06 +02:00
nix-flake-update.yml	ci: nix-flake-update: new token with pr permissions (#4879 )	2024-01-11 17:22:34 +00:00
nix-publish-flake.yml	workflows: nix-flakestry: drop tag filters	2023-12-31 13:14:58 -08:00
python-check-requirements.yml	ci: apply concurrency limit for github workflows (#6243 )	2024-03-22 19:15:06 +02:00
python-lint.yml	ci: apply concurrency limit for github workflows (#6243 )	2024-03-22 19:15:06 +02:00
server.yml	common: llama_load_model_from_url split support (#6192 )	2024-03-23 18:07:00 +01:00
zig-build.yml	ci: apply concurrency limit for github workflows (#6243 )	2024-03-22 19:15:06 +02:00