llama.cpp/.github/workflows
Pierrick Hymbert a016026a3a
server: continuous performance monitoring and PR comment (#6283)
* server: bench: init

* server: bench: reduce list of GPU nodes

* server: bench: fix graph, fix output artifact

* ci: bench: add mermaid in case of image cannot be uploaded

* ci: bench: more resilient, more metrics

* ci: bench: trigger build

* ci: bench: fix duration

* ci: bench: fix typo

* ci: bench: fix mermaid values, markdown generated

* typo on the step name

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

* ci: bench: trailing spaces

* ci: bench: move images in a details section

* ci: bench: reduce bullet point size

---------

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>
2024-03-27 20:26:49 +01:00
..
bench.yml server: continuous performance monitoring and PR comment (#6283) 2024-03-27 20:26:49 +01:00
build.yml [SYCL] fix no file in win rel (#6314) 2024-03-27 09:47:06 +08:00
close-issue.yml ci : close inactive issue, increase operations per run (#6270) 2024-03-24 10:57:06 +02:00
code-coverage.yml ci: apply concurrency limit for github workflows (#6243) 2024-03-22 19:15:06 +02:00
docker.yml ci: apply concurrency limit for github workflows (#6243) 2024-03-22 19:15:06 +02:00
editorconfig.yml ci: apply concurrency limit for github workflows (#6243) 2024-03-22 19:15:06 +02:00
gguf-publish.yml gguf.py : fix CI for publishing GGUF package (#3532) 2023-10-07 22:14:10 +03:00
nix-ci-aarch64.yml ci: apply concurrency limit for github workflows (#6243) 2024-03-22 19:15:06 +02:00
nix-ci.yml ci: apply concurrency limit for github workflows (#6243) 2024-03-22 19:15:06 +02:00
nix-flake-update.yml ci: nix-flake-update: new token with pr permissions (#4879) 2024-01-11 17:22:34 +00:00
nix-publish-flake.yml workflows: nix-flakestry: drop tag filters 2023-12-31 13:14:58 -08:00
python-check-requirements.yml ci: apply concurrency limit for github workflows (#6243) 2024-03-22 19:15:06 +02:00
python-lint.yml ci: apply concurrency limit for github workflows (#6243) 2024-03-22 19:15:06 +02:00
server.yml common: llama_load_model_from_url split support (#6192) 2024-03-23 18:07:00 +01:00
zig-build.yml ci: apply concurrency limit for github workflows (#6243) 2024-03-22 19:15:06 +02:00