llama.cpp/prometheus.yml at 400d5d722d7edf7de0cf24a18c42b183c65047d2 - llama.cpp - Gitea: Git with a cup of tea

root/llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-25 10:54:36 +00:00

Pierrick Hymbert a016026a3a

server: continuous performance monitoring and PR comment (#6283 )

* server: bench: init

* server: bench: reduce list of GPU nodes

* server: bench: fix graph, fix output artifact

* ci: bench: add mermaid in case of image cannot be uploaded

* ci: bench: more resilient, more metrics

* ci: bench: trigger build

* ci: bench: fix duration

* ci: bench: fix typo

* ci: bench: fix mermaid values, markdown generated

* typo on the step name

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

* ci: bench: trailing spaces

* ci: bench: move images in a details section

* ci: bench: reduce bullet point size

---------

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

2024-03-27 20:26:49 +01:00

10 lines

183 B

YAML

Raw Blame History

 global:
   scrape_interval:     10s
   external_labels:
     llamacpp: 'server'
 scrape_configs:
   - job_name: 'llama.cpp server'
     static_configs:
       - targets: ['localhost:8080']