llama.cpp/examples/server/tests/features/steps
2024-03-08 12:25:04 +01:00
..
steps.py server: metrics: add llamacpp:prompt_seconds_total and llamacpp:tokens_predicted_seconds_total, reset bucket only on /metrics. Fix values cast to int. Add Process-Start-Time-Unix header. (#5937) 2024-03-08 12:25:04 +01:00