Merge 7323304092 into 30caac3a68

2024-12-25 19:04:35 +00:00 · 2024-12-24 16:45:36 +08:00 · 2024-12-24 16:45:36 +08:00 · 94d9f6d381
commit 94d9f6d381
parent 30caac3a68 7323304092
1 changed files with 1 additions and 0 deletions
--- a/README.md
+++ b/README.md
@ -198,6 +198,7 @@ Instructions for adding support for new models: [HOWTO-add-model.md](docs/develo
 <details>
 <summary>Infrastructure</summary>

+- [llmaz](https://github.com/InftyAI/llmaz) - ☸️ Effortlessly serve state-of-the-art LLMs on Kubernetes, see [llama.cpp example](https://github.com/InftyAI/llmaz/tree/main/docs/examples/llamacpp) here.
 - [Paddler](https://github.com/distantmagic/paddler) - Stateful load balancer custom-tailored for llama.cpp
 - [GPUStack](https://github.com/gpustack/gpustack) - Manage GPU clusters for running LLMs
 - [llama_cpp_canister](https://github.com/onicai/llama_cpp_canister) - llama.cpp as a smart contract on the Internet Computer, using WebAssembly