diff --git a/README.md b/README.md index 4d24dd591..19bd54a53 100644 --- a/README.md +++ b/README.md @@ -178,6 +178,7 @@ Unless otherwise noted these projects are open-source with permissive licensing: **Infrastructure:** +- [llmaz](https://github.com/InftyAI/llmaz) - ☸️ Effortlessly serve state-of-the-art LLMs on Kubernetes, see [llama.cpp example](https://github.com/InftyAI/llmaz/tree/main/docs/examples/llamacpp) here. - [Paddler](https://github.com/distantmagic/paddler) - Stateful load balancer custom-tailored for llama.cpp - [GPUStack](https://github.com/gpustack/gpustack) - Manage GPU clusters for running LLMs