From a45433ba209ee0b33d02c7dc4c31f29894ad83a6 Mon Sep 17 00:00:00 2001 From: Benson Wong Date: Wed, 1 Jan 2025 23:14:54 -0800 Subject: [PATCH] readme : add llama-swap to infrastructure section (#11032) * list llama-swap under tools in README * readme: add llama-swap to Infrastructure --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index d6d1958c8..0126da89c 100644 --- a/README.md +++ b/README.md @@ -201,6 +201,7 @@ Instructions for adding support for new models: [HOWTO-add-model.md](docs/develo - [Paddler](https://github.com/distantmagic/paddler) - Stateful load balancer custom-tailored for llama.cpp - [GPUStack](https://github.com/gpustack/gpustack) - Manage GPU clusters for running LLMs - [llama_cpp_canister](https://github.com/onicai/llama_cpp_canister) - llama.cpp as a smart contract on the Internet Computer, using WebAssembly +- [llama-swap](https://github.com/mostlygeek/llama-swap) - transparent proxy that adds automatic model switching with llama-server