mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-14 04:24:30 +00:00

History

ochafik 0abfa36ca7 `tool-call`: move usage examples to examples/agent		2024-09-27 05:10:30 +01:00
..
fastify.py	`tool-call`: move usage examples to examples/agent	2024-09-27 05:10:30 +01:00
README.md	`tool-call`: move usage examples to examples/agent	2024-09-27 05:10:30 +01:00
run.py	`tool-call`: move usage examples to examples/agent	2024-09-27 05:10:30 +01:00
tools.py	`tool-call`: move usage examples to examples/agent	2024-09-27 05:10:30 +01:00

README.md

Agents / Tool Calling w/ llama.cpp

Install prerequisite: uv (used to simplify python deps)

Run llama-server w/ jinja templates:

make -j LLAMA_CURL=1 llama-server
./llama-server \
  -mu https://huggingface.co/lmstudio-community/Meta-Llama-3.1-70B-Instruct-GGUF/resolve/main/Meta-Llama-3.1-70B-Instruct-Q4_K_M.gguf \
  --jinja \
  -c 8192 -fa

Run some tools inside a docker container (check http://localhost:8088/docs once running):
```
docker run -p 8088:8088 -w /src \
  -v $PWD/examples/agent:/src \
  --rm -it ghcr.io/astral-sh/uv:python3.12-alpine \
  uv run fastify.py --port 8088 tools.py
```
Warning

The command above gives tools (and your agent) access to the web (and read-only access to examples/agent/**. If you're concerned about unleashing a rogue agent on the web, please explore setting up proxies for your docker (and contribute back!)

Run the agent with a given goal:

uv run examples/agent/run.py \
  --tool-endpoint http://localhost:8088 \
  --goal "What is the sum of 2535 squared and 32222000403?"