mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-26 19:34:35 +00:00

It's like simple-chat but it uses smart pointers to avoid manual
memory cleanups. Less memory leaks in the code now. Avoid printing
multiple dots. Split code into smaller functions. Uses no exception
handling.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>

2024-11-25 22:56:24 +01:00

177 B

Raw Permalink Blame History

llama.cpp/example/run

The purpose of this example is to demonstrate a minimal usage of llama.cpp for running models.

./llama-run Meta-Llama-3.1-8B-Instruct.gguf
...

177 B Raw Permalink Blame History

llama.cpp/example/run

177 B

Raw Permalink Blame History