llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-09-23 05:26:19 +00:00

History

Radoslav Gerganov 5e31828d3e ggml : add RPC backend (#6829 ) * ggml : add RPC backend The RPC backend proxies all operations to a remote server which runs a regular backend (CPU, CUDA, Metal, etc). * set TCP_NODELAY * add CI workflows * Address review comments * fix warning * implement llama_max_devices() for RPC * Address review comments * Address review comments * wrap sockfd into a struct * implement get_alignment and get_max_size * add get_device_memory * fix warning * win32 support * add README * readme : trim trailing whitespace * Address review comments * win32 fix * Address review comments * fix compile warnings on macos	2024-05-14 14:27:19 +03:00
..
ISSUE_TEMPLATE	server: init functional tests (#5566 )	2024-02-24 12:28:55 +01:00
workflows	ggml : add RPC backend (#6829 )	2024-05-14 14:27:19 +03:00

Radoslav Gerganov 5e31828d3e

* ggml : add RPC backend

The RPC backend proxies all operations to a remote server which runs a
regular backend (CPU, CUDA, Metal, etc).

* set TCP_NODELAY

* add CI workflows

* Address review comments

* fix warning

* implement llama_max_devices() for RPC

* Address review comments

* Address review comments

* wrap sockfd into a struct

* implement get_alignment and get_max_size

* add get_device_memory

* fix warning

* win32 support

* add README

* readme : trim trailing whitespace

* Address review comments

* win32 fix

* Address review comments

* fix compile warnings on macos

2024-05-14 14:27:19 +03:00

ISSUE_TEMPLATE

server: init functional tests (#5566 )

2024-02-24 12:28:55 +01:00

workflows

ggml : add RPC backend (#6829 )

2024-05-14 14:27:19 +03:00