mirror of
https://github.com/ggerganov/llama.cpp.git
synced 2025-01-13 04:00:16 +00:00
llama : refactor tensor offloading as callback
This commit is contained in:
parent
da936188d8
commit
15267192c0