mirror of
https://github.com/ggerganov/llama.cpp.git
synced 2024-12-27 03:44:35 +00:00
llama : refactor tensor offloading as callback
This commit is contained in:
parent
da936188d8
commit
1e9c5443c2