Commit Graph

5 Commits

Author SHA1 Message Date
slaren
cd6f5dec92 improved memory management 2023-07-21 00:44:35 +02:00
slaren
de69f8f20d initial implementation of delayed graph allocation 2023-07-20 15:57:48 +02:00
slaren
cb205c0d13 automatically calculate compute buffer sizes (without graph allocator) 2023-07-20 02:42:36 +02:00
slaren
295f85654a allocators wip
renamed ggml_backend functions
changed ggml_buffer and ggml_backend to always be used as pointers
rename ggml_tensor::params -> op_params
2023-07-19 02:43:44 +02:00
slaren
0d2b66c638 ggml backend interface wip
refactor ggml-cuda
2023-07-16 14:56:46 +02:00