llama.cpp/examples/gguf-split
Pierrick Hymbert d0d5de42e5
gguf-split: split and merge gguf per batch of tensors (#6135)
* gguf-split: split and merge gguf files per tensor

* gguf-split: build with make toolchain

* gguf-split: rename `--split-tensors-size` to `--split-max-tensors`. Set general.split_count KV to all split

* split : minor style + fix compile warnings

* gguf-split: remove --upload not implemented

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2024-03-19 12:05:44 +01:00
..
CMakeLists.txt gguf-split: split and merge gguf per batch of tensors (#6135) 2024-03-19 12:05:44 +01:00
gguf-split.cpp gguf-split: split and merge gguf per batch of tensors (#6135) 2024-03-19 12:05:44 +01:00
README.md gguf-split: split and merge gguf per batch of tensors (#6135) 2024-03-19 12:05:44 +01:00

GGUF split Example

CLI to split / merge GGUF files.

Command line options:

  • --split: split GGUF to multiple GGUF, default operation.
  • --split-max-tensors: maximum tensors in each split: default(128)
  • --merge: merge multiple GGUF to a single GGUF.