llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-13 04:00:16 +00:00

History

compilade ed9f252118 gguf-py : decouple adding metadata from writing in GGUFWriter (#7827 ) Main changes of this PR is to consolidate GGUFWriter.add_key and GGUFWriter.add_val into GGUFWriter.add_key_value. In addition use_temp_file is now opt-in instead of opt-out defaulting to False. Also GGUFWriter now does not require output file name until when actually writing to it. And GGUFWriter doesn't really need to eagerly prepare the data layout of the metadata		2024-06-09 12:34:29 +10:00
..
__init__.py	convert-hf : support direct Q8_0 conversion (#7234 )	2024-05-13 14:10:51 -04:00
constants.py	llama : add jina v2 base code (#7596 )	2024-06-06 10:22:41 +03:00
gguf_reader.py	gguf-py : fix and simplify quantized shape round-trip (#7483 )	2024-05-25 11:11:48 +10:00
gguf_writer.py	gguf-py : decouple adding metadata from writing in GGUFWriter (#7827 )	2024-06-09 12:34:29 +10:00
gguf.py	gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981 )	2023-11-11 08:04:50 +03:00
lazy.py	convert-hf : support direct Q8_0 conversion (#7234 )	2024-05-13 14:10:51 -04:00
py.typed	convert : various script cleanups/fixes + merges and special token handling (#2842 )	2023-08-30 11:25:50 +03:00
quants.py	gguf-py : fix and simplify quantized shape round-trip (#7483 )	2024-05-25 11:11:48 +10:00
tensor_mapping.py	llama : add jina v2 base code (#7596 )	2024-06-06 10:22:41 +03:00
vocab.py	Move convert.py to examples/convert-legacy-llama.py (#7430 )	2024-05-30 21:40:00 +10:00