llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-26 11:24:35 +00:00

History

Georgi Gerganov f445c0e68c llama : fix llm_build_k_shift to use correct n_rot (#4889 ) * llama : fix llm_build_k_shift to use correct n_rot ggml-ci * llama : always use hparams.n_rot for ggml_rope_custom ggml-ci * convert : fix persimmon conversion to write correct n_rot		2024-01-12 13:01:56 +02:00
..
__init__.py	gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981 )	2023-11-11 08:04:50 +03:00
constants.py	llama : differentiate the KV dims in the attention (#4657 )	2024-01-02 13:51:28 +02:00
gguf_reader.py	gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981 )	2023-11-11 08:04:50 +03:00
gguf_writer.py	llama : differentiate the KV dims in the attention (#4657 )	2024-01-02 13:51:28 +02:00
gguf.py	gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981 )	2023-11-11 08:04:50 +03:00
py.typed	convert : various script cleanups/fixes + merges and special token handling (#2842 )	2023-08-30 11:25:50 +03:00
tensor_mapping.py	llama : fix llm_build_k_shift to use correct n_rot (#4889 )	2024-01-12 13:01:56 +02:00
vocab.py	py : open merges file as 'utf-8' (#4566 )	2023-12-21 19:07:34 +02:00