llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-11-11 21:39:52 +00:00

History

Xuan Son Nguyen 97bdd26eee Refactor lora adapter support (#8332 ) * lora: load to devide buft * add patch tensor function * correct tensor patch * llama_lora_adapter_apply * correct ggml_backend_tensor_copy * add llm_build_mm * fix auto merge * update based on review comments * add convert script * no more transpose A * add f16 convert * add metadata check * add sanity check * fix ftype * add requirements * fix requirements * fix outfile * conversion: only allow selected models * fix types * cuda : do not use dmmv if the tensor does not have enough cols * llama : lora fixes * do not disable mmap with lora Co-authored-by: slaren <slarengh@gmail.com> * llm_build_lora_mm_id * convert_lora : MoE LoRA conversion support * convert_lora : prefer safetensors, similarly to convert_hf * convert_hf : simplify modify_tensors for InternLM2 * convert_lora : lazy conversion * llama : load and use alpha from LoRA adapters * llama : use llm_build_lora_mm in most model graphs * auto scale * Revert "auto scale" This reverts commit `42415a4874`. * remove redundant params * Apply suggestions from code review Co-authored-by: slaren <slarengh@gmail.com> * change kv metadata * move add_type to __init__ * convert_hf : move add_type to main() * convert_lora : use the GGUFWriter from Model instead of overwriting it --------- Co-authored-by: slaren <slarengh@gmail.com> Co-authored-by: Francis Couture-Harpin <git@compilade.net>		2024-07-15 20:50:47 +02:00
..
requirements-all.txt	py : type-check all Python scripts with Pyright (#8341 )	2024-07-07 15:04:39 -04:00
requirements-compare-llama-bench.txt	py : type-check all Python scripts with Pyright (#8341 )	2024-07-07 15:04:39 -04:00
requirements-convert_hf_to_gguf_update.txt	py : use cpu-only torch in requirements.txt (#8335 )	2024-07-07 14:23:38 +03:00
requirements-convert_hf_to_gguf.txt	py : use cpu-only torch in requirements.txt (#8335 )	2024-07-07 14:23:38 +03:00
requirements-convert_legacy_llama.txt	py : switch to snake_case (#8305 )	2024-07-05 07:53:33 +03:00
requirements-convert_llama_ggml_to_gguf.txt	py : switch to snake_case (#8305 )	2024-07-05 07:53:33 +03:00
requirements-convert_lora_to_gguf.txt	Refactor lora adapter support (#8332 )	2024-07-15 20:50:47 +02:00
requirements-pydantic.txt	pydantic : replace uses of __annotations__ with get_type_hints (#8474 )	2024-07-14 19:51:21 -04:00
requirements-test-tokenizer-random.txt	py : type-check all Python scripts with Pyright (#8341 )	2024-07-07 15:04:39 -04:00