llama.cpp/requirements
Xuan Son Nguyen 97bdd26eee
Refactor lora adapter support (#8332)
* lora: load to devide buft

* add patch tensor function

* correct tensor patch

* llama_lora_adapter_apply

* correct ggml_backend_tensor_copy

* add llm_build_mm

* fix auto merge

* update based on review comments

* add convert script

* no more transpose A

* add f16 convert

* add metadata check

* add sanity check

* fix ftype

* add requirements

* fix requirements

* fix outfile

* conversion: only allow selected models

* fix types

* cuda : do not use dmmv if the tensor does not have enough cols

* llama : lora fixes

* do not disable mmap with lora

Co-authored-by: slaren <slarengh@gmail.com>

* llm_build_lora_mm_id

* convert_lora : MoE LoRA conversion support

* convert_lora : prefer safetensors, similarly to convert_hf

* convert_hf : simplify modify_tensors for InternLM2

* convert_lora : lazy conversion

* llama : load and use alpha from LoRA adapters

* llama : use llm_build_lora_mm in most model graphs

* auto scale

* Revert "auto scale"

This reverts commit 42415a4874.

* remove redundant params

* Apply suggestions from code review

Co-authored-by: slaren <slarengh@gmail.com>

* change kv metadata

* move add_type to __init__

* convert_hf : move add_type to main()

* convert_lora : use the GGUFWriter from Model instead of overwriting it

---------

Co-authored-by: slaren <slarengh@gmail.com>
Co-authored-by: Francis Couture-Harpin <git@compilade.net>
2024-07-15 20:50:47 +02:00
..
requirements-all.txt py : type-check all Python scripts with Pyright (#8341) 2024-07-07 15:04:39 -04:00
requirements-compare-llama-bench.txt py : type-check all Python scripts with Pyright (#8341) 2024-07-07 15:04:39 -04:00
requirements-convert_hf_to_gguf_update.txt py : use cpu-only torch in requirements.txt (#8335) 2024-07-07 14:23:38 +03:00
requirements-convert_hf_to_gguf.txt py : use cpu-only torch in requirements.txt (#8335) 2024-07-07 14:23:38 +03:00
requirements-convert_legacy_llama.txt py : switch to snake_case (#8305) 2024-07-05 07:53:33 +03:00
requirements-convert_llama_ggml_to_gguf.txt py : switch to snake_case (#8305) 2024-07-05 07:53:33 +03:00
requirements-convert_lora_to_gguf.txt Refactor lora adapter support (#8332) 2024-07-15 20:50:47 +02:00
requirements-pydantic.txt pydantic : replace uses of __annotations__ with get_type_hints (#8474) 2024-07-14 19:51:21 -04:00
requirements-test-tokenizer-random.txt py : type-check all Python scripts with Pyright (#8341) 2024-07-07 15:04:39 -04:00