llama.cpp/src
pculliton e57dc62057
llama: Add support for Gemma2ForCausalLM (#8156)
* Inference support for Gemma 2 model family

* Update convert-hf-to-gguf.py, constants, and tensor mappings

* cleanup

* format fix

* Fix special token vocab bug

* Don't add space prefix

* fix deleted lines

* Update src/llama.cpp

Co-authored-by: slaren <slarengh@gmail.com>

* Add model type names

* Add control vector

* Fix model type identification

---------

Co-authored-by: Andrei Betlen <abetlen@gmail.com>
Co-authored-by: slaren <slarengh@gmail.com>
2024-06-27 21:00:43 -07:00
..
CMakeLists.txt llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
llama.cpp llama: Add support for Gemma2ForCausalLM (#8156) 2024-06-27 21:00:43 -07:00
unicode-data.cpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
unicode-data.h llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
unicode.cpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
unicode.h llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00