llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-09-22 21:16:20 +00:00

History

fairydreaming 807b0c49ff Inference support for T5 and FLAN-T5 model families (#5763 ) * llama : add inference support and model types for T5 and FLAN-T5 model families * llama : add new API functions to support encoder-decoder models: llama_encode(), llama_model_has_encoder(), llama_model_decoder_start_token() * common, llama-cli, llama-batched : add support for encoder-decoder models * convert-hf : handle shared token embeddings tensors in T5Model * convert-hf : add support for SentencePiece BPE tokenizer in T5Model (for Pile-T5 models) * convert-hf : add MT5ForConditionalGeneration and UMT5ForConditionalGeneration to architectures supported by T5Model * convert : add t5 tokenizer tests, use "slow" HF tokenizer for t5 --------- Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>		2024-07-04 15:46:11 +02:00
..
CMakeLists.txt	tests : add _CRT_SECURE_NO_WARNINGS for WIN32 (#8231 )	2024-07-04 13:53:42 +03:00
llama.cpp	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
unicode-data.cpp	Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258 )	2024-07-02 12:18:10 -04:00
unicode-data.h	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
unicode.cpp	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
unicode.h	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00