llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-09-22 21:16:20 +00:00

History

fairydreaming 807b0c49ff Inference support for T5 and FLAN-T5 model families (#5763 ) * llama : add inference support and model types for T5 and FLAN-T5 model families * llama : add new API functions to support encoder-decoder models: llama_encode(), llama_model_has_encoder(), llama_model_decoder_start_token() * common, llama-cli, llama-batched : add support for encoder-decoder models * convert-hf : handle shared token embeddings tensors in T5Model * convert-hf : add support for SentencePiece BPE tokenizer in T5Model (for Pile-T5 models) * convert-hf : add MT5ForConditionalGeneration and UMT5ForConditionalGeneration to architectures supported by T5Model * convert : add t5 tokenizer tests, use "slow" HF tokenizer for t5 --------- Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>		2024-07-04 15:46:11 +02:00
..
.editorconfig	gguf : new file format with flexible meta data (beta) (#2398 )	2023-08-21 23:07:43 +03:00
ggml-vocab-aquila.gguf	Work on the BPE tokenizer (#3252 )	2023-10-03 09:16:26 +02:00
ggml-vocab-baichuan.gguf	Add more tokenizer tests (#3742 )	2023-10-24 09:17:17 +02:00
ggml-vocab-bert-bge.gguf	llama : fix BPE pre-tokenization (#6920 )	2024-04-29 16:58:41 +03:00
ggml-vocab-bert-bge.gguf.inp	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-bert-bge.gguf.out	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-command-r.gguf	command-r : add BPE pre-tokenization (#7063 )	2024-05-05 08:19:30 +03:00
ggml-vocab-command-r.gguf.inp	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-command-r.gguf.out	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-deepseek-coder.gguf	llama : fix BPE pre-tokenization (#6920 )	2024-04-29 16:58:41 +03:00
ggml-vocab-deepseek-coder.gguf.inp	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-deepseek-coder.gguf.out	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-deepseek-llm.gguf	llama : fix BPE pre-tokenization (#6920 )	2024-04-29 16:58:41 +03:00
ggml-vocab-deepseek-llm.gguf.inp	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-deepseek-llm.gguf.out	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-falcon.gguf	llama : fix BPE pre-tokenization (#6920 )	2024-04-29 16:58:41 +03:00
ggml-vocab-falcon.gguf.inp	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-falcon.gguf.out	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-gpt2.gguf	gpt2 : Add gpt2 architecture integration (#4555 )	2023-12-28 15:03:57 +01:00
ggml-vocab-gpt-2.gguf	llama : fix BPE pre-tokenization (#6920 )	2024-04-29 16:58:41 +03:00
ggml-vocab-gpt-2.gguf.inp	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-gpt-2.gguf.out	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-gpt-neox.gguf	Add more tokenizer tests (#3742 )	2023-10-24 09:17:17 +02:00
ggml-vocab-llama-bpe.gguf	llama : fix BPE pre-tokenization (#6920 )	2024-04-29 16:58:41 +03:00
ggml-vocab-llama-bpe.gguf.inp	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-llama-bpe.gguf.out	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-llama-spm.gguf	llama : fix BPE pre-tokenization (#6920 )	2024-04-29 16:58:41 +03:00
ggml-vocab-llama-spm.gguf.inp	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-llama-spm.gguf.out	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-mpt.gguf	llama : fix BPE pre-tokenization (#6920 )	2024-04-29 16:58:41 +03:00
ggml-vocab-mpt.gguf.inp	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-mpt.gguf.out	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-phi-3.gguf	Per token attributes (#7685 )	2024-06-04 09:17:17 +02:00
ggml-vocab-phi-3.gguf.inp	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-phi-3.gguf.out	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-qwen2.gguf	llama : add BPE pre-tokenization for Qwen2 (#7114 )	2024-05-08 15:06:43 +03:00
ggml-vocab-qwen2.gguf.inp	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-qwen2.gguf.out	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-refact.gguf	tests : add test-tokenizer-0.sh + fix some tokenizers (#7036 )	2024-05-04 08:32:32 +03:00
ggml-vocab-refact.gguf.inp	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-refact.gguf.out	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-stablelm.gguf	llama : fix BPE pre-tokenization (#6920 )	2024-04-29 16:58:41 +03:00
ggml-vocab-starcoder.gguf	llama : fix BPE pre-tokenization (#6920 )	2024-04-29 16:58:41 +03:00
ggml-vocab-starcoder.gguf.inp	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
ggml-vocab-starcoder.gguf.out	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00