llama.cpp

root/llama.cpp

Fork 0

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-26 11:24:35 +00:00

Commit Graph

Author	SHA1	Message	Date
Georgi Gerganov	29ae62d2ae	llama : fix embeddings (#5796 ) * llama : fix embeddings ggml-ci * llama : do not use KV cache for non-causal models ggml-ci * embeddings : fix llama_batch_init arg * llama : add pooling switch * llama : distinguish token vs sequence embeddings ggml-ci * llama : assert pooling tensor * llama : simplify causal mask condition ggml-ci * llama : assert input batch with pooling enabled * readme : update API changes list	2024-03-04 22:31:20 +02:00

Author

SHA1

Message

Date

Georgi Gerganov

29ae62d2ae

llama : fix embeddings (#5796 )

* llama : fix embeddings

ggml-ci

* llama : do not use KV cache for non-causal models

ggml-ci

* embeddings : fix llama_batch_init arg

* llama : add pooling switch

* llama : distinguish token vs sequence embeddings

ggml-ci

* llama : assert pooling tensor

* llama : simplify causal mask condition

ggml-ci

* llama : assert input batch with pooling enabled

* readme : update API changes list

2024-03-04 22:31:20 +02:00

1 Commits