llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-24 02:14:35 +00:00

History

Pierrick Hymbert 9e359a4f47 server: continue to update other slots on embedding concurrent request (#5699 ) * server: #5655 - continue to update other slots on embedding concurrent request. * server: tests: add multi users embeddings as fixed * server: tests: adding OAI compatible embedding concurrent endpoint * server: tests: adding OAI compatible embedding with multiple inputs	2024-02-24 19:16:04 +01:00
..
steps.py	server: continue to update other slots on embedding concurrent request (#5699 )	2024-02-24 19:16:04 +01:00

Pierrick Hymbert 9e359a4f47

server: continue to update other slots on embedding concurrent request (#5699 )

* server: #5655 - continue to update other slots on embedding concurrent request.

* server: tests: add multi users embeddings as fixed

* server: tests: adding OAI compatible embedding concurrent endpoint

* server: tests: adding OAI compatible embedding with multiple inputs

2024-02-24 19:16:04 +01:00

steps.py

server: continue to update other slots on embedding concurrent request (#5699 )

2024-02-24 19:16:04 +01:00