Block a user
root
synced commits to refs/pull/9186/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
24195561fa
Merge
63b6e73500
into 0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
root
synced commits to refs/pull/9209/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
b1895ccda9
Merge
951f1d9053
into 0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
root
synced commits to refs/pull/9217/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
274da00a11
Merge 71cf0e1c0f3248fb34f32fc06a7e0c5b4bd658e2 into
0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9322/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
99a18ea03e
Merge 5f9c6fb2a47d5626f915ef5ff0633ace50087b9e into
0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
root
synced commits to refs/pull/3025/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
be7187c5e3
Merge
a7f5c74795
into e6b7801bd1
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
2a825116b6
cmake : fix for builds without
GGML_CDEF_PUBLIC
(#9338)
root
synced commits to refs/pull/8354/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
a5e45c230c
Merge
244811d856
into e6b7801bd1
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
2a825116b6
cmake : fix for builds without
GGML_CDEF_PUBLIC
(#9338)
root
synced commits to refs/pull/8633/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
c701f9651e
Merge
7e492b3e0e
into e6b7801bd1
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
2a825116b6
cmake : fix for builds without
GGML_CDEF_PUBLIC
(#9338)
root
synced commits to refs/pull/8998/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
34febe60a9
Merge a43f8e0089acc29d5f55eed6c99730e3eedb6c8b into
78203641fe
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
root
synced commits to refs/pull/9078/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
a58ce7231f
Merge
60e6e2af36
into e6b7801bd1
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
2a825116b6
cmake : fix for builds without
GGML_CDEF_PUBLIC
(#9338)
root
synced commits to refs/pull/9096/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
e1a2242249
Merge
7323304092
into 78203641fe
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9186/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
0c66c66249
Merge
63b6e73500
into 78203641fe
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
root
synced commits to refs/pull/9209/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
e4384c9d04
Merge
951f1d9053
into 78203641fe
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9251/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
170eee1b45
Merge
06e3e3bf51
into 78203641fe
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
root
synced commits to refs/pull/9322/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
36301028b7
Merge 5f9c6fb2a47d5626f915ef5ff0633ace50087b9e into
78203641fe
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9328/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
17cd4bf742
Merge
424e3a52fe
into 78203641fe
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9355/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
7ed193d71a
Merge
444b757bce
into 78203641fe
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9400/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
4274dc04e1
Merge
2d79a7077c
into 78203641fe
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9401/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
81f273c7c0
Merge
161bf2205d
into 78203641fe
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9403/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
ed50c6779b
Merge
1e8646b3e8
into 78203641fe
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)