Block a user
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
root
synced commits to refs/pull/8210/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
e491a55a77
Merge
3277bb88e5
into bd35cb0ae3
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
root
synced commits to refs/pull/8837/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
2076b2d5c2
Merge
02c75452c1
into bd35cb0ae3
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
root
synced commits to refs/pull/9034/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
1d0d408e38
Merge
ccb45186d0
into 0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
root
synced commits to refs/pull/9078/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
3cc6539f3f
Merge
60e6e2af36
into 0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9090/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
1d3a11d02a
Merge
9373e2ba58
into bd35cb0ae3
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9096/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
46fb0c8625
Merge
7323304092
into 0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
root
synced commits to refs/pull/9131/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
bf860de72f
Merge
81a37ca577
into 0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
root
synced commits to refs/pull/9186/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
24195561fa
Merge
63b6e73500
into 0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
root
synced commits to refs/pull/9209/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
b1895ccda9
Merge
951f1d9053
into 0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
root
synced commits to refs/pull/9217/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
274da00a11
Merge 71cf0e1c0f3248fb34f32fc06a7e0c5b4bd658e2 into
0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9322/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
99a18ea03e
Merge 5f9c6fb2a47d5626f915ef5ff0633ace50087b9e into
0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
root
synced commits to refs/pull/3025/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
be7187c5e3
Merge
a7f5c74795
into e6b7801bd1
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
2a825116b6
cmake : fix for builds without
GGML_CDEF_PUBLIC
(#9338)
root
synced commits to refs/pull/8354/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
a5e45c230c
Merge
244811d856
into e6b7801bd1
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
2a825116b6
cmake : fix for builds without
GGML_CDEF_PUBLIC
(#9338)
root
synced commits to refs/pull/8633/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
c701f9651e
Merge
7e492b3e0e
into e6b7801bd1
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
2a825116b6
cmake : fix for builds without
GGML_CDEF_PUBLIC
(#9338)
root
synced commits to refs/pull/8998/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
34febe60a9
Merge a43f8e0089acc29d5f55eed6c99730e3eedb6c8b into
78203641fe
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
root
synced commits to refs/pull/9078/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
a58ce7231f
Merge
60e6e2af36
into e6b7801bd1
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
2a825116b6
cmake : fix for builds without
GGML_CDEF_PUBLIC
(#9338)
root
synced commits to refs/pull/9096/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
e1a2242249
Merge
7323304092
into 78203641fe
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9186/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
0c66c66249
Merge
63b6e73500
into 78203641fe
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)