Block a user
root
synced new reference gg/cmake-dedup-link to root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
root
synced new reference gg/cmake-defaults to root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
078be074a7
log : print if build is debug [no ci]
2948768e25
common : reimplement the logger
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to gg/metal-zero-allocs at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
root
synced new reference gg/metal-zero-allocs to root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
root
synced commits to refs/pull/8210/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
e491a55a77
Merge
3277bb88e5
into bd35cb0ae3
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
root
synced commits to refs/pull/8837/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
2076b2d5c2
Merge
02c75452c1
into bd35cb0ae3
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
root
synced commits to refs/pull/9034/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
1d0d408e38
Merge
ccb45186d0
into 0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
root
synced commits to refs/pull/9078/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
3cc6539f3f
Merge
60e6e2af36
into 0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9090/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
1d3a11d02a
Merge
9373e2ba58
into bd35cb0ae3
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9096/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
46fb0c8625
Merge
7323304092
into 0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
root
synced commits to refs/pull/9131/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
bf860de72f
Merge
81a37ca577
into 0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
root
synced commits to refs/pull/9186/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
24195561fa
Merge
63b6e73500
into 0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
root
synced commits to refs/pull/9209/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
b1895ccda9
Merge
951f1d9053
into 0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
root
synced commits to refs/pull/9217/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
274da00a11
Merge 71cf0e1c0f3248fb34f32fc06a7e0c5b4bd658e2 into
0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
root
synced commits to refs/pull/9322/merge at root/llama.cpp from mirror
2024-09-13 08:46:18 +00:00
99a18ea03e
Merge 5f9c6fb2a47d5626f915ef5ff0633ace50087b9e into
0abc6a2c25
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
root
synced commits to refs/pull/3025/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
be7187c5e3
Merge
a7f5c74795
into e6b7801bd1
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
2a825116b6
cmake : fix for builds without
GGML_CDEF_PUBLIC
(#9338)
root
synced commits to refs/pull/8354/merge at root/llama.cpp from mirror
2024-09-13 00:36:17 +00:00
a5e45c230c
Merge
244811d856
into e6b7801bd1
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
e665744317
llava : fix the script error in MobileVLM README (#9054)
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
2a825116b6
cmake : fix for builds without
GGML_CDEF_PUBLIC
(#9338)