This website requires JavaScript.
Explore
Help
Sign In
root
/
llama.cpp
Watch
1
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggerganov/llama.cpp.git
synced
2024-09-22 21:16:20 +00:00
Code
Issues
Actions
16
Packages
Projects
Releases
Wiki
Activity
All Workflows
build.yml
close-issue.yml
docker.yml
editorconfig.yml
gguf-publish.yml
labeler.yml
nix-ci-aarch64.yml
nix-ci.yml
nix-flake-update.yml
nix-publish-flake.yml
python-check-requirements.yml
python-lint.yml
python-type-check.yml
server.yml
Actor
All actors
root
Status
All status
success
failure
waiting
running
CUDA: enable Gemma FA for HIP/Pascal (#9581)
#213
:
Commit
a5b57b08ce
pushed by
root
master
2024-09-22 21:16:20 +00:00
0s
llama: remove redundant loop when constructing ubatch (#9574)
#204
:
Commit
ecd5d6b65b
pushed by
root
master
2024-09-22 13:06:19 +00:00
0s
ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (#9573)
#199
:
Commit
d09770cae7
pushed by
root
b3799
2024-09-22 21:36:17 +00:00
0s
ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (#9573)
#198
:
Commit
d09770cae7
pushed by
root
master
2024-09-22 04:56:20 +00:00
0s
Update CUDA graph on scale change plus clear nodes/params (#9550)
#194
:
Commit
41f477879f
pushed by
root
b3798
2024-09-22 09:36:17 +00:00
0s
CI: Provide prebuilt windows binary for hip (#9467)
#193
:
Commit
e948a7da7a
pushed by
root
b3797
2024-09-22 09:36:17 +00:00
0s
ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG
#192
:
Commit
d9ce02ae82
pushed by
root
sl/fix-debug-alloc
2024-09-22 09:36:17 +00:00
0s
Update CUDA graph on scale change plus clear nodes/params (#9550)
#191
:
Commit
41f477879f
pushed by
root
master
2024-09-21 12:36:19 +00:00
0s
ggml : fix builds (#0)
#187
:
Commit
d13edb17ed
pushed by
root
b3795
2024-09-21 21:36:17 +00:00
0s
CUDA: fix sum.cu compilation for CUDA < 11.7 (#9562)
#186
:
Commit
5cb12f6839
pushed by
root
b3790
2024-09-21 21:36:17 +00:00
0s
quantize : improve type name parsing (#9570)
#185
:
Commit
63351143b2
pushed by
root
master
2024-09-21 04:26:20 +00:00
0s
examples : flush log upon ctrl+c (#9559)
#181
:
Commit
d39e26741f
pushed by
root
b3789
2024-09-21 15:36:17 +00:00
0s
perplexity : do not escape input data by default (#9548)
#180
:
Commit
722ec1eb51
pushed by
root
b3788
2024-09-21 15:36:17 +00:00
0s
examples : flush log upon ctrl+c (#9559)
#179
:
Commit
d39e26741f
pushed by
root
master
2024-09-20 20:16:21 +00:00
0s
llama : make llm_tokenizer more private
#176
:
Commit
6e873e561a
pushed by
root
gg/tokenizer-cleanup
2024-09-21 15:36:17 +00:00
0s
server : add rerank endpoint
#174
:
Commit
5f95dccea8
pushed by
root
gg/rerank
2024-09-20 21:36:17 +00:00
0s
server : clean-up completed tasks from waiting list (#9531)
#172
:
Commit
6026da52d6
pushed by
root
b3787
2024-09-20 15:36:17 +00:00
0s
imatrix : disable prompt escape by default (#9543)
#171
:
Commit
eca0fab44e
pushed by
root
b3786
2024-09-20 15:36:17 +00:00
0s
server : clean-up completed tasks from waiting list (#9531)
#170
:
Commit
6026da52d6
pushed by
root
master
2024-09-20 12:06:20 +00:00
0s
llama-bench : add time-to-first-byte stat
#167
:
Commit
ff231de553
pushed by
root
gg/ttfb
2024-09-20 15:36:17 +00:00
0s
llama : add "rank" pooling type
#166
:
Commit
f03bcd84e7
pushed by
root
gg/rerank
2024-09-19 19:46:21 +00:00
0s
ggml : fix n_threads_cur initialization with one thread (#9538)
#164
:
Commit
64c6af3195
pushed by
root
b3785
2024-09-19 21:36:17 +00:00
0s
llama : use reserve/emplace_back in sampler_sample (#9534)
#163
:
Commit
6443ddd985
pushed by
root
b3783
2024-09-19 21:36:17 +00:00
0s
Update ggml/src/ggml.c
#162
:
Commit
6b0248c29a
pushed by
root
sl/fix-omp-one-thread
2024-09-19 21:36:17 +00:00
0s
ggml : fix n_threads_cur initialization with one thread (#9538)
#160
:
Commit
64c6af3195
pushed by
root
master
2024-09-19 11:36:20 +00:00
0s
server : match OAI structured output response (#9527)
#156
:
Commit
8a308354f6
pushed by
root
b3782
2024-09-19 15:36:17 +00:00
0s
server : fix OpenSSL build (remove obsolete `LOG_INFO`) (#9529)
#155
:
Commit
f799155ab8
pushed by
root
b3781
2024-09-19 15:36:17 +00:00
0s
minor change
#153
:
Commit
c90a43a237
pushed by
root
pr_add_intel_amx_support
2024-09-19 15:36:17 +00:00
0s
server : match OAI structured output response (#9527)
#151
:
Commit
8a308354f6
pushed by
root
master
2024-09-18 19:16:20 +00:00
0s
server : clean-up completed tasks from waiting list
#148
:
Commit
e01cdda168
pushed by
root
gg/server-remove-waiting
2024-09-19 15:36:17 +00:00
0s
First
Previous
1
2
3
4
Next
Last