Default Branch

924518e2e5 · Reset color before we exit (#11205) · Updated 2025-01-12 18:23:10 +00:00

Branches

865066621b · llama.swiftui : improve bench · Updated 2023-12-17 17:37:22 +00:00    root

2825
12

f86b9d152c · lookup : minor · Updated 2023-12-17 15:25:28 +00:00    root

2823
9

d2f1e0dacc · Merge branch 'cuda-cublas-opts' into gg/phi-2 · Updated 2023-12-17 06:41:46 +00:00    root

2821
17

b0547d2196 · gguf-py : fail fast on nonsensical special token IDs · Updated 2023-12-15 23:06:42 +00:00    root

2823
1

c8554b80be · Merge branch 'master' of https://github.com/ggerganov/llama.cpp into ceb/fix-cuda-warning-flags · Updated 2023-12-13 17:06:01 +00:00    root

2835
12

e1241d9b46 · metal : switch to execution barriers + fix one of the barriers · Updated 2023-12-13 11:56:45 +00:00    root

2846
47

fc5f334689 · readme : add API change notice · Updated 2023-12-07 10:35:02 +00:00    root

2848
15

af99c6fbfc · llama : remove memory_f16 and kv_f16 flags · Updated 2023-12-05 16:18:16 +00:00    root

2860
26

3cb1c348b3 · metal : try to improve batched decoding · Updated 2023-12-01 20:01:58 +00:00    root

2865
2

eb594c0f7d · alloc : fix build with debug · Updated 2023-12-01 08:46:05 +00:00    root

2889
14

5b74310e6e · build : enable libstdc++ assertions for debug builds · Updated 2023-11-30 23:18:24 +00:00    root

2874
1

bb39b87964 · ggml : restore abort() in GGML_ASSERT · Updated 2023-11-28 00:27:09 +00:00    root

2893
1

87f4102a70 · llama : revert n_threads_batch logic · Updated 2023-11-27 19:47:35 +00:00    root

2894
3

6272b6764a · use stride=128 if built for tensor cores · Updated 2023-11-27 18:09:14 +00:00    root

2897
3

8d8b76d469 · lookahead : add comments · Updated 2023-11-26 09:26:55 +00:00    root

2909
9

21b70babf7 · straightforward /v1/models endpoint · Updated 2023-11-24 16:22:39 +00:00    root

2910
12

f8e9f11428 · common : add -dkvc arg for enabling kv cache dumps · Updated 2023-11-23 16:47:56 +00:00    root

2916
4

f824902623 · YaRN : correction to GPT-NeoX implementation · Updated 2023-11-15 22:10:52 +00:00    root

2948
1

d0445a2eff · better documentation · Updated 2023-11-10 00:38:20 +00:00    root

2965
3

47d604fa2d · fix issues · Updated 2023-11-05 12:20:22 +00:00    root

2979
3