Default Branch

master
Some checks are pending
flake8 Lint / Lint (push) Waiting to run
Python Type-Check / pyright type-check (push) Waiting to run

9ba399dfa7 · server : add support for "encoding_format": "base64" to the */embeddings endpoints (#10967) · Updated 2024-12-24 20:33:04 +00:00

Branches

graph-profiler
Some checks are pending
Python check requirements.txt / check-requirements (push) Waiting to run
Python Type-Check / pyright type-check (push) Waiting to run

d4051c81ee · profiler: initial support for profiling graph ops · Updated 2024-12-24 23:37:59 +00:00

0
1

1e7e3384e1 · minor · Updated 2024-12-24 07:42:53 +00:00

9
1

fe9235d795 · Force max subgroup size for coopmat shaders · Updated 2024-12-18 07:26:27 +00:00

21
1

4fbb801a9d · ggml : update ggml_backend_cpu_device_supports_op · Updated 2024-12-17 16:09:02 +00:00

31
3

3e92f4ecbe · cont [no ci] · Updated 2024-12-15 10:36:03 +00:00

43
2

7e9208e408 · scripts : change build path to "build-bench" for compare-commits.sh · Updated 2024-12-15 09:47:30 +00:00

43
1

fb18934a97 · gguf-py : bump version to 0.11.0 · Updated 2024-12-11 21:13:31 +00:00

64
0
Included

4f3a7e279b · Force max subgroup size for coopmat shaders · Updated 2024-12-10 20:27:04 +00:00

72
2

1bf38cffdf · server/bench: · Updated 2024-12-10 16:18:16 +00:00

77
1

b8d1b1a5e1 · server : fix infill prompt format · Updated 2024-12-08 20:12:11 +00:00

82
1

a6648b9df7 · server : chunked prefill support · Updated 2024-12-08 07:48:18 +00:00

86
1

a8046c888a · use calloc instead of malloc · Updated 2024-12-04 16:24:35 +00:00

117
3
gg/server-fix-spec-ctx-shift
Some checks failed
Python Type-Check / pyright type-check (push) Has been cancelled

81611bef72 · server : add tests · Updated 2024-12-04 11:11:26 +00:00

117
3

33d7b70c88 · server : do not speculate during prompt processing · Updated 2024-12-03 08:58:43 +00:00

130
1

335f48ae16 · Make sure Vulkan instance is destroyed properly on program exit · Updated 2024-11-29 07:42:00 +00:00

155
1

3c8a2a83fe · shmem experiments · Updated 2024-11-26 13:17:38 +00:00

193
3
gg/metal-mul-mv-new-save2
Some checks failed
Python check requirements.txt / check-requirements (push) Has been cancelled
flake8 Lint / Lint (push) Has been cancelled
Python Type-Check / pyright type-check (push) Has been cancelled

dafedd33d2 · 4x4 -> 4x · Updated 2024-11-26 12:54:02 +00:00

193
2
gg/metal-mul-mv-new
Some checks failed
Python check requirements.txt / check-requirements (push) Has been cancelled
flake8 Lint / Lint (push) Has been cancelled
Python Type-Check / pyright type-check (push) Has been cancelled

bf3494345e · metal : some mul_mv experiments · Updated 2024-11-26 12:48:50 +00:00

193
1

b83cae088c · speculative : add infill mode · Updated 2024-11-26 09:14:17 +00:00

198
1
compilade/mamba2
Some checks failed
Python check requirements.txt / check-requirements (push) Has been cancelled
flake8 Lint / Lint (push) Has been cancelled
Python Type-Check / pyright type-check (push) Has been cancelled

1ee6c482d0 · Merge branch 'master' into compilade/mamba2 · Updated 2024-11-25 17:06:56 +00:00

207
24