Default Branch

master
Some checks are pending
Python check requirements.txt / check-requirements (push) Waiting to run
flake8 Lint / Lint (push) Waiting to run
Python Type-Check / pyright type-check (push) Waiting to run

c05e8c9934 · gguf-py: fixed local detection of gguf package (#11180) · Updated 2025-01-11 09:42:31 +00:00

Branches

75b3a09602 · test-backend-ops : add TQ1_0 and TQ2_0 comments for later · Updated 2024-09-04 19:00:21 +00:00    root

795
33

f648ca2cee · llama : add llama_sampling API + move grammar in libllama · Updated 2024-09-03 07:31:54 +00:00    root

802
1

40fa68cb46 · readme : add API change notice · Updated 2024-09-02 15:32:24 +00:00    root

811
3

375de5b1f8 · llama : use unused n_embd_k_gqa in k_shift · Updated 2024-09-02 01:59:24 +00:00    root

811
41

a95225cdfd · metal : another fix for the fa kernel · Updated 2024-08-26 12:08:38 +00:00    root

835
1

aa931d0375 · metal : fix fa kernel · Updated 2024-08-26 10:09:50 +00:00    root

835
1

6494509801 · backup · Updated 2024-08-26 08:58:54 +00:00    root

845
2

ccb45186d0 · docs : remove references · Updated 2024-08-26 06:52:17 +00:00    root

839
2

8062650343 · llama : fix simple splits when the batch contains embeddings · Updated 2024-08-21 19:09:03 +00:00    root

850
19

9127800d83 · wip · Updated 2024-08-16 23:51:06 +00:00    root

883
2

62d7b6c87f · cuda : re-add q4_0 · Updated 2024-08-14 10:37:03 +00:00    root

879
3

93ec58b932 · server : fix typo in comment · Updated 2024-08-14 02:12:26 +00:00    root

881
4

faaac59d16 · llama : support NUL bytes in tokens · Updated 2024-08-12 01:00:03 +00:00    root

892
1

73bc9350cd · gguf-py : Numpy dequantization for grid-based i-quants · Updated 2024-08-10 03:47:31 +00:00    root

912
2

9329953a61 · llama : avoid double tensor copy when saving session to buffer · Updated 2024-08-07 20:03:34 +00:00    root

920
2

7764ab911d · update guide · Updated 2024-08-07 14:01:02 +00:00    root

921
1

cad8abb49b · add tool to allow plotting tensor allocation maps within buffers · Updated 2024-08-06 20:09:51 +00:00    root

929
1

6e299132e7 · clip : style changes · Updated 2024-08-06 08:44:29 +00:00    root

1253
56

16dab13bde · correct cmd name · Updated 2024-08-05 16:15:33 +00:00    root

938
1

bddcc5f985 · llama : better replace_all · Updated 2024-08-04 10:42:08 +00:00    root

954
1