Default Branch

master
Some checks are pending
Python Type-Check / pyright type-check (push) Waiting to run
flake8 Lint / Lint (push) Waiting to run

9ba399dfa7 · server : add support for "encoding_format": "base64" to the */embeddings endpoints (#10967) · Updated 2024-12-24 20:33:04 +00:00

Branches

95dc4d7270 · Merge 'origin/master' into steering · Updated 2023-05-19 20:19:57 +00:00    root

3771
9

40ec4882c4 · ggml : use F16C conversion when available · Updated 2023-05-17 17:05:51 +00:00    root

3833
1

a3e6d62283 · cuda : alternative q4_q8 kernel · Updated 2023-05-12 14:02:39 +00:00    root

3867
8

e116eb638c · ggml : speed-up Q5_0 + Q5_1 at 4 threads · Updated 2023-05-11 15:51:56 +00:00    root

3869
20

4baa85633a · Fix build · Updated 2023-05-07 01:44:07 +00:00    root

3877
5

31ff9e2e83 · ci : add cublas to windows release · Updated 2023-05-03 21:21:20 +00:00    root

3892
1

102cd98074 · ggml : Q4_3c using 2x "Full range" approach · Updated 2023-04-23 11:56:44 +00:00    root

3973
8

71e6ae3779 · ggml : continue from #729 (wip) · Updated 2023-04-22 15:49:07 +00:00    root

3973
7

a0242a833c · Minor, plus rebase on master · Updated 2023-04-22 14:07:10 +00:00    root

3973
2

4b8d5e3890 · llama : quantize attention results · Updated 2023-04-22 08:35:13 +00:00    root

3978
1

1506737499 · Add mmap pages stats (disabled by default) · Updated 2023-04-16 16:22:30 +00:00    root

4028
1

36ddd12924 · llama : add flash attention (demo) · Updated 2023-04-05 19:12:04 +00:00    root

4094
1

c9c820ff36 · Added support for _POSIX_MAPPED_FILES if defined in source (#564) · Updated 2023-03-28 21:26:25 +00:00    root

4328
8

4aeee216fd · Regroup q4_1 dot addition for better numerics. · Updated 2023-03-24 20:20:57 +00:00    root

4209
2

66ea164e1d · Kahan summation on Q4_1 · Updated 2023-03-23 03:28:51 +00:00    root

4236
2

711224708d · Break up loop for numeric stability · Updated 2023-03-23 02:14:44 +00:00    root

4236
2

3a0dcb3920 · Implement server mode. · Updated 2023-03-22 17:34:19 +00:00    root

4237
5
dev

a169bb889c · Gate signal support on being on a unixoid system. (#74) · Updated 2023-03-13 03:08:01 +00:00    root

4336
0
Included
sl/no-fatal-error-for-arm-features

Deleted by Ghost 2024-12-24 10:24:35 +00:00

sl/fix-fs-wstring

Deleted by Ghost 2024-12-24 10:24:35 +00:00