llama.cpp

ggml : add Q4_0_8_8 RISC-V GEMV and GEMM kernels (#10029)

docker.yml #802:Commit fc83a9e584 pushed by root

master

2024-10-30 15:46:20 +00:00

0s

ggml: Add POOL2D OP for GPU acceleration to the Vulkan backend in the MobileVLM model. (#9763)

close-issue.yml #801:Scheduled

master

2024-10-31 00:42:17 +00:00

0s

minor

python-lint.yml #800:Commit 484984c8ec pushed by root

sl/load-time-supports-op

2024-10-31 03:36:17 +00:00

0s

ggml: Add POOL2D OP for GPU acceleration to the Vulkan backend in the MobileVLM model. (#9763)

python-lint.yml #799:Commit 8f275a7c45 pushed by root

b3989

2024-10-30 15:36:17 +00:00

0s

llama : remove Tail-Free sampling (#10071)

python-type-check.yml #798:Commit 8d8ff71536 pushed by root

b3988

2024-10-30 15:36:17 +00:00

0s

llama : remove Tail-Free sampling (#10071)

python-lint.yml #797:Commit 8d8ff71536 pushed by root

b3988

2024-10-30 15:36:17 +00:00

0s

ggml: Add POOL2D OP for GPU acceleration to the Vulkan backend in the MobileVLM model. (#9763)

python-type-check.yml #796:Commit 8f275a7c45 pushed by root

master

2024-10-30 07:36:20 +00:00

0s

ggml: Add POOL2D OP for GPU acceleration to the Vulkan backend in the MobileVLM model. (#9763)

python-lint.yml #795:Commit 8f275a7c45 pushed by root

master

2024-10-30 07:36:20 +00:00

0s

ggml: Add POOL2D OP for GPU acceleration to the Vulkan backend in the MobileVLM model. (#9763)

nix-ci.yml #794:Commit 8f275a7c45 pushed by root

master

2024-10-30 07:36:20 +00:00

0s

ggml: Add POOL2D OP for GPU acceleration to the Vulkan backend in the MobileVLM model. (#9763)

docker.yml #793:Commit 8f275a7c45 pushed by root

master

2024-10-30 07:36:20 +00:00

0s

llama : Add IBM granite template (#10013)

nix-ci-aarch64.yml #792:Scheduled

master

2024-10-30 12:26:17 +00:00

0s

llama : Add IBM granite template (#10013)

close-issue.yml #791:Scheduled

master

2024-10-30 00:42:17 +00:00

0s

llama : Add IBM granite template (#10013)

python-lint.yml #790:Commit 61715d5cc8 pushed by root

b3987

2024-10-30 03:36:17 +00:00

0s

llama : refactor model loader with backend registry

python-lint.yml #789:Commit 63c47ab8c3 pushed by root

sl/load-time-supports-op

2024-10-29 23:26:19 +00:00

0s

llama : Add IBM granite template (#10013)

python-lint.yml #788:Commit 61715d5cc8 pushed by root

master

2024-10-29 15:16:21 +00:00

0s

llama : Add IBM granite template (#10013)

nix-ci.yml #787:Commit 61715d5cc8 pushed by root

master

2024-10-29 15:16:21 +00:00

0s

llama : Add IBM granite template (#10013)

nix-ci-aarch64.yml #786:Commit 61715d5cc8 pushed by root

master

2024-10-30 03:36:17 +00:00

0s

llama : Add IBM granite template (#10013)

docker.yml #785:Commit 61715d5cc8 pushed by root

master

2024-10-29 15:16:21 +00:00

0s

musa: workaround for Guilty Lockup in cleaning src0 (#10042)

python-lint.yml #784:Commit 524afeec9d pushed by root

b3985

2024-10-29 15:36:17 +00:00

0s

server : don't overfill the batch during infill (#10018)

python-lint.yml #783:Commit 8125e6cbfc pushed by root

b3984

2024-10-29 15:36:17 +00:00

0s

musa: workaround for Guilty Lockup in cleaning src0 (#10042)

python-lint.yml #782:Commit 524afeec9d pushed by root

master

2024-10-28 22:56:21 +00:00

0s

musa: workaround for Guilty Lockup in cleaning src0 (#10042)

nix-ci.yml #781:Commit 524afeec9d pushed by root

master

2024-10-28 22:56:21 +00:00

0s

musa: workaround for Guilty Lockup in cleaning src0 (#10042)

docker.yml #780:Commit 524afeec9d pushed by root

master

2024-10-28 22:56:21 +00:00

0s

llama : remove Tail-Free sampling

python-type-check.yml #779:Commit ec2be7bf57 pushed by root

gg/remove-tfs

2024-10-29 15:36:17 +00:00

0s

llama : remove Tail-Free sampling

python-lint.yml #778:Commit ec2be7bf57 pushed by root

gg/remove-tfs

2024-10-29 15:36:17 +00:00

0s

llama : switch KQ multiplication to F32 precision by default (#10015)

nix-ci-aarch64.yml #777:Scheduled

master

2024-10-29 12:26:17 +00:00

0s

llama : switch KQ multiplication to F32 precision by default (#10015)

close-issue.yml #776:Scheduled

master

2024-10-29 00:42:17 +00:00

0s

llama : switch KQ multiplication to F32 precision by default (#10015)

python-lint.yml #775:Commit 8841ce3f43 pushed by root

b3983

2024-10-29 03:36:17 +00:00

0s

llama : refactor model loader with backend registry

python-lint.yml #774:Commit 9cb38aede5 pushed by root

sl/load-time-supports-op

2024-10-28 22:56:21 +00:00

0s

llama : switch KQ multiplication to F32 precision by default (#10015)

python-lint.yml #773:Commit 8841ce3f43 pushed by root

master

2024-10-28 14:46:21 +00:00

0s