root - Gitea: Git with a cup of tea

root synced commits to refs/pull/9058/merge at root/llama.cpp from mirror 2024-09-22 13:06:19 +00:00

86311ef006 Merge fc6abde7aa into a5b57b08ce

a5b57b08ce CUDA: enable Gemma FA for HIP/Pascal (#9581)

Compare 2 commits »

root synced commits to refs/pull/9096/merge at root/llama.cpp from mirror 2024-09-22 13:06:19 +00:00

81a72fa6be Merge 7323304092 into a5b57b08ce

a5b57b08ce CUDA: enable Gemma FA for HIP/Pascal (#9581)

Compare 2 commits »

root synced commits to refs/pull/9209/merge at root/llama.cpp from mirror 2024-09-22 13:06:19 +00:00

946cef3314 Merge 951f1d9053 into a5b57b08ce

a5b57b08ce CUDA: enable Gemma FA for HIP/Pascal (#9581)

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

Compare 4 commits »

root synced commits to refs/pull/9325/merge at root/llama.cpp from mirror 2024-09-22 13:06:19 +00:00

3e1ef1e6c6 Merge b979fc97ba into ecd5d6b65b

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

Compare 3 commits »

root synced commits to refs/pull/9396/merge at root/llama.cpp from mirror 2024-09-22 13:06:19 +00:00

68a5abdc87 Merge e83d2707d3 into a5b57b08ce

a5b57b08ce CUDA: enable Gemma FA for HIP/Pascal (#9581)

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

d09770cae7 ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (#9573)

Compare 7 commits »

root synced commits to refs/pull/9407/merge at root/llama.cpp from mirror 2024-09-22 13:06:19 +00:00

34c686c3f7 Merge adf3bce13b into a5b57b08ce

a5b57b08ce CUDA: enable Gemma FA for HIP/Pascal (#9581)

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

Compare 4 commits »

root synced commits to refs/pull/9449/merge at root/llama.cpp from mirror 2024-09-22 13:06:19 +00:00

b42f7a1450 Merge 25d4599e19 into ecd5d6b65b

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

Compare 3 commits »

root synced commits to refs/pull/9510/merge at root/llama.cpp from mirror 2024-09-22 13:06:19 +00:00

c4aba398ca Merge 5f95dccea8 into a5b57b08ce

a5b57b08ce CUDA: enable Gemma FA for HIP/Pascal (#9581)

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

Compare 4 commits »

root synced commits to refs/pull/9525/merge at root/llama.cpp from mirror 2024-09-22 13:06:19 +00:00

fdd143a581 Merge 95ce058c2b into a5b57b08ce

a5b57b08ce CUDA: enable Gemma FA for HIP/Pascal (#9581)

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

d09770cae7 ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (#9573)

Compare 14 commits »

root synced commits to refs/pull/9526/head at root/llama.cpp from mirror 2024-09-22 13:06:19 +00:00

0fb0b4eab3 mtgpu: map cublasOperation_t to mublasOperation_t (sync code to latest)

a3ad2c9971 mtgpu: enable unified memory

43ff5f36c2 mtgpu: disable flash attention on qy1 (MTT S80); disable q3_k and mul_mat_batched_cublas

e40b33dcad mtgpu: add mp_21 support

a5b57b08ce CUDA: enable Gemma FA for HIP/Pascal (#9581)

Compare 26 commits »

root synced commits to refs/pull/9526/merge at root/llama.cpp from mirror 2024-09-22 04:56:21 +00:00

196729704a Merge df79623dc8 into ecd5d6b65b

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

d09770cae7 ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (#9573)

Compare 4 commits »

root synced commits to refs/pull/9532/merge at root/llama.cpp from mirror 2024-09-22 04:56:21 +00:00

a5d31ba932 Merge a829583c97 into ecd5d6b65b

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

Compare 3 commits »

root synced commits to refs/pull/9541/merge at root/llama.cpp from mirror 2024-09-22 04:56:21 +00:00

e0226c5439 Merge c42ec2f8bb into ecd5d6b65b

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

Compare 3 commits »

root synced commits to refs/pull/9544/merge at root/llama.cpp from mirror 2024-09-22 04:56:21 +00:00

5f7b32e86a Merge 4af076b494 into ecd5d6b65b

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

Compare 3 commits »

root synced commits to refs/pull/9571/merge at root/llama.cpp from mirror 2024-09-22 04:56:21 +00:00

177f766404 Merge eec216c57a into ecd5d6b65b

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

Compare 3 commits »

root synced commits to refs/pull/9577/merge at root/llama.cpp from mirror 2024-09-22 04:56:21 +00:00

eb6ea94d2d Merge 3ae8374b59 into ecd5d6b65b

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

Compare 3 commits »

root synced commits to refs/pull/9579/merge at root/llama.cpp from mirror 2024-09-22 04:56:21 +00:00

8d72461e8d Merge 33b692934f into ecd5d6b65b

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

Compare 3 commits »

root synced commits to refs/pull/9581/merge at root/llama.cpp from mirror 2024-09-22 04:56:21 +00:00

855a62a9f1 Merge 0ad9572f8b into ecd5d6b65b

ecd5d6b65b llama: remove redundant loop when constructing ubatch (#9574)

2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)

Compare 3 commits »

root synced commits to refs/tags/b3800 at root/llama.cpp from mirror 2024-09-22 04:56:21 +00:00

root synced new reference refs/tags/b3800 to root/llama.cpp from mirror 2024-09-22 04:56:21 +00:00