Block a user
root
synced commits to refs/pull/9058/merge at root/llama.cpp from mirror
2024-09-22 13:06:19 +00:00
86311ef006
Merge
fc6abde7aa
into a5b57b08ce
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
root
synced commits to refs/pull/9096/merge at root/llama.cpp from mirror
2024-09-22 13:06:19 +00:00
81a72fa6be
Merge
7323304092
into a5b57b08ce
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
root
synced commits to refs/pull/9209/merge at root/llama.cpp from mirror
2024-09-22 13:06:19 +00:00
946cef3314
Merge
951f1d9053
into a5b57b08ce
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
root
synced commits to refs/pull/9325/merge at root/llama.cpp from mirror
2024-09-22 13:06:19 +00:00
3e1ef1e6c6
Merge
b979fc97ba
into ecd5d6b65b
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
root
synced commits to refs/pull/9396/merge at root/llama.cpp from mirror
2024-09-22 13:06:19 +00:00
68a5abdc87
Merge
e83d2707d3
into a5b57b08ce
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
d09770cae7
ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (#9573)
root
synced commits to refs/pull/9407/merge at root/llama.cpp from mirror
2024-09-22 13:06:19 +00:00
34c686c3f7
Merge
adf3bce13b
into a5b57b08ce
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
root
synced commits to refs/pull/9449/merge at root/llama.cpp from mirror
2024-09-22 13:06:19 +00:00
b42f7a1450
Merge
25d4599e19
into ecd5d6b65b
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
root
synced commits to refs/pull/9510/merge at root/llama.cpp from mirror
2024-09-22 13:06:19 +00:00
c4aba398ca
Merge
5f95dccea8
into a5b57b08ce
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
root
synced commits to refs/pull/9525/merge at root/llama.cpp from mirror
2024-09-22 13:06:19 +00:00
fdd143a581
Merge
95ce058c2b
into a5b57b08ce
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
d09770cae7
ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (#9573)
0fb0b4eab3
mtgpu: map cublasOperation_t to mublasOperation_t (sync code to latest)
a3ad2c9971
mtgpu: enable unified memory
43ff5f36c2
mtgpu: disable flash attention on qy1 (MTT S80); disable q3_k and mul_mat_batched_cublas
e40b33dcad
mtgpu: add mp_21 support
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
root
synced commits to refs/pull/9526/merge at root/llama.cpp from mirror
2024-09-22 04:56:21 +00:00
196729704a
Merge
df79623dc8
into ecd5d6b65b
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
d09770cae7
ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (#9573)
root
synced commits to refs/pull/9532/merge at root/llama.cpp from mirror
2024-09-22 04:56:21 +00:00
a5d31ba932
Merge
a829583c97
into ecd5d6b65b
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
root
synced commits to refs/pull/9541/merge at root/llama.cpp from mirror
2024-09-22 04:56:21 +00:00
e0226c5439
Merge
c42ec2f8bb
into ecd5d6b65b
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
root
synced commits to refs/pull/9544/merge at root/llama.cpp from mirror
2024-09-22 04:56:21 +00:00
5f7b32e86a
Merge
4af076b494
into ecd5d6b65b
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
root
synced commits to refs/pull/9571/merge at root/llama.cpp from mirror
2024-09-22 04:56:21 +00:00
177f766404
Merge
eec216c57a
into ecd5d6b65b
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
root
synced commits to refs/pull/9577/merge at root/llama.cpp from mirror
2024-09-22 04:56:21 +00:00
eb6ea94d2d
Merge
3ae8374b59
into ecd5d6b65b
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
root
synced commits to refs/pull/9579/merge at root/llama.cpp from mirror
2024-09-22 04:56:21 +00:00
8d72461e8d
Merge
33b692934f
into ecd5d6b65b
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
root
synced commits to refs/pull/9581/merge at root/llama.cpp from mirror
2024-09-22 04:56:21 +00:00
855a62a9f1
Merge
0ad9572f8b
into ecd5d6b65b
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)