Block a user
root
synced commits to refs/pull/9096/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
49c1a6217e
Merge
7323304092
into 912c331d3d
912c331d3d
Fix merge error in #9454 (#9589)
root
synced commits to refs/pull/9131/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
68f8f86697
Merge
81a37ca577
into c35e586ea5
c35e586ea5
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526)
912c331d3d
Fix merge error in #9454 (#9589)
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
root
synced commits to refs/pull/9251/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
781adf9878
Merge
06e3e3bf51
into c35e586ea5
c35e586ea5
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526)
912c331d3d
Fix merge error in #9454 (#9589)
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
root
synced commits to refs/pull/9331/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
e3eacba3fc
Merge
904111af8c
into c35e586ea5
c35e586ea5
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526)
912c331d3d
Fix merge error in #9454 (#9589)
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
root
synced commits to refs/pull/9407/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
aeffeb4ae0
Merge
adf3bce13b
into 912c331d3d
912c331d3d
Fix merge error in #9454 (#9589)
root
synced commits to refs/pull/9482/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
188a0a562d
Merge
e08b907760
into 912c331d3d
912c331d3d
Fix merge error in #9454 (#9589)
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
root
synced commits to refs/pull/9510/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
e85e864795
Merge
5f95dccea8
into c35e586ea5
c35e586ea5
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526)
912c331d3d
Fix merge error in #9454 (#9589)
root
synced commits to refs/pull/9532/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
c7b3c139a2
Merge
a829583c97
into c35e586ea5
c35e586ea5
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526)
912c331d3d
Fix merge error in #9454 (#9589)
root
synced commits to refs/pull/9541/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
56cd436389
Merge
c42ec2f8bb
into c35e586ea5
c35e586ea5
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526)
912c331d3d
Fix merge error in #9454 (#9589)
root
synced commits to refs/pull/9544/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
9b07c845ed
Merge
4af076b494
into c35e586ea5
c35e586ea5
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526)
912c331d3d
Fix merge error in #9454 (#9589)
root
synced commits to refs/pull/9557/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
b1c09b186e
Merge
f9c2155158
into c35e586ea5
c35e586ea5
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526)
912c331d3d
Fix merge error in #9454 (#9589)
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
root
synced commits to refs/pull/9571/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
64715503cc
Merge
c4d6f343d4
into c35e586ea5
c35e586ea5
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526)
912c331d3d
Fix merge error in #9454 (#9589)
root
synced commits to refs/pull/9577/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
0563bbab56
Merge
3ae8374b59
into c35e586ea5
c35e586ea5
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526)
912c331d3d
Fix merge error in #9454 (#9589)
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
root
synced commits to refs/pull/9579/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
5532846cb3
Merge
33b692934f
into c35e586ea5
c35e586ea5
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526)
912c331d3d
Fix merge error in #9454 (#9589)
root
synced commits to refs/pull/9586/merge at root/llama.cpp from mirror
2024-09-22 21:16:21 +00:00
4940dbba45
Merge
db660f5a40
into 912c331d3d
912c331d3d
Fix merge error in #9454 (#9589)
a5b57b08ce
CUDA: enable Gemma FA for HIP/Pascal (#9581)
ecd5d6b65b
llama: remove redundant loop when constructing ubatch (#9574)
2a63caaa69
RWKV v6: RWKV_WKV op CUDA implementation (#9454)
root
synced and deleted reference 2024-09-22 21:16:20 +00:00
refs/tags/refs/pull/9526/merge
at root/llama.cpp from mirror