Aaron Miller
|
9db90cbe12
|
f16 mv broadcasting fix (gqa fix)
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
bc4b5ed1cb
|
Fixes for subgroup size to bring AMD and NVIDIA inline with eachother for all kernels.
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
de589ced7c
|
Change this back to be in agreement with metal and our previous softmax kernel.
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
f1c9bc1821
|
Add q6_k getrows and mul*vec kernel.
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
5509f74318
|
Minor cleanup.
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
93306f16d0
|
Consolidate code for mat x vec kernels and use subgroups more extensively.
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
2c24d67e7b
|
Don't crash on available devices if we can't even create an instance.
|
2023-10-05 13:39:18 -04:00 |
|
Adam Treat
|
addac25293
|
Set the singleton to nullptr here.
|
2023-10-05 13:39:18 -04:00 |
|
Adam Treat
|
68aca6be08
|
Only use vulkan with known quant that work.
|
2023-10-05 13:39:18 -04:00 |
|
Aaron Miller
|
8bea719879
|
vulkan: disambiguate gpus with the same name
|
2023-10-05 13:39:18 -04:00 |
|
Adam Treat
|
68cf1df6fb
|
Throw an exception when allocation fails for vulkan.
|
2023-10-05 13:39:18 -04:00 |
|
Adam Treat
|
b7e2e691d4
|
Completely revamp how we do object management with the vulkan backend and
stop using so many static objects so we can tear down and bring up vulkan
on new devices in the same runtime.
|
2023-10-05 13:39:18 -04:00 |
|
Adam Treat
|
45c8778b49
|
Switch to a dynamic dispatch table instead of linking hard against libvulkan.
|
2023-10-05 13:39:18 -04:00 |
|
Adam Treat
|
48a45ea435
|
Remove warning which fails on windows.
|
2023-10-05 13:39:18 -04:00 |
|
niansa
|
ba15dfd0be
|
Nomic vulkan backend licensed under the Software for Open Models License (SOM), version 1.0.
|
2023-10-05 13:39:18 -04:00 |
|