Aaron Miller
|
9bc52ebae3
|
attempted speedups
|
2023-11-03 17:22:22 -04:00 |
|
Aaron Miller
|
cd0257ed0d
|
q4_1 mat*mat
|
2023-11-03 17:22:22 -04:00 |
|
Aaron Miller
|
4809890d80
|
rm commented dbg print
|
2023-11-03 17:22:22 -04:00 |
|
Aaron Miller
|
b78a94bc6d
|
q6k mm works
|
2023-11-03 17:22:22 -04:00 |
|
Aaron Miller
|
3327d84a7f
|
perf: use bigger threadgroups in mm
|
2023-11-03 17:22:22 -04:00 |
|
Aaron Miller
|
46385ee0d5
|
misc vulkan cleanup
make pushconts consistent w/ dispatch, avoid a double free
|
2023-11-03 17:22:22 -04:00 |
|
Aaron Miller
|
f0cd38b9ad
|
add mat*mat ops
|
2023-11-03 17:22:22 -04:00 |
|
Aaron Miller
|
020b1745a0
|
vulkan: implement neox mode for rope
|
2023-11-03 17:22:21 -04:00 |
|
Aaron Miller
|
ff4212d20f
|
q8 mat*vec
|
2023-11-03 17:22:21 -04:00 |
|
Aaron Miller
|
9db90cbe12
|
f16 mv broadcasting fix (gqa fix)
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
bc4b5ed1cb
|
Fixes for subgroup size to bring AMD and NVIDIA inline with eachother for all kernels.
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
32289aa447
|
Fixes for norm.
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
06d4b21598
|
Fix offset into the qh and now we have working vulkan accelerated for gguff'd llama.
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
f1c9bc1821
|
Add q6_k getrows and mul*vec kernel.
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
4b223ec432
|
Refactor getrows to use common code and get ready for q6_k.
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
601905e75e
|
Move the subgroups and printf into common.
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
93306f16d0
|
Consolidate code for mat x vec kernels and use subgroups more extensively.
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
77135a3bf5
|
Add a common boilerplate code via include and elim copy pasta
|
2023-11-03 17:22:21 -04:00 |
|
Cebtenzzre
|
6b6c73a9e3
|
kompute : don't fail build because of -Warray-bounds
There are some warnings in debug builds that are likely to be false
positives.
|
2023-11-03 17:22:21 -04:00 |
|
Adam Treat
|
2c24d67e7b
|
Don't crash on available devices if we can't even create an instance.
|
2023-10-05 13:39:18 -04:00 |
|
Adam Treat
|
bd5f6399bb
|
Don't try and install kompute artifacts.
|
2023-10-05 13:39:18 -04:00 |
|
Aaron Miller
|
beee57266f
|
Make kompute actually include external SDK headers when requested
|
2023-10-05 13:39:18 -04:00 |
|
Adam Treat
|
b7e2e691d4
|
Completely revamp how we do object management with the vulkan backend and
stop using so many static objects so we can tear down and bring up vulkan
on new devices in the same runtime.
|
2023-10-05 13:39:18 -04:00 |
|
Adam Treat
|
45c8778b49
|
Switch to a dynamic dispatch table instead of linking hard against libvulkan.
|
2023-10-05 13:39:18 -04:00 |
|
Aaron Miller
|
8563fa001f
|
remove dynamic deps from kompute build
should no longer have new external deps other than libvulkan
```
ubuntu@ip-172-31-1-24:~/repo/gpt4all/gpt4all-backend/build$ ldd ./libllamamodel-mainline-avxonly.so
linux-vdso.so.1 (0x00007ffcb53bb000)
libvulkan.so.1 => /lib/x86_64-linux-gnu/libvulkan.so.1 (0x00007f239dab5000)
libstdc++.so.6 => /lib/x86_64-linux-gnu/libstdc++.so.6 (0x00007f239d800000)
libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007f239d719000)
libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00007f239da95000)
libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007f239d400000)
/lib64/ld-linux-x86-64.so.2 (0x00007f239dd1d000)
```
|
2023-10-05 13:39:18 -04:00 |
|
niansa
|
ba15dfd0be
|
Nomic vulkan backend licensed under the Software for Open Models License (SOM), version 1.0.
|
2023-10-05 13:39:18 -04:00 |
|