Created Feature matrix (markdown)

2024-11-13 14:29:52 +00:00 · 2024-03-05 12:36:14 +00:00 · 2024-03-05 12:36:14 +00:00 · 27130d440e
commit 27130d440e
parent 26c482f18e
1 changed files with 11 additions and 0 deletions
--- a/Feature-matrix.md
+++ b/Feature-matrix.md
@ -0,0 +1,11 @@
+# llama.cpp feature matrix
+
+|                      | **CPU (AVX2)** | **CPU (ARM NEON)** | **Metal** | **cuBLAS** |    **rocBLAS**   | **SYCL** | **CLBlast** | **Vulkan** | **Kompute** |
+|:--------------------:|:--------------:|:------------------:|:---------:|:----------:|:----------------:|:--------:|:-----------:|:----------:|:-----------:|
+| **K-quants**         | ✅              | ✅                  | ✅         | ✅          | ✅                | ✅        | ✅           | ✅          | 🚫           |
+| **I-quants**         | ✅ (SLOW)       | ✅ (SLOW)           | ✅ (SLOW)  | ✅          | ✅                | Partial¹        | 🚫           | 🚫          | 🚫           |
+| **Multi-GPU**        | N/A            | N/A                | N/A       | ✅          | ❓                | 🚫        | ❓           | ✅          | ❓           |
+|  **K cache quants**  | ✅              | ❓                  | ❓         | ✅          | Only q8_0 (SLOW) | ❓        | ✅           | 🚫          | 🚫           |
+| **MoE architecture** | ✅              | ❓                  | ✅         | ✅          | ✅                | ❓        | Only -ngl 0 | 🚫          | 🚫           |
+
+* ¹: IQ3_S and IQ1_S, see #5886