mirror of
https://github.com/ggerganov/llama.cpp.git
synced 2024-11-13 14:29:52 +00:00
Updated Feature matrix (markdown)
parent
b1c6785367
commit
c0eb5b983a
@ -14,4 +14,4 @@
|
||||
* ²: Only with `-ngl 0`
|
||||
* ³: Only `-ctk q8_0`, inference is 50% slower
|
||||
* ⁴: Slower than K-quants of comparable size
|
||||
* ⁵: Slower than hipBLAS/cuBLAS on similar cards
|
||||
* ⁵: Slower than cuBLAS/rocBLAS on similar cards
|
Loading…
Reference in New Issue
Block a user