llama.cpp/examples/speculative
2023-09-13 08:50:46 +02:00
..
CMakeLists.txt speculative : PoC for speeding-up inference via speculative sampling (#2926) 2023-09-03 15:12:08 +03:00
speculative.cpp speculative: add --n-gpu-layers-draft option (#3063) 2023-09-13 08:50:46 +02:00