llama.cpp/examples/speculative
2023-09-20 11:03:18 +03:00
..
CMakeLists.txt speculative : PoC for speeding-up inference via speculative sampling (#2926) 2023-09-03 15:12:08 +03:00
speculative.cpp llama : improve llama_batch API + simplify parallel example 2023-09-20 11:03:18 +03:00