llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-26 11:24:35 +00:00

History

Shanshan Shen 9a4b79bcfa CANN: Improve the Inferencing Performance for Ascend NPU Device (#10454 ) * improve inferencing performance for ascend npu. Co-authored-by: Frank Mai <thxCode@thxcode0824@gmail.com> * some modification after review * some modifications after review * restore some modifications * restore some modifications --------- Co-authored-by: shanshan shen <shanshanshen333@gmail.com> Co-authored-by: Frank Mai <thxCode@thxcode0824@gmail.com>		2024-11-26 18:08:37 +08:00
..
include	ggml : add support for dynamic loading of backends (#10469 )	2024-11-25 15:13:39 +01:00
src	CANN: Improve the Inferencing Performance for Ascend NPU Device (#10454 )	2024-11-26 18:08:37 +08:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	ggml : add support for dynamic loading of backends (#10469 )	2024-11-25 15:13:39 +01:00