llama.cpp/ggml/src/ggml-cann/kernels
2024-08-05 21:10:37 +08:00
..
ascendc_kernels.h cann: support q4_0 model (#8822) 2024-08-05 12:22:30 +08:00
CMakeLists.txt cann: support q4_0 model (#8822) 2024-08-05 12:22:30 +08:00
dup.cpp [CANN] Add Ascend NPU backend (#6035) 2024-07-17 14:23:50 +03:00
get_row_f16.cpp [CANN] Add Ascend NPU backend (#6035) 2024-07-17 14:23:50 +03:00
get_row_f32.cpp [CANN] Add Ascend NPU backend (#6035) 2024-07-17 14:23:50 +03:00
get_row_q4_0.cpp [CANN] Add Ascend NPU backend (#6035) 2024-07-17 14:23:50 +03:00
get_row_q8_0.cpp [CANN] Add Ascend NPU backend (#6035) 2024-07-17 14:23:50 +03:00
quantize_f16_q8_0.cpp [CANN] Add Ascend NPU backend (#6035) 2024-07-17 14:23:50 +03:00
quantize_f32_q8_0.cpp [CANN] Add Ascend NPU backend (#6035) 2024-07-17 14:23:50 +03:00
quantize_float_to_q4_0.cpp cann: fix buffer_num and runtime speed slowly error (#8865) 2024-08-05 21:10:37 +08:00