llama.cpp/cpy.cuh at 996e47978074234da58eae6f84d611098e1e0d47 - llama.cpp - Gitea: Git with a cup of tea

root/llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-10 02:31:46 +00:00

bssrdf 8c60a8a462

increase cuda_cpy block size (ggml/996)

Co-authored-by: bssrdf <bssrdf@gmail.com>

2024-10-26 10:33:56 +03:00

10 lines

298 B

Plaintext

Raw Blame History

 #include "common.cuh"
 #define CUDA_CPY_BLOCK_SIZE 64
 void ggml_cuda_cpy(ggml_backend_cuda_context & ctx, const ggml_tensor * src0, ggml_tensor * src1);
 void ggml_cuda_dup(ggml_backend_cuda_context & ctx, ggml_tensor * dst);
 void* ggml_cuda_cpy_fn(const ggml_tensor * src0, ggml_tensor * src1);