llama.cpp/scale.cuh at 95fb0aefab568348da159efdd370e064d1b35f97 - llama.cpp - Gitea: Git with a cup of tea

root/llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-09-23 05:26:19 +00:00

slaren ae1f211ce2

cuda : refactor into multiple files (#6269 )

2024-03-25 13:50:23 +01:00

6 lines

135 B

Plaintext

Raw Blame History

 #include "common.cuh"
 #define CUDA_SCALE_BLOCK_SIZE 256
 void ggml_cuda_op_scale(ggml_backend_cuda_context & ctx, ggml_tensor * dst);