mirror of
https://github.com/ggerganov/llama.cpp.git
synced 2024-11-14 23:09:53 +00:00
8f275a7c45
Some checks are pending
Publish Docker image / Push Docker image to Docker Hub (map[dockerfile:.devops/full-cuda.Dockerfile platforms:linux/amd64 tag:full-cuda]) (push) Waiting to run
Publish Docker image / Push Docker image to Docker Hub (map[dockerfile:.devops/full-musa.Dockerfile platforms:linux/amd64 tag:full-musa]) (push) Waiting to run
Publish Docker image / Push Docker image to Docker Hub (map[dockerfile:.devops/full.Dockerfile platforms:linux/amd64,linux/arm64 tag:full]) (push) Waiting to run
Publish Docker image / Push Docker image to Docker Hub (map[dockerfile:.devops/llama-cli-cuda.Dockerfile platforms:linux/amd64 tag:light-cuda]) (push) Waiting to run
Publish Docker image / Push Docker image to Docker Hub (map[dockerfile:.devops/llama-cli-intel.Dockerfile platforms:linux/amd64 tag:light-intel]) (push) Waiting to run
Publish Docker image / Push Docker image to Docker Hub (map[dockerfile:.devops/llama-cli-musa.Dockerfile platforms:linux/amd64 tag:light-musa]) (push) Waiting to run
Publish Docker image / Push Docker image to Docker Hub (map[dockerfile:.devops/llama-cli.Dockerfile platforms:linux/amd64,linux/arm64 tag:light]) (push) Waiting to run
Publish Docker image / Push Docker image to Docker Hub (map[dockerfile:.devops/llama-server-cuda.Dockerfile platforms:linux/amd64 tag:server-cuda]) (push) Waiting to run
Publish Docker image / Push Docker image to Docker Hub (map[dockerfile:.devops/llama-server-intel.Dockerfile platforms:linux/amd64 tag:server-intel]) (push) Waiting to run
Publish Docker image / Push Docker image to Docker Hub (map[dockerfile:.devops/llama-server-musa.Dockerfile platforms:linux/amd64 tag:server-musa]) (push) Waiting to run
Publish Docker image / Push Docker image to Docker Hub (map[dockerfile:.devops/llama-server.Dockerfile platforms:linux/amd64,linux/arm64 tag:server]) (push) Waiting to run
Nix CI / nix-eval (macos-latest) (push) Waiting to run
Nix CI / nix-eval (ubuntu-latest) (push) Waiting to run
Nix CI / nix-build (macos-latest) (push) Waiting to run
Nix CI / nix-build (ubuntu-latest) (push) Waiting to run
flake8 Lint / Lint (push) Waiting to run
Python Type-Check / pyright type-check (push) Waiting to run
* ggml: Add POOL2D OP for GPU ACC to the Vulkan. - The MobileVLM model now supports inference acceleration through GPU by utilizing the Vulkan backend. - A GGML_OP_POOL_2D shader has been added. (Pooling) - The encoding performance of the CLIP model improved from 2.8s on the CPU to 0.7s on the GPU. Signed-off-by: Changyeon Kim <cyzero.kim@samsung.com> * [fix] Correct the incorrect order of the parameters. fix casting to int. Signed-off-by: Changyeon Kim <cyzero.kim@samsung.com> --------- Signed-off-by: Changyeon Kim <cyzero.kim@samsung.com> |
||
---|---|---|
.. | ||
acc.comp | ||
add.comp | ||
argsort.comp | ||
clamp.comp | ||
CMakeLists.txt | ||
concat.comp | ||
copy.comp | ||
cos.comp | ||
dequant_f32.comp | ||
dequant_funcs.comp | ||
dequant_head.comp | ||
dequant_iq4_nl.comp | ||
dequant_q2_k.comp | ||
dequant_q3_k.comp | ||
dequant_q4_0.comp | ||
dequant_q4_1.comp | ||
dequant_q4_k.comp | ||
dequant_q5_0.comp | ||
dequant_q5_1.comp | ||
dequant_q5_k.comp | ||
dequant_q6_k.comp | ||
dequant_q8_0.comp | ||
diag_mask_inf.comp | ||
div.comp | ||
gelu_quick.comp | ||
gelu.comp | ||
generic_binary_head.comp | ||
generic_head.comp | ||
generic_unary_head.comp | ||
get_rows_quant.comp | ||
get_rows.comp | ||
group_norm.comp | ||
im2col.comp | ||
leaky_relu.comp | ||
mul_mat_split_k_reduce.comp | ||
mul_mat_vec_base.comp | ||
mul_mat_vec_nc.comp | ||
mul_mat_vec_p021.comp | ||
mul_mat_vec_q2_k.comp | ||
mul_mat_vec_q3_k.comp | ||
mul_mat_vec_q4_k.comp | ||
mul_mat_vec_q5_k.comp | ||
mul_mat_vec_q6_k.comp | ||
mul_mat_vec.comp | ||
mul_mm.comp | ||
mul.comp | ||
norm.comp | ||
pad.comp | ||
pool2d.comp | ||
relu.comp | ||
repeat.comp | ||
rms_norm.comp | ||
rope_head.comp | ||
rope_neox.comp | ||
rope_norm.comp | ||
scale.comp | ||
silu.comp | ||
sin.comp | ||
soft_max.comp | ||
square.comp | ||
sum_rows.comp | ||
tanh.comp | ||
timestep_embedding.comp | ||
types.comp | ||
upscale.comp | ||
vulkan-shaders-gen.cpp |