CUDA: lower GPU latency + fix Windows performance (#3110) · d54a4027a6 - llama.cpp - Gitea: Git with a cup of tea

root/llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-11-11 21:39:52 +00:00

CUDA: lower GPU latency + fix Windows performance (#3110)

This commit is contained in:

Johannes Gäßler

2023-09-11 19:55:51 +02:00

committed by

GitHub

parent 1b0d09259e

commit d54a4027a6

No known key found for this signature in database

GPG Key ID: 4AEE18F83AFDEB23

1 changed files with 572 additions and 608 deletions

1180

ggml-cuda.cu

View File

File diff suppressed because it is too large Load Diff