ggml : optimize for ppc64le using VSX intrinsics (ggml/784)

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-24 10:24:35 +00:00

* optimize for ppc64le using VSX intrinsics

* 1. code clean up by removing comments about overflow concern.

2. fix typo in suffix of scaling.

* Continue to fix typo in suffix of scaling for QK_K <> 256

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

This commit is contained in:

Hong Bo PENG

2024-05-12 17:17:18 +08:00

committed by

Georgi Gerganov

parent 4f0263633b

commit 0d26d8ccd8

1 changed files with 2167 additions and 2 deletions

2169

ggml-quants.c

View File

File diff suppressed because it is too large Load Diff

ggml : optimize for ppc64le using VSX intrinsics (ggml/784)

2169 ggml-quants.c View File

2169

ggml-quants.c

View File