Georgi Gerganov
|
231cff5f6f
|
sync : ggml
|
2024-08-27 22:41:27 +03:00 |
|
Georgi Gerganov
|
06658ad7c3
|
metal : separate scale and mask from QKT in FA kernel (#9189)
* metal : separate scale and mask from QKT in FA kernel
* metal : ne01 check no longer necessary
* metal : keep data in local memory
|
2024-08-26 18:31:02 +03:00 |
|
Georgi Gerganov
|
fc18425b6a
|
ggml : add SSM Metal kernels (#8546)
* ggml : add ggml_ssm_conv metal impl
* ggml : add ssm_scan metal impl
ggml-ci
|
2024-08-26 17:55:36 +03:00 |
|
slaren
|
0c41e03ceb
|
metal : gemma2 flash attention support (#9159)
|
2024-08-26 11:08:59 +02:00 |
|
slaren
|
87e397d00b
|
ggml : fix quant dot product with odd number of blocks (#8549)
* ggml : fix iq4_nl dot product with odd number of blocks
* ggml : fix odd blocks for ARM_NEON (#8556)
* ggml : fix iq4_nl dot product with odd number of blocks
* ggml : fix q4_1
* ggml : fix q5_0
* ggml : fix q5_1
* ggml : fix iq4_nl metal
ggml-ci
* ggml : fix q4_0
* ggml : fix q8_0
ggml-ci
* ggml : remove special Q4_0 code for first 2 blocks
* ggml : fix sumf redefinition
---------
Co-authored-by: slaren <slarengh@gmail.com>
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
|
2024-07-19 17:17:27 +02:00 |
|
Georgi Gerganov
|
c917b67f06
|
metal : template-ify some of the kernels (#8447)
ggml-ci
|
2024-07-13 18:32:33 +03:00 |
|
Clint Herron
|
07a3fc0608
|
Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258)
|
2024-07-02 12:18:10 -04:00 |
|
Georgi Gerganov
|
f3f65429c4
|
llama : reorganize source code + improve CMake (#8006)
* scripts : update sync [no ci]
* files : relocate [no ci]
* ci : disable kompute build [no ci]
* cmake : fixes [no ci]
* server : fix mingw build
ggml-ci
* cmake : minor [no ci]
* cmake : link math library [no ci]
* cmake : build normal ggml library (not object library) [no ci]
* cmake : fix kompute build
ggml-ci
* make,cmake : fix LLAMA_CUDA + replace GGML_CDEF_PRIVATE
ggml-ci
* move public backend headers to the public include directory (#8122)
* move public backend headers to the public include directory
* nix test
* spm : fix metal header
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
* scripts : fix sync paths [no ci]
* scripts : sync ggml-blas.h [no ci]
---------
Co-authored-by: slaren <slarengh@gmail.com>
|
2024-06-26 18:33:02 +03:00 |
|