Georgi Gerganov
|
f00780b2ee
|
llama : sync gguf-llama.cpp with latest llama.cpp (#2608)
* llama : sync gguf-llama.cpp with latest llama.cpp
* minor : indentation + assert
* llama : refactor gguf_buffer and gguf_ctx_buffer
* llama : minor
|
2023-08-14 16:28:44 +03:00 |
|
Georgi Gerganov
|
62490f1380
|
gguf : use UNIX line ending
|
2023-08-14 13:04:35 +03:00 |
|
Georgi Gerganov
|
0c19ae70d5
|
simple : minor style changes
|
2023-08-14 12:58:12 +03:00 |
|
Georgi Gerganov
|
56a1f32072
|
Merge branch 'master' into gguf
|
2023-08-14 10:14:05 +03:00 |
|
M. Yusuf Sarıgöz
|
60d540831b
|
gguf : roper closing of file
|
2023-08-12 21:42:31 +03:00 |
|
M. Yusuf Sarıgöz
|
202eab04d3
|
gguf : quantization is working
|
2023-08-12 16:39:05 +03:00 |
|
M. Yusuf Sarıgöz
|
fa7c39540c
|
gguf : start implementing quantization (WIP)
|
2023-08-12 15:55:58 +03:00 |
|
M. Yusuf Sarıgöz
|
c4f02b4f74
|
gguf : start implementing quantization (WIP)
|
2023-08-12 12:01:17 +03:00 |
|
M. Yusuf Sarıgöz
|
4fa017a1f9
|
gguf : start implementing quantization (WIP)
|
2023-08-12 10:40:56 +03:00 |
|
M. Yusuf Sarıgöz
|
781b9ec3f5
|
gguf : write metadata in gguf_file_saver (WIP)
|
2023-08-11 18:01:26 +03:00 |
|
M. Yusuf Sarıgöz
|
e7d346c37c
|
gguf : start implementing gguf_file_saver (WIP)
|
2023-08-11 09:52:01 +03:00 |
|
M. Yusuf Sarıgöz
|
c3a65c4bbe
|
gguf-util.h : update note
|
2023-08-02 11:16:23 +03:00 |
|
M. Yusuf Sarıgöz
|
cf365fbc20
|
gguf : gguf counterpart of llama-util.h
|
2023-08-02 11:13:56 +03:00 |
|