llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-27 20:04:35 +00:00

Author	SHA1	Message	Date
Kerfuffle	e06cbcee73	gguf : add Python script to convert GGMLv3 LLaMA models to GGUF (#2682 ) * First pass at converting GGMLv3 LLaMA models to GGUF * Cleanups, better output during conversion * Fix vocab space conversion logic * More vocab conversion fixes * Add description to converted GGUF files * Improve help text, expand warning * Allow specifying name and description for output GGUF * Allow overriding vocab and hyperparams from original model metadata * Use correct params override var name * Fix wrong type size for Q8_K Better handling of original style metadata * Set default value for gguf add_tensor raw_shape KW arg	2023-08-21 17:45:52 +03:00
klosax	6a69a693cb	gguf.py : fix rope scale kv	2023-08-21 13:23:10 +02:00
klosax	bd5a57901b	gguf.py : fix for falcon 40b	2023-08-19 01:01:52 +02:00
klosax	147a99bd3a	gguf.py : reverse GGUF_MAGIC	2023-08-17 23:24:04 +02:00
klosax	640ddc4259	gguf.py : gptneox mapping	2023-08-17 21:43:10 +02:00
M. Yusuf Sarıgöz	fc3a523211	gguf.py : write tensors in a single pass (#2644 ) * gguf : single pass for writing tensors + refactoring writer * gguf : single pass for writing tensors + refactoring writer * gguf : single pass for writing tensors + refactoring writer * gguf : style fixes in simple conversion script * gguf : refactor gptneox conversion script * gguf : rename h5 to hf (for HuggingFace) * gguf : refactor pth to gguf conversion script * gguf : rm file_type key and method * gguf.py : fix vertical alignment * gguf.py : indentation --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2023-08-17 21:57:39 +03:00
Georgi Gerganov	acaa98234a	convert.py : fix HF tensor permuting / unpacking ggml-ci	2023-08-17 21:06:45 +03:00
Georgi Gerganov	e0429d38e4	convert-new.py : output gguf (#2635 ) * convert-new.py : output gguf (WIP) * convert-new.py : add gguf key-value pairs * llama : add hparams.ctx_train + no longer print ftype * convert-new.py : minor fixes * convert-new.py : vocab-only option should work now * llama : fix tokenizer to use llama_char_to_byte * tests : add new ggml-vocab-llama.gguf * convert-new.py : tensor name mapping * convert-new.py : add map for skipping tensor serialization * convert-new.py : convert script now works * gguf.py : pick some of the refactoring from #2644 * convert-new.py : minor fixes	2023-08-17 17:19:52 +03:00
Georgi Gerganov	c8ee87f141	gguf.py : merge all files in gguf.py	2023-08-16 19:55:49 +03:00
klosax	b6056c3db8	gguf.py : add tensor data layout	2023-08-15 19:53:44 +02:00
klosax	cedb4870c6	gguf.py : add token types	2023-08-14 22:08:40 +02:00
Georgi Gerganov	62490f1380	gguf : use UNIX line ending	2023-08-14 13:04:35 +03:00
klosax	5c5a95ba2d	gguf.py : dont add empty strings	2023-08-14 11:22:06 +02:00
M. Yusuf Sarıgöz	91d4bfd536	convert : write more metadata for LLaMA	2023-08-13 13:29:46 +03:00
klosax	d2ce9cfe8d	gguf.py : n_layer --> n_block	2023-08-13 00:01:20 +02:00
klosax	5d81a715d4	gguf.py : no need to convert tensors twice	2023-08-12 21:45:45 +02:00
klosax	4357e692ac	gguf.py : use custom alignment if present	2023-08-07 13:51:26 +02:00
klosax	278ada9572	gguf.py : bytesarray for gpt2bpe tokenizer	2023-08-04 04:07:57 +02:00
klosax	49380a23a3	gguf.py : accumulate kv and tensor info data + special tokens	2023-08-01 23:37:48 +02:00
M. Yusuf Sarıgöz	7aa0a0e7f7	gguf : support custom alignment value	2023-07-31 09:59:36 +03:00
klosax	ccd81a751b	gguf.py : add layer norm eps and merges	2023-07-30 14:48:14 +02:00
M. Yusuf Sarıgöz	32e037ffbe	gguf : fix set is not subscriptable	2023-07-30 01:01:13 +03:00
klosax	9577821487	gguf.py : support any type	2023-07-29 21:29:07 +02:00
M. Yusuf Sarıgöz	0f5e57f01d	gguf : handle already encoded string	2023-07-29 19:56:06 +03:00
M. Yusuf Sarıgöz	cc3dd7f042	gguf : write tokenizer data	2023-07-29 13:30:22 +03:00
M. Yusuf Sarıgöz	8a76dd8a85	gguf : write tensors one by one	2023-07-29 13:17:28 +03:00
M. Yusuf Sarıgöz	c861e234f4	gguf : write tensors one by one	2023-07-29 12:49:01 +03:00
M. Yusuf Sarıgöz	0c219fb5b5	gguf : fix writing gguf arrays	2023-07-29 12:42:54 +03:00
M. Yusuf Sarıgöz	93f7f7aef7	gguf : write tensors one by one and code reuse	2023-07-29 12:34:35 +03:00
M. Yusuf Sarıgöz	ea5f9ad2ca	gguf : fix writing gguf arrays	2023-07-29 12:25:43 +03:00
M. Yusuf Sarıgöz	06f423a8e1	gguf : write sample tensors to read	2023-07-29 10:26:26 +03:00
M. Yusuf Sarıgöz	1495735aac	gguf : fix writing tensors	2023-07-29 00:26:22 +03:00
M. Yusuf Sarıgöz	11ef380c2a	GGUF : write tensor (#2426 ) * WIP: Write tensor * GGUF : Support writing tensors in Python * refactor : rm unused import and upd todos * fix : fix errors upd writing example * rm example.gguf * gitignore .gguf undo formatting	2023-07-28 11:34:16 +03:00
Georgi Gerganov	158be8f7f4	gguf.py : some code style changes	2023-07-27 15:37:06 +03:00
M. Yusuf Sarıgöz	c85d3178b3	refactor : reduce code duplication and better API (#2415 )	2023-07-27 10:29:29 +03:00
M. Yusuf Sarıgöz	bae6b125f6	wip : implement GGUF (#2397 ) * Add LLAMA_DEFAULT_RMS_EPS so we can change the default (#2384) Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com> * WIP: python class to write GGUF, incomplete C apı for reading --------- Co-authored-by: Kawrakow <48489457+ikawrakow@users.noreply.github.com> Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>	2023-07-26 18:21:13 +03:00

36 Commits