llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-26 03:14:35 +00:00

History

Daniel Bevenius 263978904c finetune : rename feed-forward tensors (w1/w2/w3) (#4839 ) * finetune: rename feed-forward tensors (w1/w2/w3) This commit renames the feed-forward tensors w1, w2 and w3 to ffn_gate, ffn_down and ffn_up respectively. The motivation for this change is to make it easier to understand the purpose of the tensors. This also seems to be inline with the names used in the llama_layer struct in llama.cpp. Signed-off-by: Daniel Bevenius <daniel.bevenius@gmail.com> * train-text-from-scratch: rename ff tensors This commit renames the feed-forward tensors w1, w2 and w3 to ffn_gate, ffn_down and ffn_up respectively. The motivation for this change is to make it easier to understand the purpose of the tensors. This also seems to be inline with the names used in the llama_layer struct in llama.cpp Signed-off-by: Daniel Bevenius <daniel.bevenius@gmail.com> --------- Signed-off-by: Daniel Bevenius <daniel.bevenius@gmail.com>		2024-02-13 15:15:42 +02:00
..
baby-llama	ggml : change ggml_scale to take a float instead of tensor (#4573 )	2023-12-21 23:20:49 +02:00
batched	examples : add passkey test (#3856 )	2024-01-08 11:14:04 +02:00
batched-bench	llama : remove LLAMA_MAX_DEVICES and LLAMA_SUPPORTS_GPU_OFFLOAD (#5240 )	2024-01-31 17:30:17 +02:00
batched.swift	swift : fix prompt tokenization logic (#4321 )	2023-12-04 15:43:45 +02:00
beam-search	llama : remove token functions with `context` args in favor of `model` (#3720 )	2023-10-23 22:40:03 +03:00
benchmark	2-bit quantizations (#4897 )	2024-01-14 09:45:56 +02:00
convert-llama2c-to-ggml	ggml : remove n_dims from ggml_tensor (#4469 )	2023-12-14 16:52:08 +01:00
embedding	llama : support batched embeddings (#5466 )	2024-02-13 14:06:58 +02:00
export-lora	sync : ggml (#5452 )	2024-02-12 09:16:06 +02:00
finetune	finetune : rename feed-forward tensors (w1/w2/w3) (#4839 )	2024-02-13 15:15:42 +02:00
gguf	gguf : simplify example dependencies	2023-12-21 23:08:14 +02:00
imatrix	Adding some imatrix tools (#5302 )	2024-02-04 10:39:58 +02:00
infill	Remove unused data and add fixes (#5154 )	2024-01-27 15:25:55 +01:00
jeopardy	parallel : add option to load external prompt file (#3416 )	2023-10-06 16:16:38 +03:00
llama-bench	refactor : switch to emplace_back to avoid extra object (#5291 )	2024-02-03 13:23:37 +02:00
llama.android	android : use release cmake build type by default (#5123 )	2024-01-25 19:05:51 +02:00
llama.swiftui	llama.swiftui : update models layout (#4826 )	2024-01-12 14:48:00 +02:00
llava	llava : remove prog parameter from ArgumentParser (#5457 )	2024-02-12 10:38:44 +02:00
lookahead	english : use `typos` to fix comments and logs (#4354 )	2023-12-12 11:53:36 +02:00
lookup	lookup: add print for drafting performance (#5450 )	2024-02-11 12:44:51 +01:00
main	main : ctrl+C print timing in non-interactive mode (#3873 )	2024-02-11 15:35:50 +02:00
main-cmake-pkg	main-cmake-pkg : fix build issue (#4665 )	2023-12-29 16:18:20 +02:00
parallel	llama : KV cache view API + better KV cache management (#4170 )	2023-11-23 19:07:56 +02:00
passkey	examples : add passkey test (#3856 )	2024-01-08 11:14:04 +02:00
perplexity	refactor : switch to emplace_back to avoid extra object (#5291 )	2024-02-03 13:23:37 +02:00
quantize	refactor : switch to emplace_back to avoid extra object (#5291 )	2024-02-03 13:23:37 +02:00
quantize-stats	refactor : switch to emplace_back to avoid extra object (#5291 )	2024-02-03 13:23:37 +02:00
save-load-state	llama : minimize size used for state save/load (#4820 )	2024-01-13 18:29:43 +02:00
server	server : allow to specify tokens as strings in logit_bias (#5003 )	2024-02-11 15:38:14 +02:00
simple	simple : update error message for KV cache check (#4324 )	2023-12-04 18:04:21 +02:00
speculative	speculative : threading options (#4959 )	2024-01-16 13:04:32 +02:00
sycl	[SYCL] update guide of SYCL backend (#5254 )	2024-02-02 15:53:27 +08:00
tokenize	tokenize example: Respect normal add BOS token behavior (#4126 )	2023-11-18 14:48:17 -07:00
train-text-from-scratch	finetune : rename feed-forward tensors (w1/w2/w3) (#4839 )	2024-02-13 15:15:42 +02:00
alpaca.sh	alpaca.sh : update model file name (#2074 )	2023-07-06 19:17:50 +03:00
base-translate.sh	examples : improve base-translate.sh script (#4783 )	2024-01-06 11:40:24 +02:00
chat-13B.bat	Create chat-13B.bat (#592 )	2023-03-29 20:21:09 +03:00
chat-13B.sh	examples : read chat prompts from a template file (#1196 )	2023-05-03 20:58:11 +03:00
chat-persistent.sh	llama : fix session saving/loading (#3400 )	2023-10-03 21:04:01 +03:00
chat-vicuna.sh	examples : add chat-vicuna.sh (#1854 )	2023-06-15 21:05:53 +03:00
chat.sh	main : log file (#2748 )	2023-08-30 09:29:32 +03:00
CMakeLists.txt	ggml : add unified SYCL backend for Intel GPUs (#2690 )	2024-01-28 17:56:23 +02:00
gpt4all.sh	examples : add -n to alpaca and gpt4all scripts (#706 )	2023-04-13 16:03:39 +03:00
json-schema-to-grammar.py	chmod : make scripts executable (#2675 )	2023-08-23 17:29:09 +03:00
llama2-13b.sh	gitignore : changes for Poetry users + chat examples (#2284 )	2023-07-21 13:53:27 +03:00
llama2.sh	gitignore : changes for Poetry users + chat examples (#2284 )	2023-07-21 13:53:27 +03:00
llama.vim	llama.vim : added api key support (#5090 )	2024-01-23 08:51:27 +02:00
llm.vim	llm.vim : stop generation at multiple linebreaks, bind to <F2> (#2879 )	2023-08-30 09:50:55 +03:00
make-ggml.py	make-ggml.py : compatibility with more models and GGUF (#3290 )	2023-09-27 19:25:12 +03:00
Miku.sh	MIKU MAYHEM: Upgrading the Default Model for Maximum Fun 🎉 (#2287 )	2023-07-21 11:13:18 +03:00
pydantic_models_to_grammar.py	examples : make pydantic scripts pass mypy and support py3.8 (#5099 )	2024-01-25 14:51:24 -05:00
pydantic-models-to-grammar-examples.py	examples : make pydantic scripts pass mypy and support py3.8 (#5099 )	2024-01-25 14:51:24 -05:00
reason-act.sh	chmod : make scripts executable (#2675 )	2023-08-23 17:29:09 +03:00
server-llama2-13B.sh	chmod : make scripts executable (#2675 )	2023-08-23 17:29:09 +03:00