mirror of
https://github.com/ggerganov/llama.cpp.git
synced 2024-12-26 03:14:35 +00:00
f99e1e456e
* fix: llama-3 ignore_merges * test: add test for llama-3 bpe ignore_merges * fix: set ignore_merges only for llama-3 * fix: test-tokenizer-1-bpe --ingore-merges detection * fix: copy to fix fallthrough * fix: change ignore_merges to bool * fix: add ignore merges tests to cmake * llama : alternative merge ignore logic --------- Co-authored-by: Haoxiang Fei <feihaoxiang@idea.edu.cn> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
109 lines
1.8 KiB
Plaintext
109 lines
1.8 KiB
Plaintext
ied 4 ½ months
|
||
__ggml_vocab_test__
|
||
Führer
|
||
__ggml_vocab_test__
|
||
|
||
__ggml_vocab_test__
|
||
|
||
__ggml_vocab_test__
|
||
|
||
__ggml_vocab_test__
|
||
|
||
__ggml_vocab_test__
|
||
|
||
__ggml_vocab_test__
|
||
|
||
|
||
__ggml_vocab_test__
|
||
|
||
|
||
|
||
__ggml_vocab_test__
|
||
|
||
|
||
|
||
|
||
__ggml_vocab_test__
|
||
|
||
|
||
__ggml_vocab_test__
|
||
Hello world
|
||
__ggml_vocab_test__
|
||
Hello world
|
||
__ggml_vocab_test__
|
||
Hello World
|
||
__ggml_vocab_test__
|
||
Hello World
|
||
__ggml_vocab_test__
|
||
Hello World!
|
||
__ggml_vocab_test__
|
||
Hello, world!
|
||
__ggml_vocab_test__
|
||
Hello, world!
|
||
__ggml_vocab_test__
|
||
this is 🦙.cpp
|
||
__ggml_vocab_test__
|
||
w048 7tuijk dsdfhu
|
||
__ggml_vocab_test__
|
||
нещо на Български
|
||
__ggml_vocab_test__
|
||
កាន់តែពិសេសអាចខលចេញ
|
||
__ggml_vocab_test__
|
||
🚀 (normal) 😶🌫️ (multiple emojis concatenated) ✅ (only emoji that has its own token)
|
||
__ggml_vocab_test__
|
||
Hello
|
||
__ggml_vocab_test__
|
||
Hello
|
||
__ggml_vocab_test__
|
||
Hello
|
||
__ggml_vocab_test__
|
||
Hello
|
||
__ggml_vocab_test__
|
||
Hello
|
||
__ggml_vocab_test__
|
||
Hello
|
||
Hello
|
||
__ggml_vocab_test__
|
||
(
|
||
__ggml_vocab_test__
|
||
|
||
=
|
||
__ggml_vocab_test__
|
||
' era
|
||
__ggml_vocab_test__
|
||
Hello, y'all! How are you 😁 ?我想在apple工作1314151天~
|
||
__ggml_vocab_test__
|
||
3
|
||
__ggml_vocab_test__
|
||
33
|
||
__ggml_vocab_test__
|
||
333
|
||
__ggml_vocab_test__
|
||
3333
|
||
__ggml_vocab_test__
|
||
33333
|
||
__ggml_vocab_test__
|
||
333333
|
||
__ggml_vocab_test__
|
||
3333333
|
||
__ggml_vocab_test__
|
||
33333333
|
||
__ggml_vocab_test__
|
||
333333333
|
||
__ggml_vocab_test__
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
🚀 (normal) 😶🌫️ (multiple emojis concatenated) ✅ 🦙🦙 3 33 333 3333 33333 333333 3333333 33333333 3.3 3..3 3...3 កាន់តែពិសេសអាច😁 ?我想在apple工作1314151天~ ------======= нещо на Български ''''''```````""""......!!!!!!?????? I've been 'told he's there, 'RE you sure? 'M not sure I'll make it, 'D you like some tea? We'Ve a'lL
|
||
__ggml_vocab_test__
|
||
Việt
|
||
__ggml_vocab_test__
|