Commit Graph

  • ea336bfa33 rwkv eos Concedo 2023-05-29 22:40:27 +0800
  • 6b3373cb81 revert bad fix Concedo 2023-05-29 22:06:12 +0800
  • 248367605e
    Work around for recalculating logits in cached prompts (Fixes #1585) (#1609) master-2483676 DannyDaemonic 2023-05-29 05:13:40 -0700
  • ef16d09a51 fix for older gcc, updated lite Concedo 2023-05-29 18:54:15 +0800
  • 44c83c6eba Merge remote-tracking branch 'upstream/master' into cached-logits-bandaid Danny Daemonic 2023-05-29 02:57:57 -0700
  • 3a73ebe8d2 Merge branch 'master' into concedo_experimental Concedo 2023-05-29 16:47:32 +0800
  • 254a9ff12c Merge commit 'ebc5d0651a1af44a2aecf503c1ceecede1ef99c4' into concedo_experimental Concedo 2023-05-29 16:26:24 +0800
  • 30ff1133f5 allow users to rename models for use in horde Concedo 2023-05-29 16:01:05 +0800
  • 97b39f875c fixed fstat64 build error on mac Concedo 2023-05-29 15:50:07 +0800
  • 0773028d52 1) make gpt_params_parse can jump over some predefined unknown args so we can reuse the gpt_params_parse function 2) fixed the grpc server error Liu Ming 2023-05-29 14:07:13 +0800
  • 0e730dd23b
    Adding git in container package dependencies (#1621) Jiří Podivín 2023-05-29 06:45:50 +0200
  • 96165b1201 pick from master changhz 2023-05-28 23:47:42 -0400
  • 530eb57fe4 fix the error of no ending Liu Ming 2023-05-29 08:37:34 +0800
  • 56895e28f6
    get vocabulary for exporting training checkpoint to llama compatible model file xaedes 2023-05-29 02:25:18 +0200
  • 4b81c32d5b
    add export of training checkpoint to llama compatible model file xaedes 2023-05-29 01:27:09 +0200
  • 2da5c8cf24
    set default model.type for unknown models with few layers xaedes 2023-05-29 01:20:55 +0200
  • bf4d9b3b81
    add llama_get_vocab to get the vocabulary as output parameters xaedes 2023-05-29 01:20:26 +0200
  • 42cf4d8433
    Merge branch 'master' into master Henri Vasserman 2023-05-29 01:05:19 +0300
  • 33b6957177 Fixed failing to return result on stopping token. digiwombat 2023-05-28 16:45:05 -0400
  • 89475fb320
    slightly improve how cross entropy loss is compute xaedes 2023-05-28 22:40:58 +0200
  • 5f5aa20078
    remove trailing whitespace xaedes 2023-05-28 22:00:56 +0200
  • 1fbd19abe1
    use ggml_cross_entropy_loss in text training example xaedes 2023-05-28 22:00:26 +0200
  • f056a04a80
    add tests for cross_entropy_loss backward pass xaedes 2023-05-28 21:59:17 +0200
  • 71aaf8dedf
    add ggml_cross_entropy_loss with backward pass for faster training xaedes 2023-05-28 21:57:38 +0200
  • 3b126f654f
    LLAMA_DEBUG adds debug symbols (#1617) master-3b126f6 Johannes Gäßler 2023-05-28 21:01:02 +0200
  • 6c58f64a3b --ctx_size flag to --ctx-size to match common.cpp digiwombat 2023-05-28 14:17:36 -0400
  • b38d41ef52 --memory_f32 flag to --memory-f32 to match common.cpp digiwombat 2023-05-28 13:58:25 -0400
  • 655899db89 Add ignore_eos option to generation settings. digiwombat 2023-05-28 13:49:45 -0400
  • 1b78ed2081
    Only show -ngl option when relevant + other doc/arg handling updates (#1625) master-1b78ed2 Kerfuffle 2023-05-28 11:48:57 -0600
  • 337aea1139
    examples : add --alias option to gpt_params to set use friendly model name (#1614) master-337aea1 Vladimir Zorin 2023-05-28 20:14:24 +0300
  • bb051d9723
    opencl : no need to allocate cl_mem on heap (#1612) master-bb051d9 Howard Su 2023-05-29 01:13:36 +0800
  • ca74884f66
    opencl : use strstr to check if fp16 supported (#1611) master-ca74884 Howard Su 2023-05-29 01:09:56 +0800
  • 2c9ee7a052
    Apply suggestions from code review Randall Fitzgerald 2023-05-28 09:34:11 -0700
  • 74c6f36bf1
    Editorconfig suggested fixes Henri Vasserman 2023-05-28 19:19:34 +0300
  • 05cb629c8e
    replace inefficient repeat backward pass with dedicated repeat_back operation xaedes 2023-05-28 18:00:17 +0200
  • c47df09842
    simplify backward pass for SQRT xaedes 2023-05-28 17:32:01 +0200
  • 15ddc4903b Merge remote-tracking branch 'slyecho/server_refactor' digiwombat 2023-05-28 11:09:32 -0400
  • 36758b1009 Setting the ftype argument of the script as optional Jiri Podivin 2023-05-28 16:39:51 +0200
  • 7186d655a1
    seed and gen params Henri Vasserman 2023-05-28 17:03:01 +0300
  • 7740301db9 Set unspecified generation settings back to default. (Notes below) digiwombat 2023-05-28 09:18:47 -0400
  • dda915cac4 Added capturing the stopping word and sending it along with the final JSON. digiwombat 2023-05-28 08:43:38 -0400
  • 2e5c5ee224 Changed JSON names to match the parameter name rather than the variable name. digiwombat 2023-05-28 08:12:48 -0400
  • 23928f2887 Added generation_settings to final json object. digiwombat 2023-05-28 08:04:05 -0400
  • 5eacb84223 Display a warning if -ngl is supplied without support. KerfuffleV2 2023-05-28 05:48:36 -0600
  • e8efd75492 Initial timeout code and expanded json return on completion. digiwombat 2023-05-28 07:44:31 -0400
  • 28f1196f65 adjust default rep pen range Concedo 2023-05-28 19:36:21 +0800
  • 177868e68a Changed to params/args digiwombat 2023-05-28 06:29:11 -0400
  • a70095e961 Fix derp in ngl ifdef KerfuffleV2 2023-05-28 04:17:44 -0600
  • f40f6e8252 Documentation and arg help/handling updates KerfuffleV2 2023-05-28 03:58:03 -0600
  • 764a21ce0f Only show -ngl option when relevant + add warning for --memory-f32 option KerfuffleV2 2023-05-28 03:36:52 -0600
  • 549291fe61
    keep processed from the beginning Henri Vasserman 2023-05-28 12:08:37 +0300
  • df0e0d094c
    Forgot to remove some testing code. Randall Fitzgerald 2023-05-23 06:22:30 -0700
  • f93fe36c5b
    Add all generation parameters to server.cpp and allow resetting context Randall Fitzgerald 2023-05-23 06:16:54 -0700
  • 51e09944ce
    server rewrite Henri Vasserman 2023-05-28 02:42:18 +0300
  • c01c7d2caf Adding git in container package dependencies Jiri Podivin 2023-05-28 09:33:33 +0200
  • 7d159bacd7 updated kobold lite Concedo 2023-05-28 11:23:20 +0800
  • 0d308e2ef2 remove excessive codes and prints liang 2023-05-28 08:45:51 +0800
  • 1f40a789e6
    Didn't see the already defined top_k var. Randall Fitzgerald 2023-05-27 17:10:09 -0700
  • e84b802161
    Change top_k type. Randall Fitzgerald 2023-05-27 17:07:45 -0700
  • fdce8951ac
    Merge branch 'ggerganov:master' into master Randall Fitzgerald 2023-05-27 19:57:37 -0400
  • d20f36b93c
    Removed unnecessary last_prompt_token set Randall Fitzgerald 2023-05-27 16:46:05 -0700
  • 36c86d794d
    Automate Context resetting and minor fixes Randall Fitzgerald 2023-05-27 16:43:08 -0700
  • dbf50b4afc
    Merge 95dc4d7270 into a6704643b6 Henri Vasserman 2023-05-27 13:51:52 -0700
  • a6704643b6
    ggml : add support for the RISCV architecture (#1616) master-a670464 apcameron 2023-05-27 21:03:25 +0100
  • 6dea8d9f2e LLAMA_DEBUG adds debug symbols JohannesGaessler 2023-05-27 21:28:08 +0200
  • 66ed19d01f
    Corrected dashes in the help lines. Randall Fitzgerald 2023-05-27 11:51:21 -0700
  • dcc426e2de Merge branch 'master' into concedo_experimental Concedo 2023-05-28 01:08:39 +0800
  • 48cb16a51a
    Merge branch 'ggerganov:master' into master Randall Fitzgerald 2023-05-27 13:08:03 -0400
  • 0df7d63e5b
    Include server in releases + other build system cleanups (#1610) master-0df7d63 Kerfuffle 2023-05-27 11:04:14 -0600
  • 5d9f5b28a6 rwkv integration completed Concedo 2023-05-28 00:48:56 +0800
  • 9fdaa1d250
    Add more defs Henri Vasserman 2023-05-27 19:17:53 +0300
  • 81996ea82f Minor style fix in cast KerfuffleV2 2023-05-27 06:10:51 -0600
  • 2c5f38ba92 Apply style/style suggestions for server example KerfuffleV2 2023-05-27 06:00:26 -0600
  • b70917d2b4 Include server in releases, other build system cleanups KerfuffleV2 2023-05-27 05:08:58 -0600
  • 97c9b77c4f
    Add documentation about CLBlast (#1604) Henri Vasserman 2023-05-27 18:47:55 +0300
  • a4648c1e7c
    Merge 'origin/master' into hipblas Henri Vasserman 2023-05-27 18:22:39 +0300
  • 55e0fbf024 wip integrating new rwkv Concedo 2023-05-27 22:45:28 +0800
  • 98edbdba63 ADD Support for the RISCV Architecture Andrew Cameron 2023-05-27 15:34:40 +0100
  • a4e5f65659 Ensure ext_buffer is null terminated Howard Su 2023-05-27 21:04:42 +0800
  • 6c5c79a2a3 Use strstr to check if fp16 supported Howard Su 2023-05-27 20:13:49 +0800
  • 8c1a3746ba No need to allocate cl_mem on heap Howard Su 2023-05-27 20:17:41 +0800
  • 605f3fe290
    Add --alias option to gpt_params to set use friendly model name Vladimir Zorin 2023-05-27 17:25:37 +0300
  • 0ecb1bbbeb
    [CI] Fix openblas (#1613) master-0ecb1bb Henri Vasserman 2023-05-27 17:24:06 +0300
  • e84e30db2f
    fiiixx Henri Vasserman 2023-05-27 16:56:46 +0300
  • 4be5c264a7
    fix license copy Henri Vasserman 2023-05-27 16:40:34 +0300
  • 42d8271ee3
    syntax error fix Henri Vasserman 2023-05-27 16:24:03 +0300
  • 699bd99d4e
    Fix openblas build Henri Vasserman 2023-05-27 16:22:44 +0300
  • 93618031c7
    ggml : add ggml_tensor_overhead() master-9361803 Georgi Gerganov 2023-05-27 16:19:56 +0300
  • 83c54e6da5
    [CI] CLBlast: Fix directory name (#1606) Henri Vasserman 2023-05-27 15:18:25 +0300
  • 6d47258e41 n_session_consumed should just be size_t, but the cache code casts to (int) Danny Daemonic 2023-05-27 03:36:44 -0700
  • 573b690e16 Work around for recalculating logits in cached prompts Danny Daemonic 2023-05-27 03:20:03 -0700
  • fe63bfdb0f Revert "allow 2048 blasbatchsize" Concedo 2023-05-27 18:13:27 +0800
  • 97c5cca4e5 OpenCL: Don't load gpu layers into RAM, add mul_f32 kernel 0cc4m 2023-05-27 12:00:56 +0200
  • 94dc5c2324 allow 2048 blasbatchsize Concedo 2023-05-27 17:47:18 +0800
  • 92a0d77712 Merge branch 'master' into concedo_experimental Concedo 2023-05-27 17:44:14 +0800
  • abfdfb702e added top_a sampler Concedo 2023-05-27 17:32:37 +0800
  • bdbda1b17a
    ggml : sync ggml core (minor additions, e.g. ggml_get_tensor_by_name()) Georgi Gerganov 2023-05-27 12:22:05 +0300
  • ebc5d0651a Use events instead of clFinish, where possible 0cc4m 2023-05-27 10:03:35 +0200
  • 01a0f206df added support for starcoder, which is basically gpt2 Concedo 2023-05-27 13:35:40 +0800
  • 6d7749c98f no difference Concedo 2023-05-27 12:42:19 +0800