Commit Graph

  • 99c5b27654
    ggml : refactor quantized processing functions (#509) master-99c5b27 Stephan Walter 2023-03-28 17:13:01 +0000
  • 1229722c61
    Merge branch 'master' into q-refactor Georgi Gerganov 2023-03-28 20:11:56 +0300
  • a0c2401359
    ggml : minor Georgi Gerganov 2023-03-28 20:10:14 +0300
  • cbddf4661b
    Get mmap() working with WIN32 MSVC Justine Tunney 2023-03-28 09:27:41 -0700
  • 692ce3164e
    py : removed unused model variable and verified that the code functions correctly with vocab_only setting. Also confirmed that the code works as expected after running with reduced memory usage due to deletion of no-longer-needed variable. (#547) DooWoong Lee (David) 2023-03-29 02:02:34 +0900
  • 96f9c0506f
    ci : make ctest verbose, hopefully we see what is wrong with the sanitizer master-96f9c05 Georgi Gerganov 2023-03-28 20:01:09 +0300
  • d502bc7c9d
    tests : free llama context at the end of the test master-d502bc7 Georgi Gerganov 2023-03-28 19:51:55 +0300
  • 436e561931
    all : be more strict about converting float to double (#458) master-436e561 Stephan Walter 2023-03-28 16:48:20 +0000
  • 21e9ce7574
    perplexity : add <cmath> Georgi Gerganov 2023-03-28 19:40:01 +0300
  • 20e1e84884
    deploy : add a Package.swift for SwiftPM support (#393) master-20e1e84 Jed Fox 2023-03-28 11:39:01 -0500
  • 61733d3b49
    all : prefer float over double where appropriate Georgi Gerganov 2023-03-28 19:11:31 +0300
  • e4881686b4
    Make WIN32 mmap() improvements (#341) oKatanaaa 2023-03-21 01:46:44 +0400
  • f68345e9b1
    Fix softmax in perplexity.cpp Stephan Walter 2023-03-26 12:36:55 +0200
  • 3a42193b3d
    Test equivalence of round, SILU implementations Stephan Walter 2023-03-25 17:00:29 +0100
  • 54b75a77fb
    Be more strict about converting float to double Stephan Walter 2023-03-24 10:26:44 +0100
  • c1f885067c
    ggml : introduce structs for the q4 data blocks (#356) master-c1f8850 Stephan Walter 2023-03-28 15:56:03 +0000
  • 6a3b29a923
    ggml : rename quant struct variables + fix ARM_NEON Georgi Gerganov 2023-03-28 18:52:33 +0300
  • e0670260fb
    gitignore : add "embedding" Georgi Gerganov 2023-03-28 18:34:35 +0300
  • ce3f7adc85
    Fix linking on mingw32 anzz1 2023-03-28 18:14:04 +0300
  • 28ba975aea
    Check the existence of f16_model_path_base in quantize.py (#574) dotpy314 2023-03-28 23:06:28 +0800
  • 25248d7391 Use the same threshold for OpenBLAS and ggml thread limiting Maël Kerbiriou 2023-03-28 16:51:45 +0200
  • 2e6c295bc7
    CMake: Add explicit F16C option (x86) anzz1 2023-03-28 17:43:32 +0300
  • a6bdc47cba
    Fix usage of F16C intrinsics in AVX code (#563) master-a6bdc47 slaren 2023-03-28 16:26:55 +0200
  • 40c8e68122 Check the existence of f16_model_path_base in quantize.py Jincheng Miao 2023-03-28 22:13:16 +0800
  • 7b8dbcb78b
    main.cpp fixes, refactoring (#571) master-7b8dbcb anzz1 2023-03-28 17:09:55 +0300
  • 51266e4ae7
    n_keep help update anzz1 2023-03-28 16:54:29 +0300
  • ebf09a1919
    * -> & anzz1 2023-03-28 16:02:40 +0300
  • fcabe9b8b2
    found this one on the floor anzz1 2023-03-28 15:49:40 +0300
  • 021bdf237a
    main.cpp fixes, refactoring anzz1 2023-03-28 15:43:16 +0300
  • 911782cfdd Use more accurate function names Slaren 2023-03-28 14:29:09 +0200
  • 7c97743ea6 Fix linker error for tests kirillsurkov 2023-03-28 13:40:25 +0300
  • bf30406f50 Merge branch 'master' into concedo Concedo 2023-03-28 17:13:38 +0800
  • 99590bf992
    CI: github runner avx512f detection fix (windows) anzz1 2023-03-28 11:31:49 +0300
  • 9e03cba624
    Merge branch 'master' into feat-instruct-cpp anzz1 2023-03-28 09:17:04 +0300
  • 5ffcf7e2de
    Add instruct.cpp to Makefile anzz1 2023-03-28 09:14:09 +0300
  • 4b8efff0e3
    Add embedding example to Makefile (#540) master-4b8efff RJ Adriaansen 2023-03-28 08:11:09 +0200
  • 30317983c4
    Added support for _POSIX_MAPPED_FILES if defined in source CoderRC 2023-03-27 20:47:37 -0400
  • ab6ac3d335 Fix usage of F16C intrinsics in AVX code Slaren 2023-03-27 23:33:49 +0200
  • 926e49e78e
    Merge branch 'ggerganov:master' into master R.Kaufmann 2023-03-27 21:11:08 +0200
  • 120bda4c18
    Update alpaca.sh anzz1 2023-03-27 17:22:15 +0300
  • 4971df4f5c
    remove -ins --instruct cmdline arg anzz1 2023-03-27 17:06:03 +0300
  • c8048f0cbd
    instruct.cpp, continue on empty line, endless instruct mode, refactors anzz1 2023-03-27 16:32:48 +0300
  • 46ddbb22bf allow url params Concedo 2023-03-27 17:40:05 +0800
  • 6cf5bde4c3
    Revert 7e53955 (#542) anzz1 2023-03-27 10:51:27 +0300
  • b983f4fc70 Removed unused model variable and verified that the code functions correctly with vocab_only setting. Also confirmed that the code works as expected after running with reduced memory usage due to deletion of no-longer-needed variable. fritzprix 2023-03-27 14:35:02 +0900
  • 7e5395575a
    Fix missing ggml link in cmake for examples/* on w64-mingw32 (#542) Marco Matthies 2023-03-27 06:55:26 +0200
  • 1bf20a34b0
    CI: fix subdirectory path globbing anzz1 2023-03-27 07:52:56 +0300
  • 13dedf2dbe Fix missing ggml link in cmake for examples/* on w64-mingw32 Marco Matthies 2023-03-27 01:29:19 +0200
  • 58b3cbc20b
    Add embedding example to Makefile RJ Adriaansen 2023-03-26 23:34:42 +0200
  • 53a187d2c0 Add script to convert old ggml files to newer version Jakub Horak 2023-03-26 22:47:45 +0200
  • b8a80f9994 Add __SSE3__ macro for MSVC too because why not anzz1 2023-03-22 05:47:05 +0200
  • 67ad7c445c MSVC cvt intrinsics anzz1 2023-03-22 04:36:29 +0200
  • c43d45696f Enable F16C/CVT16 vector extensions on MSVC anzz1 2023-03-22 04:05:49 +0200
  • 50ea0027a4 Enable Fused-Multiply-Add (FMA) instructions on MSVC anzz1 2023-03-22 03:46:48 +0200
  • 3a6b10d22d Refactor quantized processing functions Stephan Walter 2023-03-25 18:40:28 +0100
  • 34c1072e49
    ci: add debug build to sanitizer build matrix (#527) master-34c1072 Erik Scholz 2023-03-26 17:48:40 +0200
  • d54bed0e37
    ci: add debug build to sanitzier build matrix Green Sky 2023-03-26 16:26:22 +0200
  • 939ad2d3a5
    Fix undefined variables in debug build, remove unused variables (#531) master-939ad2d Stephan Walter 2023-03-26 15:34:02 +0000
  • 8fca119295 Fix undefined variables in debug build, remove unused variables Stephan Walter 2023-03-26 17:16:15 +0200
  • 35e76863db Treat empty line input as "no input" Jakub Horak 2023-03-26 16:50:02 +0200
  • 8c2ec5e21d
    Add support for linux/arm64 platform during Docker Builds (#514) master-8c2ec5e Juan Calderon-Perez 2023-03-26 10:48:42 -0400
  • 417bd2d677 Add backwards-compatibility for older model format Jakub Horak 2023-03-26 16:23:11 +0200
  • 92d10215ec Introduce structs for the q4 data blocks Stephan Walter 2023-03-21 16:14:59 +0100
  • b391579db9
    Update README and comments for standalone perplexity tool (#525) master-b391579 Stephan Walter 2023-03-26 13:14:01 +0000
  • 7a87d31f4f
    [main] fix infinite generation (-n == -1) (#523) master-7a87d31 anzz1 2023-03-26 16:06:10 +0300
  • e2152356dc Update README and comments for standalone perplexity tool Stephan Walter 2023-03-26 14:59:13 +0200
  • 7312191d1f Update README and comments for standalone perplexity tool Stephan Walter 2023-03-26 14:59:13 +0200
  • d4fd8ccbf8
    [main] fix infinite generation (-n == -1) anzz1 2023-03-26 15:35:01 +0300
  • 8e1fb49abf
    Merge branch 'ggerganov:master' into master R.Kaufmann 2023-03-26 13:00:52 +0200
  • 348d6926ee
    Add logo to README.md Georgi Gerganov 2023-03-26 10:20:49 +0300
  • 053b20c8ca merged complete Concedo 2023-03-26 14:55:43 +0800
  • 33b5d2c376 Merge branch 'master' into concedo Concedo 2023-03-26 14:52:14 +0800
  • 57474944d6 Merge branch 'master' into concedo Concedo 2023-03-26 14:52:08 +0800
  • 33e35b8fe8
    Exit from interactive mode if input stream is bad (#491) master-33e35b8 Harald Fernengel 2023-03-26 07:25:46 +0200
  • 7dca16bcfd Add AVX2 implementation of quantize_row_q4_1 Slaren 2023-03-26 01:01:16 +0100
  • 1582a04085 Exit from interactive mode if input stream is bad Harald Fernengel 2023-03-25 12:48:39 +0100
  • 7a7e0acf1a
    Merge branch 'ggerganov:master' into master R.Kaufmann 2023-03-26 00:23:18 +0100
  • 633dec89af
    Add platform to versioned builds Juan Calderon-Perez 2023-03-25 19:18:10 -0400
  • 840645dea7 trace logits to a file Maël Kerbiriou 2023-03-24 21:59:07 +0100
  • bbf5b04a95
    Add support for linux/arm64 platform Juan Calderon-Perez 2023-03-25 19:15:41 -0400
  • 19726169b3
    CI: Run other sanitizer builds even if one fails (#511) master-1972616 anzz1 2023-03-26 00:13:28 +0200
  • f732695cd5
    Clarify console output in convert-pth-to-ggml.py (#512) jp-x-g 2023-03-25 14:53:55 -0700
  • a157fe69ca
    Clarify console output in convert-pth-to-ggml.py jp-x-g 2023-03-25 14:50:25 -0700
  • aca5a9e74d
    CI: Run other sanitizer builds even if one fails anzz1 2023-03-25 23:44:47 +0200
  • 2f7bf7dd7c
    CMake / CI additions (#497) master-2f7bf7d anzz1 2023-03-25 23:38:11 +0200
  • 43523220a4 Remove perplexity from main Gary Linscott 2023-03-25 13:33:42 -0700
  • 7392ad629d update from merge Gary Linscott 2023-03-25 13:30:40 -0700
  • 34ab526843
    (Windows) Set console to UTF-8 on init (#420) master-34ab526 anzz1 2023-03-25 22:29:22 +0200
  • c3d3cd2d45 Merge branch 'master' into batch_perplexity Gary Linscott 2023-03-25 13:24:22 -0700
  • 098eb922b8
    Merge branch 'ggerganov:master' into master R.Kaufmann 2023-03-25 21:23:56 +0100
  • c3dc4dbb1e merge master anzz1 2023-03-25 22:20:45 +0200
  • c2b25b6912
    Fix colors enabling on WIN32 master-c2b25b6 Georgi Gerganov 2023-03-25 21:53:39 +0200
  • 79b2b266db
    If n_predict == -1, generate forever Georgi Gerganov 2023-03-25 21:51:41 +0200
  • 004fddfed7
    CI: Add sanitizer build (Ubuntu) anzz1 2023-03-25 21:44:27 +0200
  • 779c37f916
    cmake: make sanitizers linking #468 anzz1 2023-03-25 21:39:05 +0200
  • e2d490dafd
    Inifinite generation via context swapping (#71) Georgi Gerganov 2023-03-25 21:36:22 +0200
  • 2d27013343
    test avx-512f only when possible anzz1 2023-03-25 21:28:31 +0200
  • 03f7e33560
    Cleanup STL headers + fix embedding examples + minor stuff master-03f7e33 Georgi Gerganov 2023-03-25 20:51:14 +0200
  • 55ad42af84
    Move chat scripts into "./examples" Georgi Gerganov 2023-03-25 20:36:52 +0200
  • 459e93cce0
    Add AVX2 implementation of dequantize_row_q4_1 (#505) master-459e93c slaren 2023-03-25 19:31:48 +0100