llama.cpp/tests/test-double-float.cpp

// These tests may take a long time!
// They are to prove that conversion from double to float of various functions in ggml.c doesn't affect the result.
// This is done by checking all finite (non-NaN, non-infinite) floats.

#undef NDEBUG
#include <cassert>
#if !defined(__riscv) && !defined(__s390__) && !defined(__ARM_NEON)
#include <immintrin.h>
#endif
#include <cmath>
#include <cstdint>
#include <cstring>

#pragma GCC diagnostic push
#pragma GCC diagnostic ignored "-Wdouble-promotion"

// ggml.c::quantize_row_q4_0_ref
inline static uint8_t round_orig(float v0) { return ((int8_t) (round(v0))) + 8; }

// ggml.c::ggml_silu_f32
inline static float silu_orig(float x) {
    return x/(1.0 + exp(-x));
}

#pragma GCC diagnostic pop

// ggml.c::quantize_row_q4_0_ref
inline static uint8_t round_float(float v0) { return (int8_t)roundf(v0) + 8; }

// ggml.c::ggml_silu_f32
inline static float silu_float(float x) {
    return x/(1.0f + expf(-x));
}

int main(void) {
    uint32_t x = UINT32_MAX;
    do {
        float f;
        memcpy(&f, &x, sizeof(x));
        assert(!std::isfinite(f) || (round_orig(f) == round_float(f)));
    } while (x--);

#ifdef __F16C__
    // GELU and SILU implementations are used with a FP16 lookup table.
    // The original and float-only results are not equal for all inputs after converting to FP16.
    // GELU is an approximation anyway (tanh), not tested here.
    // For SILU, verify that the results are at least the closest floating point numbers, if the FP16 values don't match.
    for (x = 0; x <= UINT16_MAX; x++) {
        float f = _cvtsh_ss(x);
        const float so = silu_orig(f);
        const float sf = silu_float(f);
        assert(   (_cvtss_sh(so, 0) == _cvtss_sh(sf, 0))
               || (nextafterf(so, sf) == sf)
               || (nextafterf(sf, so) == so));
    }
#endif
}
all : be more strict about converting float to double (#458) * Be more strict about converting float to double * Test equivalence of round, SILU implementations Test module is commented out in CMakeLists.txt because the tests may take a long time, depending on how much the compiler optimizes. * Fix softmax in perplexity.cpp * all : prefer float over double where appropriate * perplexity : add <cmath> --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 2023-03-28 16:48:20 +00:00			`// These tests may take a long time!`
			`// They are to prove that conversion from double to float of various functions in ggml.c doesn't affect the result.`
			`// This is done by checking all finite (non-NaN, non-infinite) floats.`

			`#undef NDEBUG`
tests : Fix compilation warnings (Linux/GCC) (#2451) * fix hellaswag print format, cast away warning in test-double-float * c++11 cannot use designated initializers * add static to test-grad0.c internal functions * use memcpy in test-double-float.c * port c tests to c++ * use initializer list for ggml_init_params 2023-08-02 08:06:19 +00:00			`#include <cassert>`
ggml : move FP16 <-> FP32 code to ggml-impl.h (#3861) * ggml : move FP16 <-> FP32 stuff to ggml-impl.h ggml-ci * tests : fix ARM build * ggml : explicitly initialize deprecated type traits * ggml : add math.h to ggml-impl.h * ggml : remove duplicate static assert macros * ggml : prefix lookup tables with ggml_ ggml-ci * ggml-impl : move extern "C" to start of file 2023-10-30 17:19:15 +00:00			`#if !defined(__riscv) && !defined(__s390__) && !defined(__ARM_NEON)`
all : be more strict about converting float to double (#458) * Be more strict about converting float to double * Test equivalence of round, SILU implementations Test module is commented out in CMakeLists.txt because the tests may take a long time, depending on how much the compiler optimizes. * Fix softmax in perplexity.cpp * all : prefer float over double where appropriate * perplexity : add <cmath> --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 2023-03-28 16:48:20 +00:00			`#include <immintrin.h>`
gguf : support big endian platform (#3552) * check whether platform is 390x if yes->do not import immintrin.h * support s390x big endian * support --bigendian option for s390x 1. verified with baichuan7b-chat with float 16 on s390x 2. verified with baichuan7b-chat 3. verified with chinese-alpaca-2-13b-f16 * update format based on editor-config checker result * Update convert-baichuan-hf-to-gguf.py * 1. check in ggml.c if endianess is not match 2. update GGUF version 3. change get_pack_prefix to property 4. update information log * always use "GGUF" as beginng of GGUF file * Compare "GGUF" with file header char by char 1. Set GGUF_MAGIC to "GGUF" string instead of int value 2. Compare "GGUF" char by char to ensure its byte order 3. Move bytes swap code from convert.py to gguf.py write_tensor_data --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 2023-10-20 11:19:40 +00:00			`#endif`
tests : Fix compilation warnings (Linux/GCC) (#2451) * fix hellaswag print format, cast away warning in test-double-float * c++11 cannot use designated initializers * add static to test-grad0.c internal functions * use memcpy in test-double-float.c * port c tests to c++ * use initializer list for ggml_init_params 2023-08-02 08:06:19 +00:00			`#include <cmath>`
			`#include <cstdint>`
			`#include <cstring>`
all : be more strict about converting float to double (#458) * Be more strict about converting float to double * Test equivalence of round, SILU implementations Test module is commented out in CMakeLists.txt because the tests may take a long time, depending on how much the compiler optimizes. * Fix softmax in perplexity.cpp * all : prefer float over double where appropriate * perplexity : add <cmath> --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 2023-03-28 16:48:20 +00:00
			`#pragma GCC diagnostic push`
			`#pragma GCC diagnostic ignored "-Wdouble-promotion"`

ggml : minor naming changes (#8433) * ggml : minor naming changes ggml-ci * ggml : use PRId64 [no ci] * ggml : revert FA K/Q names 2024-07-12 07:46:02 +00:00			`// ggml.c::quantize_row_q4_0_ref`
all : be more strict about converting float to double (#458) * Be more strict about converting float to double * Test equivalence of round, SILU implementations Test module is commented out in CMakeLists.txt because the tests may take a long time, depending on how much the compiler optimizes. * Fix softmax in perplexity.cpp * all : prefer float over double where appropriate * perplexity : add <cmath> --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 2023-03-28 16:48:20 +00:00			`inline static uint8_t round_orig(float v0) { return ((int8_t) (round(v0))) + 8; }`

			`// ggml.c::ggml_silu_f32`
			`inline static float silu_orig(float x) {`
			`return x/(1.0 + exp(-x));`
			`}`

			`#pragma GCC diagnostic pop`

ggml : minor naming changes (#8433) * ggml : minor naming changes ggml-ci * ggml : use PRId64 [no ci] * ggml : revert FA K/Q names 2024-07-12 07:46:02 +00:00			`// ggml.c::quantize_row_q4_0_ref`
all : be more strict about converting float to double (#458) * Be more strict about converting float to double * Test equivalence of round, SILU implementations Test module is commented out in CMakeLists.txt because the tests may take a long time, depending on how much the compiler optimizes. * Fix softmax in perplexity.cpp * all : prefer float over double where appropriate * perplexity : add <cmath> --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 2023-03-28 16:48:20 +00:00			`inline static uint8_t round_float(float v0) { return (int8_t)roundf(v0) + 8; }`

			`// ggml.c::ggml_silu_f32`
			`inline static float silu_float(float x) {`
			`return x/(1.0f + expf(-x));`
			`}`

			`int main(void) {`
			`uint32_t x = UINT32_MAX;`
			`do {`
tests : Fix compilation warnings (Linux/GCC) (#2451) * fix hellaswag print format, cast away warning in test-double-float * c++11 cannot use designated initializers * add static to test-grad0.c internal functions * use memcpy in test-double-float.c * port c tests to c++ * use initializer list for ggml_init_params 2023-08-02 08:06:19 +00:00			`float f;`
			`memcpy(&f, &x, sizeof(x));`
			`assert(!std::isfinite(f) \|\| (round_orig(f) == round_float(f)));`
all : be more strict about converting float to double (#458) * Be more strict about converting float to double * Test equivalence of round, SILU implementations Test module is commented out in CMakeLists.txt because the tests may take a long time, depending on how much the compiler optimizes. * Fix softmax in perplexity.cpp * all : prefer float over double where appropriate * perplexity : add <cmath> --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 2023-03-28 16:48:20 +00:00			`} while (x--);`

			`#ifdef __F16C__`
			`// GELU and SILU implementations are used with a FP16 lookup table.`
			`// The original and float-only results are not equal for all inputs after converting to FP16.`
			`// GELU is an approximation anyway (tanh), not tested here.`
			`// For SILU, verify that the results are at least the closest floating point numbers, if the FP16 values don't match.`
			`for (x = 0; x <= UINT16_MAX; x++) {`
			`float f = _cvtsh_ss(x);`
			`const float so = silu_orig(f);`
			`const float sf = silu_float(f);`
			`assert( (_cvtss_sh(so, 0) == _cvtss_sh(sf, 0))`
			`\|\| (nextafterf(so, sf) == sf)`
			`\|\| (nextafterf(sf, so) == so));`
			`}`
			`#endif`
			`}`