* Generalize quantize_fns for simpler FP16 handling * Remove call to ggml_cuda_mul_mat_get_wsize * ci : disable FMA for mac os actions --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> |
||
|---|---|---|
| .. | ||
| CMakeLists.txt | ||
| test-double-float.c | ||
| test-grad0.c | ||
| test-opt.c | ||
| test-quantize-fns.cpp | ||
| test-quantize-perf.cpp | ||
| test-sampling.cpp | ||
| test-tokenizer-0.cpp | ||