llama.cpp

History

cmdr2 f54a4ba11e Support pure float16 add/sub/mul/div operations in the CUDA (and CPU) backend (ggml/1121) * Support float16-to-float16 add/sub/mul/div operations in the CUDA backend * Add fp16 support for add/sub/mul/div on the CPU backend * Add test cases for fp16 add/sub/mul/div		2025-03-03 18:18:11 +02:00
..
cmake	cmake: Fix ggml backend dependencies and installation (#11818 )	2025-02-27 09:42:48 +02:00
include	ggml : upgrade init_tensor API to return a ggml_status (#11854 )	2025-02-28 14:41:47 +01:00
src	Support pure float16 add/sub/mul/div operations in the CUDA (and CPU) backend (ggml/1121)	2025-03-03 18:18:11 +02:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	CUDA: compress mode option and default to size (#12029 )	2025-03-01 12:57:22 +01:00