llama.cpp

History

Johannes Gäßler e1e8e0991f CUDA: batched+noncont MMQ, refactor bs>1 MoE code (#13199 )		2025-04-30 23:12:59 +02:00
..
cmake	scripts : update sync + fix cmake merge	2025-03-27 10:09:29 +02:00
include	CUDA: fix q_nope_absorbed prec for DS 2 Lite f16 (#13137 )	2025-04-28 09:29:26 +02:00
src	CUDA: batched+noncont MMQ, refactor bs>1 MoE code (#13199 )	2025-04-30 23:12:59 +02:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	ggml : add SSE 4.2 and x64 base variant for CPUs without AVX (#12871 )	2025-04-21 18:13:51 +02:00