llama.cpp

History

Nicolò Scipione b460d16ae8 sycl: Add reorder to Q6_K mmvq implementation (#13885 ) * Add Reorder to Q6_K mmvq implementation * Address PR comments: clean up comments * Remove unused parameter after refactoring q4_k * Adding inline to function and removing unnecessary reference to int --------- Signed-off-by: nscipione <nicolo.scipione@codeplay.com>		2025-06-09 11:47:07 +02:00
..
cmake	cmake: Factor out CPU architecture detection (#13883 )	2025-05-29 12:50:25 +02:00
include	ggml : remove ggml_graph_import and ggml_graph_export declarations (ggml/1247)	2025-06-01 13:43:57 +03:00
src	sycl: Add reorder to Q6_K mmvq implementation (#13885 )	2025-06-09 11:47:07 +02:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	llama : allow using mmap without PrefetchVirtualMemory, apply GGML_WIN_VER to llama.cpp sources (#14013 )	2025-06-05 11:57:42 +02:00