llama.cpp

History

Jeff Bolz a4837577aa vulkan: use aligned loads for flash attention mask (#12853 ) Rewrite the stride logic for the mask tensor in the FA shader to force the stride to be aligned, to allow using more efficient loads.		2025-04-12 10:44:48 +02:00
..
cmake	scripts : update sync + fix cmake merge	2025-03-27 10:09:29 +02:00
include	ggml : add bilinear upscale support (ggml/1185)	2025-04-11 00:17:47 +03:00
src	vulkan: use aligned loads for flash attention mask (#12853 )	2025-04-12 10:44:48 +02:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	ggml : add logging for native build options/vars (whisper/2935)	2025-03-30 08:33:31 +03:00