llama.cpp

History

Jeff Bolz 02115dcd9a vulkan: Allow up to 4096 elements for mul_mat_id row_ids (#13326 ) This assert fired running Qwen_Qwen3-30B-A3B-Q2_K.gguf: GGML_ASSERT(nei0 * nei1 <= 3072); The tensor is 8 x 512. Increase this array size to accommodate.		2025-05-09 09:23:41 +02:00
..
cmake	scripts : update sync + fix cmake merge	2025-03-27 10:09:29 +02:00
include	CUDA: fix bad asserts for partial offload (#13337 )	2025-05-06 13:58:51 +02:00
src	vulkan: Allow up to 4096 elements for mul_mat_id row_ids (#13326 )	2025-05-09 09:23:41 +02:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	whisper: remove MSVC warnings pragmas (whisper/3090)	2025-05-07 17:28:36 +03:00