llama.cpp/ggml/src/ggml-vulkan
Jeff Bolz 02115dcd9a
vulkan: Allow up to 4096 elements for mul_mat_id row_ids (#13326)
This assert fired running Qwen_Qwen3-30B-A3B-Q2_K.gguf:

GGML_ASSERT(nei0 * nei1 <= 3072);

The tensor is 8 x 512. Increase this array size to accommodate.
2025-05-09 09:23:41 +02:00
..
cmake cmake: fix ggml-shaders-gen compiler paths containing spaces (#12747) 2025-04-04 10:12:40 -03:00
vulkan-shaders vulkan: Allow up to 4096 elements for mul_mat_id row_ids (#13326) 2025-05-09 09:23:41 +02:00
CMakeLists.txt vulkan: Add bfloat16 support (#12554) 2025-05-01 20:49:39 +02:00
ggml-vulkan.cpp vulkan: Allow up to 4096 elements for mul_mat_id row_ids (#13326) 2025-05-09 09:23:41 +02:00