llama.cpp

History

uvos 34c961b181 CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (#12315 ) When fattn-wmma was ported over to warp64 various bits that also touch fattn-vec where converted to selectable warp size, however the fattn-vec kernels dont work with 64 wide warps for now, so we need to avoid launching them with parameters for warp64		2025-03-12 10:14:11 +01:00
..
cmake	cmake: Fix ggml backend dependencies and installation (#11818 )	2025-02-27 09:42:48 +02:00
include	ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions (#12154 )	2025-03-06 02:26:10 +01:00
src	CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (#12315 )	2025-03-12 10:14:11 +01:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	opencl: use OpenCL C standard supported by the device (#12221 )	2025-03-10 09:57:00 -07:00