llama.cpp

History

Neo Zhang Jianyu 08d5986290 [SYCL] Optimize mul_mat for Q4_0 on Intel GPU (#12035 ) * opt performance by reorder for Intel GPU * detect hw type and save opt feature, and print opt feature * correct name * support optimize graph once when compute graph, record the opt status in tensor->extra, make CI passed * add env variable GGML_SYCL_DISABLE_OPT for debug * use syclex::architecture replace the custom hw define, update the guide for GGML_SYCL_DISABLE_OPT * add performance data * mv getrows functions to separeted files * fix global variables --------- Co-authored-by: arthw <14088817+arthw@users.noreply.github.com>		2025-02-24 22:33:23 +08:00
..
backend	[SYCL] Optimize mul_mat for Q4_0 on Intel GPU (#12035 )	2025-02-24 22:33:23 +08:00
development	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
android.md	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
build.md	MUSA: support ARM64 and enable dp4a .etc (#11843 )	2025-02-21 09:46:23 +02:00
cuda-fedora.md	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
docker.md	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
install.md	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
llguidance.md	llguidance build fixes for Windows (#11664 )	2025-02-14 12:46:08 -08:00