llama.cpp/ggml
Akarshan Biswas b49a8ff96b
SYCL: Add mrope kernel (#13755)
* SYCL: Add mrope kernel

* feat: Optimize rope operations with vectorization

Uses `sycl::vec` to load and store two elements at a time,
significantly improving performance in `rope_norm`,
`rope_neox`, and `rope_multi`. This reduces the number of memory
accesses and leverages SIMD instructions for faster execution.

* Use ceil_div
2025-05-30 19:40:57 +05:30
..
cmake cmake: Factor out CPU architecture detection (#13883) 2025-05-29 12:50:25 +02:00
include ggml : add ggml_repeat_4d (#13824) 2025-05-27 15:53:55 +02:00
src SYCL: Add mrope kernel (#13755) 2025-05-30 19:40:57 +05:30
.gitignore vulkan : cmake integration (#8119) 2024-07-13 18:12:39 +02:00
CMakeLists.txt vulkan: use timestamp queries for GGML_VULKAN_PERF (#13817) 2025-05-27 18:39:07 +02:00