llama.cpp/ggml/src/ggml-opencl
lhez d84635b1b0
opencl: improve profiling (#12442)
* opencl: more profiling timing

* opencl: generate trace for profiling

* opencl: reduce profiling overhead

* Populate profiling timing info at the end rather than after each
  kernel run

* opencl: fix for chrome tracing
2025-03-18 12:54:55 -07:00
..
kernels opencl: Noncontiguous norm, rms_norm, disable fp16 for some ops (#12217) 2025-03-07 00:20:35 +00:00
CMakeLists.txt opencl: use OpenCL C standard supported by the device (#12221) 2025-03-10 09:57:00 -07:00
ggml-opencl.cpp opencl: improve profiling (#12442) 2025-03-18 12:54:55 -07:00